Accounting for nonmonotone missing data using inverse probability weighting

Stat Med. 2023 Oct 15;42(23):4282-4298. doi: 10.1002/sim.9860. Epub 2023 Jul 31.

Abstract

Inverse probability weighting can be used to correct for missing data. New estimators for the weights in the nonmonotone setting were introduced in 2018. These estimators are the unconstrained maximum likelihood estimator (UMLE) and the constrained Bayesian estimator (CBE), an alternative if UMLE fails to converge. In this work we describe and illustrate these estimators, and examine performance in simulation and in an applied example estimating the effect of anemia on spontaneous preterm birth in the Zambia Preterm Birth Prevention Study. We compare performance with multiple imputation (MI) and focus on the setting of an observational study where inverse probability of treatment weights are used to address confounding. In simulation, weighting was less statistically efficient at the smallest sample size and lowest exposure prevalence examined (n = 1500, 15% respectively) but in other scenarios statistical performance of weighting and MI was similar. Weighting had improved computational efficiency taking, on average, 0.4 and 0.05 times the time for MI in R and SAS, respectively. UMLE was easy to implement in commonly used software and convergence failure occurred just twice in >200 000 simulated cohorts making implementation of CBE unnecessary. In conclusion, weighting is an alternative to MI for nonmonotone missingness, though MI performed as well as or better in terms of bias and statistical efficiency. Weighting's superior computational efficiency may be preferred with large sample sizes or when using resampling algorithms. As validity of weighting and MI rely on correct specification of different models, both approaches could be implemented to check agreement of results.

Keywords: imputation; missing data; nonmonotone; simulation; weighting.

Publication types

  • Observational Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Bayes Theorem
  • Computer Simulation
  • Data Interpretation, Statistical
  • Female
  • Humans
  • Infant, Newborn
  • Models, Statistical
  • Premature Birth* / epidemiology
  • Probability