The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score

Am J Epidemiol. 2014 Sep 15;180(6):645-55. doi: 10.1093/aje/kwu181. Epub 2014 Aug 20.

Abstract

The covariate-balancing propensity score (CBPS) extends logistic regression to simultaneously optimize covariate balance and treatment prediction. Although the CBPS has been shown to perform well in certain settings, its performance has not been evaluated in settings specific to pharmacoepidemiology and large database research. In this study, we use both simulations and empirical data to compare the performance of the CBPS with logistic regression and boosted classification and regression trees. We simulated various degrees of model misspecification to evaluate the robustness of each propensity score (PS) estimation method. We then applied these methods to compare the effect of initiating glucagonlike peptide-1 agonists versus sulfonylureas on cardiovascular events and all-cause mortality in the US Medicare population in 2007-2009. In simulations, the CBPS was generally more robust in terms of balancing covariates and reducing bias compared with misspecified logistic PS models and boosted classification and regression trees. All PS estimation methods performed similarly in the empirical example. For settings common to pharmacoepidemiology, logistic regression with balance checks to assess model specification is a valid method for PS estimation, but it can require refitting multiple models until covariate balance is achieved. The CBPS is a promising method to improve the robustness of PS models.

Keywords: cardiovascular disease; covariate balance; diabetes; epidemiologic methods; propensity score; regression; simulation.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bias
  • Cardiovascular Diseases / epidemiology
  • Cardiovascular Diseases / prevention & control
  • Comorbidity
  • Computer Simulation
  • Confounding Factors, Epidemiologic
  • Diabetes Mellitus / drug therapy
  • Diabetes Mellitus / epidemiology
  • Glucagon-Like Peptide 1 / agonists
  • Humans
  • Likelihood Functions
  • Logistic Models*
  • Pharmacoepidemiology
  • Propensity Score*
  • Sulfonylurea Compounds / therapeutic use

Substances

  • Sulfonylurea Compounds
  • Glucagon-Like Peptide 1