Metrics for covariate balance in cohort studies of causal effects

Stat Med. 2014 May 10;33(10):1685-99. doi: 10.1002/sim.6058. Epub 2013 Dec 9.

Abstract

Inferring causation from non-randomized studies of exposure requires that exposure groups can be balanced with respect to prognostic factors for the outcome. Although there is broad agreement in the literature that balance should be checked, there is confusion regarding the appropriate metric. We present a simulation study that compares several balance metrics with respect to the strength of their association with bias in estimation of the effect of a binary exposure on a binary, count, or continuous outcome. The simulations utilize matching on the propensity score with successively decreasing calipers to produce datasets with varying covariate balance. We propose the post-matching C-statistic as a balance metric and found that it had consistently strong associations with estimation bias, even when the propensity score model was misspecified, as long as the propensity score was estimated with sufficient study size. This metric, along with the average standardized difference and the general weighted difference, outperformed all other metrics considered in association with bias, including the unstandardized absolute difference, Kolmogorov-Smirnov and Lévy distances, overlapping coefficient, Mahalanobis balance, and L1 metrics. Of the best-performing metrics, the C-statistic and general weighted difference also have the advantage that they automatically evaluate balance on all covariates simultaneously and can easily incorporate balance on interactions among covariates. Therefore, when combined with the usual practice of comparing individual covariate means and standard deviations across exposure groups, these metrics may provide useful summaries of the observed covariate imbalance.

Keywords: bias; confounding factors; covariate balance; matching; propensity score.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Anti-Inflammatory Agents, Non-Steroidal / pharmacology
  • Celecoxib
  • Cohort Studies*
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Gastrointestinal Tract / metabolism
  • Humans
  • Models, Statistical*
  • Monte Carlo Method
  • Myocardial Infarction / prevention & control
  • Propensity Score*
  • Pyrazoles / pharmacology
  • Sulfonamides / pharmacology

Substances

  • Anti-Inflammatory Agents, Non-Steroidal
  • Pyrazoles
  • Sulfonamides
  • Celecoxib