Principal Component Analysis Reduces Collider Bias in Polygenic Score Effect Size Estimation

Behav Genet. 2022 Sep;52(4-5):268-280. doi: 10.1007/s10519-022-10104-z. Epub 2022 Jun 8.

Abstract

In this study, we test principal component analysis (PCA) of measured confounders as a method to reduce collider bias in polygenic association models. We present results from simulations and application of the method in the Collaborative Study of the Genetics of Alcoholism (COGA) sample with a polygenic score for alcohol problems, DSM-5 alcohol use disorder as the target phenotype, and two collider variables: tobacco use and educational attainment. Simulation results suggest that assumptions regarding the correlation structure and availability of measured confounders are complementary, such that meeting one assumption relaxes the other. Application of the method in COGA shows that PC covariates reduce collider bias when tobacco use is used as the collider variable. Application of this method may improve PRS effect size estimation in some cases by reducing the effect of collider bias, making efficient use of data resources that are available in many studies.

Keywords: Collider Bias; Polygenic scores; Principal component analysis.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alcohol Drinking / genetics
  • Alcoholism* / genetics
  • Bias
  • Genome-Wide Association Study
  • Humans
  • Multifactorial Inheritance* / genetics
  • Principal Component Analysis