An approach to incorporate linkage disequilibrium structure into genomic association analysis

J Genet Genomics. 2008 Jun;35(6):381-5. doi: 10.1016/S1673-8527(08)60055-7.

Abstract

In this study, we propose to use the principal component analysis (PCA) and regression model to incorporate linkage disequilibrium (LD) in genomic association data analysis. To accommodate LD in genomic data and reduce multiple testing, we suggest performing PCA and extracting the PCA score to capture the variation of genomic data, after which regression analysis is used to assess the association of the disease with the principal component score. An empirical analysis result shows that both genotype-based correlation matrix and haplotype-based LD matrix can produce similar results for PCA. Principal component score seems to be more powerful in detecting genetic association because the principal component score is quantitatively measured and may be able to capture the effect of multiple loci.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Disease / genetics
  • Genome-Wide Association Study / methods*
  • Genomics / methods*
  • Haplotypes
  • Linkage Disequilibrium*
  • Logistic Models
  • Polymorphism, Single Nucleotide
  • Principal Component Analysis