A New Method for Conditional Gene-Based Analysis Effectively Accounts for the Regional Polygenic Background

Genes (Basel). 2024 Sep 7;15(9):1174. doi: 10.3390/genes15091174.

Abstract

Gene-based association analysis is a powerful tool for identifying genes that explain trait variability. An essential step of this analysis is a conditional analysis. It aims to eliminate the influence of SNPs outside the gene, which are in linkage disequilibrium with intragenic SNPs. The popular conditional analysis method, GCTA-COJO, accounts for the influence of several top independently associated SNPs outside the gene, correcting the z statistics for intragenic SNPs. We suggest a new TauCOR method for conditional gene-based analysis using summary statistics. This method accounts the influence of the full regional polygenic background, correcting the genotype correlations between intragenic SNPs. As a result, the distribution of z statistics for intragenic SNPs becomes conditionally independent of distribution for extragenic SNPs. TauCOR is compatible with any gene-based association test. TauCOR was tested on summary statistics simulated under different scenarios and on real summary statistics for a 'gold standard' gene list from the Open Targets Genetics project. TauCOR proved to be effective in all modelling scenarios and on real data. The TauCOR's strategy showed comparable sensitivity and higher specificity and accuracy than GCTA-COJO on both simulated and real data. The method can be successfully used to improve the effectiveness of gene-based association analyses.

Keywords: conditional distribution; gene-based association analysis; random-effects model; summary statistics.

MeSH terms

  • Genome-Wide Association Study / methods
  • Genotype
  • Humans
  • Linkage Disequilibrium*
  • Models, Genetic
  • Multifactorial Inheritance* / genetics
  • Polymorphism, Single Nucleotide* / genetics