Network-guided sparse regression modeling for detection of gene-by-gene interactions

Bioinformatics. 2013 May 15;29(10):1241-9. doi: 10.1093/bioinformatics/btt139. Epub 2013 Apr 18.

Abstract

Motivation: Genetic variants identified by genome-wide association studies to date explain only a small fraction of total heritability. Gene-by-gene interaction is one important potential source of unexplained total heritability. We propose a novel approach to detect such interactions that uses penalized regression and sparse estimation principles, and incorporates outside biological knowledge through a network-based penalty.

Results: We tested our new method on simulated and real data. Simulation showed that with reasonable outside biological knowledge, our method performs noticeably better than stage-wise strategies (i.e. selecting main effects first, and interactions second, from those main effects selected) in finding true interactions, especially when the marginal strength of main effects is weak. We applied our method to Framingham Heart Study data on total plasma immunoglobulin E (IgE) concentrations and found a number of interactions among different classes of human leukocyte antigen genes that may interact to influence the risk of developing IgE dysregulation and allergy.

Availability: The proposed method is implemented in R and available at http://math.bu.edu/people/kolaczyk/software.html.

Contact: chenlu@bu.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Epistasis, Genetic*
  • Genome-Wide Association Study
  • HLA Antigens / genetics
  • Humans
  • Hypersensitivity / blood
  • Hypersensitivity / genetics
  • Immunoglobulin E / blood
  • Immunoglobulin E / genetics
  • Polymorphism, Single Nucleotide
  • Regression Analysis*

Substances

  • HLA Antigens
  • Immunoglobulin E