Whole-genome sequencing of individuals from a founder population identifies candidate genes for asthma

PLoS One. 2014 Aug 12;9(8):e104396. doi: 10.1371/journal.pone.0104396. eCollection 2014.

Abstract

Asthma is a complex genetic disease caused by a combination of genetic and environmental risk factors. We sought to test classes of genetic variants largely missed by genome-wide association studies (GWAS), including copy number variants (CNVs) and low-frequency variants, by performing whole-genome sequencing (WGS) on 16 individuals from asthma-enriched and asthma-depleted families. The samples were obtained from an extended 13-generation Hutterite pedigree with reduced genetic heterogeneity due to a small founding gene pool and reduced environmental heterogeneity as a result of a communal lifestyle. We sequenced each individual to an average depth of 13-fold, generated a comprehensive catalog of genetic variants, and tested the most severe mutations for association with asthma. We identified and validated 1960 CNVs, 19 nonsense or splice-site single nucleotide variants (SNVs), and 18 insertions or deletions that were out of frame. As follow-up, we performed targeted sequencing of 16 genes in 837 cases and 540 controls of Puerto Rican ancestry and found that controls carry a significantly higher burden of mutations in IL27RA (2.0% of controls; 0.23% of cases; nominal p = 0.004; Bonferroni p = 0.21). We also genotyped 593 CNVs in 1199 Hutterite individuals. We identified a nominally significant association (p = 0.03; Odds ratio (OR) = 3.13) between a 6 kbp deletion in an intron of NEDD4L and increased risk of asthma. We genotyped this deletion in an additional 4787 non-Hutterite individuals (nominal p = 0.056; OR = 1.69). NEDD4L is expressed in bronchial epithelial cells, and conditional knockout of this gene in the lung in mice leads to severe inflammation and mucus accumulation. Our study represents one of the early instances of applying WGS to complex disease with a large environmental component and demonstrates how WGS can identify risk variants, including CNVs and low-frequency variants, largely untested in GWAS.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alleles
  • Asthma / genetics*
  • Chromosome Mapping
  • Comparative Genomic Hybridization
  • DNA Copy Number Variations
  • Endosomal Sorting Complexes Required for Transport / genetics
  • Female
  • Founder Effect*
  • Gene Frequency
  • Genetic Predisposition to Disease*
  • Genetic Variation
  • Genome, Human*
  • Genome-Wide Association Study*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Introns
  • Male
  • Nedd4 Ubiquitin Protein Ligases
  • Polymorphism, Single Nucleotide
  • Population Groups / genetics
  • Sequence Deletion
  • Ubiquitin-Protein Ligases / genetics

Substances

  • Endosomal Sorting Complexes Required for Transport
  • Nedd4 Ubiquitin Protein Ligases
  • Nedd4L protein, human
  • Nedd4l protein, mouse
  • Ubiquitin-Protein Ligases

Associated data

  • dbGaP/PHS000599.V1.P1