A Survey of Rare Epigenetic Variation in 23,116 Human Genomes Identifies Disease-Relevant Epivariations and CGG Expansions

Am J Hum Genet. 2020 Oct 1;107(4):654-669. doi: 10.1016/j.ajhg.2020.08.019. Epub 2020 Sep 15.

Abstract

There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.

Keywords: DNA methylation; epimutation; epivariation; folate-sensitive fragile site; repeat expansion.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • BRCA1 Protein / genetics*
  • BRCA1 Protein / metabolism
  • DNA Methylation
  • Epigenesis, Genetic*
  • Female
  • Folic Acid / metabolism
  • Gene Silencing
  • Genetic Diseases, Inborn / diagnosis
  • Genetic Diseases, Inborn / genetics*
  • Genetic Diseases, Inborn / pathology
  • Genetic Loci
  • Genetic Variation
  • Genome, Human*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • Promoter Regions, Genetic
  • Receptors, LDL / genetics*
  • Receptors, LDL / metabolism
  • Trinucleotide Repeat Expansion*
  • Twins, Monozygotic

Substances

  • BRCA1 Protein
  • BRCA1 protein, human
  • LDLR protein, human
  • Receptors, LDL
  • Folic Acid