Identification of DNA motifs that regulate DNA methylation

Nucleic Acids Res. 2019 Jul 26;47(13):6753-6768. doi: 10.1093/nar/gkz483.

Abstract

DNA methylation is an important epigenetic mark but how its locus-specificity is decided in relation to DNA sequence is not fully understood. Here, we have analyzed 34 diverse whole-genome bisulfite sequencing datasets in human and identified 313 motifs, including 92 and 221 associated with methylation (methylation motifs, MMs) and unmethylation (unmethylation motifs, UMs), respectively. The functionality of these motifs is supported by multiple lines of evidence. First, the methylation levels at the MM and UM motifs are respectively higher and lower than the genomic background. Second, these motifs are enriched at the binding sites of methylation modifying enzymes including DNMT3A and TET1, indicating their possible roles of recruiting these enzymes. Third, these motifs significantly overlap with "somatic QTLs" (quantitative trait loci) of methylation and expression. Fourth, disruption of these motifs by mutation is associated with significantly altered methylation level of the CpGs in the neighbor regions. Furthermore, these motifs together with somatic mutations are predictive of cancer subtypes and patient survival. We revealed some of these motifs were also associated with histone modifications, suggesting a possible interplay between the two types of epigenetic modifications. We also found some motifs form feed forward loops to contribute to DNA methylation dynamics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Binding Sites
  • CpG Islands
  • DNA (Cytosine-5-)-Methyltransferases / metabolism
  • DNA / genetics*
  • DNA / metabolism
  • DNA Methylation / genetics*
  • DNA Methyltransferase 3A
  • DNA, Neoplasm / genetics
  • Datasets as Topic
  • Epigenesis, Genetic / genetics*
  • Histone Code
  • Humans
  • Kaplan-Meier Estimate
  • Mixed Function Oxygenases / metabolism
  • Models, Genetic
  • Neoplasms / genetics
  • Neoplasms / mortality
  • Promoter Regions, Genetic / genetics
  • Proto-Oncogene Proteins / metabolism
  • Quantitative Trait Loci
  • Sequence Analysis, DNA

Substances

  • DNA, Neoplasm
  • DNMT3A protein, human
  • Proto-Oncogene Proteins
  • DNA
  • Mixed Function Oxygenases
  • TET1 protein, human
  • DNA (Cytosine-5-)-Methyltransferases
  • DNA Methyltransferase 3A