The amniote-conserved DNA-binding domain of CGGBP1 restricts cytosine methylation of transcription factor binding sites in proximal promoters to regulate gene expression

BMC Genom Data. 2024 Nov 18;25(1):98. doi: 10.1186/s12863-024-01282-2.

Abstract

CGGBP1 is a GC-rich DNA-binding protein which is important for genomic integrity, gene expression and epigenome maintenance through regulation of CTCF occupancy and cytosine methylation. It has remained unclear how CGGBP1 integrates multiple diverse functions with its simple architecture of only a DNA-binding domain tethered to a C-terminal tail with low structural rigidity. We have used truncated forms of CGGBP1 with or without the DNA-binding domain (DBD) to assay cytosine methylation and global gene expression. Proximal promoters of CGGBP1-repressed genes, although significantly GC-poor, contain GC-rich transcription factor binding motifs and exhibit base compositions indicative of low C-T transition rates due to prevention of cytosine methylation. Genome-wide analyses of cytosine methylation and binding of CGGBP1 DBD show that CGGBP1 restricts cytosine methylation in a manner that depends on its DBD and its DNA-binding. The CGGBP1-repressed genes show an increase in promoter cytosine methylation alongside a decrease in transcript abundance when the DBD-deficient CGGBP1 is expressed. Our findings suggest that CGGBP1 protects transcription factor binding sites (TFBS) from cytosine methylation-associated loss and thereby regulates gene expression. By analysing orthologous promoter sequences we show that restriction of cytosine methylation is a function of CGGBP1 progressively acquired during vertebrate evolution. A superimposition of our results and evolution of CGGBP1 suggests that mitigation of cytosine methylation is majorly achieved by its N-terminal DBD. Our results position CGGBP1 DNA-binding as a major evolutionarily acquired mechanism through which it keeps cytosine methylation under check and regulates TFBS retention and gene activity.

Keywords: CGGBP1; DNA sequence motifs; Evolution; TFBS.

MeSH terms

  • Animals
  • Binding Sites
  • Cytosine* / metabolism
  • DNA Methylation* / genetics
  • DNA-Binding Proteins* / chemistry
  • DNA-Binding Proteins* / genetics
  • DNA-Binding Proteins* / metabolism
  • Gene Expression Regulation*
  • Humans
  • Promoter Regions, Genetic* / genetics
  • Protein Binding
  • Protein Domains / genetics
  • Transcription Factors* / chemistry
  • Transcription Factors* / genetics
  • Transcription Factors* / metabolism

Substances

  • Cytosine
  • DNA-Binding Proteins
  • Transcription Factors
  • CGGBP1 protein, human