The G-Box Transcriptional Regulatory Code in Arabidopsis

Plant Physiol. 2017 Oct;175(2):628-640. doi: 10.1104/pp.17.01086. Epub 2017 Sep 1.

Abstract

Plants have significantly more transcription factor (TF) families than animals and fungi, and plant TF families tend to contain more genes; these expansions are linked to adaptation to environmental stressors. Many TF family members bind to similar or identical sequence motifs, such as G-boxes (CACGTG), so it is difficult to predict regulatory relationships. We determined that the flanking sequences near G-boxes help determine in vitro specificity but that this is insufficient to predict the transcription pattern of genes near G-boxes. Therefore, we constructed a gene regulatory network that identifies the set of bZIPs and bHLHs that are most predictive of the expression of genes downstream of perfect G-boxes. This network accurately predicts transcriptional patterns and reconstructs known regulatory subnetworks. Finally, we present Ara-BOX-cis (araboxcis.org), a Web site that provides interactive visualizations of the G-box regulatory network, a useful resource for generating predictions for gene regulatory relations.

MeSH terms

  • Arabidopsis / genetics*
  • Basic Helix-Loop-Helix Transcription Factors / genetics*
  • Basic-Leucine Zipper Transcription Factors / genetics*
  • G-Box Binding Factors / genetics*
  • Gene Expression Regulation, Plant / genetics*
  • Gene Regulatory Networks*
  • Nucleotide Motifs*
  • Plant Proteins / genetics

Substances

  • Basic Helix-Loop-Helix Transcription Factors
  • Basic-Leucine Zipper Transcription Factors
  • G-Box Binding Factors
  • Plant Proteins