Machine learning reveals the transcriptional regulatory network and circadian dynamics of Synechococcus elongatus PCC 7942

Proc Natl Acad Sci U S A. 2024 Sep 17;121(38):e2410492121. doi: 10.1073/pnas.2410492121. Epub 2024 Sep 13.

Abstract

Synechococcus elongatus is an important cyanobacterium that serves as a versatile and robust model for studying circadian biology and photosynthetic metabolism. Its transcriptional regulatory network (TRN) is of fundamental interest, as it orchestrates the cell's adaptation to the environment, including its response to sunlight. Despite the previous characterization of constituent parts of the S. elongatus TRN, a comprehensive layout of its topology remains to be established. Here, we decomposed a compendium of 300 high-quality RNA sequencing datasets of the model strain PCC 7942 using independent component analysis. We obtained 57 independently modulated gene sets, or iModulons, that explain 67% of the variance in the transcriptional response and 1) accurately reflect the activity of known transcriptional regulations, 2) capture functional components of photosynthesis, 3) provide hypotheses for regulon structures and functional annotations of poorly characterized genes, and 4) describe the transcriptional shifts under dynamic light conditions. This transcriptome-wide analysis of S. elongatus provides a quantitative reconstruction of the TRN and presents a knowledge base that can guide future investigations. Our systems-level analysis also provides a global TRN structure for S. elongatus PCC 7942.

Keywords: carbon fixation; circadian rhythm; cyanobacteria; machine learning; transcriptional regulatory network.

MeSH terms

  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Circadian Rhythm* / genetics
  • Circadian Rhythm* / physiology
  • Gene Expression Regulation, Bacterial*
  • Gene Regulatory Networks*
  • Machine Learning*
  • Photosynthesis / genetics
  • Synechococcus* / genetics
  • Synechococcus* / metabolism
  • Transcriptome

Substances

  • Bacterial Proteins

Supplementary concepts

  • Synechococcus elongatus