Prioritization of risk genes in colorectal cancer by integrative analysis of multi-omics data and gene networks

Sci China Life Sci. 2024 Jan;67(1):132-148. doi: 10.1007/s11427-023-2439-7. Epub 2023 Sep 21.

Abstract

Genome-wide association studies (GWASs) have identified over 140 colorectal cancer (CRC)-associated loci; however, target genes at the majority of loci and underlying molecular mechanisms are poorly understood. Here, we utilized a Bayesian approach, integrative risk gene selector (iRIGS), to prioritize risk genes at CRC GWAS loci by integrating multi-omics data. As a result, a total of 105 high-confidence risk genes (HRGs) were identified, which exhibited strong gene dependencies for CRC and enrichment in the biological processes implicated in CRC. Among the 105 HRGs, CEBPB, located at the 20q13.13 locus, acted as a transcription factor playing critical roles in cancer. Our subsequent assays indicated the tumor promoter function of CEBPB that facilitated CRC cell proliferation by regulating multiple oncogenic pathways such as MAPK, PI3K-Akt, and Ras signaling. Next, by integrating a fine-mapping analysis and three independent case-control studies in Chinese populations consisting of 8,039 cases and 12,775 controls, we elucidated that rs1810503, a putative functional variant regulating CEBPB, was associated with CRC risk (OR=0.90, 95%CI=0.86-0.93, P=1.07×10-7). The association between rs1810503 and CRC risk was further validated in three additional multi-ancestry populations consisting of 24,254 cases and 58,741 controls. Mechanistically, the rs1810503 A to T allele change weakened the enhancer activity in an allele-specific manner to decrease CEBPB expression via long-range promoter-enhancer interactions, mediated by the transcription factor, REST, and thus decreased CRC risk. In summary, our study provides a genetic resource and a generalizable strategy for CRC etiology investigation, and highlights the biological implications of CEBPB in CRC tumorigenesis, shedding new light on the etiology of CRC.

Keywords: CEBPB; GWAS; gene screening models; long-range promoter-enhancer interactions; multi-omics; susceptibility genes.

MeSH terms

  • Bayes Theorem
  • Colorectal Neoplasms* / metabolism
  • Gene Regulatory Networks*
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study
  • Humans
  • Multiomics
  • Phosphatidylinositol 3-Kinases / genetics
  • Polymorphism, Single Nucleotide
  • Transcription Factors / genetics

Substances

  • Phosphatidylinositol 3-Kinases
  • Transcription Factors