CRCDB: A comprehensive database for integrating and analyzing multi-omics data of early-onset and late-onset colorectal cancer

Comput Struct Biotechnol J. 2024 Jun 2:23:2507-2515. doi: 10.1016/j.csbj.2024.05.051. eCollection 2024 Dec.

Abstract

The incidence of early-onset colorectal cancer (EOCRC) has increased significantly worldwide. Uncovering biomarkers that are unique to EOCRC is of great importance to facilitate the prevention and detection of this growing cancer subtype. Although efforts have been made in the data curation about CRC, there is no integrated platform that gives access to data specifically related to young CRC patients. Here, we constructed a user-friendly open integrated resource called CRCDB (URL: http://crcdb-hust.com) which contains multi-omics data of 785 EOCRC, 4898 late-onset CRCs (LOCRC), and 1110 normal control samples from tissue, whole blood, platelets, and serum exosomes. CRCDB manages the differential analysis, survival analysis, co-expression analysis, and immune cell infiltration comparison analysis results in different CRC groups. Meta-analysis results were also provided for users for further data interpretation. Using the resource in CRCDB, we identified that genes associated with the metabolic process were less expressed in EOCRC patients, while up regulated genes most associated with the mitosis process might play an important role in the molecular pathogenesis of LOCRC. Survival-related genes were most enriched in oxidoreduction pathways in EOCRC while in immune-related pathways in LOCRC. With all the data gathered and processed, we anticipate that CRCDB could be a practical data mining platform to help explore potential applications of omics data and develop effective prevention and therapeutic strategies for the specific group of CRC patients.

Keywords: Colorectal cancer; Data mining platform; Early-onset colorectal cancer; Multi-omics data.