Purpose: Using the prostate, lung, colorectal, and ovarian (PLCO) Cancer Screening Trial samples, we identified cell-free DNA (cfDNA) candidate biomarkers bearing the epigenetic mark 5-hydroxymethylcytosine (5hmC) that detected occult colorectal cancer (CRC) up to 36 months before clinical diagnosis.
Materials and methods: We performed the 5hmC-seal assay and sequencing on ≤8 ng cfDNA extracted from PLCO study participant plasma samples, including n = 201 cases (diagnosed with CRC within 36 months of blood collection) and n = 401 controls (no cancer diagnosis on follow-up). We conducted association studies and machine learning modeling to analyze the genome-wide 5hmC profiles within training and validation groups that were randomly selected at a 2:1 ratio.
Results: We successfully obtained 5hmC profiles from these decades-old samples. A weighted Cox model of 32 5hmC-modified gene bodies showed a predictive detection value for CRC as early as 36 months before overt tumor diagnosis (training set AUC, 77.1% [95% CI, 72.2 to 81.9] and validation set AUC, 72.8% [95% CI, 65.8 to 79.7]). Notably, the 5hmC-based predictive model showed comparable performance regardless of sex and race/ethnicity, and significantly outperformed risk factors such as age and obesity (assessed as BMI). Finally, when splitting cases at median weighted prediction scores, Kaplan-Meier analyses showed significant risk stratification for CRC occurrence in both the training set (hazard ratio, [HR], 3.3 [95% CI, 2.6 to 5.8]) and validation set (HR, 3.1 [95% CI, 1.8 to 5.8]).
Conclusion: Candidate 5hmC biomarkers and a scoring algorithm have the potential to predict CRC occurrence despite the absence of clinical symptoms and effective predictors. Developing a minimally invasive clinical assay that detects 5hmC-modified biomarkers holds promise for improving early CRC detection and ultimately patient outcomes.