Machine Learning-based Differentiation of Benign and Premalignant Colorectal Polyps Detected with CT Colonography in an Asymptomatic Screening Population: A Proof-of-Concept Study

Radiology. 2021 May;299(2):326-335. doi: 10.1148/radiol.2021202363. Epub 2021 Feb 23.

Abstract

Background CT colonography does not enable definite differentiation between benign and premalignant colorectal polyps. Purpose To perform machine learning-based differentiation of benign and premalignant colorectal polyps detected with CT colonography in an average-risk asymptomatic colorectal cancer screening sample with external validation using radiomics. Materials and Methods In this secondary analysis of a prospective trial, colorectal polyps of all size categories and morphologies were manually segmented on CT colonographic images and were classified as benign (hyperplastic polyp or regular mucosa) or premalignant (adenoma) according to the histopathologic reference standard. Quantitative image features characterizing shape (n = 14), gray level histogram statistics (n = 18), and image texture (n = 68) were extracted from segmentations after applying 22 image filters, resulting in 1906 feature-filter combinations. Based on these features, a random forest classification algorithm was trained to predict the individual polyp character. Diagnostic performance was validated in an external test set. Results The random forest model was fitted using a training set consisting of 107 colorectal polyps in 63 patients (mean age, 63 years ± 8 [standard deviation]; 40 men) comprising 169 segmentations on CT colonographic images. The external test set included 77 polyps in 59 patients comprising 118 segmentations. Random forest analysis yielded an area under the receiver operating characteristic curve of 0.91 (95% CI: 0.85, 0.96), a sensitivity of 82% (65 of 79) (95% CI: 74%, 91%), and a specificity of 85% (33 of 39) (95% CI: 72%, 95%) in the external test set. In two subgroup analyses of the external test set, the area under the receiver operating characteristic curve was 0.87 in the size category of 6-9 mm and 0.90 in the size category of 10 mm or larger. The most important image feature for decision making (relative importance of 3.7%) was quantifying first-order gray level histogram statistics. Conclusion In this proof-of-concept study, machine learning-based image analysis enabled noninvasive differentiation of benign and premalignant colorectal polyps with CT colonography. © RSNA, 2021 Online supplemental material is available for this article.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Colonic Neoplasms / diagnostic imaging*
  • Colonic Neoplasms / pathology
  • Colonic Polyps / diagnostic imaging*
  • Colonic Polyps / pathology
  • Colonography, Computed Tomographic*
  • Contrast Media
  • Diagnosis, Differential
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted
  • Machine Learning*
  • Male
  • Mass Screening
  • Middle Aged
  • Precancerous Conditions / diagnostic imaging*
  • Precancerous Conditions / parasitology
  • Proof of Concept Study
  • Prospective Studies

Substances

  • Contrast Media