Pathologist Validation of a Machine Learning-Derived Feature for Colon Cancer Risk Stratification

Vincenzo L'Imperio; Ellery Wulczyn; Markus Plass; Heimo Müller; Nicolò Tamini; Luca Gianotti; Nicola Zucchini; Robert Reihs; Greg S Corrado; Dale R Webster; Lily H Peng; Po-Hsuan Cameron Chen; Marialuisa Lavitrano; Yun Liu; David F Steiner; Kurt Zatloukal; Fabio Pagni

doi:10.1001/jamanetworkopen.2022.54891

Pathologist Validation of a Machine Learning-Derived Feature for Colon Cancer Risk Stratification

JAMA Netw Open. 2023 Mar 1;6(3):e2254891. doi: 10.1001/jamanetworkopen.2022.54891.

Authors

Vincenzo L'Imperio¹, Ellery Wulczyn², Markus Plass³, Heimo Müller³, Nicolò Tamini⁴, Luca Gianotti⁴, Nicola Zucchini¹, Robert Reihs³, Greg S Corrado², Dale R Webster², Lily H Peng², Po-Hsuan Cameron Chen², Marialuisa Lavitrano¹, Yun Liu², David F Steiner², Kurt Zatloukal³, Fabio Pagni¹

Affiliations

¹ Department of Medicine and Surgery, Pathology, University of Milano-Bicocca, IRCCS (Scientific Institute for Research, Hospitalization and Healthcare) Fondazione San Gerardo dei Tintori, Monza, Italy.
² Google Health, Google LLC, Palo Alto, California.
³ Medical University of Graz, Diagnostic and Research Institute of Pathology, Graz, Austria.
⁴ Department of Surgery, San Gerardo Hospital, Monza, Italy.

Abstract

Importance: Identifying new prognostic features in colon cancer has the potential to refine histopathologic review and inform patient care. Although prognostic artificial intelligence systems have recently demonstrated significant risk stratification for several cancer types, studies have not yet shown that the machine learning-derived features associated with these prognostic artificial intelligence systems are both interpretable and usable by pathologists.

Objective: To evaluate whether pathologist scoring of a histopathologic feature previously identified by machine learning is associated with survival among patients with colon cancer.

Design, setting, and participants: This prognostic study used deidentified, archived colorectal cancer cases from January 2013 to December 2015 from the University of Milano-Bicocca. All available histologic slides from 258 consecutive colon adenocarcinoma cases were reviewed from December 2021 to February 2022 by 2 pathologists, who conducted semiquantitative scoring for tumor adipose feature (TAF), which was previously identified via a prognostic deep learning model developed with an independent colorectal cancer cohort.

Main outcomes and measures: Prognostic value of TAF for overall survival and disease-specific survival as measured by univariable and multivariable regression analyses. Interpathologist agreement in TAF scoring was also evaluated.

Results: A total of 258 colon adenocarcinoma histopathologic cases from 258 patients (138 men [53%]; median age, 67 years [IQR, 65-81 years]) with stage II (n = 119) or stage III (n = 139) cancer were included. Tumor adipose feature was identified in 120 cases (widespread in 63 cases, multifocal in 31, and unifocal in 26). For overall survival analysis after adjustment for tumor stage, TAF was independently prognostic in 2 ways: TAF as a binary feature (presence vs absence: hazard ratio [HR] for presence of TAF, 1.55 [95% CI, 1.07-2.25]; P = .02) and TAF as a semiquantitative categorical feature (HR for widespread TAF, 1.87 [95% CI, 1.23-2.85]; P = .004). Interpathologist agreement for widespread TAF vs lower categories (absent, unifocal, or multifocal) was 90%, corresponding to a κ metric at this threshold of 0.69 (95% CI, 0.58-0.80).

Conclusions and relevance: In this prognostic study, pathologists were able to learn and reproducibly score for TAF, providing significant risk stratification on this independent data set. Although additional work is warranted to understand the biological significance of this feature and to establish broadly reproducible TAF scoring, this work represents the first validation to date of human expert learning from machine learning in pathology. Specifically, this validation demonstrates that a computationally identified histologic feature can represent a human-identifiable, prognostic feature with the potential for integration into pathology practice.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adenocarcinoma*
Aged
Artificial Intelligence
Colonic Neoplasms* / diagnosis
Humans
Machine Learning
Male
Pathologists
Risk Assessment