Developing a clinical utility framework to evaluate prediction models in radiogenomics

Yirong Wu; Jie Liu; Alejandro Munoz Del Rio; David C Page; Oguzhan Alagoz; Peggy Peissig; Adedayo A Onitilo; Elizabeth S Burnside

doi:10.1117/12.2081954

Developing a clinical utility framework to evaluate prediction models in radiogenomics

Proc SPIE Int Soc Opt Eng. 2015 Feb 21:9416:941617. doi: 10.1117/12.2081954. Epub 2015 Mar 17.

Authors

Yirong Wu¹, Jie Liu², Alejandro Munoz Del Rio³, David C Page², Oguzhan Alagoz⁴, Peggy Peissig⁵, Adedayo A Onitilo⁶, Elizabeth S Burnside¹

Affiliations

¹ Dept. of Radiology, UW Madison, WI, USA.
² Dept. of Computer Science, UW Madison, WI, USA.
³ Dept. of Radiology, UW Madison, WI, USA; Dept. Of Medical Physics, UW Madison, WI, USA.
⁴ Dept. of Industrial and Systems Engineering, UW Madison, WI, USA.
⁵ Marshfield Clinic Research Foundation, Marshfield, WI, USA.
⁶ Marshfield Clinic Research Foundation, Marshfield, WI, USA; Dept. of Hematology/Oncology, Marshfield Clinic Weston Center, Weston, WI, USA.

Abstract

Combining imaging and genetic information to predict disease presence and behavior is being codified into an emerging discipline called "radiogenomics." Optimal evaluation methodologies for radiogenomics techniques have not been established. We aim to develop a clinical decision framework based on utility analysis to assess prediction models for breast cancer. Our data comes from a retrospective case-control study, collecting Gail model risk factors, genetic variants (single nucleotide polymorphisms-SNPs), and mammographic features in Breast Imaging Reporting and Data System (BI-RADS) lexicon. We first constructed three logistic regression models built on different sets of predictive features: (1) Gail, (2) Gail+SNP, and (3) Gail+SNP+BI-RADS. Then, we generated ROC curves for three models. After we assigned utility values for each category of findings (true negative, false positive, false negative and true positive), we pursued optimal operating points on ROC curves to achieve maximum expected utility (MEU) of breast cancer diagnosis. We used McNemar's test to compare the predictive performance of the three models. We found that SNPs and BI-RADS features augmented the baseline Gail model in terms of the area under ROC curve (AUC) and MEU. SNPs improved sensitivity of the Gail model (0.276 vs. 0.147) and reduced specificity (0.855 vs. 0.912). When additional mammographic features were added, sensitivity increased to 0.457 and specificity to 0.872. SNPs and mammographic features played a significant role in breast cancer risk estimation (p-value < 0.001). Our decision framework comprising utility analysis and McNemar's test provides a novel framework to evaluate prediction models in the realm of radiogenomics.

Keywords: ROC methodology; breast imaging; expected utility; genetics; mammography.

Abstract

Grants and funding