Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort

Mahmoud Aldraimli; Sarah Osman; Diana Grishchuck; Samuel Ingram; Robert Lyon; Anil Mistry; Jorge Oliveira; Robert Samuel; Leila E A Shelley; Daniele Soria; Miriam V Dwek; Miguel E Aguado-Barrera; David Azria; Jenny Chang-Claude; Alison Dunning; Alexandra Giraldo; Sheryl Green; Sara Gutiérrez-Enríquez; Carsten Herskind; Hans van Hulle; Maarten Lambrecht; Laura Lozza; Tiziana Rancati; Victoria Reyes; Barry S Rosenstein; Dirk de Ruysscher; Maria C de Santis; Petra Seibold; Elena Sperk; R Paul Symonds; Hilary Stobart; Begoña Taboada-Valadares; Christopher J Talbot; Vincent J L Vakaet; Ana Vega; Liv Veldeman; Marlon R Veldwijk; Adam Webb; Caroline Weltens; Catharine M West; Thierry J Chaussalet; Tim Rattay; REQUITE consortium

doi:10.1016/j.adro.2021.100890

Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort

Adv Radiat Oncol. 2022 Jan 3;7(3):100890. doi: 10.1016/j.adro.2021.100890. eCollection 2022 May-Jun.

Authors

Mahmoud Aldraimli¹, Sarah Osman², Diana Grishchuck³, Samuel Ingram⁴, Robert Lyon⁵, Anil Mistry⁶, Jorge Oliveira⁷, Robert Samuel⁸, Leila E A Shelley⁹, Daniele Soria¹⁰, Miriam V Dwek¹¹, Miguel E Aguado-Barrera^{12

13}, David Azria¹⁴, Jenny Chang-Claude^{15

16}, Alison Dunning¹⁷, Alexandra Giraldo¹⁸, Sheryl Green¹⁹, Sara Gutiérrez-Enríquez²⁰, Carsten Herskind²¹, Hans van Hulle²², Maarten Lambrecht²³, Laura Lozza²⁴, Tiziana Rancati²⁵, Victoria Reyes¹⁸, Barry S Rosenstein²⁶, Dirk de Ruysscher²⁷, Maria C de Santis²⁴, Petra Seibold¹⁵, Elena Sperk²¹, R Paul Symonds²⁸, Hilary Stobart²⁹, Begoña Taboada-Valadares^{12

30}, Christopher J Talbot³¹, Vincent J L Vakaet²², Ana Vega^{12

13

32}, Liv Veldeman²², Marlon R Veldwijk²¹, Adam Webb³¹, Caroline Weltens²³, Catharine M West³³, Thierry J Chaussalet¹, Tim Rattay²⁸; REQUITE consortium

Affiliations

¹ Health Innovation Ecosystem, University of Westminster, London, United Kingdom.
² Patrick G. Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, United Kingdom.
³ Imperial College Healthcare NHS Trust, London, United Kingdom.
⁴ Division of Cancer Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom.
⁵ Department of Computer Science, Edge Hill University, Ormskirk, Lancashire, United Kingdom.
⁶ Guy's and St. Thomas' NHS Foundation Trust, London, United Kingdom.
⁷ Mirada Medical, Oxford, United Kingdom.
⁸ University of Leeds, Leeds Cancer Centre, St. James's University Hospital, Leeds, United Kingdom.
⁹ Edinburgh Cancer Centre, Western General Hospital, Edinburgh, United Kingdom.
¹⁰ School of Computing, University of Kent, Canterbury, United Kingdom.
¹¹ School of Life Sciences, University of Westminster, London, United Kingdom.
¹² Fundación Publica Galega Medicina Xenomica, Santiago de Compostela, Spain.
¹³ Instituto de Investigación Sanitaria de Santiago (IDIS), Servicio Galego de Saúde (SERGAS), Santiago de Compostela, Spain.
¹⁴ University of Montpellier, Montpellier, France.
¹⁵ Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
¹⁶ UKE University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
¹⁷ Centre for Cancer Genetic Epidemiology, University of Cambridge, Strangeways Research Laboratory, Worts Causeway, Cambridge, United Kingdom.
¹⁸ Radiation Oncology Department, Vall d'Hebron Hospital Universitari, Vall d'Hebron Hospital Campus, Barcelona, Spain.
¹⁹ Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, New York.
²⁰ Hereditary Cancer Genetics Group, Vall d'Hebron Institute of Oncology (VHIO), Vall d'Hebron Hospital Campus, Barcelona, Spain.
²¹ Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.
²² Department of Human Structure and Repair, Ghent University, Ghent, Belgium.
²³ Department of Radiation Oncology, University Hospital, Leuven, Belgium.
²⁴ Department of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
²⁵ Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
²⁶ Icahn School of Medicine at Mount Sinai, New York, New York.
²⁷ Maastricht University Medical Center, Department of Radiation Oncology (Maastro), GROW, Maastricht, The Netherlands.
²⁸ Cancer Research Centre, University of Leicester, Leicester, United Kingdom.
²⁹ Independent Cancer Patients' Voice, London, United Kingdom.
³⁰ Department of Radiation Oncology, Complexo Hospitalario Universitario de Santiago, Servicio Galego de Saúde (SERGAS), Santiago de Compostela, Spain.
³¹ Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom.
³² Biomedical Network on Rare Diseases (CIBERER), Madrid, Spain.
³³ University of Manchester, Christie Hospital, Manchester, United Kingdom.

Abstract

Purpose: Some patients with breast cancer treated by surgery and radiation therapy experience clinically significant toxicity, which may adversely affect cosmesis and quality of life. There is a paucity of validated clinical prediction models for radiation toxicity. We used machine learning (ML) algorithms to develop and optimise a clinical prediction model for acute breast desquamation after whole breast external beam radiation therapy in the prospective multicenter REQUITE cohort study.

Methods and materials: Using demographic and treatment-related features (m = 122) from patients (n = 2058) at 26 centers, we trained 8 ML algorithms with 10-fold cross-validation in a 50:50 random-split data set with class stratification to predict acute breast desquamation. Based on performance in the validation data set, the logistic model tree, random forest, and naïve Bayes models were taken forward to cost-sensitive learning optimisation.

Results: One hundred and ninety-two patients experienced acute desquamation. Resampling and cost-sensitive learning optimisation facilitated an improvement in classification performance. Based on maximising sensitivity (true positives), the "hero" model was the cost-sensitive random forest algorithm with a false-negative: false-positive misclassification penalty of 90:1 containing m = 114 predictive features. Model sensitivity and specificity were 0.77 and 0.66, respectively, with an area under the curve of 0.77 in the validation cohort.

Conclusions: ML algorithms with resampling and cost-sensitive learning generated clinically valid prediction models for acute desquamation using patient demographic and treatment features. Further external validation and inclusion of genomic markers in ML prediction models are worthwhile, to identify patients at increased risk of toxicity who may benefit from supportive intervention or even a change in treatment plan.