Background and objectives: The aim of this study is to develop a radiomic and deep learning-based signature for survival analysis of patients with Non-Small Cell Lung Cancer.
Methods: Four-hundred twenty-two patients from "Lung1" dataset were included in the study. A 3D convolutional autoencoder (AE) was built and features from the latent space extracted for further analysis. Radiomic features were derived from the 3D volume of the tumor region using PyRadiomics. Both radiomic and AE-based features underwent feature selection, by removing: i) highly correlated and ii) constant features. The selected variables were then used to derive both mono-domain (radiomics, AE and clinic) and multi-domain signatures fitting a Cox Proportional Hazard model with LASSO penalization and evaluated considering the concordance (C)-index as performance metric.
Results: Both mono-domain and multi-domain signatures could significantly differentiate high risk from low risk patients. Among the mono-domain signatures, the highest hazard ratio (HR) in the test set was obtained using radiomics (HR = 1.5428) followed by the AE-based signature (HR = 1.5012) and the clinical signature (HR = 1.4770). The best overall performance was achieved by combining all three signatures, resulting in the highest HR (HR = 1.7383), while the combination of AE-based and clinical signatures yielded the highest C-index (C-index = 0.6309).
Conclusions: These preliminary results show that combining information carried by AE, radiomic and clinical domain shows potential for improving the prediction of overall survival in NSCLC patients.
Keywords: 3D autoencoder; Machine learning; Non-small cell lung cancer; Radiomics; Survival analysis.
Copyright © 2024. Published by Elsevier B.V.