Background: To design a pulmonary ground-glass nodules (GGN) classification method based on computed tomography (CT) radiomics and machine learning for prediction of invasion in early-stage ground-glass opacity (GGO) pulmonary adenocarcinoma.
Methods: This retrospective study included pulmonary GGN patients who were histologically confirmed to have adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), or invasive adenocarcinoma cancer (IAC) from 2020 to 2023. CT images of all patients were automatically segmented and 107 radiomic features were obtained for each patient. Classification models were developed using random forest (RF) and cross-validation, including three one-versus-others models and one three-class model. For each model, features were ranked by normalized Gini importance, and a minimal subset was selected with a cumulative importance exceeding 0.9. These selected features were then used to train the final models. The models' performance metrics, including area under the curve (AUC), accuracy, sensitivity, and specificity, were computed. AUC and accuracy were compared to determine the final optimal method.
Results: The study comprised 193 patients (mean age 54 ± 11 years, 65 men), including 65 AIS, 54 MIA, and 74 IAC, divided into one training cohort (N = 154) and one test cohort (N = 39). The final three-class RF model outperformed three individual one-versus-others models in distinguishing each class from the other two. For the multiclass classification model, the AUC, accuracy, sensitivity, and specificity were 0.87, 0.79, 0.62, and 0.88 for AIS; 0.90, 0.79, 0.54, and 0.89 for MIA; and 0.87, 0.69, 0.73, and 0.67 for IAC, respectively.
Conclusions: A radiomics-based multiclass RF model could effectively differentiate three types of pulmonary GGN, which enabled early diagnosis of GGO pulmonary adenocarcinoma.
Keywords: Computed tomography; Ground-glass modules; Lung adenocarcinoma; Machine learning; Radiomics.
© 2024. The Author(s).