Background: Obstructive sleep apnea (OSA) is a highly prevalent sleep disorder associated with daytime sleepiness, fatigue, and increased all-cause mortality risk in patients with cancer. Existing screening tools for OSA do not account for the interaction of cancer-related features that may increase OSA risk.
Study design and methods: This is a retrospective study of patients with cancer at a single tertiary cancer institution who underwent a home sleep apnea test (HSAT) to evaluate for OSA. Unsupervised machine learning (ML) was used to reduce the dimensions and extract significant features associated with OSA. ML classifiers were applied to principal components and model hyperparameters were optimized using k-fold cross-validation. Training models for OSA were subsequently tested and compared with the STOP-Bang questionnaire on a prospective unseen test set of patients who underwent an HSAT.
Results: From a training dataset of 249 patients, kernel principal component analysis (PCA) extracted eight components through dimension reduction to explain the maximum variance with OSA at 98%. Predictors of OSA were smoking, asthma, chronic kidney disease, STOP-Bang score, race, diabetes, radiation to head/neck/thorax (RT-HNT), type of cancer, and cancer metastases. Of the ML models, PCA + RF had the highest sensitivity (96.8%), specificity (92.3%), negative predictive value (92%), F1 score (0.93), and ROC-AUC score (0.88). The PCA + RF screening algorithm also performed better than the STOP-Bang questionnaire alone when tested on a prospective unseen test set.
Conclusions: The PCA + RF ML model had the highest accuracy in screening for OSA in patients with cancer. History of RT-HNT, cancer metastases, and type of cancer were identified as cancer-related risk factors for OSA.
Keywords: OSA-clinical assessment; cancer; machine learning.
© The Author(s) 2023. Published by Oxford University Press on behalf of Sleep Research Society.