Exploring vision transformers and XGBoost as deep learning ensembles for transforming carcinoma recognition

Sci Rep. 2024 Dec 3;14(1):30052. doi: 10.1038/s41598-024-81456-1.

Abstract

Early detection of colorectal carcinoma (CRC), one of the most prevalent forms of cancer worldwide, significantly enhances the prognosis of patients. This research presents a new method for improving CRC detection using a deep learning ensemble with the Computer Aided Diagnosis (CADx). The method involves combining pre-trained convolutional neural network (CNN) models, such as ADaRDEV2I-22, DaRD-22, and ADaDR-22, using Vision Transformers (ViT) and XGBoost. The study addresses the challenges associated with imbalanced datasets and the necessity of sophisticated feature extraction in medical image analysis. Initially, the CKHK-22 dataset comprised 24 classes. However, we refined it to 14 classes, which led to an improvement in data balance and quality. This improvement enabled more precise feature extraction and improved classification results. We created two ensemble models: the first model used Vision Transformers to capture long-range spatial relationships in the images, while the second model combined CNNs with XGBoost to facilitate structured data classification. We implemented DCGAN-based augmentation to enhance the dataset's diversity. The tests showed big improvements in performance, with the ADaDR-22 + Vision Transformer group getting the best results, with a testing accuracy of 93.4% and an AUC of 98.8%. In contrast, the ADaDR-22 + XGBoost model had an AUC of 97.8% and an accuracy of 92.2%. These findings highlight the efficacy of the proposed ensemble models in detecting CRC and highlight the importance of using well-balanced, high-quality datasets. The proposed method significantly enhances the clinical diagnostic accuracy and the capabilities of medical image analysis or early CRC detection.

Keywords: CKHK-22 dataset; Colorectal Carcinoma (CRC); Ensemble models; Integrated CNNs; Vision Transformers; XGBoost.

MeSH terms

  • Colorectal Neoplasms* / diagnosis
  • Databases, Factual
  • Deep Learning*
  • Diagnosis, Computer-Assisted / methods
  • Early Detection of Cancer / methods
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer