Lung cancer lesion detection in histopathology images using graph-based sparse PCA network

Sundaresh Ram; Wenfei Tang; Alexander J Bell; Ravi Pal; Cara Spencer; Alexander Buschhaus; Charles R Hatt; Marina Pasca diMagliano; Alnawaz Rehemtulla; Jeffrey J Rodríguez; Stefanie Galban; Craig J Galban

doi:10.1016/j.neo.2023.100911

Lung cancer lesion detection in histopathology images using graph-based sparse PCA network

Neoplasia. 2023 Aug:42:100911. doi: 10.1016/j.neo.2023.100911. Epub 2023 Jun 1.

Authors

Affiliations

¹ Departments of Radiology, and Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA. Electronic address: sundarer@umich.edu.
² Department of Computer Science and Engineering, University of Michigan, Ann Arbor, MI 48109, USA.
³ Departments of Radiology, and Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA.
⁴ Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA.
⁵ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
⁶ Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA; Imbio LLC, Minneapolis, MN 55405, USA.
⁷ Departments of Surgery, and Cell and Developmental Biology, University of Michigan, Ann Arbor, MI 48109, USA.
⁸ Departments of Radiology, and Radiation Oncology, University of Michigan, Ann Arbor, MI 48109, USA.
⁹ Departments of Electrical and Computer Engineering, and Biomedical Engineering, The University of Arizona, Tucson, AZ 85721, USA.

Abstract

Early detection of lung cancer is critical for improvement of patient survival. To address the clinical need for efficacious treatments, genetically engineered mouse models (GEMM) have become integral in identifying and evaluating the molecular underpinnings of this complex disease that may be exploited as therapeutic targets. Assessment of GEMM tumor burden on histopathological sections performed by manual inspection is both time consuming and prone to subjective bias. Therefore, an interplay of needs and challenges exists for computer-aided diagnostic tools, for accurate and efficient analysis of these histopathology images. In this paper, we propose a simple machine learning approach called the graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E). Our method comprises four steps: 1) cascaded graph-based sparse PCA, 2) PCA binary hashing, 3) block-wise histograms, and 4) support vector machine (SVM) classification. In our proposed architecture, graph-based sparse PCA is employed to learn the filter banks of the multiple stages of a convolutional network. This is followed by PCA hashing and block histograms for indexing and pooling. The meaningful features extracted from this GS-PCA are then fed to an SVM classifier. We evaluate the performance of the proposed algorithm on H&E slides obtained from an inducible K-ras^G12D lung cancer mouse model using precision/recall rates, F_β-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC) and show that our algorithm is efficient and provides improved detection accuracy compared to existing algorithms.

Keywords: Cancer lesion detection; Computational imaging; Graph-based sparse PCA; Image analysis; Machine learning.

Publication types

Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Algorithms*
Animals
Lung
Lung Neoplasms* / diagnosis
Machine Learning
Mice
Treatment Outcome

Grants and funding

R01 HL139690/HL/NHLBI NIH HHS/United States