scCaT: An explainable capsulating architecture for sepsis diagnosis transferring from single-cell RNA sequencing

Xubin Zheng; Dian Meng; Duo Chen; Wan-Ki Wong; Ka-Ho To; Lei Zhu; JiaFei Wu; Yining Liang; Kwong-Sak Leung; Man-Hon Wong; Lixin Cheng

doi:10.1371/journal.pcbi.1012083

scCaT: An explainable capsulating architecture for sepsis diagnosis transferring from single-cell RNA sequencing

PLoS Comput Biol. 2024 Oct 21;20(10):e1012083. doi: 10.1371/journal.pcbi.1012083. eCollection 2024 Oct.

Authors

Xubin Zheng^{1

2

3}, Dian Meng¹, Duo Chen^{4

5}, Wan-Ki Wong⁶, Ka-Ho To⁶, Lei Zhu⁷, JiaFei Wu⁸, Yining Liang⁹, Kwong-Sak Leung^{6

10}, Man-Hon Wong⁶, Lixin Cheng²

Affiliations

¹ School of Computing and Information Technology, Great Bay University, Guangdong, China.
² Department of Critical Care Medicine, Shenzhen People's Hospital, the First Affiliated Hospital of Southern University of Science and Technology, the Second Clinical Medicine College of Jinan University, Shenzhen, China.
³ Guangdong Provincial Key Laboratory of Mathematical and Neural Dynamical Systems, Guangzhou, China.
⁴ Department of Respiratory and Critical Care Medicine, Beijing Institute of Respiratory Medicine and Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China.
⁵ Department of Nosocomial Infection Prevention and Control, Beijing Institute of Respiratory Medicine and Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China.
⁶ Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong.
⁷ Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong.
⁸ SenseTime Research, Zhejiang, China.
⁹ Southern University of Science and Technology, Guangdong, China.
¹⁰ Department of Applied Data Science, Shue Yan University, North Point, Hong Kong.

Abstract

Sepsis is a life-threatening condition characterized by an exaggerated immune response to pathogens, leading to organ damage and high mortality rates in the intensive care unit. Although deep learning has achieved impressive performance on prediction and classification tasks in medicine, it requires large amounts of data and lacks explainability, which hinder its application to sepsis diagnosis. We introduce a deep learning framework, called scCaT, which blends the capsulating architecture with Transformer to develop a sepsis diagnostic model using single-cell RNA sequencing data and transfers it to bulk RNA data. The capsulating architecture effectively groups genes into capsules based on biological functions, which provides explainability in encoding gene expressions. The Transformer serves as a decoder to classify sepsis patients and controls. Our model achieves high accuracy with an AUROC of 0.93 on the single-cell test set and an average AUROC of 0.98 on seven bulk RNA cohorts. Additionally, the capsules can recognize different cell types and distinguish sepsis from control samples based on their biological pathways. This study presents a novel approach for learning gene modules and transferring the model to other data types, offering potential benefits in diagnosing rare diseases with limited subjects.

Copyright: © 2024 Zheng et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Computational Biology* / methods
Deep Learning*
Humans
Sepsis* / diagnosis
Sepsis* / genetics
Sequence Analysis, RNA* / methods
Single-Cell Analysis* / methods

Grants and funding

This work was supported in part by National Natural Science Foundation of China (32370711 and 32300554) received by LC and XZ, Shenzhen Medical Research Fund (A2303033) received by LC and Shenzhen Science and Technology Program (JCYJ20220530152409020) received by LC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.