DA-TransUNet: integrating spatial and channel dual attention with transformer U-net for medical image segmentation

Guanqun Sun; Yizhi Pan; Weikun Kong; Zichang Xu; Jianhua Ma; Teeradaj Racharak; Le-Minh Nguyen; Junyi Xin

doi:10.3389/fbioe.2024.1398237

DA-TransUNet: integrating spatial and channel dual attention with transformer U-net for medical image segmentation

Front Bioeng Biotechnol. 2024 May 16:12:1398237. doi: 10.3389/fbioe.2024.1398237. eCollection 2024.

Authors

Guanqun Sun^{1

2}, Yizhi Pan¹, Weikun Kong³, Zichang Xu⁴, Jianhua Ma⁵, Teeradaj Racharak², Le-Minh Nguyen², Junyi Xin^{1

6

7}

Affiliations

¹ School of Information Engineering, Hangzhou Medical College, Hangzhou, China.
² School of Information Science, Japan Advanced Institute of Science and Technology, Nomi, Japan.
³ Department of Electronic Engineering, Tsinghua University, Beijing, China.
⁴ Department of Systems Immunology, Immunology Frontier Research Institute (IFReC), Osaka University, Suita, Japan.
⁵ Faculty of Computer and Information Sciences, Hosei University, Tokyo, Japan.
⁶ Zhejiang Engineering Research Center for Brain Cognition and Brain Diseases Digital Medical Instruments, Hangzhou Medical College, Hangzhou, China.
⁷ Academy for Advanced Interdisciplinary Studies of Future Health, Hangzhou Medical College, Hangzhou, China.

Abstract

Accurate medical image segmentation is critical for disease quantification and treatment evaluation. While traditional U-Net architectures and their transformer-integrated variants excel in automated segmentation tasks. Existing models also struggle with parameter efficiency and computational complexity, often due to the extensive use of Transformers. However, they lack the ability to harness the image's intrinsic position and channel features. Research employing Dual Attention mechanisms of position and channel have not been specifically optimized for the high-detail demands of medical images. To address these issues, this study proposes a novel deep medical image segmentation framework, called DA-TransUNet, aiming to integrate the Transformer and dual attention block (DA-Block) into the traditional U-shaped architecture. Also, DA-TransUNet tailored for the high-detail requirements of medical images, optimizes the intermittent channels of Dual Attention (DA) and employs DA in each skip-connection to effectively filter out irrelevant information. This integration significantly enhances the model's capability to extract features, thereby improving the performance of medical image segmentation. DA-TransUNet is validated in medical image segmentation tasks, consistently outperforming state-of-the-art techniques across 5 datasets. In summary, DA-TransUNet has made significant strides in medical image segmentation, offering new insights into existing techniques. It strengthens model performance from the perspective of image features, thereby advancing the development of high-precision automated medical image diagnosis. The codes and parameters of our model will be publicly available at https://github.com/SUN-1024/DA-TransUnet.

Keywords: U-net; deep learning; dual attention; medical image segmentation; transformer.

Grants and funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by The Soft Science Research Planning Project of Zhejiang Province under Grant 2024C35064 for the project “Study on Performance Evaluation and Optimization Path of Digital Aging Transformation Driven by User Experience.”