Unleashing the strengths of unlabelled data in deep learning-assisted pan-cancer abdominal organ quantification: the FLARE22 challenge

Jun Ma; Yao Zhang; Song Gu; Cheng Ge; Shihao Mae; Adamo Young; Cheng Zhu; Xin Yang; Kangkang Meng; Ziyan Huang; Fan Zhang; Yuanke Pan; Shoujin Huang; Jiacheng Wang; Mingze Sun; Rongguo Zhang; Dengqiang Jia; Jae Won Choi; Natália Alves; Bram de Wilde; Gregor Koehler; Haoran Lai; Ershuai Wang; Manuel Wiesenfarth; Qiongjie Zhu; Guoqiang Dong; Jian He; FLARE Challenge Consortium; Bo Wang

doi:10.1016/S2589-7500(24)00154-7

Unleashing the strengths of unlabelled data in deep learning-assisted pan-cancer abdominal organ quantification: the FLARE22 challenge

Lancet Digit Health. 2024 Nov;6(11):e815-e826. doi: 10.1016/S2589-7500(24)00154-7.

Authors

Jun Ma¹, Yao Zhang², Song Gu³, Cheng Ge⁴, Shihao Mae⁵, Adamo Young⁶, Cheng Zhu⁷, Xin Yang⁸, Kangkang Meng⁹, Ziyan Huang¹⁰, Fan Zhang¹¹, Yuanke Pan¹², Shoujin Huang¹², Jiacheng Wang¹³, Mingze Sun¹⁴, Rongguo Zhang¹⁵, Dengqiang Jia¹⁶, Jae Won Choi¹⁷, Natália Alves¹⁸, Bram de Wilde¹⁸, Gregor Koehler¹⁹, Haoran Lai²⁰, Ershuai Wang²¹, Manuel Wiesenfarth²², Qiongjie Zhu²³, Guoqiang Dong²⁴, Jian He²⁵; FLARE Challenge Consortium; Bo Wang³⁵

Collaborators

FLARE Challenge Consortium:
Junjun He²⁶, Hua Yang²⁷, Bingding Huang²⁸, Mengye Lyu²⁹, Yongkang Ma³⁰, Heng Guo³¹, Weixin Xu³², Klaus Maier-Hein³³, Yajun Wu³⁴

Affiliations

¹ University Health Network, Toronto, ON, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada; Vector Institute, Toronto, ON, Canada.
² AI Lab, Lenovo Research, Beijing, China.
³ Department of Image Reconstruction, Nanjing Anke Medical Technology, Nanjing, China.
⁴ School of Medicine and Pharmacy, Ocean University of China, Qingdao, China.
⁵ University Health Network, Toronto, ON, Canada; Department of Computer Science, University of Toronto, Toronto, ON, Canada; Vector Institute, Toronto, ON, Canada.
⁶ Department of Computer Science, University of Toronto, Toronto, ON, Canada; Vector Institute, Toronto, ON, Canada; Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada.
⁷ Beijing Tinavi Medical Technologies, Beijing, China.
⁸ School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China.
⁹ University of Science and Technology Beijing, Beijing, China.
¹⁰ Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai, China; Shanghai AI Laboratory, Shanghai, China.
¹¹ Department of Radiological Algorithm, Fosun Aitrox Information Technology, Shanghai, China.
¹² Shenzhen Technology University, Shenzhen, China.
¹³ Xiamen University, Xiamen, China.
¹⁴ Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, China.
¹⁵ Academy for Multidisciplinary Studies, Capital Normal University, Beijing, China.
¹⁶ Hong Kong Centre for Cerebro-cardiovascular Health Engineering, Hong Kong Special Administrative Region, China.
¹⁷ Department of Radiology, Seoul National University Hospital, Seoul, South Korea.
¹⁸ Department of Radiology, Radboud University Medical Center, Nijmegen, Netherlands.
¹⁹ Division of Medical Image Computing, German Cancer Research Center Heidelberg, Heidelberg, Germany; German Cancer Research Center Heidelberg, Heidelberg, Germany; Helmholtz Information and Data Science School for Health, Karlsruhe/Heidelberg, Germany.
²⁰ Southern Medical University, Guangzhou, China.
²¹ Shenzhen Yorktal DMIT, Shenzhen, China.
²² Division of Biostatistics, German Cancer Research Center Heidelberg, Heidelberg, Germany.
²³ Department of Radiology, Shidong Hospital affiliated to University of Shanghai for Science and Technology, Shanghai, China.
²⁴ Department of Interventional Radiology, The Second Affiliated Hospital of Bengbu Medical University, Bengbu, China.
²⁵ Department of Nuclear Medicine, Nanjing Drum Tower Hospital, Nanjing, China.
²⁶ Shanghai AI Laboratory, Shanghai, China.
²⁷ Department of Radiological Algorithm, Fosun Aitrox Information Technology Co., Ltd., Shanghai, China.
²⁸ College of Big Data and Internet, Shenzhen Technology University, Shenzhen, China.
²⁹ College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen, China.
³⁰ Manteia Technologies Co., Ltd., Xiamen, China.
³¹ Alibaba DAMO Academy, Hangzhou, China.
³² Infervision Medical Technology Co., Ltd., Beijing, China.
³³ Department of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany; with the Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Heidelberg, Germany.
³⁴ Department of Research and Development, ShenZhen Yorktal DMIT Co., Ltd., Beijing, China.
³⁵ University Health Network, Toronto, ON, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada; Department of Computer Science, University of Toronto, Toronto, ON, Canada; Vector Institute, Toronto, ON, Canada; University Health Network AI Hub, Toronto, ON, Canada. Electronic address: bowang@vectorinstitute.ai.

PMID: 39455194
DOI: 10.1016/S2589-7500(24)00154-7

Abstract

Deep learning has shown great potential to automate abdominal organ segmentation and quantification. However, most existing algorithms rely on expert annotations and do not have comprehensive evaluations in real-world multinational settings. To address these limitations, we organised the FLARE 2022 challenge to benchmark fast, low-resource, and accurate abdominal organ segmentation algorithms. We first constructed an intercontinental abdomen CT dataset from more than 50 clinical research groups. We then independently validated that deep learning algorithms achieved a median dice similarity coefficient (DSC) of 90·0% (IQR 87·4-91·3%) by use of 50 labelled images and 2000 unlabelled images, which can substantially reduce manual annotation costs. The best-performing algorithms successfully generalised to holdout external validation sets, achieving a median DSC of 89·4% (85·2-91·3%), 90·0% (84·3-93·0%), and 88·5% (80·9-91·9%) on North American, European, and Asian cohorts, respectively. These algorithms show the potential to use unlabelled data to boost performance and alleviate annotation shortages for modern artificial intelligence models.

Publication types

Review

MeSH terms

Abdomen / diagnostic imaging
Algorithms*
Deep Learning*
Humans
Tomography, X-Ray Computed*