A robust image segmentation and synthesis pipeline for histopathology

Muhammad Jehanzaib; Yasin Almalioglu; Kutsev Bengisu Ozyoruk; Drew F K Williamson; Talha Abdullah; Kayhan Basak; Derya Demir; G Evren Keles; Kashif Zafar; Mehmet Turan

doi:10.1016/j.media.2024.103344

A robust image segmentation and synthesis pipeline for histopathology

Med Image Anal. 2025 Jan:99:103344. doi: 10.1016/j.media.2024.103344. Epub 2024 Sep 11.

Authors

Affiliations

¹ Department of Computer Engineering, Bogazici University, Istanbul, Turkey; Department of Computer Science, FAST-NUCES, Lahore, Pakistan.
² Computer Science Department, Oxford University, England, United Kingdom.
³ National Cancer Institute, Bethesda, MD, USA.
⁴ Department of Pathology, Brigham and Women's Hospital, USA; Harvard Medical School, Boston, MA, USA.
⁵ Saglık Bilimleri University, Kartal Dr.Lutfi Kırdar City Hospital, Department of Pathology, Istanbul, Turkey.
⁶ Faculty of Medicine, Department of Pathology, Ege University, Izmir, Turkey.
⁷ Virasoft Corporation, New York, NY, USA.
⁸ Department of Computer Science, FAST-NUCES, Lahore, Pakistan.
⁹ Department of Computer Engineering, Bogazici University, Istanbul, Turkey. Electronic address: mehmet.turan@boun.edu.tr.

PMID: 39265361
DOI: 10.1016/j.media.2024.103344

Abstract

Significant diagnostic variability between and within observers persists in pathology, despite the fact that digital slide images provide the ability to measure and quantify features much more precisely compared to conventional methods. Automated and accurate segmentation of cancerous cell and tissue regions can streamline the diagnostic process, providing insights into the cancer progression, and helping experts decide on the most effective treatment. Here, we evaluate the performance of the proposed PathoSeg model, with an architecture comprising of a modified HRNet encoder and a UNet++ decoder integrated with a CBAM block to utilize attention mechanism for an improved segmentation capability. We demonstrate that PathoSeg outperforms the current state-of-the-art (SOTA) networks in both quantitative and qualitative assessment of instance and semantic segmentation. Notably, we leverage the use of synthetic data generated by PathopixGAN, which effectively addresses the data imbalance problem commonly encountered in histopathology datasets, further improving the performance of PathoSeg. It utilizes spatially adaptive normalization within a generative and discriminative mechanism to synthesize diverse histopathological environments dictated through semantic information passed through pixel-level annotated Ground Truth semantic masks.Besides, we contribute to the research community by providing an in-house dataset that includes semantically segmented masks for breast carcinoma tubules (BCT), micro/macrovesicular steatosis of the liver (MSL), and prostate carcinoma glands (PCG). In the first part of the dataset, we have a total of 14 whole slide images from 13 patients' liver, with fat cell segmented masks, totaling 951 masks of size 512 × 512 pixels. In the second part, it includes 17 whole slide images from 13 patients with prostate carcinoma gland segmentation masks, amounting to 30,000 masks of size 512 × 512 pixels. In the third part, the dataset contains 51 whole slides from 36 patients, with breast carcinoma tubule masks totaling 30,000 masks of size 512 × 512 pixels. To ensure transparency and encourage further research, we will make this dataset publicly available for non-commercial and academic purposes. To facilitate reproducibility and encourage further research, we will also make our code and pre-trained models publicly available at https://github.com/DeepMIALab/PathoSeg.

Keywords: Deep learning; Instance segmentation; Semantic segmentation; Synthetic data.

MeSH terms

Algorithms
Breast Neoplasms / diagnostic imaging
Breast Neoplasms / pathology
Female
Humans
Image Interpretation, Computer-Assisted* / methods
Image Processing, Computer-Assisted / methods
Male
Prostatic Neoplasms / diagnostic imaging
Prostatic Neoplasms / pathology