Pan-tumor T-lymphocyte detection using deep neural networks: Recommendations for transfer learning in immunohistochemistry

Frauke Wilm; Christian Ihling; Gábor Méhes; Luigi Terracciano; Chloé Puget; Robert Klopfleisch; Peter Schüffler; Marc Aubreville; Andreas Maier; Thomas Mrowiec; Katharina Breininger

doi:10.1016/j.jpi.2023.100301

Pan-tumor T-lymphocyte detection using deep neural networks: Recommendations for transfer learning in immunohistochemistry

J Pathol Inform. 2023 Feb 27:14:100301. doi: 10.1016/j.jpi.2023.100301. eCollection 2023.

Authors

Frauke Wilm^{1

2

3}, Christian Ihling², Gábor Méhes⁴, Luigi Terracciano⁵, Chloé Puget⁶, Robert Klopfleisch⁶, Peter Schüffler^{7

8}, Marc Aubreville⁹, Andreas Maier¹, Thomas Mrowiec², Katharina Breininger³

Affiliations

¹ Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
² Merck Healthcare KGaA, Darmstadt, Germany.
³ Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
⁴ Department of Pathology, University of Debrecen, Debrecen, Hungary.
⁵ Research Department Pathology, Universitätsspital Basel, Basel, Switzerland.
⁶ Institute of Veterinary Pathology, Freie Universität Berlin, Berlin, Germany.
⁷ Institute of General and Surgical Pathology, Technical University of Munich, Munich, Germany.
⁸ School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.
⁹ Technische Hochschule Ingolstadt, Ingolstadt, Germany.

Abstract

The success of immuno-oncology treatments promises long-term cancer remission for an increasing number of patients. The response to checkpoint inhibitor drugs has shown a correlation with the presence of immune cells in the tumor and tumor microenvironment. An in-depth understanding of the spatial localization of immune cells is therefore critical for understanding the tumor's immune landscape and predicting drug response. Computer-aided systems are well suited for efficiently quantifying immune cells in their spatial context. Conventional image analysis approaches are often based on color features and therefore require a high level of manual interaction. More robust image analysis methods based on deep learning are expected to decrease this reliance on human interaction and improve the reproducibility of immune cell scoring. However, these methods require sufficient training data and previous work has reported low robustness of these algorithms when they are tested on out-of-distribution data from different pathology labs or samples from different organs. In this work, we used a new image analysis pipeline to explicitly evaluate the robustness of marker-labeled lymphocyte quantification algorithms depending on the number of training samples before and after being transferred to a new tumor indication. For these experiments, we adapted the RetinaNet architecture for the task of T-lymphocyte detection and employed transfer learning to bridge the domain gap between tumor indications and reduce the annotation costs for unseen domains. On our test set, we achieved human-level performance for almost all tumor indications with an average precision of 0.74 in-domain and 0.72-0.74 cross-domain. From our results, we derive recommendations for model development regarding annotation extent, training sample selection, and label extraction for the development of robust algorithms for immune cell scoring. By extending the task of marker-labeled lymphocyte quantification to a multi-class detection task, the pre-requisite for subsequent analyses, e.g., distinguishing lymphocytes in the tumor stroma from tumor-infiltrating lymphocytes, is met.

Keywords: Deep learning; Domain adaptation; Immuno-oncology; Immunohistochemistry; Transfer learning; Tumor-infiltrating lymphocytes.