Large language model use in clinical oncology

Nicolas Carl; Franziska Schramm; Sarah Haggenmüller; Jakob Nikolas Kather; Martin J Hetz; Christoph Wies; Maurice Stephan Michel; Frederik Wessels; Titus J Brinker

doi:10.1038/s41698-024-00733-4

Large language model use in clinical oncology

NPJ Precis Oncol. 2024 Oct 23;8(1):240. doi: 10.1038/s41698-024-00733-4.

Authors

Affiliations

¹ Department of Digital Prevention, Diagnostics and Therapy Guidance, German Cancer Research Center (DKFZ), Heidelberg, Germany.
² Department of Urology and Urological Surgery, University Medical Center Mannheim, Ruprecht-Karls University Heidelberg, Mannheim, Germany.
³ Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
⁴ Medical Faculty, Ruprecht-Karls University Heidelberg, Heidelberg, Germany.
⁵ Department of Digital Prevention, Diagnostics and Therapy Guidance, German Cancer Research Center (DKFZ), Heidelberg, Germany. titus.brinker@dkfz.de.

^# Contributed equally.

Abstract

Large language models (LLMs) are undergoing intensive research for various healthcare domains. This systematic review and meta-analysis assesses current applications, methodologies, and the performance of LLMs in clinical oncology. A mixed-methods approach was used to extract, summarize, and compare methodological approaches and outcomes. This review includes 34 studies. LLMs are primarily evaluated on their ability to answer oncologic questions across various domains. The meta-analysis highlights a significant performance variance, influenced by diverse methodologies and evaluation criteria. Furthermore, differences in inherent model capabilities, prompting strategies, and oncological subdomains contribute to heterogeneity. The lack of use of standardized and LLM-specific reporting protocols leads to methodological disparities, which must be addressed to ensure comparability in LLM research and ultimately leverage the reliable integration of LLM technologies into clinical practice.