Advancing medical imaging with language models: featuring a spotlight on ChatGPT

Phys Med Biol. 2024 May 3;69(10):10TR01. doi: 10.1088/1361-6560/ad387d.

Abstract

This review paper aims to serve as a comprehensive guide and instructional resource for researchers seeking to effectively implement language models in medical imaging research. First, we presented the fundamental principles and evolution of language models, dedicating particular attention to large language models. We then reviewed the current literature on how language models are being used to improve medical imaging, emphasizing a range of applications such as image captioning, report generation, report classification, findings extraction, visual question response systems, interpretable diagnosis and so on. Notably, the capabilities of ChatGPT were spotlighted for researchers to explore its further applications. Furthermore, we covered the advantageous impacts of accurate and efficient language models in medical imaging analysis, such as the enhancement of clinical workflow efficiency, reduction of diagnostic errors, and assistance of clinicians in providing timely and accurate diagnoses. Overall, our goal is to have better integration of language models with medical imaging, thereby inspiring new ideas and innovations. It is our aspiration that this review can serve as a useful resource for researchers in this field, stimulating continued investigative and innovative pursuits of the application of language models in medical imaging.

Keywords: BERT; ChatGPT; large language model; medical imaging; multimodal learning.

Publication types

  • Review

MeSH terms

  • Diagnostic Imaging* / methods
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Language