Clinical risk prediction using language models: benefits and considerations

Angeela Acharya; Sulabh Shrestha; Anyi Chen; Joseph Conte; Sanja Avramovic; Siddhartha Sikdar; Antonios Anastasopoulos; Sanmay Das

doi:10.1093/jamia/ocae030

Clinical risk prediction using language models: benefits and considerations

J Am Med Inform Assoc. 2024 Sep 1;31(9):1856-1864. doi: 10.1093/jamia/ocae030.

Authors

Angeela Acharya¹, Sulabh Shrestha¹, Anyi Chen², Joseph Conte², Sanja Avramovic¹, Siddhartha Sikdar¹, Antonios Anastasopoulos¹, Sanmay Das¹

Affiliations

¹ George Mason University, Fairfax, VA, United States.
² Staten Island Performing Provider System, Staten Island, NY, United States.

PMID: 38412328
PMCID: PMC11339498 (available on 2025-02-27)
DOI: 10.1093/jamia/ocae030

Abstract

Objective: The use of electronic health records (EHRs) for clinical risk prediction is on the rise. However, in many practical settings, the limited availability of task-specific EHR data can restrict the application of standard machine learning pipelines. In this study, we investigate the potential of leveraging language models (LMs) as a means to incorporate supplementary domain knowledge for improving the performance of various EHR-based risk prediction tasks.

Methods: We propose two novel LM-based methods, namely "LLaMA2-EHR" and "Sent-e-Med." Our focus is on utilizing the textual descriptions within structured EHRs to make risk predictions about future diagnoses. We conduct a comprehensive comparison with previous approaches across various data types and sizes.

Results: Experiments across 6 different methods and 3 separate risk prediction tasks reveal that employing LMs to represent structured EHRs, such as diagnostic histories, results in significant performance improvements when evaluated using standard metrics such as area under the receiver operating characteristic (ROC) curve and precision-recall (PR) curve. Additionally, they offer benefits such as few-shot learning, the ability to handle previously unseen medical concepts, and adaptability to various medical vocabularies. However, it is noteworthy that outcomes may exhibit sensitivity to a specific prompt.

Conclusion: LMs encompass extensive embedded knowledge, making them valuable for the analysis of EHRs in the context of risk prediction. Nevertheless, it is important to exercise caution in their application, as ongoing safety concerns related to LMs persist and require continuous consideration.

Keywords: electronic health records; large language models; opioid use disorder; risk prediction; substance use disorder.

MeSH terms

Electronic Health Records*
Humans
Machine Learning*
Natural Language Processing
ROC Curve
Risk Assessment / methods

Abstract

MeSH terms

Grants and funding