A foundational vision transformer improves diagnostic performance for electrocardiograms

Akhil Vaid; Joy Jiang; Ashwin Sawant; Stamatios Lerakis; Edgar Argulian; Yuri Ahuja; Joshua Lampert; Alexander Charney; Hayit Greenspan; Jagat Narula; Benjamin Glicksberg; Girish N Nadkarni

doi:10.1038/s41746-023-00840-9

A foundational vision transformer improves diagnostic performance for electrocardiograms

NPJ Digit Med. 2023 Jun 6;6(1):108. doi: 10.1038/s41746-023-00840-9.

Authors

Akhil Vaid^{1

2

3

4}, Joy Jiang^{5

6}, Ashwin Sawant⁷, Stamatios Lerakis^{8

9}, Edgar Argulian^{8

9}, Yuri Ahuja¹⁰, Joshua Lampert^{8

9}, Alexander Charney^{5

11

12

13}, Hayit Greenspan¹⁴, Jagat Narula^{8

9}, Benjamin Glicksberg^{11

15}, Girish N Nadkarni^{5

6

11

15

16}

Affiliations

¹ The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. akhil.vaid@mssm.edu.
² Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, NY, USA. akhil.vaid@mssm.edu.
³ Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA. akhil.vaid@mssm.edu.
⁴ The Hasso Plattner Institute for Digital Health at Mount Sinai, New York, NY, USA. akhil.vaid@mssm.edu.
⁵ The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁶ Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁷ Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁸ Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁹ Department of Cardiology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹⁰ Department of Medicine, NYU Langone Health, New York, NY, USA.
¹¹ Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹² The Pamela Sklar Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹³ Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹⁴ Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 6997801, Israel.
¹⁵ The Hasso Plattner Institute for Digital Health at Mount Sinai, New York, NY, USA.
¹⁶ Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

Abstract

The electrocardiogram (ECG) is a ubiquitous diagnostic modality. Convolutional neural networks (CNNs) applied towards ECG analysis require large sample sizes, and transfer learning approaches for biomedical problems may result in suboptimal performance when pre-training is done on natural images. We leveraged masked image modeling to create a vision-based transformer model, HeartBEiT, for electrocardiogram waveform analysis. We pre-trained this model on 8.5 million ECGs and then compared performance vs. standard CNN architectures for diagnosis of hypertrophic cardiomyopathy, low left ventricular ejection fraction and ST elevation myocardial infarction using differing training sample sizes and independent validation datasets. We find that HeartBEiT has significantly higher performance at lower sample sizes compared to other models. We also find that HeartBEiT improves explainability of diagnosis by highlighting biologically relevant regions of the EKG vs. standard CNNs. Domain specific pre-trained transformer models may exceed the classification performance of models trained on natural images especially in very low data regimes. The combination of the architecture and such pre-training allows for more accurate, granular explainability of model predictions.

Abstract

Grants and funding