Actionable absolute risk prediction of atherosclerotic cardiovascular disease based on the UK Biobank

PLoS One. 2022 Feb 11;17(2):e0263940. doi: 10.1371/journal.pone.0263940. eCollection 2022.

Abstract

Cardiovascular diseases (CVDs) are the primary cause of all death globally. Timely and accurate identification of people at risk of developing an atherosclerotic CVD and its sequelae is a central pillar of preventive cardiology. One widely used approach is risk prediction models; however, currently available models consider only a limited set of risk factors and outcomes, yield no actionable advice to individuals based on their holistic medical state and lifestyle, are often not interpretable, were built with small cohort sizes or are based on lifestyle data from the 1960s, e.g. the Framingham model. The risk of developing atherosclerotic CVDs is heavily lifestyle dependent, potentially making many occurrences preventable. Providing actionable and accurate risk prediction tools to the public could assist in atherosclerotic CVD prevention. Accordingly, we developed a benchmarking pipeline to find the best set of data preprocessing and algorithms to predict absolute 10-year atherosclerotic CVD risk. Based on the data of 464,547 UK Biobank participants without atherosclerotic CVD at baseline, we used a comprehensive set of 203 consolidated risk factors associated with atherosclerosis and its sequelae (e.g. heart failure). Our two best performing absolute atherosclerotic risk prediction models provided higher performance, (AUROC: 0.7573, 95% CI: 0.755-0.7595) and (AUROC: 0.7544, 95% CI: 0.7522-0.7567), than Framingham (AUROC: 0.680, 95% CI: 0.6775-0.6824) and QRisk3 (AUROC: 0.725, 95% CI: 0.7226-0.7273). Using a subset of 25 risk factors identified with feature selection, our reduced model achieves similar performance (AUROC 0.7415, 95% CI: 0.7392-0.7438) while being less complex. Further, it is interpretable, actionable and highly generalizable. The model could be incorporated into clinical practice and might allow continuous personalized predictions with automated intervention suggestions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Algorithms
  • Atherosclerosis / complications
  • Atherosclerosis / epidemiology*
  • Biological Specimen Banks
  • Cardiovascular Diseases / epidemiology*
  • Cardiovascular Diseases / etiology
  • Cohort Studies
  • Female
  • Humans
  • Life Style
  • Male
  • Middle Aged
  • Risk Assessment
  • Sample Size
  • United Kingdom

Grants and funding

This research was funded by Ada Health GmbH and has been conducted using the UK Biobank under application id 34802.