Stratifying risk for dementia onset using large-scale electronic health record data: A retrospective cohort study

Alzheimers Dement. 2020 Mar;16(3):531-540. doi: 10.1016/j.jalz.2019.09.084. Epub 2020 Jan 16.

Abstract

Introduction: Preventing dementia, or modifying disease course, requires identification of presymptomatic or minimally symptomatic high-risk individuals.

Methods: We used longitudinal electronic health records from two large academic medical centers and applied a validated natural language processing tool to estimate cognitive symptomatology. We used survival analysis to examine the association of cognitive symptoms with incident dementia diagnosis during up to 8 years of follow-up.

Results: Among 267,855 hospitalized patients with 1,251,858 patient years of follow-up data, 6516 (2.4%) received a new diagnosis of dementia. In competing risk regression, an increasing cognitive symptom score was associated with earlier dementia diagnosis (HR 1.63; 1.54-1.72). Similar results were observed in the second hospital system and in subgroup analysis of younger and older patients.

Discussion: A cognitive symptom measure identified in discharge notes facilitated stratification of risk for dementia up to 8 years before diagnosis.

Keywords: Alzheimer's disease; Cognition; Data mining; Dementia; Electronic health record; Machine learning; Natural language processing; Phenotype; Research domain criteria.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Dementia / diagnosis*
  • Disease Progression*
  • Early Diagnosis*
  • Electronic Health Records / statistics & numerical data*
  • Female
  • Humans
  • Longitudinal Studies
  • Male
  • Middle Aged
  • Natural Language Processing
  • Retrospective Studies