Natural Language Processing for the Accurate Identification of Colorectal Cancer Mismatch Repair Status in Lynch Syndrome Screening

Clin Gastroenterol Hepatol. 2021 Mar;19(3):610-612.e1. doi: 10.1016/j.cgh.2020.01.040. Epub 2020 Feb 7.

Abstract

Lynch syndrome (LS) is the most common type of hereditary colorectal cancer (CRC) syndrome caused by pathogenic variants in mismatch repair (MMR) genes.1 Current multisociety guidelines recommend screening all CRC tumors for LS.2,3 The most widely adopted screening method is MMR immunohistochemistry (IHC) followed by germline analysis if indicated.2,3 However, the text-based nature of pathology and IHC reports used for LS screening results impedes creation of an efficient tracking system for identifying affected patients and screening outcomes.4 In this study, we developed and validated a natural language processing (NLP) tool for extracting MMR IHC results in LS screening in a large, diverse, multicenter, community-based setting.5.

Publication types

  • Multicenter Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Colorectal Neoplasms* / diagnosis
  • Colorectal Neoplasms, Hereditary Nonpolyposis* / diagnosis
  • Colorectal Neoplasms, Hereditary Nonpolyposis* / genetics
  • DNA Mismatch Repair
  • Early Detection of Cancer
  • Humans
  • Microsatellite Instability
  • Natural Language Processing