Common pre-diagnostic features in individuals with different rare diseases represent a key for diagnostic support with computerized pattern recognition?

PLoS One. 2019 Oct 10;14(10):e0222637. doi: 10.1371/journal.pone.0222637. eCollection 2019.

Abstract

Background: Rare diseases (RD) result in a wide variety of clinical presentations, and this creates a significant diagnostic challenge for health care professionals. We hypothesized that there exist a set of consistent and shared phenomena among all individuals affected by (different) RD during the time before diagnosis is established.

Objective: We aimed to identify commonalities between different RD and developed a machine learning diagnostic support tool for RD.

Methods: 20 interviews with affected individuals with different RD, focusing on the time period before their diagnosis, were performed and qualitatively analyzed. Out of these pre-diagnostic experiences, we distilled key phenomena and created a questionnaire which was then distributed among individuals with the established diagnosis of i.) RD, ii.) other common non-rare diseases (NRO) iii.) common chronic diseases (CD), iv.), or psychosomatic/somatoform disorders (PSY). Finally, four combined single machine learning methods and a fusion algorithm were used to distinguish the different answer patterns of the questionnaires.

Results: The questionnaire contained 53 questions. A total sum of 1763 questionnaires (758 RD, 149 CD, 48 PSY, 200 NRO, 34 healthy individuals and 574 not evaluable questionnaires) were collected. Based on 3 independent data sets the 10-fold stratified cross-validation method for the answer-pattern recognition resulted in sensitivity values of 88.9% to detect the answer pattern of a RD, 86.6% for NRO, 87.7% for CD and 84.2% for PSY.

Conclusion: Despite the great diversity in presentation and pathogenesis of each RD, patients with RD share surprisingly similar pre-diagnosis experiences. Our questionnaire and data-mining based approach successfully detected unique patterns in groups of individuals affected by a broad range of different rare diseases. Therefore, these results indicate distinct patterns that may be used for diagnostic support in RD.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Artificial Intelligence
  • Chronic Disease / epidemiology*
  • Data Mining
  • Female
  • Health Personnel / statistics & numerical data
  • Health Status
  • Humans
  • Machine Learning*
  • Male
  • Patients
  • Rare Diseases / classification
  • Rare Diseases / diagnosis*
  • Rare Diseases / epidemiology*
  • Surveys and Questionnaires
  • Young Adult

Grants and funding

This study was supported the funding organization, Robert Bosch Stiftung, Germany (https://www.bosch-stiftung.de/en). LG, FK, and WL are co-founders of the commercial company, Improved Medical Diagnostics IMD GmbH. Improved Medical Diagnostics IMD GmbH does not currently employ LG, FK, and WL, nor did it provide funding for the current study. Neither the funding organization, Robert Bosch Stiftung, Germany, nor Improved Medical Diagnostics IMD GmbH had any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.