A Bayesian hierarchical model for prediction of latent health states from multiple data sources with application to active surveillance of prostate cancer

Biometrics. 2017 Jun;73(2):625-634. doi: 10.1111/biom.12577. Epub 2016 Aug 22.

Abstract

In this article, we present a Bayesian hierarchical model for predicting a latent health state from longitudinal clinical measurements. Model development is motivated by the need to integrate multiple sources of data to improve clinical decisions about whether to remove or irradiate a patient's prostate cancer. Existing modeling approaches are extended to accommodate measurement error in cancer state determinations based on biopsied tissue, clinical measurements possibly not missing at random, and informative partial observation of the true state. The proposed model enables estimation of whether an individual's underlying prostate cancer is aggressive, requiring surgery and/or radiation, or indolent, permitting continued surveillance. These individualized predictions can then be communicated to clinicians and patients to inform decision-making. We demonstrate the model with data from a cohort of low-risk prostate cancer patients at Johns Hopkins University and assess predictive accuracy among a subset for whom true cancer state is observed. Simulation studies confirm model performance and explore the impact of adjusting for informative missingness on true state predictions. R code is provided in an online supplement and at http://github.com/rycoley/prediction-prostate-surveillance.

Keywords: Latent class analysis; Missing data; Precision medicine; Prostate cancer prognosis; Risk classification.

MeSH terms

  • Bayes Theorem
  • Biopsy
  • Humans
  • Information Storage and Retrieval
  • Male
  • Prostatic Neoplasms*