Problems in detecting misfit of latent class models in diagnostic research without a gold standard were shown

Maarten van Smeden; Daniel L Oberski; Johannes B Reitsma; Jeroen K Vermunt; Karel G M Moons; Joris A H de Groot

doi:10.1016/j.jclinepi.2015.11.012

Problems in detecting misfit of latent class models in diagnostic research without a gold standard were shown

J Clin Epidemiol. 2016 Jun:74:158-66. doi: 10.1016/j.jclinepi.2015.11.012. Epub 2015 Nov 25.

Authors

Maarten van Smeden¹, Daniel L Oberski², Johannes B Reitsma³, Jeroen K Vermunt², Karel G M Moons³, Joris A H de Groot³

Affiliations

¹ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG Utrecht, The Netherlands. Electronic address: M.vanSmeden@umcutrecht.nl.
² Department of Methodology and Statistics, Tilburg University, PO Box 90153, 5000 LE Tilburg, The Netherlands.
³ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG Utrecht, The Netherlands.

PMID: 26628335
DOI: 10.1016/j.jclinepi.2015.11.012

Abstract

Objectives: The objective of this study was to evaluate the performance of goodness-of-fit testing to detect relevant violations of the assumptions underlying the criticized "standard" two-class latent class model. Often used to obtain sensitivity and specificity estimates for diagnostic tests in the absence of a gold reference standard, this model relies on assuming that diagnostic test errors are independent. When this assumption is violated, accuracy estimates may be biased: goodness-of-fit testing is often used to evaluate the assumption and prevent bias.

Study design and setting: We investigate the performance of goodness-of-fit testing by Monte Carlo simulation. The simulation scenarios are based on three empirical examples.

Results: Goodness-of-fit tests lack power to detect relevant misfit of the standard two-class latent class model at sample sizes that are typically found in empirical diagnostic studies. The goodness-of-fit tests that are based on asymptotic theory are not robust to the sparseness of data. A parametric bootstrap procedure improves the evaluation of goodness of fit in the case of sparse data.

Conclusion: Our simulation study suggests that relevant violation of the local independence assumption underlying the standard two-class latent class model may remain undetected in empirical diagnostic studies, potentially leading to biased estimates of sensitivity and specificity.

Keywords: Goodness of fit; Latent class analysis; Local independence assumption; No gold standard; Sensitivity and specificity; Simulation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bias
Data Interpretation, Statistical*
Diagnostic Errors
Diagnostic Tests, Routine / standards*
Diagnostic Tests, Routine / statistics & numerical data*
Humans
Models, Statistical*
Reference Standards
Sensitivity and Specificity