Purpose: Estimate incidence of autoimmune conditions in a population who received HPV4 vaccine and a comparison unvaccinated population. Electronic health record (EHR) data may contain inaccurate or incomplete coding, while manual chart review of all cases may not be feasible. We propose a method to estimate incidence using EHR data and case review for a sample.
Methods: Suspected incident cases were identified using ICD-9 codes, laboratory results and medications related to the condition. A random sample of charts was reviewed to confirm the diagnosis and determine disease onset date. Multiple imputation, using a Monte Carlo model including age and disease indicators was used to impute case status of non-reviewed cases. Incidence rate was calculated in each imputed dataset, with median and percentiles giving a distribution for the estimated incidence rate. Sensitivity analyses compared modeled results to results without imputation and results where imputation was applied to the subset of cases identified using specific ICD-9 codes.
Results: The model accounted for differential case confirmation rates by age and method of case identification, identifying a potential safety signal that was missed relying on EHR data alone.
Conclusions: This method may be useful for computing incidence when full case review is not feasible.
Keywords: Data accuracy; Epidemiologic methods; Incidence; Review of reported cases.
Copyright © 2017 Elsevier Ltd. All rights reserved.