Crowd-sourced annotation of ecg signals using contextual information

Tingting Zhu; Alistair E W Johnson; Joachim Behar; Gari D Clifford

doi:10.1007/s10439-013-0964-6

Crowd-sourced annotation of ecg signals using contextual information

Ann Biomed Eng. 2014 Apr;42(4):871-84. doi: 10.1007/s10439-013-0964-6. Epub 2013 Dec 25.

Authors

Tingting Zhu¹, Alistair E W Johnson, Joachim Behar, Gari D Clifford

Affiliation

¹ Intelligent Patient Monitoring Group, Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK, tingting.zhu@eng.ox.ac.uk.

PMID: 24368593
DOI: 10.1007/s10439-013-0964-6

Abstract

For medical applications, the ground truth is ascertained through manual labels by clinical experts. However, significant inter-observer variability and various human biases limit accuracy. A probabilistic framework addresses these issues by comparing aggregated human and automated labels to provide a reliable ground truth, with no prior knowledge of the individual performance. As an alternative to median or mean voting strategies, novel contextual features (signal quality and physiology) were introduced to allow the Probabilistic Label Aggregator (PLA) to weight an algorithm or human based on its performance. As a proof of concept, the PLA was applied to QT interval (pro-arrhythmic indicator) estimation from the electrocardiogram using labels from 20 humans and 48 algorithms crowd-sourced from the 2006 PhysioNet/Computing in Cardiology Challenge database. For automatic annotations, the root mean square error of the PLA was 13.97 ± 0.46 ms, significantly outperforming the best Challenge entry (16.36 ms) as well as mean and median voting strategies (17.67 ± 0.56 ms and 14.44 ± 0.52 ms respectively with p < 0.05). When selecting three annotators, the PLA improved the annotation accuracy over median aggregation by 10.7% for human annotators and 14.4% for automated algorithms. The PLA could therefore provide an improved "gold standard" for medical annotation tasks even when ground truth is not available.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Databases, Factual
Electrocardiography / statistics & numerical data*
Female
Humans
Male
Middle Aged
Observer Variation
Regression Analysis
Reproducibility of Results