Automated Diabetic Retinopathy Image Assessment Software: Diagnostic Accuracy and Cost-Effectiveness Compared with Human Graders

Adnan Tufail; Caroline Rudisill; Catherine Egan; Venediktos V Kapetanakis; Sebastian Salas-Vega; Christopher G Owen; Aaron Lee; Vern Louw; John Anderson; Gerald Liew; Louis Bolter; Sowmya Srinivas; Muneeswar Nittala; SriniVas Sadda; Paul Taylor; Alicja R Rudnicka

doi:10.1016/j.ophtha.2016.11.014

Automated Diabetic Retinopathy Image Assessment Software: Diagnostic Accuracy and Cost-Effectiveness Compared with Human Graders

Ophthalmology. 2017 Mar;124(3):343-351. doi: 10.1016/j.ophtha.2016.11.014. Epub 2016 Dec 23.

Authors

Adnan Tufail¹, Caroline Rudisill², Catherine Egan³, Venediktos V Kapetanakis⁴, Sebastian Salas-Vega², Christopher G Owen⁴, Aaron Lee⁵, Vern Louw³, John Anderson⁶, Gerald Liew³, Louis Bolter⁶, Sowmya Srinivas⁷, Muneeswar Nittala⁷, SriniVas Sadda⁷, Paul Taylor⁸, Alicja R Rudnicka⁴

Affiliations

¹ Moorfields Biomedical Research Centre, Moorfields Eye Hospital, London, United Kingdom. Electronic address: Adnan.tufail@moorfields.nhs.uk.
² Department of Social Policy, LSE Health, London School of Economics and Political Science, London, United Kingdom.
³ Moorfields Biomedical Research Centre, Moorfields Eye Hospital, London, United Kingdom.
⁴ Population Health Research Institute, St George's, University of London, Cranmer Terrace, London, United Kingdom.
⁵ Moorfields Biomedical Research Centre, Moorfields Eye Hospital, London, United Kingdom; University of Washington, Department of Ophthalmology, Seattle, Washington.
⁶ Homerton University Hospital, Homerton Row, London, United Kingdom.
⁷ Doheny Eye Institute, Los Angeles, California.
⁸ Centre for Health Informatics and Multiprofessional Education, Institute of Health Informatics, University College London, London, United Kingdom.

PMID: 28024825
DOI: 10.1016/j.ophtha.2016.11.014

Abstract

Objective: With the increasing prevalence of diabetes, annual screening for diabetic retinopathy (DR) by expert human grading of retinal images is challenging. Automated DR image assessment systems (ARIAS) may provide clinically effective and cost-effective detection of retinopathy. We aimed to determine whether ARIAS can be safely introduced into DR screening pathways to replace human graders.

Design: Observational measurement comparison study of human graders following a national screening program for DR versus ARIAS.

Participants: Retinal images from 20 258 consecutive patients attending routine annual diabetic eye screening between June 1, 2012, and November 4, 2013.

Methods: Retinal images were manually graded following a standard national protocol for DR screening and were processed by 3 ARIAS: iGradingM, Retmarker, and EyeArt. Discrepancies between manual grades and ARIAS results were sent to a reading center for arbitration.

Main outcome measures: Screening performance (sensitivity, false-positive rate) and diagnostic accuracy (95% confidence intervals of screening-performance measures) were determined. Economic analysis estimated the cost per appropriate screening outcome.

Results: Sensitivity point estimates (95% confidence intervals) of the ARIAS were as follows: EyeArt 94.7% (94.2%-95.2%) for any retinopathy, 93.8% (92.9%-94.6%) for referable retinopathy (human graded as either ungradable, maculopathy, preproliferative, or proliferative), 99.6% (97.0%-99.9%) for proliferative retinopathy; Retmarker 73.0% (72.0 %-74.0%) for any retinopathy, 85.0% (83.6%-86.2%) for referable retinopathy, 97.9% (94.9%-99.1%) for proliferative retinopathy. iGradingM classified all images as either having disease or being ungradable. EyeArt and Retmarker saved costs compared with manual grading both as a replacement for initial human grading and as a filter prior to primary human grading, although the latter approach was less cost-effective.

Conclusions: Retmarker and EyeArt systems achieved acceptable sensitivity for referable retinopathy when compared with that of human graders and had sufficient specificity to make them cost-effective alternatives to manual grading alone. ARIAS have the potential to reduce costs in developed-world health care economies and to aid delivery of DR screening in developing or remote health care settings.

Publication types

Comparative Study
Observational Study

MeSH terms

Adolescent
Adult
Aged
Aged, 80 and over
Child
Cost-Benefit Analysis*
Decision Trees
Diabetic Retinopathy / diagnosis*
Diabetic Retinopathy / economics*
Economics, Medical
False Negative Reactions
Female
Humans
Image Interpretation, Computer-Assisted* / methods
Male
Mass Screening / methods
Middle Aged
Physical Examination / methods
Predictive Value of Tests
Reproducibility of Results
Sensitivity and Specificity
Software

Grants and funding

11/21/02/DH_/Department of Health/United Kingdom