Measurement Reliability for Keratitis Morphology

Matthias F Kriegel; Jessica Loo; Sina Farsiu; Venkatesh Prajna; Megan Tuohy; Kyeong Hwan Kim; Autumn N Valicevic; Leslie M Niziol; Huan Tan; Hamza A Ashfaq; Dena Ballouz; Maria A Woodward

doi:10.1097/ICO.0000000000002470

Measurement Reliability for Keratitis Morphology

Cornea. 2020 Dec;39(12):1503-1509. doi: 10.1097/ICO.0000000000002470.

Authors

Matthias F Kriegel^{1

2}, Jessica Loo³, Sina Farsiu^{3

4}, Venkatesh Prajna¹, Megan Tuohy¹, Kyeong Hwan Kim¹, Autumn N Valicevic¹, Leslie M Niziol¹, Huan Tan¹, Hamza A Ashfaq¹, Dena Ballouz¹, Maria A Woodward^{1

5}

Affiliations

¹ Department of Ophthalmology and Visual Sciences, W.K. Kellogg Eye Center, University of Michigan, Ann Arbor, MI.
² Augenzentrum am St. Franziskus Hospital Muenster, Muenster, Germany.
³ Department of Biomedical Engineering, Duke University, Durham, NC.
⁴ Department of Ophthalmology, Duke University, Durham, NC; and.
⁵ Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, MI.

Abstract

Purpose: To evaluate the reliability of manual annotation when quantifying cornea anatomical and microbial keratitis (MK) morphological features on slit-lamp photography (SLP) images.

Methods: Prospectively enrolled patients with MK underwent SLP at initial encounter at 2 academic eye hospitals. Patients who presented with an epithelial defect (ED) were eligible for analysis. Features, which included ED, corneal limbus (L), pupil (P), stromal infiltrate (SI), white blood cell (WBC) infiltration at the SI edge, and hypopyon (H), were annotated independently by 2 physicians on SLP images. Intraclass correlation coefficients (ICCs) were applied for reliability assessment; dice similarity coefficients (DSCs) were used to investigate the area overlap between readers.

Results: Seventy-five MK patients with an ED received SLP. DSCs indicate good to fair annotation overlap between graders (L = 0.97, P = 0.80, ED = 0.94, SI = 0.82, H = 0.82, WBC = 0.83) and between repeat annotations by the same grader (L = 0.97, P = 0.81, ED = 0.94, SI = 0.85, H = 0.84, WBC = 0.82). ICC scores showed good intergrader (L = 0.98, P = 0.78, ED = 1.00, SI = 0.67, H = 0.97, WBC = 0.86) and intragrader (L = 0.99, P = 0.92, ED = 0.99, SI = 0.94, H = 0.99, WBC = 0.92) reliabilities. When reliability statistics were recalculated for annotated SI area in the subset of cases where both graders agreed WBC infiltration was present/absent, intergrader ICC improved to 0.91 and DSC improved to 0.86 and intragrader ICC remained the same, whereas DSC improved to 0.87.

Conclusions: Manual annotation indicates usefulness of area quantification in the evaluation of MK. However, variability is intrinsic to the task. Thus, there is a need for optimization of annotation protocols. Future directions may include using multiple annotators per image or automated annotation software.

MeSH terms

Adult
Aged
Bacteria / isolation & purification
Corneal Stroma / pathology
Epithelium, Corneal / pathology*
Eye Infections, Bacterial / microbiology
Eye Infections, Bacterial / pathology*
Eye Infections, Fungal / microbiology
Eye Infections, Fungal / pathology*
Female
Fungi / isolation & purification
Humans
Keratitis / microbiology
Keratitis / pathology*
Leukocyte Count
Limbus Corneae / pathology
Male
Middle Aged
Prospective Studies
Reproducibility of Results
Slit Lamp Microscopy

Grants and funding

R01 EY031033/EY/NEI NIH HHS/United States