Comparison of deep learning and human observer performance for detection and characterization of simulated lesions

Ruben De Man; Grace J Gang; Xin Li; Ge Wang

doi:10.1117/1.JMI.6.2.025503

Comparison of deep learning and human observer performance for detection and characterization of simulated lesions

J Med Imaging (Bellingham). 2019 Apr;6(2):025503. doi: 10.1117/1.JMI.6.2.025503. Epub 2019 Jun 21.

Authors

Ruben De Man¹, Grace J Gang², Xin Li³, Ge Wang⁴

Affiliations

¹ Stony Brook University, Department of Biochemistry and Cell Biology, Stony Brook, New York, United States.
² Johns Hopkins University, Department of Biomedical Engineering, Baltimore, Maryland, United States.
³ GE Global Research, Radiation Imaging Sciences, Niskayuna, New York, United States.
⁴ Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, New York, United States.

Abstract

Detection and characterization of abnormalities in clinical imaging are of utmost importance for patient diagnosis and treatment. We present a comparison of convolutional neural network (CNN) and human observer performance on a simulated lesion detection and characterization task. We apply both conventional performance metrics, including accuracy and nonconventional metrics such as lift charts to perform qualitative and quantitative comparisons of each type of observer. It is determined that the CNN generally outperforms the human observers, particularly at high noise levels. However, high noise correlation reduces the relative performance of the CNN, and human observer performance is comparable to CNN under these conditions. These findings extend into the field of diagnostic radiology, where the adoption of deep learning is starting to become widespread. Consideration of the applications for which deep learning is most effective is of critical importance to this development.

Keywords: artificial intelligence; detection; image analysis; image quality; noise.