Objective evaluation of deep uncertainty predictions for COVID-19 detection

Hamzeh Asgharnezhad; Afshar Shamsi; Roohallah Alizadehsani; Abbas Khosravi; Saeid Nahavandi; Zahra Alizadeh Sani; Dipti Srinivasan; Sheikh Mohammed Shariful Islam

doi:10.1038/s41598-022-05052-x

Objective evaluation of deep uncertainty predictions for COVID-19 detection

Sci Rep. 2022 Jan 17;12(1):815. doi: 10.1038/s41598-022-05052-x.

Authors

Hamzeh Asgharnezhad^#¹, Afshar Shamsi^#², Roohallah Alizadehsani³, Abbas Khosravi³, Saeid Nahavandi³, Zahra Alizadeh Sani⁴, Dipti Srinivasan⁵, Sheikh Mohammed Shariful Islam⁶

Affiliations

¹ Individual researcher, Tehran, Iran.
² Individual researcher, Tehran, Iran. afshar.shamsi.j@gmail.com.
³ Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Melbourne, VIC, Australia.
⁴ Omid Hospital, Iran University of Medical Sciences, Tehran, Iran.
⁵ Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore.
⁶ Institute of Physical Activity and Nutrition, Deakin University, Melbourne, VIC, Australia.

^# Contributed equally.

Abstract

Deep neural networks (DNNs) have been widely applied for detecting COVID-19 in medical images. Existing studies mainly apply transfer learning and other data representation strategies to generate accurate point estimates. The generalization power of these networks is always questionable due to being developed using small datasets and failing to report their predictive confidence. Quantifying uncertainties associated with DNN predictions is a prerequisite for their trusted deployment in medical settings. Here we apply and evaluate three uncertainty quantification techniques for COVID-19 detection using chest X-Ray (CXR) images. The novel concept of uncertainty confusion matrix is proposed and new performance metrics for the objective evaluation of uncertainty estimates are introduced. Through comprehensive experiments, it is shown that networks pertained on CXR images outperform networks pretrained on natural image datasets such as ImageNet. Qualitatively and quantitatively evaluations also reveal that the predictive uncertainty estimates are statistically higher for erroneous predictions than correct predictions. Accordingly, uncertainty quantification methods are capable of flagging risky predictions with high uncertainty estimates. We also observe that ensemble methods more reliably capture uncertainties during the inference. DNN-based solutions for COVID-19 detection have been mainly proposed without any principled mechanism for risk mitigation. Previous studies have mainly focused on on generating single-valued predictions using pretrained DNNs. In this paper, we comprehensively apply and comparatively evaluate three uncertainty quantification techniques for COVID-19 detection using chest X-Ray images. The novel concept of uncertainty confusion matrix is proposed and new performance metrics for the objective evaluation of uncertainty estimates are introduced for the first time. Using these new uncertainty performance metrics, we quantitatively demonstrate when we could trust DNN predictions for COVID-19 detection from chest X-rays. It is important to note the proposed novel uncertainty evaluation metrics are generic and could be applied for evaluation of probabilistic forecasts in all classification problems.

Objective evaluation of deep uncertainty predictions for COVID-19 detection

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding