CAManim: Animating end-to-end network activation maps

Emily Kaczmarek; Olivier X Miguel; Alexa C Bowie; Robin Ducharme; Alysha L J Dingwall-Harvey; Steven Hawken; Christine M Armour; Mark C Walker; Kevin Dick

doi:10.1371/journal.pone.0296985

CAManim: Animating end-to-end network activation maps

PLoS One. 2024 Jun 18;19(6):e0296985. doi: 10.1371/journal.pone.0296985. eCollection 2024.

Authors

Emily Kaczmarek¹, Olivier X Miguel², Alexa C Bowie², Robin Ducharme², Alysha L J Dingwall-Harvey^{1

2}, Steven Hawken^{1

2

3

4}, Christine M Armour^{1

5

6}, Mark C Walker^{1

2

3

4

7

8

9

10}, Kevin Dick^{1

6

9}

Affiliations

¹ Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada.
² Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada.
³ School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada.
⁴ ICES, Toronto, Canada.
⁵ Department of Pediatrics, University of Ottawa, Ottawa, Canada.
⁶ Prenatal Screening Ontario, Better Outcomes Registry & Network, Ottawa, Canada.
⁷ Department of Obstetrics and Gynecology, University of Ottawa, Ottawa, Canada.
⁸ International and Global Health Office, University of Ottawa, Ottawa, Canada.
⁹ BORN Ontario, Children's Hospital of Eastern Ontario, Ottawa, Canada.
¹⁰ Department of Obstetrics, Gynecology & Newborn Care, The Ottawa Hospital, Ottawa, Canada.

Abstract

Deep neural networks have been widely adopted in numerous domains due to their high performance and accessibility to developers and application-specific end-users. Fundamental to image-based applications is the development of Convolutional Neural Networks (CNNs), which possess the ability to automatically extract features from data. However, comprehending these complex models and their learned representations, which typically comprise millions of parameters and numerous layers, remains a challenge for both developers and end-users. This challenge arises due to the absence of interpretable and transparent tools to make sense of black-box models. There exists a growing body of Explainable Artificial Intelligence (XAI) literature, including a collection of methods denoted Class Activation Maps (CAMs), that seek to demystify what representations the model learns from the data, how it informs a given prediction, and why it, at times, performs poorly in certain tasks. We propose a novel XAI visualization method denoted CAManim that seeks to simultaneously broaden and focus end-user understanding of CNN predictions by animating the CAM-based network activation maps through all layers, effectively depicting from end-to-end how a model progressively arrives at the final layer activation. Herein, we demonstrate that CAManim works with any CAM-based method and various CNN architectures. Beyond qualitative model assessments, we additionally propose a novel quantitative assessment that expands upon the Remove and Debias (ROAD) metric, pairing the qualitative end-to-end network visual explanations assessment with our novel quantitative "yellow brick ROAD" assessment (ybROAD). This builds upon prior research to address the increasing demand for interpretable, robust, and transparent model assessment methodology, ultimately improving an end-user's trust in a given model's predictions. Examples and source code can be found at: https://omni-ml.github.io/pytorch-grad-cam-anim/.

Copyright: © 2024 Kaczmarek et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Algorithms
Artificial Intelligence
Deep Learning
Humans
Neural Networks, Computer*

Grants and funding

The author(s) received no specific funding for this work.