Addressing multi-label imbalance problem of surgical tool detection using CNN

Int J Comput Assist Radiol Surg. 2017 Jun;12(6):1013-1020. doi: 10.1007/s11548-017-1565-x. Epub 2017 Mar 29.

Abstract

Purpose: A fully automated surgical tool detection framework is proposed for endoscopic video streams. State-of-the-art surgical tool detection methods rely on supervised one-vs-all or multi-class classification techniques, completely ignoring the co-occurrence relationship of the tools and the associated class imbalance.

Methods: In this paper, we formulate tool detection as a multi-label classification task where tool co-occurrences are treated as separate classes. In addition, imbalance on tool co-occurrences is analyzed and stratification techniques are employed to address the imbalance during convolutional neural network (CNN) training. Moreover, temporal smoothing is introduced as an online post-processing step to enhance runtime prediction.

Results: Quantitative analysis is performed on the M2CAI16 tool detection dataset to highlight the importance of stratification, temporal smoothing and the overall framework for tool detection.

Conclusion: The analysis on tool imbalance, backed by the empirical results, indicates the need and superiority of the proposed framework over state-of-the-art techniques.

Keywords: CNN; Laparoscopic videos; Multi-label learning; Surgical tool detection; Transfer learning.

MeSH terms

  • Algorithms
  • Endoscopy / methods*
  • Foreign Bodies / diagnostic imaging*
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / methods*