Automatic identification of novel bacteria using Raman spectroscopy and Gaussian processes

Anal Chim Acta. 2013 Sep 10:794:29-37. doi: 10.1016/j.aca.2013.07.051. Epub 2013 Jul 31.

Abstract

Raman spectroscopy is successfully used for the reliable classification of complex biological samples. Much effort concentrates on the accurate prediction of known categories for highly relevant tasks in a wide area of applications such as cancer detection and bacteria recognition. However, the resulting recognition systems cannot always be directly used in practice since unseen samples might not belong to classes present in the training set. Our work aims to tackle this problem of novelty detection using a recently proposed approach based on Gaussian processes. By learning novelty scores for a large bacteria Raman dataset comprising 50 different strains, we analyze the behavior of this method on an independent dataset which includes known as well as unknown categories. Our experiment reveals that non-parametric methods such as Gaussian processes can be successfully applied to the task of finding unknown bacterial strains, leading to encouraging results motivating their further utilization in this area.

Keywords: Bacteria recognition; Gaussian processes; Novelty detection; One-class classification; Raman spectroscopy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / chemistry
  • Bacteria / classification*
  • Bacterial Typing Techniques / methods*
  • Pattern Recognition, Automated
  • Principal Component Analysis
  • Spectrum Analysis, Raman*