A machine learning algorithm for high throughput identification of FTIR spectra: Application on microplastics collected in the Mediterranean Sea

Chemosphere. 2019 Nov:234:242-251. doi: 10.1016/j.chemosphere.2019.05.113. Epub 2019 Jun 6.

Abstract

The development of methods to automatically determine the chemical nature of microplastics by FTIR-ATR spectra is an important challenge. A machine learning method, named k-nearest neighbors classification, has been applied on spectra of microplastics collected during Tara Expedition in the Mediterranean Sea (2014). To realize these tests, a learning database composed of 969 microplastic spectra has been created. Results show that the machine learning process is very efficient to identify spectra of classical polymers such as poly(ethylene), but also that the learning database must be enhanced with less common microplastic spectra. Finally, this method has been applied on more than 4000 spectra of unidentified microplastics. The verification protocol showed less than 10% difference in the results between the proposed automated method and a human expertise, 75% of which can be very easily corrected.

Keywords: FTIR spectra; Machine learning; Microplastic; Tara mediterranean campaign; k-nearest neighbor classification.

MeSH terms

  • Algorithms*
  • Environmental Monitoring / methods
  • Humans
  • Machine Learning*
  • Mediterranean Sea
  • Plastics / analysis*
  • Plastics / chemistry*
  • Water Pollutants, Chemical / analysis*
  • Water Pollutants, Chemical / chemistry*

Substances

  • Plastics
  • Water Pollutants, Chemical