Untargeted time-pattern analysis of LC-HRMS data to detect spills and compounds with high fluctuation in influent wastewater

J Hazard Mater. 2019 Jan 5:361:19-29. doi: 10.1016/j.jhazmat.2018.08.073. Epub 2018 Aug 23.

Abstract

Peak prioritization plays a key role in non-target analysis of complex samples in order to focus the elucidation efforts on potentially relevant substances. The present work shows the development of a computational workflow capable of detecting compounds that exhibit large variation in intensity over time. The developed approach is based on three open-source R packages (xcms, CAMERA and TIMECOURSE) and includes the use of the statistical test Multivariate Empirical Bayes Approach to rank the compounds based on the Hotelling T2 coefficient, which is an indicator of large concentration variations of unknown components. The approach was applied to replicate series of 24 h composite flow-proportional influent wastewater samples collected during 8 consecutive days. 60 events involving unknown substances with high fluctuation over time were successfully prioritized. 14 of those compounds were tentatively identified using HRMS/MS libraries, chemical databases, in-silico fragmentation tools, and retention time prediction models. Four compounds were confirmed with standards from which two never reported before in wastewater.

Keywords: Multivariate Empirical Bayes Approach; Non-target screening; Peak prioritization workflow; Trend analysis; Wastewater.