Peak prioritization plays a key role in non-target analysis of complex samples in order to focus the elucidation efforts on potentially relevant substances. The present work shows the development of a computational workflow capable of detecting compounds that exhibit large variation in intensity over time. The developed approach is based on three open-source R packages (xcms, CAMERA and TIMECOURSE) and includes the use of the statistical test Multivariate Empirical Bayes Approach to rank the compounds based on the Hotelling T2 coefficient, which is an indicator of large concentration variations of unknown components. The approach was applied to replicate series of 24 h composite flow-proportional influent wastewater samples collected during 8 consecutive days. 60 events involving unknown substances with high fluctuation over time were successfully prioritized. 14 of those compounds were tentatively identified using HRMS/MS libraries, chemical databases, in-silico fragmentation tools, and retention time prediction models. Four compounds were confirmed with standards from which two never reported before in wastewater.
Keywords: Multivariate Empirical Bayes Approach; Non-target screening; Peak prioritization workflow; Trend analysis; Wastewater.
Copyright © 2018 Elsevier B.V. All rights reserved.