Instrumentation technology for metabolomics has advanced drastically in recent years in terms of sensitivity and specificity. Despite these technical advances, data analytical strategies are still in their infancy in comparison with other 'omics'. Plants are known to possess an immense diversity of secondary metabolites. Typically, more than 70% of metabolomics data are not amenable to systems biological interpretation because of poor database coverage. Here, we propose a new general strategy for mass-spectrometry-based metabolomics that incorporates all exact mass features with known sum formulas into the evaluation and interpretation of metabolomics studies. We extend the use of mass differences, commonly used for feature annotation, by redefining them as variables that reflect the remaining 'omic' domains. The strategy uses exact mass difference network analyses exemplified for the metabolomic description of two grey poplar (Populus × canescens) genotypes that differ in their capability to emit isoprene. This strategy established a direct connection between the metabotype and the non-isoprene-emitting phenotype, as mass differences pertaining to prenylation reactions were over-represented in non-isoprene-emitting poplars. Not only was the analysis of mass differences able to grasp the known chemical biology of poplar, but it also improved the interpretability of yet unknown biochemical relationships.
Keywords: Populus × canescens; mass difference analysis; metabolomics; networks; systems chemical biology.
© 2016 John Wiley & Sons Ltd.