A flexible statistical model for alignment of label-free proteomics data--incorporating ion mobility and product ion information

Ashlee M Benjamin; J Will Thompson; Erik J Soderblom; Scott J Geromanos; Ricardo Henao; Virginia B Kraus; M Arthur Moseley; Joseph E Lucas

doi:10.1186/1471-2105-14-364

A flexible statistical model for alignment of label-free proteomics data--incorporating ion mobility and product ion information

BMC Bioinformatics. 2013 Dec 16:14:364. doi: 10.1186/1471-2105-14-364.

Authors

Ashlee M Benjamin¹, J Will Thompson, Erik J Soderblom, Scott J Geromanos, Ricardo Henao, Virginia B Kraus, M Arthur Moseley, Joseph E Lucas

Affiliation

¹ Institute for Genome Sciences and Policy, Duke University Medical Center, Durham, North Carolina, USA. amb103@duke.edu.

Abstract

Background: The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing--the matching of peptide measurements across samples.

Results: We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates.

Conclusions: Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

DNA-Binding Proteins / chemistry
DNA-Binding Proteins / genetics
Escherichia coli Proteins / chemistry
Escherichia coli Proteins / genetics
Hepatitis C / genetics
Hepatitis C / metabolism
Humans
Ions / chemistry
Models, Genetic*
Osteoarthritis / genetics
Osteoarthritis / metabolism
Peptide Fragments / chemistry*
Peptide Fragments / genetics
Proteomics / methods*
Proteomics / statistics & numerical data
Sequence Alignment / methods*
Sequence Alignment / statistics & numerical data
Spectrometry, Mass, Electrospray Ionization* / methods
Tandem Mass Spectrometry / methods

Substances

DNA-Binding Proteins
Escherichia coli Proteins
Ions
Peptide Fragments

Grants and funding

1UL1 RR024128-01/RR/NCRR NIH HHS/United States