Analysis of protein complexes through model-based biclustering of label-free quantitative AP-MS data

Mol Syst Biol. 2010 Jun 22:6:385. doi: 10.1038/msb.2010.41.

Abstract

Affinity purification followed by mass spectrometry (AP-MS) has become a common approach for identifying protein-protein interactions (PPIs) and complexes. However, data analysis and visualization often rely on generic approaches that do not take advantage of the quantitative nature of AP-MS. We present a novel computational method, nested clustering, for biclustering of label-free quantitative AP-MS data. Our approach forms bait clusters based on the similarity of quantitative interaction profiles and identifies submatrices of prey proteins showing consistent quantitative association within bait clusters. In doing so, nested clustering effectively addresses the problem of overrepresentation of interactions involving baits proteins as compared with proteins only identified as preys. The method does not require specification of the number of bait clusters, which is an advantage against existing model-based clustering methods. We illustrate the performance of the algorithm using two published intermediate scale human PPI data sets, which are representative of the AP-MS data generated from mammalian cells. We also discuss general challenges of analyzing and interpreting clustering results in the context of AP-MS data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • ATPases Associated with Diverse Cellular Activities
  • Carrier Proteins / metabolism
  • Chromatography, Affinity / methods*
  • Cluster Analysis
  • DNA Helicases / metabolism
  • Databases, Protein*
  • Humans
  • Mass Spectrometry / methods*
  • Models, Biological*
  • Multiprotein Complexes / metabolism*
  • Protein Binding
  • Protein Interaction Mapping
  • Protein Phosphatase 2 / metabolism
  • Staining and Labeling

Substances

  • Carrier Proteins
  • Multiprotein Complexes
  • Protein Phosphatase 2
  • ATPases Associated with Diverse Cellular Activities
  • DNA Helicases
  • RUVBL1 protein, human
  • RUVBL2 protein, human