MemPype: a pipeline for the annotation of eukaryotic membrane proteins

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W375-80. doi: 10.1093/nar/gkr282. Epub 2011 May 4.

Abstract

MemPype is a Python-based pipeline including previously published methods for the prediction of signal peptides (SPEP), glycophosphatidylinositol (GPI) anchors (PredGPI), all-alpha membrane topology (ENSEMBLE), and a recent method (MemLoci) that specifically discriminates the localization of eukaryotic membrane proteins in: 'cell membrane', 'internal membranes', 'organelle membranes'. MemLoci scores with accuracy of 70% and generalized correlation coefficient (GCC) of 0.50 on a rigorous homology-unbiased validation set and overpasses other predictors for subcellular localization. The annotation process is based both on inheritance through homology and computational methods. Each submitted protein first retrieves, when available, up to 25 similar proteins (with sequence identity ≥50% and alignment coverage ≥50% on both sequences). This helps the identification of membrane-associated proteins and detailed localization tags. Each protein is also filtered for the presence of a GPI anchor [0.8% false positive rate (FPR)]. A positive score of GPI anchor prediction labels the sequence as exposed to 'Cell surface'. Concomitantly the sequence is analysed for the presence of a signal peptide and classified with MemLoci into one of three discriminated classes. Finally the sequence is filtered for predicting its putative all-alpha protein membrane topology (FPR <1%). The web server is available at: http://mu2py.biocomp.unibo.it/mempype.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • GPI-Linked Proteins / chemistry
  • Membrane Proteins / analysis
  • Membrane Proteins / chemistry*
  • Molecular Sequence Annotation*
  • Protein Sorting Signals
  • Protein Structure, Tertiary
  • Sequence Homology, Amino Acid
  • Software*

Substances

  • GPI-Linked Proteins
  • Membrane Proteins
  • Protein Sorting Signals