Environment specific substitution tables improve membrane protein alignment

Bioinformatics. 2011 Jul 1;27(13):i15-23. doi: 10.1093/bioinformatics/btr230.

Abstract

Motivation: Membrane proteins are both abundant and important in cells, but the small number of solved structures restricts our understanding of them. Here we consider whether membrane proteins undergo different substitutions from their soluble counterparts and whether these can be used to improve membrane protein alignments, and therefore improve prediction of their structure.

Results: We construct substitution tables for different environments within membrane proteins. As data is scarce, we develop a general metric to assess the quality of these asymmetric tables. Membrane proteins show markedly different substitution preferences from soluble proteins. For example, substitution preferences in lipid tail-contacting parts of membrane proteins are found to be distinct from all environments in soluble proteins, including buried residues. A principal component analysis of the tables identifies the greatest variation in substitution preferences to be due to changes in hydrophobicity; the second largest variation relates to secondary structure. We demonstrate the use of our tables in pairwise sequence-to-structure alignments (also known as 'threading') of membrane proteins using the FUGUE alignment program. On average, in the 10-25% sequence identity range, alignments are improved by 28 correctly aligned residues compared with alignments made using FUGUE's default substitution tables. Our alignments also lead to improved structural models.

Availability: Substitution tables are available at: http://www.stats.ox.ac.uk/proteins/resources.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Membrane Proteins / chemistry*
  • Models, Molecular
  • Principal Component Analysis
  • Protein Structure, Secondary
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Software*
  • Structural Homology, Protein*

Substances

  • Membrane Proteins