Understanding the physical properties that control protein crystallization by analysis of large-scale experimental data

Nat Biotechnol. 2009 Jan;27(1):51-7. doi: 10.1038/nbt.1514.

Abstract

Crystallization is the most serious bottleneck in high-throughput protein-structure determination by diffraction methods. We have used data mining of the large-scale experimental results of the Northeast Structural Genomics Consortium and experimental folding studies to characterize the biophysical properties that control protein crystallization. This analysis leads to the conclusion that crystallization propensity depends primarily on the prevalence of well-ordered surface epitopes capable of mediating interprotein interactions and is not strongly influenced by overall thermodynamic stability. We identify specific sequence features that correlate with crystallization propensity and that can be used to estimate the crystallization probability of a given construct. Analyses of entire predicted proteomes demonstrate substantial differences in the amino acid-sequence properties of human versus eubacterial proteins, which likely reflect differences in biophysical properties, including crystallization propensity. Our thermodynamic measurements do not generally support previous claims regarding correlations between sequence properties and protein stability.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Animals
  • Biophysics / methods
  • Computational Biology / methods
  • Crystallization*
  • Entropy
  • Epitopes / chemistry
  • Humans
  • Models, Statistical
  • Protein Folding
  • Proteins / chemistry*
  • Surface Properties
  • Thermodynamics

Substances

  • Epitopes
  • Proteins