Gene identification and comparative molecular modeling of a Trypanosoma rangeli major surface protease

J Mol Model. 2013 Aug;19(8):3053-64. doi: 10.1007/s00894-013-1834-8. Epub 2013 Apr 13.

Abstract

Trypanosoma rangeli is a hemoflagellate parasite which is able to infect humans. Distinct from Trypanosoma cruzi, the causative agent of Chagas disease, T. rangeli is non-pathogenic to the vertebrate host. The manner by which the T. rangeli interacts with the host is still unknown, but it certainly depends on the surface molecules. Major surface proteins (MSP) are GPI-anchored, zinc-dependent metalloproteases present in the surface of all trypanosomatids studied so far, which are implicated as virulence factors in pathogenic trypanosomatids, such as Leishmania spp and T. cruzi. The aims of this work were to generate the complete sequence of a T. rangeli MSP (TrMSP) gene and to determine the 3D-structure of the predicted protein by homology modeling. The plasmid bearing a complete copy of a TrMSP gene was completely sequenced and the predicted protein was modeled using Modeller software. Results indicate that TrMSP open reading frame (ORF) codes for a predicted 588 amino acid protein and shows all elements required for its posttranslational processing. Multiple sequence alignment of TrMSP with other trypanosomatids' MSPs showed an extensive conservation of the N-terminal and central regions and a more divergent C-terminal region. Leishmania major MSP (LmMSP), which had its crystal structure previously determined, has an overall 35% identity with TrMSP. This identity allowed the comparative molecular modeling of TrMSP, which demonstrated a high degree of structural conservation between MSPs from other trypanosomatids (TrypMSPs). All modeled MSPs have a conserved folding pattern, apart from structural divergences in the C-domain and discrete differences of charge and topology in the catalytic cleft, and present the same geometry of the canonical HEXXH zinc-binding motif. The determination of surface charges of the molecules revealed that TrMSP is a predominantly positive protein, whereas LmMSP and Trypanosoma cruzi MSP (TcMSP) are negative proteins, suggesting that substrates recognized by TcMSP and LmMSP could not interact with TrMSP. Moreover, the comparison between TrMSP and TcMSP protein sequences has revealed 45 non-neutral amino acid substitutions, which can be further assessed through protein engineering. The characteristics of TrMSP could explain, at least in part, the lack of pathogenicity of T. rangeli to humans and point to the necessity of identifying the biological targets of this enzyme.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Conserved Sequence
  • DNA, Protozoan / chemistry*
  • DNA, Protozoan / genetics
  • Humans
  • Leishmania major / chemistry
  • Leishmania major / enzymology
  • Leishmania major / genetics
  • Membrane Proteins / chemistry*
  • Membrane Proteins / genetics
  • Membrane Proteins / metabolism
  • Metalloproteases / chemistry*
  • Metalloproteases / genetics
  • Metalloproteases / metabolism
  • Models, Molecular*
  • Molecular Sequence Data
  • Open Reading Frames
  • Protein Folding
  • Protozoan Proteins / chemistry*
  • Protozoan Proteins / genetics
  • Protozoan Proteins / metabolism
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Substrate Specificity
  • Trypanosoma cruzi / chemistry
  • Trypanosoma cruzi / enzymology
  • Trypanosoma cruzi / genetics
  • Trypanosoma rangeli / chemistry*
  • Trypanosoma rangeli / enzymology
  • Trypanosoma rangeli / genetics

Substances

  • DNA, Protozoan
  • Membrane Proteins
  • Protozoan Proteins
  • Metalloproteases