Discovery of noncanonical translation initiation sites through mass spectrometric analysis of protein N termini

Genome Res. 2018 Jan;28(1):25-36. doi: 10.1101/gr.226050.117. Epub 2017 Nov 21.

Abstract

Translation initiation generally occurs at AUG codons in eukaryotes, although it has been shown that non-AUG or noncanonical translation initiation can also occur. However, the evidence for noncanonical translation initiation sites (TISs) is largely indirect and based on ribosome profiling (Ribo-seq) studies. Here, using a strategy specifically designed to enrich N termini of proteins, we demonstrate that many human proteins are translated at noncanonical TISs. The large majority of TISs that mapped to 5' untranslated regions were noncanonical and led to N-terminal extension of annotated proteins or translation of upstream small open reading frames (uORF). It has been controversial whether the amino acid corresponding to the start codon is incorporated at the TIS or methionine is still incorporated. We found that methionine was incorporated at almost all noncanonical TISs identified in this study. Comparison of the TISs determined through mass spectrometry with ribosome profiling data revealed that about two-thirds of the novel annotations were indeed supported by the available ribosome profiling data. Sequence conservation across species and a higher abundance of noncanonical TISs than canonical ones in some cases suggests that the noncanonical TISs can have biological functions. Overall, this study provides evidence of protein translation initiation at noncanonical TISs and argues that further studies are required for elucidation of functional implications of such noncanonical translation initiation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions*
  • HEK293 Cells
  • Human Umbilical Vein Endothelial Cells / metabolism
  • Humans
  • Mass Spectrometry*
  • Open Reading Frames*
  • Peptide Chain Initiation, Translational*
  • Protein Domains
  • Ribosomes / genetics
  • Ribosomes / metabolism*

Substances

  • 5' Untranslated Regions