Genome annotation improvements from cross-phyla proteogenomics and time-of-day differences in malaria mosquito proteins using untargeted quantitative proteomics

PLoS One. 2019 Jul 29;14(7):e0220225. doi: 10.1371/journal.pone.0220225. eCollection 2019.

Abstract

The malaria mosquito, Anopheles stephensi, and other mosquitoes modulate their biology to match the time-of-day. In the present work, we used a non-hypothesis driven approach (untargeted proteomics) to identify proteins in mosquito tissue, and then quantified the relative abundance of the identified proteins from An. stephensi bodies. Using these quantified protein levels, we then analyzed the data for proteins that were only detectable at certain times-of-the day, highlighting the need to consider time-of-day in experimental design. Further, we extended our time-of-day analysis to look for proteins which cycle in a rhythmic 24-hour ("circadian") manner, identifying 31 rhythmic proteins. Finally, to maximize the utility of our data, we performed a proteogenomic analysis to improve the genome annotation of An. stephensi. We compare peptides that were detected using mass spectrometry but are 'missing' from the An. stephensi predicted proteome, to reference proteomes from 38 other primarily human disease vector species. We found 239 such peptide matches and reveal that genome annotation can be improved using proteogenomic analysis from taxonomically diverse reference proteomes. Examination of 'missing' peptides revealed reading frame errors, errors in gene-calling, overlapping gene models, and suspected gaps in the genome assembly.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Anopheles / genetics
  • Anopheles / metabolism*
  • Humans
  • India
  • Insect Proteins / chemistry
  • Insect Proteins / genetics*
  • Insect Proteins / metabolism*
  • Malaria / transmission
  • Mass Spectrometry
  • Mosquito Vectors / genetics
  • Mosquito Vectors / metabolism
  • Peptides / analysis
  • Proteogenomics / methods*
  • Proteomics / methods
  • Sequence Analysis, DNA

Substances

  • Insect Proteins
  • Peptides

Associated data

  • Dryad/10.5061/dryad.8p20m31