Broad-scale phylogenomics provides insights into retrovirus-host evolution

Proc Natl Acad Sci U S A. 2013 Dec 10;110(50):20146-51. doi: 10.1073/pnas.1315419110. Epub 2013 Nov 25.

Abstract

Genomic data provide an excellent resource to improve understanding of retrovirus evolution and the complex relationships among viruses and their hosts. In conjunction with broad-scale in silico screening of vertebrate genomes, this resource offers an opportunity to complement data on the evolution and frequency of past retroviral spread and so evaluate future risks and limitations for horizontal transmission between different host species. Here, we develop a methodology for extracting phylogenetic signal from large endogenous retrovirus (ERV) datasets by collapsing information to facilitate broad-scale phylogenomics across a wide sample of hosts. Starting with nearly 90,000 ERVs from 60 vertebrate host genomes, we construct phylogenetic hypotheses and draw inferences regarding the designation, host distribution, origin, and transmission of the Gammaretrovirus genus and associated class I ERVs. Our results uncover remarkable depths in retroviral sequence diversity, supported within a phylogenetic context. This finding suggests that current infectious exogenous retrovirus diversity may be underestimated, adding credence to the possibility that many additional exogenous retroviruses may remain to be discovered in vertebrate taxa. We demonstrate a history of frequent horizontal interorder transmissions from a rodent reservoir and suggest that rats may have acted as important overlooked facilitators of gammaretrovirus spread across diverse mammalian hosts. Together, these results demonstrate the promise of the methodology used here to analyze large ERV datasets and improve understanding of retroviral evolution and diversity for utilization in wider applications.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Disease Transmission, Infectious
  • Evolution, Molecular*
  • Genetic Variation*
  • Host-Pathogen Interactions / genetics*
  • Mice
  • Molecular Sequence Data
  • Phylogeny*
  • Rats
  • Retroviridae / genetics*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Species Specificity
  • Vertebrates / genetics*