The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes

Genome Biol. 2014;15(11):524. doi: 10.1186/s13059-014-0524-x.

Abstract

Whole-genome sequences are now available for many microbial species and clades, however existing whole-genome alignment methods are limited in their ability to perform sequence comparisons of multiple sequences simultaneously. Here we present the Harvest suite of core-genome alignment and visualization tools for the rapid and simultaneous analysis of thousands of intraspecific microbial strains. Harvest includes Parsnp, a fast core-genome multi-aligner, and Gingr, a dynamic visual platform. Together they provide interactive core-genome alignments, variant calls, recombination detection, and phylogenetic trees. Using simulated and real data we demonstrate that our approach exhibits unrivaled speed while maintaining the accuracy of existing methods. The Harvest suite is open-source and freely available from: http://github.com/marbl/harvest.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Bacteria / genetics*
  • Genome, Bacterial / genetics*
  • Phylogeny*
  • Polymorphism, Single Nucleotide
  • Sequence Alignment*
  • Sequence Analysis, DNA
  • Software