SKESA: strategic k-mer extension for scrupulous assemblies

Genome Biol. 2018 Oct 4;19(1):153. doi: 10.1186/s13059-018-1540-z.

Abstract

SKESA is a DeBruijn graph-based de-novo assembler designed for assembling reads of microbial genomes sequenced using Illumina. Comparison with SPAdes and MegaHit shows that SKESA produces assemblies that have high sequence quality and contiguity, handles low-level contamination in reads, is fast, and produces an identical assembly for the same input when assembled multiple times with the same or different compute resources. SKESA has been used for assembling over 272,000 read sets in the Sequence Read Archive at NCBI and for real-time pathogen detection. Source code for SKESA is freely available at https://github.com/ncbi/SKESA/releases .

Keywords: Contamination; De-novo assembly; DeBruijn graphs; Illumina reads; Sequence quality.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms
  • Base Pairing / genetics
  • Base Sequence
  • Sequence Analysis, DNA / methods*
  • Software*
  • Time Factors