CanSNPer: a hierarchical genotype classifier of clonal pathogens

Adrian Lärkeryd; Kerstin Myrtennäs; Edvin Karlsson; Chinmay Kumar Dwibedi; Mats Forsman; Pär Larsson; Anders Johansson; Andreas Sjödin

doi:10.1093/bioinformatics/btu113

CanSNPer: a hierarchical genotype classifier of clonal pathogens

Bioinformatics. 2014 Jun 15;30(12):1762-4. doi: 10.1093/bioinformatics/btu113. Epub 2014 Feb 25.

Authors

Adrian Lärkeryd¹, Kerstin Myrtennäs¹, Edvin Karlsson¹, Chinmay Kumar Dwibedi², Mats Forsman¹, Pär Larsson¹, Anders Johansson¹, Andreas Sjödin²

Affiliations

¹ Department of Clinical Microbiology, Umeå University, Division of CBRN Security and Defence, FOI, Swedish Defence Research Agency, Department of Clinical Microbiology, The Laboratory for Molecular Infection Medicine Sweden (MIMS) and Department of Chemistry, Computational Life Science Cluster (CLiC), Umeå University, Umeå, Sweden.
² Department of Clinical Microbiology, Umeå University, Division of CBRN Security and Defence, FOI, Swedish Defence Research Agency, Department of Clinical Microbiology, The Laboratory for Molecular Infection Medicine Sweden (MIMS) and Department of Chemistry, Computational Life Science Cluster (CLiC), Umeå University, Umeå, SwedenDepartment of Clinical Microbiology, Umeå University, Division of CBRN Security and Defence, FOI, Swedish Defence Research Agency, Department of Clinical Microbiology, The Laboratory for Molecular Infection Medicine Sweden (MIMS) and Department of Chemistry, Computational Life Science Cluster (CLiC), Umeå University, Umeå, Sweden.

PMID: 24574113
DOI: 10.1093/bioinformatics/btu113

Abstract

Summary: Advances in typing methodologies have recently reformed the field of molecular epidemiology of pathogens. The falling cost of sequencing technologies is creating a deluge of whole genome sequencing data that burdens bioinformatics resources and tool development. In particular, single nucleotide polymorphisms in core genomes of pathogens are recognized as the most important markers for inferring genetic relationships because they are evolutionarily stable and amenable to high-throughput detection methods. Sequence data will provide an excellent opportunity to extend our understanding of infectious disease when the challenge of extracting knowledge from available sequence resources is met. Here, we present an efficient and user-friendly genotype classification pipeline, CanSNPer, based on an easily expandable database of predefined canonical single nucleotide polymorphisms.

Availability and implementation: All documentation and Python-based source code for the CanSNPer are freely available at http://github.com/adrlar/CanSNPer.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Genomics / methods
Genotype
Genotyping Techniques / methods*
Molecular Typing / methods*
Polymorphism, Single Nucleotide
Software*