RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation

Wai Yee Wong; Oleg Simakov

doi:10.1093/bioinformatics/bty745

RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation

Bioinformatics. 2019 Mar 15;35(6):1051-1052. doi: 10.1093/bioinformatics/bty745.

Authors

Wai Yee Wong¹, Oleg Simakov¹

Affiliation

¹ Department of Molecular Evolution and Development, Faculty of Life Science, University of Vienna, A-1090 Vienna, Austria.

Abstract

Summary: Repetitive elements comprise large proportion of many genomes. They have impact on both genome evolution and regulation. Their classification and the study of evolutionary history is a major emerging field. Various software exist to-date to classify and map repeats across genomes. The major unresolved drawback, however, is the fragmented nature of many identified repeat loci. This ultimately makes the classification of novel repeats and their evolutionary analyses difficult. To improve on this, we developed a pipeline (RepeatCraft) that integrates results from several repeat element classification tools based on both sequence similarity and structural features. The pipeline de-fragments closely spaced repeat loci in the genomes, reconstructing longer copies, thus allowing for a better annotation and sequence comparisons. The pipeline also includes a user interface that can run in a web browser allowing for an easy access and exploration of the repeat data.

Availability and implementation: RepeatCraft is implemented in Python and the web application is implemented in R. Download and documentation is freely available at https://github.com/niccw/repeatCraftp.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Genome
Genomics*
Software*
Web Browser

Grants and funding

P 30686/FWF_/Austrian Science Fund FWF/Austria