Comparative Genomics Analysis of Repetitive Elements in Ten Gymnosperm Species: "Dark Repeatome" and Its Abundance in Conifer and Gnetum Species

Avi Titievsky; Yuliya A Putintseva; Elizaveta A Taranenko; Sofya Baskin; Natalia V Oreshkova; Elia Brodsky; Alexandra V Sharova; Vadim V Sharov; Julia Panov; Dmitry A Kuzmin; Leonid Brodsky; Konstantin V Krutovsky

doi:10.3390/life11111234

Comparative Genomics Analysis of Repetitive Elements in Ten Gymnosperm Species: "Dark Repeatome" and Its Abundance in Conifer and Gnetum Species

Life (Basel). 2021 Nov 15;11(11):1234. doi: 10.3390/life11111234.

Authors

Avi Titievsky¹, Yuliya A Putintseva², Elizaveta A Taranenko^{1

2}, Sofya Baskin¹, Natalia V Oreshkova^{2

3

4

5}, Elia Brodsky⁶, Alexandra V Sharova^{1

7}, Vadim V Sharov^{1

3

7}, Julia Panov¹, Dmitry A Kuzmin⁷, Leonid Brodsky¹, Konstantin V Krutovsky^{2

4

5

8

9

10}

Affiliations

¹ Tauber Bioinformatics Research Center, University of Haifa, Haifa 3498838, Israel.
² Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, 660036 Krasnoyarsk, Russia.
³ Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", 660036 Krasnoyarsk, Russia.
⁴ Department of Genomics and Bioinformatics, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, 660074 Krasnoyarsk, Russia.
⁵ Scientific and Methodological Center, G. F. Morozov Voronezh State University of Forestry and Technologies, 394087 Voronezh, Russia.
⁶ Pine Biotech Inc., New Orleans, LA 70112, USA.
⁷ Department of High Performance Computing, Institute of Space and Information Technologies, Siberian Federal University, 660074 Krasnoyarsk, Russia.
⁸ Department of Forest Genetics and Forest Tree Breeding, Georg-August University of Göttingen, 37077 Göttingen, Germany.
⁹ Center for Integrated Breeding Research, Georg-August University of Göttingen, 37075 Göttingen, Germany.
¹⁰ Laboratory of Population Genetics, N. I. Vavilov Institute of General Genetics, Russian Academy of Sciences, 119333 Moscow, Russia.

Abstract

Repetitive elements (RE) and transposons (TE) can comprise up to 80% of some plant genomes and may be essential for regulating their evolution and adaptation. The "repeatome" information is often unavailable in assembled genomes because genomic areas of repeats are challenging to assemble and are often missing from final assembly. However, raw genomic sequencing data contain rich information about RE/TEs. Here, raw genomic NGS reads of 10 gymnosperm species were studied for the content and abundance patterns of their "repeatome". We utilized a combination of alignment on databases of repetitive elements and de novo assembly of highly repetitive sequences from genomic sequencing reads to characterize and calculate the abundance of known and putative repetitive elements in the genomes of 10 conifer plants: Pinus taeda, Pinus sylvestris, Pinus sibirica, Picea glauca, Picea abies, Abies sibirica, Larix sibirica, Juniperus communis, Taxus baccata, and Gnetum gnemon. We found that genome abundances of known and newly discovered putative repeats are specific to phylogenetically close groups of species and match biological taxa. The grouping of species based on abundances of known repeats closely matches the grouping based on abundances of newly discovered putative repeats (kChains) and matches the known taxonomic relations.

Keywords: gymnosperms; principal component analysis; repetitive elements.

Abstract

Grants and funding