Background: Sequencing and annotating genomes of non-model organisms helps to understand genome architecture, the genetic processes underlying species traits, and how these genes have evolved in closely-related taxa, among many other biological processes. However, many metazoan groups, such as the extremely diverse molluscs, are still underrepresented in the number of sequenced and annotated genomes. Although sequencing techniques have recently improved in quality and quantity, molluscs are still neglected due to difficulties in applying standardized protocols for obtaining genomic data.
Results: In this study, we present the chromosome-level genome assembly and annotation of the sacoglossan sea slug species Elysia timida, known for its ability to store the chloroplasts of its food algae. In particular, by optimizing the long-read and chromosome conformation capture library preparations, the genome assembly was performed using PacBio HiFi and Arima HiC data. The scaffold and contig N50s, at 41.8 Mb and 1.92 Mb, respectively, are approximately 30-fold and fourfold higher compared to other published sacoglossan genome assemblies. Structural annotation resulted in 19,904 protein-coding genes, which are more contiguous and complete compared to publicly available annotations of Sacoglossa with respect to metazoan BUSCOs. We found no evidence for horizontal gene transfer (HGT), i.e. no photosynthetic genes encoded in the sacoglossan nucleus genome. However, we detected genes encoding polyketide synthases in E. timida, indicating that polypropionates are produced. HPLC-MS/MS analysis confirmed the presence of a large number of polypropionates, including known and yet uncharacterised compounds.
Conclusions: We can show that our methodological approach helps to obtain a high-quality genome assembly even for a "difficult-to-sequence" organism, which may facilitate genome sequencing in molluscs. This will enable a better understanding of complex biological processes in molluscs, such as functional kleptoplasty in Sacoglossa, by significantly improving the quality of genome assemblies and annotations.
Keywords: Elysia timida; Arima HiC; Biosynthesis; High-quality reference genome; Kleptoplasty; Mollusca; PacBio HiFi; Polyketides; Sacoglossa.
© 2024. The Author(s).