Chromosome-scale genome assembly of Codonopsis pilosula and comparative genomic analyses shed light on its genome evolution

Front Plant Sci. 2024 Nov 4:15:1469375. doi: 10.3389/fpls.2024.1469375. eCollection 2024.

Abstract

Introduction: Codonopsis pilosula is a significant plant in traditional Chinese medicine, valued for its edible and medicinal properties. However, the lack of available genomic resources has hindered further research.

Methods: This study presents the first chromosome-scale genome assembly of C. pilosula using PacBio CLR reads and Hi-C scaffolding technology. Additionally, Ks analysis and syntenic depth analysis were performed to elucidate its evolutionary history.

Results: The final assembly yielded a high-quality genome of 679.20 Mb, which was anchored to 8 pseudo-chromosomes with an anchoring rate of 96.5% and a scaffold N50 of 80.50 Mb. The genome assembly showed a high completeness of 97.6% based on Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis. Repetitive elements constituted approximately 76.8% of the genome, with long terminal repeat retrotransposons (LTRs) accounting for about 39.17%. Ks and syntenic depth analyses revealed that the polyploidization history of three platycodonoid clade species involved only the γ-WGT event. Karyotype evolutionary analysis identified an ancestral karyotype with 9 protochromosomes for the three platycodonoid clade species. Moreover, non-WGD genes, particularly those arising from tandem duplications, were found to contribute significantly to gene family expansion.

Discussion: These findings provide essential insights into the genetic diversity and evolutionary biology of C. pilosula, aiding its conservation and sustainable use.

Keywords: Codonopsis pilosula; assembly; comparative genomics; gene family; genome evolution.

Associated data

  • figshare/10.6084/m9.figshare.26799025

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by National Key R&D Program of China from the Ministry of Science and Technology of China (grant No. 2019YFC1711100) and Digitalization and Utilization of Biological Resources (grant No. 202002AA100007).