The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Nat Genet. 2024 Apr;56(4):721-731. doi: 10.1038/s41588-024-01695-w. Epub 2024 Apr 15.

Abstract

Coffea arabica, an allotetraploid hybrid of Coffea eugenioides and Coffea canephora, is the source of approximately 60% of coffee products worldwide, and its cultivated accessions have undergone several population bottlenecks. We present chromosome-level assemblies of a di-haploid C. arabica accession and modern representatives of its diploid progenitors, C. eugenioides and C. canephora. The three species exhibit largely conserved genome structures between diploid parents and descendant subgenomes, with no obvious global subgenome dominance. We find evidence for a founding polyploidy event 350,000-610,000 years ago, followed by several pre-domestication bottlenecks, resulting in narrow genetic variation. A split between wild accessions and cultivar progenitors occurred ~30.5 thousand years ago, followed by a period of migration between the two populations. Analysis of modern varieties, including lines historically introgressed with C. canephora, highlights their breeding histories and loci that may contribute to pathogen resistance, laying the groundwork for future genomics-based breeding of C. arabica.

MeSH terms

  • Coffea* / genetics
  • Coffee
  • Genome, Plant / genetics
  • Metagenomics
  • Plant Breeding

Substances

  • Coffee