A reference quality, fully annotated diploid genome from a Saudi individual

Sci Data. 2024 Nov 23;11(1):1278. doi: 10.1038/s41597-024-04121-2.

Abstract

We have used multiple sequencing approaches to sequence the genome of a volunteer from Saudi Arabia. We use the resulting data to generate a de novo assembly of the genome, and use different computational approaches to refine the assembly. As a consequence, we provide a contiguous assembly of the complete genome of an individual from Saudi Arabia for all chromosomes except chromosome Y, and label this assembly KSA001. We transferred genome annotations from reference genomes to fully annotate KSA001, and we make all primary sequencing data, the assembly, and the genome annotations freely available in public databases using the FAIR data principles. KSA001 is the first telomere-to-telomere-assembled genome from a Saudi individual that is freely available for any purpose.

Publication types

  • Dataset

MeSH terms

  • Diploidy
  • Genome, Human*
  • Humans
  • Molecular Sequence Annotation
  • Saudi Arabia