A small NGS-SNP panel of ancestry inference designed to distinguish African, European, East, and South Asian populations

Electrophoresis. 2020 May;41(9):649-656. doi: 10.1002/elps.201900231. Epub 2020 Mar 24.

Abstract

In this study, a small set of ancestry informative SNPs was selected to differentiate African, European, East and South Asian samples, which was detected by the next-generation sequencing technology. A total of 127 Chinese Shaanxi Han individuals were collected as test samples. No statistically significant linkage disequilibrium of any pair of loci or departure from Hardy-Weinberg equilibrium of each locus was observed in the test population. To evaluate the performance of ancestry assignment using this panel, admixture analysis, principal component analysis, and likelihood ratio calculations were conducted based on the 1000 genome data and test samples. All populations were clustered into four groups, African, European, South and East Asian populations, which were consistent with their geographical origins. The pairwise fixation index (FST ) between populations from different continental groups ranged from 0.140 to 0.621 with average 0.415, and the pairwise FST between populations from the same continent ranged from 0.000 to 0.056 with average 0.012. The likelihood ratio results of 125 test individuals indicated that their ancestry components were highly possible from East Asia. In conclusion, this small set of ancestry informative SNPs can be used as a reliable tool to identify and quantify ancestry components of unknown samples.

Keywords: AISNPs; Ancestry inference; NGS.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Databases, Genetic
  • Ethnicity / classification
  • Ethnicity / genetics
  • Gene Frequency / genetics
  • Genetics, Population
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Polymorphism, Single Nucleotide / genetics*
  • Principal Component Analysis
  • Racial Groups* / classification
  • Racial Groups* / genetics