Statistical methods for discrimination of STR genotypes using high resolution melt curve data

Int J Legal Med. 2024 Nov;138(6):2281-2288. doi: 10.1007/s00414-024-03289-x. Epub 2024 Jul 13.

Abstract

Despite the improvements in forensic DNA quantification methods that allow for the early detection of low template/challenged DNA samples, complicating stochastic effects are not revealed until the final stage of the DNA analysis workflow. An assay that would provide genotyping information at the earlier stage of quantification would allow examiners to make critical adjustments prior to STR amplification allowing for potentially exclusionary information to be immediately reported. Specifically, qPCR instruments often have dissociation curve and/or high-resolution melt curve (HRM) capabilities; this, coupled with statistical prediction analysis, could provide additional information regarding STR genotypes present. Thus, this study aimed to evaluate Qiagen's principal component analysis (PCA)-based ScreenClust® HRM® software and a linear discriminant analysis (LDA)-based technique for their abilities to accurately predict genotypes and similar groups of genotypes from HRM data. Melt curves from single source samples were generated from STR D5S818 and D18S51 amplicons using a Rotor-Gene® Q qPCR instrument and EvaGreen® intercalating dye. When used to predict D5S818 genotypes for unknown samples, LDA analysis outperformed the PCA-based method whether predictions were for individual genotypes (58.92% accuracy) or for geno-groups (81.00% accuracy). However, when a locus with increased heterogeneity was tested (D18S51), PCA-based prediction accuracy rates improved to rates similar to those obtained using LDA (45.10% and 63.46%, respectively). This study provides foundational data documenting the performance of prediction modeling for STR genotyping based on qPCR-HRM data. In order to expand the forensic applicability of this HRM assay, the method could be tested with a more commonly utilized qPCR platform.

Keywords: High resolution melt curves; Linear Discriminant Analysis (LDA); Prediction modeling; Principal component analysis (PCA); qPCR genotyping.

MeSH terms

  • DNA Fingerprinting* / methods
  • Discriminant Analysis
  • Genotype*
  • Humans
  • Microsatellite Repeats*
  • Principal Component Analysis*
  • Real-Time Polymerase Chain Reaction* / methods
  • Software