Identification of allele-specific KIV-2 repeats and impact on Lp(a) measurements for cardiovascular disease risk

bioRxiv [Preprint]. 2023 Apr 27:2023.04.24.538128. doi: 10.1101/2023.04.24.538128.

Abstract

The abundance of Lp(a) protein holds significant implications for the risk of cardiovascular disease (CVD), which is directly impacted by the copy number (CN) of KIV-2, a 5.5 kbp sub-region. KIV-2 is highly polymorphic in the population and accurate analysis is challenging. In this study, we present the DRAGEN KIV-2 CN caller, which utilizes short reads. Data across 166 WGS show that the caller has high accuracy, compared to optical mapping and can further phase ~50% of the samples. We compared KIV-2 CN numbers to 24 previously postulated KIV-2 relevant SNVs, revealing that many are ineffective predictors of KIV-2 copy number. Population studies, including USA-based cohorts, showed distinct KIV-2 CN, distributions for European-, African-, and Hispanic-American populations and further underscored the limitations of SNV predictors. We demonstrate that the CN estimates correlate significantly with the available Lp(a) protein levels and that phasing is highly important.

Keywords: Illumina; LPA; Next Generation sequencing; cardiovascular disease; copy number variation.

Publication types

  • Preprint