Pharmacogenomics is a key component of personalized medicine that promises safer and more effective drug treatment by individualizing drug choice and dose based on genetic profiles. In clinical practice, genetic biomarkers are used to categorize patients into *-alleles to predict CYP450 enzyme activity and adjust drug dosages accordingly. However, this approach leaves a large part of variability in drug response unexplained. Here, we present a proof-of-concept approach that uses continuous-scale (instead of categorical) assignments to predict enzyme activity. We used full CYP2D6 gene sequences obtained with long-read amplicon-based sequencing and cytochrome P450 (CYP) 2D6-mediated tamoxifen metabolism data from a prospective study of 561 patients with breast cancer to train a neural network. The model explained 79% of interindividual variability in CYP2D6 activity compared to 54% with the conventional *-allele approach, assigned enzyme activities to known alleles with previously reported effects, and predicted the activity of previously uncharacterized combinations of variants. The results were replicated in an independent cohort of tamoxifen-treated patients (model R 2 adjusted = 0.66 versus *-allele R 2 adjusted = 0.35) and a cohort of patients treated with the CYP2D6 substrate venlafaxine (model R 2 adjusted = 0.64 versus *-allele R 2 adjusted = 0.55). Human embryonic kidney cells were used to confirm the effect of five genetic variants on metabolism of the CYP2D6 substrate bufuralol in vitro. These results demonstrate the advantage of a continuous scale and a completely phased genotype for prediction of CYP2D6 enzyme activity and could potentially enable more accurate prediction of individual drug response.
Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.