DeepFace: Deep-learning-based framework to contextualize orofacial-cleft-related variants during human embryonic craniofacial development

HGG Adv. 2024 Jul 18;5(3):100312. doi: 10.1016/j.xhgg.2024.100312. Epub 2024 May 24.

Abstract

Orofacial clefts (OFCs) are among the most common human congenital birth defects. Previous multiethnic studies have identified dozens of associated loci for both cleft lip with or without cleft palate (CL/P) and cleft palate alone (CP). Although several nearby genes have been highlighted, the "casual" variants are largely unknown. Here, we developed DeepFace, a convolutional neural network model, to assess the functional impact of variants by SNP activity difference (SAD) scores. The DeepFace model is trained with 204 epigenomic assays from crucial human embryonic craniofacial developmental stages of post-conception week (pcw) 4 to pcw 10. The Pearson correlation coefficient between the predicted and actual values for 12 epigenetic features achieved a median range of 0.50-0.83. Specifically, our model revealed that SNPs significantly associated with OFCs tended to exhibit higher SAD scores across various variant categories compared to less related groups, indicating a context-specific impact of OFC-related SNPs. Notably, we identified six SNPs with a significant linear relationship to SAD scores throughout developmental progression, suggesting that these SNPs could play a temporal regulatory role. Furthermore, our cell-type specificity analysis pinpointed the trophoblast cell as having the highest enrichment of risk signals associated with OFCs. Overall, DeepFace can harness distal regulatory signals from extensive epigenomic assays, offering new perspectives for prioritizing OFC variants using contextualized functional genomic features. We expect DeepFace to be instrumental in accessing and predicting the regulatory roles of variants associated with OFCs, and the model can be extended to study other complex diseases or traits.

Keywords: SNP activity difference prediction; convolutional neural network; epigenomic assay; genome-wide association studies; human embryonic craniofacial development; noncoding variant; orofacial clefts; variant function.

MeSH terms

  • Cleft Lip* / embryology
  • Cleft Lip* / genetics
  • Cleft Palate* / embryology
  • Cleft Palate* / genetics
  • Deep Learning*
  • Embryonic Development / genetics
  • Epigenomics / methods
  • Humans
  • Neural Networks, Computer
  • Polymorphism, Single Nucleotide*