TriTan: an efficient triple nonnegative matrix factorization method for integrative analysis of single-cell multiomics data

Brief Bioinform. 2024 Nov 22;26(1):bbae615. doi: 10.1093/bib/bbae615.

Abstract

Single-cell multiomics have opened up tremendous opportunities for understanding gene regulatory networks underlying cell states by simultaneously profiling transcriptomes, epigenomes, and proteomes of the same cell. However, existing computational methods for integrative analysis of these high-dimensional multiomics data are either computationally expensive or limited in interpretation. These limitations pose challenges in the implementation of these methods in large-scale studies and hinder a more in-depth understanding of the underlying regulatory mechanisms. Here, we propose TriTan (Triple inTegrative fast non-negative matrix factorization), an efficient joint factorization method for single-cell multiomics data. TriTan implements a highly efficient factorization algorithm, greatly improving its computational performance. Three matrix factorization produced by TriTan helps in clustering cells, identifying signature features for each cell type, and uncovering feature associations across omics, which facilitates the identification of domains of regulatory chromatin and the prediction of cell-type-specific regulatory networks. We applied TriTan to the single-cell multiomics data obtained from different technologies and benchmarked it against the state-of-the-art methods where it shows highly competitive performance. Furthermore, we showed a range of downstream analyses conducted utilizing TriTan outputs, highlighting its capacity to facilitate interpretation in biological discovery.

Keywords: gene regulation; machine learning; multi-omics; single cell.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Computational Biology / methods
  • Gene Expression Profiling / methods
  • Gene Regulatory Networks
  • Humans
  • Multiomics
  • Single-Cell Analysis* / methods
  • Transcriptome