Motivation: Single-cell/nuclei RNA-sequencing (scRNA-seq) technologies can simultaneously quantify gene expression in thousands of cells across the genome. However, the majority of the noncoding RNAs, such as microRNAs (miRNAs), cannot currently be profiled at the same scale. MiRNAs are a class of small noncoding RNAs and play an important role in gene regulation. MiRNAs originate from the processing of primary transcripts, known as primary-microRNAs (pri-miRNAs). The pri-miRNA transcripts, independent of their cognate miRNAs, can also function as long noncoding RNAs, code for micropeptides or even interact with DNA, acting like enhancers. Therefore, it is apparent that the significance of scRNA-seq pri-miRNA profiling expands beyond using pri-miRNA as proxies of mature miRNAs. However, there are no computational methods that allow profiling and quantification of pri-miRNAs at the single-cell-type resolution.
Results: We have developed a simple yet effective computational framework to profile pri-MiRNAs from single-cell RNA-sequencing datasets (PPMS). Based on user input, PPMS can profile pri-miRNAs at cell-type resolution. PPMS can be applied to both newly produced and publicly available datasets obtained via single cell or single-nuclei RNA-seq. It allows users to (i) investigate the distribution of pri-miRNAs across cell types and cell states and (ii) establish a relationship between the number of cells/reads sequenced and the detection of pri-miRNAs. Here, to demonstrate its efficacy, we have applied PPMS to publicly available scRNA-seq data generated from (i) individual chambers (ventricles and atria) of the human heart, (ii) human pluripotent stem cells during their differentiation into cardiomyocytes (the heart beating cells) and (iii) hiPSCs-derived cardiomyocytes infected with severe acute respiratory syndrome coronavirus 2.
© The Author(s) 2022. Published by Oxford University Press.