SCMarker: Ab initio marker selection for single cell transcriptome profiling

PLoS Comput Biol. 2019 Oct 28;15(10):e1007445. doi: 10.1371/journal.pcbi.1007445. eCollection 2019 Oct.

Abstract

Single-cell RNA-sequencing data generated by a variety of technologies, such as Drop-seq and SMART-seq, can reveal simultaneously the mRNA transcript levels of thousands of genes in thousands of cells. It is often important to identify informative genes or cell-type-discriminative markers to reduce dimensionality and achieve informative cell typing results. We present an ab initio method that performs unsupervised marker selection by identifying genes that have subpopulation-discriminative expression levels and are co- or mutually-exclusively expressed with other genes. Consistent improvements in cell-type classification and biologically meaningful marker selection are achieved by applying SCMarker on various datasets in multiple tissue types, followed by a variety of clustering algorithms. The source code of SCMarker is publicly available at https://github.com/KChen-lab/SCMarker.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Sequence / genetics
  • Biomarkers
  • Cluster Analysis
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Humans
  • RNA / genetics
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis / methods*
  • Software
  • Transcriptome / genetics

Substances

  • Biomarkers
  • RNA