Quantifying transcriptome diversity: a review

Brief Funct Genomics. 2024 Mar 20;23(2):83-94. doi: 10.1093/bfgp/elad019.

Abstract

Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.

Keywords: gene expression; gene-level diversity; isoform-level diversity; transcript diversity; transcriptional variation; transcriptome diversity.

Publication types

  • Review

MeSH terms

  • Alternative Splicing / genetics
  • Gene Expression Profiling*
  • High-Throughput Nucleotide Sequencing
  • Protein Isoforms / genetics
  • Sequence Analysis, RNA
  • Transcriptome* / genetics

Substances

  • Protein Isoforms