The use of exome capture RNA-seq for highly degraded RNA with application to clinical cancer sequencing

Genome Res. 2015 Sep;25(9):1372-81. doi: 10.1101/gr.189621.115. Epub 2015 Aug 7.

Abstract

RNA-seq by poly(A) selection is currently the most common protocol for whole transcriptome sequencing as it provides a broad, detailed, and accurate view of the RNA landscape. Unfortunately, the utility of poly(A) libraries is greatly limited when the input RNA is degraded, which is the norm for research tissues and clinical samples, especially when specimens are formalin-fixed. To facilitate the use of RNA sequencing beyond cell lines and in the clinical setting, we developed an exome-capture transcriptome protocol with greatly improved performance on degraded RNA. Capture transcriptome libraries enable measuring absolute and differential gene expression, calling genetic variants, and detecting gene fusions. Through validation against gold-standard poly(A) and Ribo-Zero libraries from intact RNA, we show that capture RNA-seq provides accurate and unbiased estimates of RNA abundance, uniform transcript coverage, and broad dynamic range. Unlike poly(A) selection and Ribo-Zero depletion, capture libraries retain these qualities regardless of RNA quality and provide excellent data from clinical specimens including formalin-fixed paraffin-embedded (FFPE) blocks. Systematic improvements across key applications of RNA-seq are shown on a cohort of prostate cancer patients and a set of clinical FFPE samples. Further, we demonstrate the utility of capture RNA-seq libraries in a patient with a highly malignant solitary fibrous tumor (SFT) enrolled in our clinical sequencing program called MI-ONCOSEQ. Capture transcriptome profiling from FFPE revealed two oncogenic fusions: the pathognomonic NAB2-STAT6 inversion and a therapeutically actionable BRAF fusion, which may drive this specific cancer's aggressive phenotype.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line, Tumor
  • Exome*
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Gene Library
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Oncogene Proteins, Fusion / genetics
  • RNA Stability*
  • Reproducibility of Results
  • Sequence Analysis, RNA*
  • Transcriptome

Substances

  • Oncogene Proteins, Fusion

Associated data

  • GEO/GSE64113