Statistical analysis of MPSS measurements: application to the study of LPS-activated macrophage gene expression

Proc Natl Acad Sci U S A. 2005 Feb 1;102(5):1402-7. doi: 10.1073/pnas.0406555102. Epub 2005 Jan 24.

Abstract

Massively Parallel Signature Sequencing (MPSS), a recently developed high-throughput transcription profiling technology, has the ability to profile almost every transcript in a sample without requiring prior knowledge of the sequence of the transcribed genes. As is the case with DNA microarrays, effective data analysis depends crucially on understanding how noise affects measurements. We analyze the sources of noise in MPSS and present a quantitative model describing the variability between replicate MPSS assays. We use this model to construct statistical hypotheses that test whether an observed change in gene expression in a pair-wise comparison is significant. This analysis is then extended to the determination of the significance of changes in expression levels measured over the course of a time series of measurements. We apply these analytic techniques to the study of a time series of MPSS gene expression measurements on LPS-stimulated macrophages. To evaluate our statistical significance metrics, we compare our results with published data on macrophage activation measured by using Affymetrix GeneChips.

MeSH terms

  • Base Sequence*
  • Breast Neoplasms
  • Cell Line, Tumor
  • Cells, Cultured
  • Cluster Analysis
  • DNA, Complementary / chemistry
  • Female
  • Gene Expression Regulation / physiology*
  • Humans
  • Lipopolysaccharides / pharmacology*
  • Macrophage Activation / drug effects
  • Macrophage Activation / physiology*
  • Macrophages / drug effects
  • Macrophages / physiology*
  • Models, Genetic
  • Oligonucleotide Array Sequence Analysis / methods*
  • Reproducibility of Results

Substances

  • DNA, Complementary
  • Lipopolysaccharides