Integrative analyses for omics data: a Bayesian mixture model to assess the concordance of ChIP-chip and ChIP-seq measurements

J Toxicol Environ Health A. 2012;75(8-10):461-70. doi: 10.1080/15287394.2012.674914.

Abstract

The analysis of different variations in genomics, transcriptomics, epigenomics, and proteomics has increased considerably in recent years. This is especially due to the success of microarray and, more recently, sequencing technology. Apart from understanding mechanisms of disease pathogenesis on a molecular basis, for example in cancer research, the challenge of analyzing such different data types in an integrated way has become increasingly important also for the validation of new sequencing technologies with maximum resolution. For this purpose, a methodological framework for their comparison with microarray techniques in the context of smallest sample sizes, which result from the high costs of experiments, is proposed in this contribution. Based on an adaptation of the externally centered correlation coefficient ( Schäfer et al. 2009 ), it is demonstrated how a Bayesian mixture model can be applied to compare and classify measurements of histone acetylation that stem from chromatin immunoprecipitation combined with either microarray (ChIP-chip) or sequencing techniques (ChIP-seq) for the identification of DNA fragments. Here, the murine hematopoietic cell line 32D, which was transduced with the oncogene BCR-ABL, the hallmark of chronic myeloid leukemia, was characterized. Cells were compared to mock-transduced cells as control. Activation or inhibition of other genes by histone modifications induced by the oncogene is considered critical in such a context for the understanding of the disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Bayes Theorem
  • Capillary Electrochromatography
  • Chromatin Immunoprecipitation
  • DNA / chemistry
  • DNA / genetics
  • Data Interpretation, Statistical
  • Epigenomics / methods*
  • Epigenomics / statistics & numerical data
  • Fusion Proteins, bcr-abl / genetics
  • Genomics / methods*
  • Genomics / statistics & numerical data
  • Hematopoietic Stem Cells / metabolism
  • Histones / genetics
  • Histones / metabolism
  • Leukemia, Myelogenous, Chronic, BCR-ABL Positive / genetics
  • Markov Chains
  • Mice
  • Microarray Analysis
  • Models, Statistical
  • Monte Carlo Method
  • Oncogenes / genetics
  • Proteomics / methods*
  • Proteomics / statistics & numerical data
  • Sample Size
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, DNA / statistics & numerical data*
  • Transduction, Genetic

Substances

  • Histones
  • DNA
  • Fusion Proteins, bcr-abl