Motif-blind, genome-wide discovery of cis-regulatory modules in Drosophila and mouse

Dev Cell. 2009 Oct;17(4):568-79. doi: 10.1016/j.devcel.2009.09.002.

Abstract

We present new approaches to cis-regulatory module (CRM) discovery in the common scenario where relevant transcription factors and/or motifs are unknown. Beginning with a small list of CRMs mediating a common gene expression pattern, we search genome-wide for CRMs with similar functionality, using new statistical scores and without requiring known motifs or accurate motif discovery. We cross-validate our predictions on 31 regulatory networks in Drosophila and through correlations with gene expression data. Five predicted modules tested using an in vivo reporter gene assay all show tissue-specific regulatory activity. We also demonstrate our methods' ability to predict mammalian tissue-specific enhancers. Finally, we predict human CRMs that regulate early blood and cardiovascular development. In vivo transgenic mouse analysis of two predicted CRMs demonstrates that both have appropriate enhancer activity. Overall, 7/7 predictions were validated successfully in vivo, demonstrating the effectiveness of our approach for insect and mammalian genomes.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Animals, Genetically Modified
  • Computational Biology*
  • Drosophila / embryology
  • Drosophila / genetics*
  • Enhancer Elements, Genetic
  • Genome*
  • Heart / embryology
  • Hematopoietic Stem Cells / physiology*
  • Humans
  • Mice / embryology
  • Mice / genetics*
  • Regulatory Elements, Transcriptional / genetics*
  • Species Specificity
  • Transcription Factors / genetics

Substances

  • Transcription Factors