Scale-invariant structure of strongly conserved sequence in genomic intersections and alignments

Proc Natl Acad Sci U S A. 2006 Aug 29;103(35):13121-5. doi: 10.1073/pnas.0605735103. Epub 2006 Aug 21.

Abstract

A power-law distribution of the length of perfectly conserved sequence from mouse/human whole-genome intersection and alignment is exhibited. Spatial correlations of these elements within the mouse genome are studied. It is argued that these power-law distributions and correlations are comprised in part by functional noncoding sequence and ought to be accounted for in estimating the statistical significance of apparent sequence conservation. These inter-genomic correlations of conservation are placed in the context of previously observed intra-genomic correlations, and their possible origins and consequences are discussed.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Cluster Analysis
  • Conserved Sequence / genetics*
  • Genome / genetics*
  • Genomics
  • Humans
  • Mice
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Alignment*