Conservation of the sizes of 53 introns and over 100 intronic sequences for the binding of common transcription factors in the human and mouse genes for type II procollagen (COL2A1)

Biochem J. 1995 Jun 15;308 ( Pt 3)(Pt 3):923-9. doi: 10.1042/bj3080923.

Abstract

Over 11,000 bp of previously undefined sequences of the human COL2A1 gene were defined. The results made it possible to compare the intron structures of a highly complex gene from man and mouse. Surprisingly, the sizes of the 53 introns of the two genes were highly conserved with a mean difference of 13%. After alignment of the sequences, 69% of the intron sequences were identical. The introns contained consensus sequences for the binding of over 100 different transcription factors that were conserved in the introns of the two genes. The first intron of the gene contained 80 conserved consensus sequences and the remaining 52 introns of the gene contained 106 conserved sequences for the binding of transcription factors. The 5'-end of intron 2 in both genes had a potential for forming a stem loop in RNA transcripts.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Binding Sites / genetics
  • Chickens
  • Conserved Sequence / genetics*
  • DNA-Binding Proteins / metabolism
  • Electronic Data Processing
  • Exons / genetics
  • Humans
  • Introns / genetics
  • Mice
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Procollagen / genetics*
  • Sequence Alignment
  • Software
  • Transcription Factors / chemistry*
  • Transcription Factors / genetics

Substances

  • DNA-Binding Proteins
  • Procollagen
  • Transcription Factors

Associated data

  • GENBANK/L10347