The exon structure of the mouse alpha 2(IX) collagen gene shows unexpected divergence from the chick gene

J Biol Chem. 1994 Feb 18;269(7):5064-71.

Abstract

One cosmid and two overlapping phage clones covering the entire mouse alpha 2(IX) collagen gene including 12 kilobase pairs (kb) of 5'- and 8 kb of 3'-flanking sequences were isolated from two genomic libraries. The overall gene structure was determined by restriction mapping and nucleotide sequencing. The gene spans 16 kb from the start of transcription to the polyadenylation site and contains 32 exons. It codes for a mRNA of 3 kb that translates into a polypeptide of 688 amino acids. The intron-exon junctions and mRNA structure were confirmed by amplification of cDNA made for mouse cartilage RNA. The coding sequence of the mouse alpha 2(IX) collagen gene shows marked similarities to those for other type IX collagen chains. Although the overall exon-intron organization of the mouse gene is very similar to the chick alpha 2(IX) gene, some unexpected differences were observed at the splice junctions. Split codons characteristic for the central triple helical domain of the chick were not found in the mouse gene that thus exhibited a long stretch of exons with sizes that are multiples of 9 base pairs in this domain. The promoter of the mouse alpha 2(IX) collagen gene contains some G + C-rich elements including three Sp1 consensus recognition sites and a far upstream CCAAT box but no TATAA box. Both primer extension and RNase protection assays revealed several transcription start sites within 418 base pairs of the promoter. The present study reports the first complete nucleotide sequence of any type IX collagen gene and forms the basis for comparative structural studies on this collagen type and for experiments involving transgenic mice.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Chickens / genetics*
  • Collagen / genetics*
  • Consensus Sequence
  • DNA Primers
  • Exons*
  • Genes, Regulator*
  • Genetic Variation*
  • Genomic Library
  • Humans
  • Introns*
  • Liver / metabolism
  • Mice
  • Mice, Inbred BALB C / genetics*
  • Molecular Sequence Data
  • Polymerase Chain Reaction
  • Restriction Mapping
  • Sequence Homology, Amino Acid

Substances

  • DNA Primers
  • Collagen

Associated data

  • GENBANK/Z22923