The 4.4-kilodalton proline-rich polypeptides of the rat ventral prostate are the proteolytic products of a 637-kilodalton protein displaying highly repetitive sequences and encoded in a single exon

J Biol Chem. 1992 May 15;267(14):9884-94.

Abstract

We have determined the complete sequence of the 637-kilodalton precursor for the proline-rich polypeptides (PRPs). This protein is encoded in one large exon of a single copy gene. The acidic precursor of 5761 residues comprises a signal peptide and three large domains displaying a high proline content (11-15%). The sequence of domain A (928 residues) is unique and contains several small clusters of acidic amino acids. Domain B (830 residues) exhibits seven tandem repeats, four of them displaying a strongly diverged sequence. In domain C (3914 residues) 39 units, of which only 8 are degenerate, occur in a tandem repeat. Their sequence of 100 amino acids shows a high structural similarity (76-92%) and contains all the PRP variants which are produced by specific proteolytic processing. The COOH-terminal part (35 residues) is basic. Two variant PRP-precursor alleles occur which slightly differ in the number of repeats in domain C. The high degree of sequence conservation within the repeat regions suggests that the gene presumably evolved by multiple amplification and dispersion of two internal segments. In the 5097-base pair genomic region 5' upstream from the translation start, several control elements for transcription are recognized. A potential binding site for the Sp1 factor (GGGCGG) separated by 47 nucleotides from an initiator motif, most probably elements of the promoter, is detected in the vicinity of the ATG codon. Several putative androgen response elements (TGTYCT) are found in the 5' adjacent region and far upstream two Alu type III repeats and two (CA)n repeats are located. These results provide the basis for a detailed study of the androgen-regulated and tissue-specific expression of the PRP-precursor gene.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Biological Evolution
  • Cloning, Molecular
  • DNA / genetics
  • DNA / isolation & purification
  • Endopeptidases / metabolism
  • Exons*
  • Gene Library
  • Genes
  • Male
  • Molecular Sequence Data
  • Molecular Weight
  • Peptides / chemistry
  • Peptides / genetics*
  • Proline-Rich Protein Domains
  • Prostate / metabolism*
  • RNA, Messenger / genetics
  • Rats
  • Repetitive Sequences, Nucleic Acid
  • Restriction Mapping
  • Sequence Homology, Nucleic Acid

Substances

  • Peptides
  • RNA, Messenger
  • DNA
  • Endopeptidases

Associated data

  • GENBANK/M74319
  • GENBANK/M74320
  • GENBANK/M74321
  • GENBANK/M74322
  • GENBANK/M74323
  • GENBANK/M74324
  • GENBANK/M76378
  • GENBANK/M85301
  • GENBANK/M86514
  • GENBANK/M86526