A detailed transcription map of the 320-kb region containing the HSD17B locus on chromosome 17 was generated. Thirty unique cDNA fragments, retrieved following the hybridization of immobilized YACs to primary pools of cDNAs prepared from RNA of mammary gland, ovary, placenta, and the Caco-2 cell line, were aligned into 10 transcription units by physical mapping and hybridization to RNAs of a series of tissues. The cDNAs were then further characterized by sequencing and used to screen mammary gland cDNA libraries. Fragments corresponding to the broadly expressed gamma-tubulin and Ki antigen genes were identified. A full-length cDNA clone encoding a 117-amino-acid protein homologous to the rat ribosomal protein L34 was isolated. Portions of genes with restricted patterns of expression were also obtained, including the previously characterized HSD17B1. One new gene, for which a full-length cDNA was isolated, was found to have an interesting tissue-specific pattern of expression with abundant mRNA in both the colon and the testis and in the mammary carcinoma cell line BT-474. This contrasted with the barely detectable level observed in several tissues including normal mammary gland. Of the five additional transcription units identified, one showed no similarity, two showed identity to human expressed sequences, and two displayed similarity to genes of animal species by amino acid alignment. These latter cDNA clones include potential homologues of a rat nuclear tyrosine phosphatase and of a factor of Drosophila that is known to be involved in the negative regulation of transcription of segment identity genes.