Classification, prediction, and verification of the regioselectivity of fungal polyketide synthase product template domains

J Biol Chem. 2010 Jul 23;285(30):22764-73. doi: 10.1074/jbc.M110.128504. Epub 2010 May 17.

Abstract

The fungal iterative nonreducing polyketide synthases (NRPKSs) synthesize aromatic polyketides, many of which have important biological activities. The product template domains (PT) embedded in the multidomain NRPKSs mediate the regioselective cyclization of the highly reactive polyketide backbones and dictate the final structures of the products. Understanding the sequence-activity relationships of different PT domains is therefore an important step toward the prediction of polyketide structures from NRPKS sequences and can enable the genome mining of hundreds of cryptic NRPKSs uncovered via genome sequencing. In this work, we first performed phylogenetic analysis of PT domains from NRPKSs of known functions and showed that the PT domains can be classified into five groups, with each group corresponding to a unique product size or cyclization regioselectivity. Group V contains the formerly unverified PT domains that were identified as C6-C11 aldol cyclases. The regioselectivity of PTs from this group were verified by product-based assays using the PT domain excised from the asperthecin AptA NRPKS. When combined with dissociated PKS4 minimal PKS, or replaced the endogenous PKS4 C2-C7 PT domain in a hybrid NRPKS, AptA-PT directed the C6-C11 cyclization of the nonaketide backbone to yield a tetracyclic pyranoanthraquinone 4. Extensive NMR analysis verified that the backbone of 4 was indeed cyclized with the expected regioselectivity. The PT phylogenetic analysis was then expanded to include approximately 100 PT sequences from unverified NRPKSs. Using the assays developed for AptA-PT, the regioselectivities of additional PT domains were investigated and matched to those predicted by the phylogenetic classifications.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Computational Biology*
  • Fungi / enzymology*
  • Macrolides / chemistry
  • Macrolides / metabolism
  • Phylogeny
  • Polyketide Synthases / chemistry*
  • Polyketide Synthases / classification
  • Polyketide Synthases / metabolism*
  • Protein Structure, Tertiary
  • Reproducibility of Results
  • Stereoisomerism
  • Structure-Activity Relationship
  • Substrate Specificity

Substances

  • Macrolides
  • Polyketide Synthases