Orthology-based analysis helps map evolutionary diversification and predict substrate class use of BAHD acyltransferases

Plant J. 2022 Sep;111(5):1453-1468. doi: 10.1111/tpj.15902. Epub 2022 Jul 27.

Abstract

Large enzyme families catalyze metabolic diversification by virtue of their ability to use diverse chemical scaffolds. How enzyme families attain such functional diversity is not clear. Furthermore, duplication and promiscuity in such enzyme families limits their functional prediction, which has produced a burgeoning set of incompletely annotated genes in plant genomes. Here, we address these challenges using BAHD acyltransferases as a model. This fast-evolving family expanded drastically in land plants, increasing from one to five copies in algae to approximately 100 copies in diploid angiosperm genomes. Compilation of >160 published activities helped visualize the chemical space occupied by this family and define eight different classes based on structural similarities between acceptor substrates. Using orthologous groups (OGs) across 52 sequenced plant genomes, we developed a method to predict BAHD acceptor substrate class utilization as well as origins of individual BAHD OGs in plant evolution. This method was validated using six novel and 28 previously characterized enzymes and helped improve putative substrate class predictions for BAHDs in the tomato genome. Our results also revealed that while cuticular wax and lignin biosynthetic activities were more ancient, anthocyanin acylation activity was fixed in BAHDs later near the origin of angiosperms. The OG-based analysis enabled identification of signature motifs in anthocyanin-acylating BAHDs, whose importance was validated via molecular dynamic simulations, site-directed mutagenesis and kinetic assays. Our results not only describe how BAHDs contributed to evolution of multiple chemical phenotypes in the plant world but also propose a biocuration-enabled approach for improved functional annotation of plant enzyme families.

Keywords: BAHD acyltransferase; comparative genomics; enzyme family; evolutionary biochemistry; gene duplication; protein function; protein structure analysis.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Acyltransferases* / metabolism
  • Anthocyanins / metabolism
  • Genome, Plant / genetics
  • Phylogeny
  • Plants / metabolism
  • Solanum lycopersicum* / genetics

Substances

  • Anthocyanins
  • Acyltransferases