Conserved globulin gene across eight grass genomes identify fundamental units of the loci encoding seed storage proteins

Funct Integr Genomics. 2010 Mar;10(1):111-22. doi: 10.1007/s10142-009-0135-x. Epub 2009 Aug 26.

Abstract

The wheat high molecular weight (HMW) glutenins are important seed storage proteins that determine bread-making quality in hexaploid wheat (Triticum aestivum). In this study, detailed comparative sequence analyses of large orthologous HMW glutenin genomic regions from eight grass species, representing a wide evolutionary history of grass genomes, reveal a number of lineage-specific sequence changes. These lineage-specific changes, which resulted in duplications, insertions, and deletions of genes, are the major forces disrupting gene colinearity among grass genomes. Our results indicate that the presence of the HMW glutenin gene in Triticeae genomes was caused by lineage-specific duplication of a globulin gene. This tandem duplication event is shared by Brachypodium and Triticeae genomes, but is absent in rice, maize, and sorghum, suggesting the duplication occurred after Brachypodium and Triticeae genomes diverged from the other grasses ~35 Ma ago. Aside from their physical location in tandem, the sequence similarity, expression pattern, and conserved cis-acting elements responsible for endosperm-specific expression further support the paralogous relationship between the HMW glutenin and globulin genes. While the duplicated copy in Brachypodium has apparently become nonfunctional, the duplicated copy in wheat has evolved to become the HMW glutenin gene by gaining a central prolamin repetitive domain.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Conserved Sequence*
  • DNA, Intergenic / genetics
  • Evolution, Molecular
  • Genes, Plant / genetics*
  • Genetic Loci / genetics*
  • Globulins / genetics*
  • Glutens / chemistry
  • Models, Genetic
  • Molecular Sequence Data
  • Molecular Weight
  • Poaceae / genetics*
  • Promoter Regions, Genetic / genetics
  • Seed Storage Proteins / genetics*
  • Sequence Homology, Nucleic Acid

Substances

  • DNA, Intergenic
  • Globulins
  • Seed Storage Proteins
  • Glutens
  • glutenin