Amplification of prolamin storage protein genes in different subfamilies of the Poaceae
- First Online:
- Cite this article as:
- Xu, JH. & Messing, J. Theor Appl Genet (2009) 119: 1397. doi:10.1007/s00122-009-1143-x
- 418 Downloads
Prolamins are seed storage proteins in cereals and represent an important source of essential amino acids for feed and food. Genes encoding these proteins resulted from dispersed and tandem amplification. While previous studies have concentrated on protein sequences from different grass species, we now can add a new perspective to their relationships by asking how their genes are shared by ancestry and copied in different lineages of the same family of species. These differences are derived from alignment of chromosomal regions, where collinearity is used to identify prolamin genes in syntenic positions, also called orthologous gene copies. New or paralogous gene copies are inserted in tandem or new locations of the same genome. More importantly, one can detect the loss of older genes. We analyzed chromosomal intervals containing prolamin genes from rice, sorghum, wheat, barley, and Brachypodium, representing different subfamilies of the Poaceae. The Poaceae commonly known as the grasses includes three major subfamilies, the Ehrhartoideae (rice), Pooideae (wheat, barley, and Brachypodium), and Panicoideae (millets, maize, sorghum, and switchgrass). Based on chromosomal position and sequence divergence, it becomes possible to infer the order of gene amplification events. Furthermore, the loss of older genes in different subfamilies seems to permit a faster pace of divergence of paralogous genes. Change in protein structure affects their physical properties, subcellular location, and amino acid composition. On the other hand, regulatory sequence elements and corresponding transcriptional activators of new gene copies are more conserved than coding sequences, consistent with the tissue-specific expression of these genes.