Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

Kawaura, Kanako; Mochida, Keiichi; Enju, Akiko; Totoki, Yasushi; Toyoda, Atsushi; Sakaki, Yoshiyuki; Kai, Chikatoshi; Kawai, Jun; Hayashizaki, Yoshihide; Seki, Motoaki; Shinozaki, Kazuo; Ogihara, Yasunari

doi:10.1186/1471-2164-10-271

Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

Research article
Open access
Published: 18 June 2009

Volume 10, article number 271, (2009)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

Download PDF

Kanako Kawaura¹,
Keiichi Mochida²,
Akiko Enju²,
Yasushi Totoki³,
Atsushi Toyoda³,
Yoshiyuki Sakaki³,
Chikatoshi Kai³,
Jun Kawai³,
Yoshihide Hayashizaki³,
Motoaki Seki²,
Kazuo Shinozaki² &
…
Yasunari Ogihara¹

9733 Accesses
40 Citations
6 Altmetric
Explore all metrics

Abstract

Background

Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs) for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals.

Results

As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks) of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection.

Conclusion

We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the ongoing curation and annotation of the wheat genome. The data for each clone's expression in various tissues and stress treatments and its variability in wheat and rice as a result of their diversification are valuable tools for functional genomics in wheat and for comparative genomics in cereals.

Genome-wide identification of the MED25 BINDING RING-H2 PROTEIN gene family in foxtail millet (Setaria italica L.) and the role of SiMBR2 in resistance to abiotic stress in Arabidopsis

Article 07 June 2024

High-density mapping of durable and broad-spectrum stripe rust resistance gene Yr30 in wheat

Article 08 June 2024

The first high-quality genome assembly and annotation of Lantana camara, an important ornamental plant and a major invasive species

Article Open access 10 May 2024

Background

Wheat is mainly cultivated in temperate zones and is one of the world's main staple foods. Wheat is polyploid and common wheat is an allohexaploid that has three homoeologous genomes. These genomes have been designated as A, B and D, with the coding regions of the homoeologous genes sharing more than 90% homology. Due to the huge genome size of the hexaploid (17 Gbp) [1] along with the high content of repeat sequences, it is very difficult to carry out complete sequencing of the entire genome or to perform forward genetics in polyploid wheat. Although common wheat genetic maps with molecular markers and cytological maps with deletion mutants of the chromosome segments have been constructed [2, 3], the number of mapped DNA markers for both map-based cloning and anchoring of the genome positions remains restricted. Comparison of wheat and rice genomic sequences to corresponding full-length cDNAs can provide information on exon-intron boundaries. With this information, it should be possible to set up primers for PCR-based markers such as the simple-sequence repeats [4]. An international effort has been made to determine the DNA sequence of the entire chromosome of group 3 in bread wheat and in its ancestor, Aegilops tauschii (the D genome donor) [5]. However, transformation of wheat is still difficult, and tagged lines with transposable elements or T-DNA are not yet available. While tools for forward genetics have proven insufficient for wheat, a recent report [6] suggested that RNA interference can suppress the action of three homoeologous genes, even in polyploid wheat. Therefore, reverse genetic approaches based on functional genomics might be quite useful. We thus have been collecting expressed sequence tags (ESTs) from single limited strains of common wheat [7, 8]. Simultaneous efforts from both our own and other laboratories have generated data on more than one million wheat ESTs http://www.ncbi.nlm.nih.gov./dbEST. With the recent innovations that have been developed for the systematic collection and in silico display of comprehensive ESTs from a number of tissues, including abiotic-stressed tissues [8], it has become possible to characterize the expression profiles of target genes in these particular tissues and treatments.

Sequence-verified full-length cDNA clones with high accuracy that harbor protein coding sequences are critical for advances in structural, functional and comparative genomic studies. Using full-length cDNA sequence data, the protein-coding regions in the genome can be precisely predicted. In the wheat genome, gene annotation using full-length cDNA sequence data is essential, because there are many repetitive sequences and retrotransposons that can cause confusion when trying to predict the gene regions in genomic sequences [see examples in reference [9]]. In addition, functional annotation should be more reliable for converted amino acid sequence (coding sequence or CDS) predicted from full-length cDNA sequence data. Employing CDS data should enable inference of functional roles based on gene ontology from model plants such as rice and Arabidopsis (InterPro: http://www.ebi.ac.uk/interpro/) [10].

CDS data can also be used to search for counterparts in related plant species. Wheat and rice both belong to the grass family, the Gramineae, and CDS data can reveal characteristic breeding behavior and the ability to adapt to environmental conditions that led to their diversification more than 50 million years ago [11]. Wheat, which originated in temperate zones, grows on dry land, is a long-day plant, has a shortened rachis, accumulates gluten-rich flour, and its germination is promoted in response to cold temperatures [12]. In contrast, rice, which is cultivated in tropical or subtropical areas, grows in water-rich conditions, is a short-day plant, has an elongated rachis, accumulates storage proteins via two types of protein bodies, and its germination is promoted in response to high temperatures [13]. Although it has long been theorized that selection pressure controls the distinct growth habits of wheat and rice, there has been no systematic investigation on the evolutionary rates of change for a genome-wide set of growth habit-related genes for these plants. The main reason for this is that DNA sequence information for the wheat genome is still limited.

Here we report on the construction of a full-length cDNA library for Chinese Spring wheat and its resulting quality as a starting resource for the complete sequencing of 6,162 independent full-length cDNA clones. We describe the overall characteristics of these full-length cDNA clones and their annotation compared to other model plants. From these clones, we selected 3,487 genes for which expression patterns could be traced with EST data in 28 tissues that are either observed during the wheat life cycle or were also stress-treated [7, 8]. Based on hierarchical cluster classification of expression pattern, the substitution rates (Ka/Ks) of genes common to wheat and rice were calculated in order to assess the variability of the gene grouping in each cluster. Genes preferentially expressed in certain tissues or stress treatments showed higher Ka/Ks values, suggesting that molecular selection occurred during the diversification of wheat and rice.

Results and Discussion

Sequence determination and functional annotation of full-length wheat cDNAs

The CAP-trapper method [14, 15] was used to construct a full-length cDNA library from pooled RNAs derived from the common wheat (Triticum aestivum cv. Chinese Spring). The library included 17 tissues that are either formed during the wheat life cycle or were subjected to an abiotic stress (Table 1). A total of 19,968 randomly selected recombinant clones from the full-length cDNA library have been sequenced from both ends. By grouping these one-path sequences, we obtained 7,149 independent gene clusters, which made it possible to group the homoeogenes expressed from each of the three common wheat genomes (A, B and D genomes) [16]. The largest clones were representative of each gene cluster and harbored the CDSs. After they were rearrayed, we were able to determine the full-length sequences of 6,162 clones. Figure 1 presents a detailed description of the sequence data for the full-length cDNA clones and the size distribution of the inserted DNAs. The full-length wheat cDNA data were used to carry out a blast homology search versus the public DNA database. About 10% of the clones were determined to be wheat-specific genes without any counterparts in the DNA database. Molecular functions of these sequenced cDNAs were predicted using the InterPro database [10]. The distribution of the wheat genes that were classified into various categories of the InterPro database was similar to that for rice and Arabidopsis. These results indicate that the selection of full-length wheat cDNAs from the pooled RNAs reflects their expression within the tissues (Figure 2).

Table 1 Wheat tissues used for construction of the full-length cDNA library

Full size table

Comparison of gene nucleotide substitution rates in coding regions in wheat and rice

To compare the gene nucleotide substitution rates of wheat and rice, we selected 4,321 wheat genes that had deduced amino acid sequences with more than 80% overlapping homology as determined by the blastx method. In contrast, when the homology search was carried out against the wheat EST database using selected rice genes as the query, the corresponding wheat genes had the highest ranking. In order to examine the expression profiles in various tissues of wheat, including abiotic-stressed tissues, we used blastn (E < 1e-60) to search the wheat EST database (MUGEST: http://www.shigen.nig.ac.jp/wheat/komugi/top/top.jsp) for corresponding ESTs of the 4,321 wheat cDNAs described in references 7 and 8. We selected 3,487 wheat genes to compare their amino acid sequences in wheat and rice (Additional file 1). In addition, we were also able to trace their expression profiles in 28 normal or stress-treated tissues of common wheat (Table 2). Since the most common way to estimate selection constraints on protein evolution is using the ratio of the nonsynonymous (Ka) and synonymous (Ks) substitution rates [17], we calculated the Ka/Ks ratio for each gene to compare the values for the wheat and rice counterparts. The Ka/Ks value varied from 1.3 to 0, with the average being 0.2363 (Figure 3). The likely occurrence of adaptive evolution for a given gene is indicated by a Ka/Ks value greater than 1 [17]. Out of the 3,487 wheat genes, 12 had Ka/Ks values greater than 1. Although the identity of most of these 12 genes is yet unknown, five have been annotated, including one gene as involved in lipid transfer, one with homology to heat shock protein HSP20, one to proteinase inhibitor, one to thionin and one to an RNA binding protein. Genes for lipid transfer proteins and HSP20 have been demonstrated to be involved in stress responses, such as plant defense [18] and heat shock [19]. A proteinase inhibitor gene has been reported to regulate plant development [20] and drought tolerance [21]. Thionin genes are specifically expressed in the seed and control plant defense against Fusarium head blight [22, 23]. RNA binding protein also has a role in both the development of flowers and seeds [24]. Taken together, this evidence suggests that positive selection may have a role for beneficial functions of these genes. Therefore, to some extent these genes characterize the adaptation that must occur in order for growth to continue within the specific habitats for wheat and rice.

Table 2 Wheat tissues and treatments selected to evaluate expression patterns of full-length cDNAs using ESTs homologous to their gene counterparts

Full size table

Comparative and functional genomic analysis of full-length wheat cDNAs

In order to examine nucleotide substitution rates of the genes in relation to their expression patterns for the various tissues and stress treatments, global expression profiles of the 3,487 full-length cDNA genes in the 28 tissues and treatments were estimated using the full-length cDNAs as scaffolds to cluster ESTs. Expression patterns of the clustered wheat ESTs (contigs) have been previously reported [7, 8]. The number of EST constituents assigned to the 28 cDNA libraries was scored for each contig, which produced a two-way expression profile, i.e., contig vs. library. As seen in Figure 3, hierarchical clustering was constructed based on the EST constituent matrix [25]. To determine the counterparts of the 3,487 full-length cDNAs in the wheat ESTs, we performed a search with blastn (E < 1e-60) against the contigs [7, 8]. Based on the expression patterns of their counterpart contigs after hierarchical clustering, the 3,487 genes were classified into 30 major groups. In addition to the genes that were ubiquitously or nearly ubiquitously expressed in the 28 tissues and treatments (designated as ubiquitous or "ub"), we also classified the genes that were dominantly expressed in each of these 28 tissues and treatments (Figure 3 and Table 2). Out of the 28 cDNA libraries, 13 were assigned to a life cycle group, while 15 were assigned to the stressed-tissue group (Figure 3) [8]. During the clustering, a group was found that showed dominant expression in all of the stress-treated tissues (Figure 3). These genes were designated stress constant or "strc". The number of genes classified into each category ranged from 44 (heat shock) to 182 (dried roots), with a mean of 109.9.

In order to assess the relationship between the variability of the genes and their expression profiles, the Ka/Ks value of each gene was plotted against its expression pattern (listed above the expression profile of each contig in Figure 3). The mean Ka/Ks values for the genes classified in each group are shown in Figure 4. In addition, the Ka/Ks values of the life cycle and stressed-tissue groups are respectively displayed in Figure 4A and 4B. Analysis of variance indicated a significant difference in the Ka/Ks value between the groups. Notably, the Ka/Ks values of the "ub" and "strc" genes were significantly lower than the average of the 3,487 genes, which suggests a selective constraint on these subsets of the genes [26]. Genes preferentially expressed in the callus (Figure 4B) also had significantly lower Ka/Ks values than the average. On the other hand, the 35 genes expressed in six tissues and in the stressed tissues, namely, root (r), the spike at booting (o), the spike at flowering (f), the developing seed 30 days post-anthesis (DPA) (sl), seedlings subjected to a cold treatment (kv) and desiccated seedlings (vd), all had significantly higher mean Ka/Ks values compared to the overall average (Figure 4A, B). This shows that there was a loose selection constraint against these genes during the evolutionary divergence of wheat and rice. Table 3 lists the annotated genes that displayed higher Ka/Ks values (greater than 0.5) for the genes expressed in these six tissues and treatments (for further explanation, see Additional file 2). These six tissues and treatments characterize the growth habit or speciation requirements of wheat and rice. For example, rice roots are usually covered by water, which creates anaerobic conditions around the roots [27]. In contrast, since wheat plants prefer dry land, anaerobic conditions do not occur around the wheat roots [28]. Among the genes that display higher Ka/Ks values, genes encoding transcription factors [29, 30], beta-glucanase [31] and nodulin [32], which are regulated by abiotic as well as biotic stresses, were found. Additionally, radial oxygen loss is a critical marker for adaptation of roots to anaerobic conditions [33, 34]. Two genes related to radial oxygen loss had higher Ka/Ks values (Table 1 and Additional file 2). The evidence accumulating from these observations suggests that genes expressed in the roots that are related to signal perception or transduction, transcription regulation, or stress responses have all been exposed, to some extent, to the adaptive selection that occurred during the diversification of wheat and rice.

Table 3 Tissues and genes that showed higher variation as a result of wheat and rice diversification

Full size table

The spike at the booting stage of both wheat and rice, during which the critical process of meiosis occurs, generates the gametes [35, 36]. Among the four annotated genes in this tissue, two genes are related to lipid metabolism for cellular activity [37, 38] and two genes are associated with redox reactions in photosynthesis [39] and respiration [40]. All of these had relatively higher Ka/Ks values (Table 3 and Additional file 2), suggesting that the spike at the booting stage is under highly stressed conditions for dry (wheat) and moist (rice) habitats.

The spike that occurs at flowering supplies the field for pollination. Thus, genes that control compatibility for fertilization need to work at this particular stage [41]. Among the genes that were preferentially expressed at this stage, the RNA binding protein showed Ka/Ks values that were higher than 1, which suggests a key role in the pollination by the spikes during flowering [24]. The DNA binding histone-like protein that is specifically expressed in this tissue has a higher Ka/Ks value. Additionally, a methyltransferase involved in epigenetic regulation has highly diverged. Certain stress-related proteins such as glutathione S-transferase [42], osmotin [43, 44], pectate lyase [45] and glucan endo-1,3-beta-D-glucosidase [46] were also characteristically expressed in this tissue. Likewise, a gene encoding a receptor-like kinase is also uniquely expressed in this tissue. In addition, genes that encode pollen allergen-related protein [47, 48] and proteinase inhibitor [49] were expressed in the spike at flowering. All of these data suggest that selection for recognition between male and female gametes, modulation of nucleic acids, stress responses and signal transduction in the field during pollination might operate on these genes.

In response to internal and external environments, developing seeds at 30 DPA enter into dormancy [50, 51]. Genes for thionin 1 and 2, polysaccharide-related proteins, lipid transfer protein and ankyrins are all involved in controlling plant defense at this stage (Additional file 2). Stage-specific proteins that play substantial roles in the seeds, such as late embryo abundant proteins I and II [52], stress-related protein [53] and α-amylase [54], all showed higher Ka/Ks values during this part of the life cycle. This suggests that genes for signal perception during seed dormancy, along with those for stage-specific and stress-related conditions, might characterize the growth habits of wheat and rice in situations where they are adapting to environmental conditions.

Genes expressed in cold-treated and desiccated seedlings showed higher Ka/Ks values than the others. Genes found to be involved after these treatments are normally associated with cell membranes, cell walls, stress and senescence.

It is assumed that adaptive selection works on a large number of genes, some of which show rapid evolution. Unfortunately, in genes that have rapidly evolved, there is great divergence in the plants that have resulted from genetic divergence, which makes it difficult to trace their orthology. In the current study, after deducing the complete sequence of full-length cDNA clones, we compared coding sequences of common wheat genes to their rice counterparts. The expression patterns in various tissues during the wheat life cycle as well as in stress-treated tissues can be systematically monitored [8]. Thus, based on mean Ka/Ks values, it may be possible to infer the variability of genes in their sequences that will be expressed in their respective tissues or treatments to the plants. Genes ubiquitously expressed throughout the majority of the tissues or stress treatments exhibited lower Ka/Ks values, whereas significantly higher Ka/Ks values were noted both for the genes that were characteristically expressed in four tissues of the wheat life cycle (the root, the spike at the booting stage, the spike at flowering and the seed at 30 DPA), and for the genes expressed in response to the two stress treatments (seedlings undergoing cold treatment and desiccated seedlings). These findings of gene expression patterns in response to stresses are also reported for mammalian genes [26]. Therefore, to some extent, positive selection might play a role in the gene expression that occurs in response to environmental changes.

Conclusion

We have developed a resource of a large number of sequenced full-length cDNAs of common wheat that covers the majority of the functional annotations deduced from gene ontology of rice and Arabidopsis. This full-length wheat cDNA resource is indispensable for gene annotations in the wheat genome for which sequencing is still ongoing, and for the functional analysis of these genes and their products. The information presented here on the full-length wheat cDNA sequences, their variability during the evolution of cereals and their expression profiles in various tissues during both the life cycle and in response to stress treatments should facilitate functional genomics and genome breeding of wheat and other cereals.

Methods

Construction of full-length wheat cDNA library, DNA sequencing, and selection of independent cDNA clones

The CAP-trapper method [14, 15] was used to construct a full-length cDNA library from pooled RNAs derived from 17 samples of common wheat (Triticum aestivum cv. Chinese Spring) tissues, a combination of those formed during the wheat life cycle and those subjected to abiotic stresses (Table 1). Subsequently, 39,936 cDNA clones were randomly selected from the library and sequenced by a one-path method from both ends of the inserts. These DNA sequences were assembled with the Phrap method (University of Washington Genome Center; http://www.phrap.org) using the program new_ace-penalty-5 – mismatch 50 – minscore 100. To construct the gene clusters (E < 1e-30), the resultant contigs were clustered using the blastn method [55]. Representative clones from each gene cluster that was predicted to harbor CDSs were then rearrayed. Subsequently, 6,162 full-length cDNA clones were selected, and their inserts were completely sequenced using the primer walking method.

Data mining and characterization of full-length wheat cDNA clones

The wheat cDNAs were annotated using the blast method against the public database (E < 1e-5). Rice counterparts for each of the wheat cDNAs were selected from the public database when a blastx search determined that there was more than 80% overlapping homology in the deduced amino acids. The CDSs of the counterparts were automatically aligned. The alignments were needed to be corrected manually. Then, the Ka/Ks values [17] were calculated for the codons that corresponded between the wheat and rice homologues.

cDNA expression patterns were monitored using a tissue expression map of wheat, which included 28 cDNA libraries (Table 2). The blastn program (with E < 1e-60) was then used to search the full-length cDNAs for their correct counterparts in the wheat tissue expression map, with the full-length cDNAs used as scaffolds to cluster ESTs. Similarities between the resultant 3,487 cDNAs and the libraries were estimated using Pearson's correlation coefficient [25]. Hierarchical clustering http://rana.lbl.gov/eisen/ was applied in order to compare these cDNAs expression profiles among the 28 wheat libraries. Expression profiles were displayed based on the number of constituents in the cDNAs along with their Ka/Ks value.

Full-length cDNA data

The full-length cDNA data have been deposited in the DDBJ under accession nos. AK330135 to AK336296, and are available at http://www.shigen.nig.ac.jp/wheat/komugi/top/top.jsp.

References

Bennett MD, Leitch IJ: Nuclear DNA amounts in angiosperms. Ann Bot. 1995, 76: 113-176. 10.1006/anbo.1995.1085.
Article CAS Google Scholar
Xue S, Zhang Z, Lin F, Kong Z, Cao Y, Li C, Yi H, Mei M, Zhu H, Wu J, Xu H, Zhao D, Tian D, Zhang C, Ma Z: A high-density intervarietal map of the wheat genome enriched with markers derived from expressed sequence tags. Theor Appl Genet. 2008, 117: 181-189. 10.1007/s00122-008-0764-9.
Article CAS PubMed Google Scholar
Qi LL, Echalier B, Chao S, Lazo GR, Butler GE, Anderson OD, Akhunov ED, Dvorák J, Linkiewicz AM, Ratnasiri A, Dubcovsky J, Bermudez-Kandianis CE, Greene RA, Kantety R, La Rota CM, Munkvold JD, Sorrells SF, Sorrells ME, Dilbirligi M, Sidhu D, Erayman M, Randhawa HS, Sandhu D, Bondareva SN, Gill KS, Mahmoud AA, Ma XF, Miftahudin , Gustafson JP, Conley EJ, Nduati V, Gonzalez-Hernandez JL, Anderson JA, Peng JH, Lapitan NL, Hossain KG, Kalavacharla V, Kianian SF, Pathan MS, Zhang DS, Nguyen HT, Choi DW, Fenton RD, Close TJ, McGuire PE, Qualset CO, Gill BS: A chromosome bin map of 16,000 expressed sequence tag loci and distribution of genes among the three genomes of polyploid wheat. Genetics. 2004, 168: 701-712. 10.1534/genetics.104.034868.
Article PubMed Central CAS PubMed Google Scholar
Ishikawa G, Yonemaru J, Saito M, Nakamura T: PCR-based landmark unique gene (PLUG) markers effectively assign homoeologous wheat genes to A, B and D genomes. BMC Genomics. 2007, 8: 135-10.1186/1471-2164-8-135.
Article PubMed Central PubMed Google Scholar
Gill BS, Appels R, Botha-Oberholster A-M, Buell CR, Bennetzen JL, Chalhoub B, Chumley F, Dvorak J, Iwanaga M, Keller B, Li W, McCombie WR, Ogihara Y, Quetier F, Sasaki T: A workshop report on wheat genome sequencing: International genome research on wheat consortium. Genetics. 2004, 168: 1087-1096. 10.1534/genetics.104.034769.
Article PubMed Central PubMed Google Scholar
Fu D, Uauy C, Blechl A, Dubcovsky J: RNA interference for wheat functional gene analysis. Transgenic Res. 2007, 16: 689-701. 10.1007/s11248-007-9150-7.
Article CAS PubMed Google Scholar
Ogihara Y, Mochida K, Nemoto Y, Murai K, Yamazaki Y, Shin-I T, Kohara Y: Correlated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tags. Plant J. 2003, 33: 1001-1011. 10.1046/j.1365-313X.2003.01687.x.
Article PubMed Google Scholar
Mochida K, Kawaura K, Shimosaka E, Kawakami N, Shin-I T, Kohara Y, Yamazaki Y, Ogihara Y: Tissue expression map of comprehensive expressed sequence tags and its application to in silico screening of stress response genes in common wheat. Mol Genet Genomics. 2006, 276: 304-312. 10.1007/s00438-006-0120-1.
Article CAS PubMed Google Scholar
Salse J, Bolot S, Throude M, Jouffe V, Piegu B, Quraishi UM, Calcagno T, Cooke R, Delseny M, Feuillet C: Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution. Plant Cell. 2008, 20: 11-24. 10.1105/tpc.107.056309.
Article PubMed Central CAS PubMed Google Scholar
Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD, Durbin R, Falquet L, Fleischmann W, Gouzy J, Hermjakob H, Hulo N, Jonassen I, Kahn D, Kanapin A, Karavidopoulou Y, Lopez R, Marx B, Mulder NJ, Oinn TM, Pagni M, Servant F, Sigrist CJ, Zdobnov EM: The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 2001, 29: 37-40. 10.1093/nar/29.1.37.
Article PubMed Central CAS PubMed Google Scholar
Gaut BS: Evolutionary dynamics of grass genomes. New Phytol. 2002, 154: 15-28. 10.1046/j.1469-8137.2002.00352.x.
Article CAS Google Scholar
Zohary D, Hopf M: Domestication of Plants in the Old World. 2001, Oxford University Press, New York
Google Scholar
Oka HI: Origin of Cultivated Rice. 1988, Elsevier Publications, Amsterdam
Google Scholar
Seki M, Narusaka M, Kamiya A, Ishida J, Satou M, Sakurai T, Nakajima M, Enju A, Akiyama K, Oono Y, Muramatsu M, Hayashizaki Y, Kawai J, Carninci P, Itoh M, Ishii Y, Arakawa T, Shibata K, Shinagawa A, Shinozaki K: Functional annotation of a full-length Arabidopsis cDNA collection. Science. 2002, 296: 141-145. 10.1126/science.1071006.
Article PubMed Google Scholar
Ogihara Y, Mochida K, Kawaura K, Murai K, Seki M, Kamiya A, Shinozaki K, Carninci P, Hayashizaki Y, Shin-I T, Kohara Y, Yamazaki Y: Construction of a full-length cDNA library from young spikelets of hexaploid wheat and its characterization by large-scale sequencing of expressed sequence tags. Genes Genet Syst. 2004, 79: 227-232. 10.1266/ggs.79.227.
Article PubMed Google Scholar
Mochida K, Yamazaki Y, Ogihara Y: Discernment of homoeologous gene expression in hexaploid wheat by SNP analysis of contigs grouped from a large number of expressed sequence tags. Mol Genet Genomics. 2003, 270: 371-377. 10.1007/s00438-003-0939-7.
Article CAS PubMed Google Scholar
Li WH: Unbiased estimation of the rates of synonymous and nonsynonymous substitutions. J Mol Evol. 1993, 36: 96-99. 10.1007/BF02407308.
Article CAS PubMed Google Scholar
Cheng CS, Samuel D, Liu YJ, Shyu JC, Lai SM, Liu KF, Lyu PC: Binding mechanism of nonspecific lipid transfer proteins and their role in plant defense. Biochemistry. 2004, 43: 13628-13636. 10.1021/bi048873j.
Article CAS PubMed Google Scholar
Swindell WR, Huebner M, Weber AP: Transcriptional profiling of Arabidopsis heat shock proteins and transcription factors reveals extensive overlap between heat and non-heat stress response pathways. BMC Genomics. 2007, 8: 125-10.1186/1471-2164-8-125.
Article PubMed Central PubMed Google Scholar
Wang J, Shi ZY, Wan XS, Sheu GZ, Zhang JL: The expression pattern of a rice proteinase inhibitor gene OsP18-1 implies its role in plant development. J Plant Physiol. 2008, 165: 1519-1529. 10.1016/j.jplph.2007.08.008.
Article CAS PubMed Google Scholar
Huang Y, Xiao B, Xiong L: Characterization of a stress responsive proteinase inhibitor gene with positive effect in improving drought resistance in rice. Planta. 2007, 226: 73-85. 10.1007/s00425-006-0469-8.
Article CAS PubMed Google Scholar
Poppenberger B, Berthiller F, Lucyshyn D, Sieberer T, Schuhmacher R, Krska R, Kuchlerk , Glössl J, Luschnig C, Adam G: Detoxification of the Fusarium mycotoxin deoxynivalenol by a UDP-glucosyltransferase from Arabidopsis thaliana. J Biol Chem. 2003, 278: 47905-47914. 10.1074/jbc.M307552200.
Article CAS PubMed Google Scholar
Sepulveda-Jimenez G, Rueda-Benitez P, Porta H, Rocha-Sosa M: A red beet (Beta vulgaris) UDP-glucosyl transferase gene induced by wounding, bacterial infiltration and oxidative stress. J Exp Bot. 2005, 56: 605-611. 10.1093/jxb/eri036.
Article CAS PubMed Google Scholar
Fusaro AF, Bocca SN, Ramos RL, Barroco RM, Magioli C, Jorge VC, Couthinho TC, Rangel-Lima CM, De Rycke R, Inze D, Engler G, Sachetto-Martins G: AtGRP2, a cold-induced nucleo-cytoplasmic RNA-binding protein, has a role in flower and seed development. Planta. 2007, 225: 1339-1351. 10.1007/s00425-006-0444-4.
Article CAS PubMed Google Scholar
Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.
Article PubMed Central CAS PubMed Google Scholar
Duret L, Mouchiroud D: Determinants of substitution rates in mammalian genes: Expression pattern affects selection intensity but not mutation rate. Mol Biol Evol. 2000, 17: 68-70.
Article CAS PubMed Google Scholar
Tsuji H, Saika H, Tsutsumi N, Hirai A, Nakazono M: Dynamic and reversible changes in histone H3-Lys4 methylation and H3 acetylation occurring at submergence-inducible genes in rice. Plant Cell Physiol. 2006, 47: 995-1003. 10.1093/pcp/pcj072.
Article CAS PubMed Google Scholar
Mohammadi M, Kav NN, Deyholos MK: Transcriptional profiling of hexaploid wheat (Triticum aestivum L.) roots identifies novel, dehydration-responsive genes. Plant Cell Environ. 2007, 30: 630-645. 10.1111/j.1365-3040.2007.01645.x.
Article CAS PubMed Google Scholar
Minglin L, Yuxiu Z, Tuanyao C: Identification of genes up-regulated in response to Cd exposure in Brassica juncea L. Gene. 2005, 363: 151-158. 10.1016/j.gene.2005.07.037.
Article PubMed Google Scholar
Zhang Y, Shih DS: Isolation of an osmotin-like protein gene from strawberry and analysis of the response of this gene to abiotic stresses. J Plant Physiol. 2007, 164: 68-77. 10.1016/j.jplph.2006.02.002.
Article CAS PubMed Google Scholar
Simmons CR, Litts JC, Huang N, Rodriguez RL: Structure of a rice beta-glucanase gene regulated by ethylene, cytokinin, wounding, salicylic acid and fungal elicitors. Plant Mol Biol. 1992, 18: 33-45. 10.1007/BF00018454.
Article CAS PubMed Google Scholar
Kottapalli KR, Kottapalli P, Agrawal GK, Kikuchi S, Rakwal R: Recessive bacterial leaf blight resistance in rice: complexity, challenges and strategy. Biochem Biophys Res Commun. 2007, 355: 295-301. 10.1016/j.bbrc.2007.01.134.
Article CAS PubMed Google Scholar
Colmer TD: Long-distance transport of gases in plants: a perspective on internal aeration and radial oxygen loss from roots. Plant Cell Environ. 2003, 26: 17-36. 10.1046/j.1365-3040.2003.00846.x.
Article CAS Google Scholar
Malik AI, Colmer TD, Lambers , Schortemeyer : Aerenchyma formation and radial O₂ loss along adventitious roots of wheat with only the apical root portion exposed to O₂ deficiency. Plant Cell Environ. 2003, 26: 1713-1722. 10.1046/j.1365-3040.2003.01089.x.
Article Google Scholar
Martinez-Perez E, Moore G: To check or not to check? The application of meiotic studies to plant breeding. Curr Opin Plant Biol. 2008, 11: 222-227.
Article PubMed Google Scholar
Nonomura KI, Nakano M, Fukuda T, Eguchi M, Miyao A, Hirochika H, Kurata N: The novel gene HOMOLOGOUS PAIRING ABERRATION IN RICE MEIOSIS 1 of rice encodes a putative coiled-coil protein required for homologous chromosome pairing in meiosis. Plant Cell. 2004, 16: 1008-1020. 10.1105/tpc.020701.
Article PubMed Central CAS PubMed Google Scholar
Clauss K, Baumert A, Nimtz M, Milkowski C, Strack D: Role of a GDSL lipase-like protein as sinapine esterase in Brassicaceae. Plant J. 2008, 53: 802-813. 10.1111/j.1365-313X.2007.03374.x.
Article CAS PubMed Google Scholar
Salas JJ, Youssar L, Martinez-Force E, Garces R: The biochemical characterization of a high-stearic acid sunflower mutant reveals the coordinated regulation of stearoyl-acyl carrier protein desaturases. Plant Physiol Biochem. 2008, 46: 109-116. 10.1016/j.plaphy.2007.10.007.
Article CAS PubMed Google Scholar
Marri L, Trost P, Trivelli X, Gonnelli L, Pupillo P, Sparla F: Spontaneous assembly of photosynthetic supramolecular complexes as mediated by the intrinsically unstructured protein CP12. J Biol Chem. 2008, 283: 1831-1838. 10.1074/jbc.M705650200.
Article CAS PubMed Google Scholar
Martsinkovskaya AI, Poghosyan ZP, Haralampidis K, Murphy DJ, Hatzopoulos P: Temporal and spatial gene expression of cytochrome B5 during flower and fruit development in olives. Plant Mol Biol. 1999, 40: 79-90. 10.1023/A:1026417710320.
Article CAS PubMed Google Scholar
Schierup MH, Vekemans X: Genomic consequences of selection on self-incompatibility genes. Curr Opin Plant Biol. 2008, 11: 116-122. 10.1016/j.pbi.2008.01.003.
Article CAS PubMed Google Scholar
Jacquard C, Mazeyrat-Gourbeyre F, Devaux P, Boutilier K, Baillieul F, Clement C: Microspore embryogenesis in barley: anther pre-treatment stimulates plant defence gene expression. Planta. 2009, 229: 393-402. 10.1007/s00425-008-0838-6.
Article CAS PubMed Google Scholar
D'Angeli S, Altamura MM: Osmotin induces cold protection in olive trees by affecting programmed cell death and cytoskeleton organization. Planta. 2007, 225: 1147-1163. 10.1007/s00425-006-0426-6.
Article PubMed Google Scholar
Zhang Y, Shih DS: Isolation of an osmotin-like protein gene from strawberry and analysis of the response of this gene to abiotic stresses. J Plant Physiol. 2007, 164: 68-77. 10.1016/j.jplph.2006.02.002.
Article CAS PubMed Google Scholar
Palusa SG, Golovkin M, Shin SB, Richardson DN, Reddy AS: Organ-specific, developmental, hormonal and stress regulation of expression of putative pectate lyase genes in Arabidopsis. New Phytol. 2007, 174: 537-550. 10.1111/j.1469-8137.2007.02033.x.
Article CAS PubMed Google Scholar
Opassiri R, Pomthong B, Akiyama T, Nakphaichit M, Onkoksoong T, Ketudata Cairns M, Ketudat Cains M: A stress-induced rice (Oryza sativa L.) beta-glucosidase represents a new subfamily of glycosyl hydrolase family 5 containing a fascin-like domain. Biochem J. 2007, 408: 241-249. 10.1042/BJ20070734.
Article PubMed Central CAS PubMed Google Scholar
Balzer HJ, Borisiuk L, Meyer HM, Matzk F, Baumlein H: A pollen allergen-encoding gene is expressed in wheat ovaries. Plant Mol Biol. 1996, 32: 435-445. 10.1007/BF00019095.
Article CAS PubMed Google Scholar
Jiang SY, Jasmin PX, Ting YY, Ramachandran S: Genome-wide identification and molecular characterization of Ole_e_I, Allerg_1 and Allerg_2 domain-containing pollen-allergen-like genes in Oryza sativa. DNA Res. 2005, 12: 167-179. 10.1093/dnares/dsi005.
Article CAS PubMed Google Scholar
Arai S, Matsumoto I, Emori Y, Abe K: Plant seed cystatins and their target enzymes of endogenous and exogenous origin. J Agric Food Chem. 2002, 50: 6612-6617. 10.1021/jf0201935.
Article CAS PubMed Google Scholar
Wilson ID, Barker GL, Lu C, Coghill JA, Beswick RW, Lenton JR, Edwards KJ: Alteration of the embryo transcriptome of hexaploid winter wheat (Triticum aestivum cv. Mercia) during maturation and germination. Funct Integr Genomics. 2005, 5: 144-154. 10.1007/s10142-005-0137-2.
Article PubMed Google Scholar
Gu XY, Kainian SF, Foley ME: Dormancy genes from weedy rice respond divergently to seed development environments. Genetics. 2006, 172: 1199-1211. 10.1534/genetics.105.049155.
Article PubMed Central PubMed Google Scholar
Delseny M, Bies-Etheve N, Carles C, Hull G, Vicient C, Raynal M, Grellet F, Aspart L: Late embryogenesis abundant (LEA) protein gene regulation during Arabidopsis seed maturation. J Plant Physiol. 2001, 158: 419-427. 10.1078/0176-1617-00353.
Article CAS Google Scholar
Tai HH, Tai GCC, Beardmore T: Dynamic histone acetylation of late embryonic genes during seed germination. Plant Mol Biol. 2005, 59: 909-925. 10.1007/s11103-005-2081-x.
Article CAS PubMed Google Scholar
Mrva K, Wallwork M, Mares DJ: α-Amylase and programmed cell death in aleurone of ripening wheat grains. J Exp Bot. 2006, 57: 877-885. 10.1093/jxb/erj072.
Article CAS PubMed Google Scholar
Karlin S, Altschul SF: Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci USA. 1993, 90: 5873-5877. 10.1073/pnas.90.12.5873.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the members of the Wheat Genomics Consortium of Japan for supplying the original wheat RNAs and their kind support throughout this project. This work was supported by Grants-in-Aid for Scientific Research on Priority Areas in the field of Comparative Genomics, and the National BioResource Project from the Ministry of Education, Culture, Sports, Science and Technology of Japan.

Author information

Authors and Affiliations

Kihara Institute for Biological Research, Yokohama City University, Maioka-cho 641-12, Yokohama, 244-0813, Japan
Kanako Kawaura & Yasunari Ogihara
RIKEN Plant Science Center, 1-7-22, Suehiro-cho, Yokohama, 230-0045, Japan
Keiichi Mochida, Akiko Enju, Motoaki Seki & Kazuo Shinozaki
RIKEN Genomic Science Center, 1-7-22, Suehiro-cho, Yokohama, 230-0045, Japan
Yasushi Totoki, Atsushi Toyoda, Yoshiyuki Sakaki, Chikatoshi Kai, Jun Kawai & Yoshihide Hayashizaki

Authors

Kanako Kawaura
View author publications
You can also search for this author in PubMed Google Scholar
Keiichi Mochida
View author publications
You can also search for this author in PubMed Google Scholar
Akiko Enju
View author publications
You can also search for this author in PubMed Google Scholar
Yasushi Totoki
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Toyoda
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiyuki Sakaki
View author publications
You can also search for this author in PubMed Google Scholar
Chikatoshi Kai
View author publications
You can also search for this author in PubMed Google Scholar
Jun Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihide Hayashizaki
View author publications
You can also search for this author in PubMed Google Scholar
Motoaki Seki
View author publications
You can also search for this author in PubMed Google Scholar
Kazuo Shinozaki
View author publications
You can also search for this author in PubMed Google Scholar
Yasunari Ogihara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yasunari Ogihara.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

This study was conceived and directed by YO, MS and KS. The full-length cDNA library was constructed by AE, YT and CK under the guidance of JK and YH. Selection of cDNA clones and DNA sequencing were conducted by AE and YT with the help of KM, AT and YS. Data were analyzed by KK and KM. AT, YS, JK, YH, MS, KS and YO participated in the design and coordination of the study. All authors read and approved the final manuscript.

Kanako Kawaura, Keiichi Mochida contributed equally to this work.

Electronic supplementary material

12864_2008_2155_MOESM1_ESM.pdf

Additional file 1: Annotation of the 3407 full-length wheat cDNA genes and their Ka/Ks values. The 3407 full-length wheat cDNA genes are annotated and their Ka/Ks values are calculated. (PDF 768 KB)

12864_2008_2155_MOESM2_ESM.doc

Additional file 2: Adaptational genes showing relatively faster evolution during diversification between wheat and rice. Adaptational genes showing relatively faster evolution are listed and their annotation is given with their Ka/Ks values. (DOC 102 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Kawaura, K., Mochida, K., Enju, A. et al. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns. BMC Genomics 10, 271 (2009). https://doi.org/10.1186/1471-2164-10-271

Download citation

Received: 21 August 2008
Accepted: 18 June 2009
Published: 18 June 2009
DOI: https://doi.org/10.1186/1471-2164-10-271

Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

Abstract

Background

Results

Conclusion

Similar content being viewed by others

Background

Results and Discussion

Sequence determination and functional annotation of full-length wheat cDNAs

Comparison of gene nucleotide substitution rates in coding regions in wheat and rice

Comparative and functional genomic analysis of full-length wheat cDNAs

Conclusion

Methods

Construction of full-length wheat cDNA library, DNA sequencing, and selection of independent cDNA clones

Data mining and characterization of full-length wheat cDNA clones

Full-length cDNA data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation