Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes

Lenka, Sangram K.; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C.

doi:10.1007/s11103-008-9423-4

Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes

Open access
Published: 08 November 2008

Volume 69, pages 261–271, (2009)
Cite this article

Download PDF

You have full access to this open access article

Plant Molecular Biology Aims and scope Submit manuscript

Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes

Download PDF

Sangram K. Lenka¹,
Bikash Lohia¹,
Abhay Kumar¹,
Viswanathan Chinnusamy² &
…
Kailash C. Bansal¹

3010 Accesses
44 Citations
Explore all metrics

Abstract

Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

Genome-wide identification, classification and expression analysis of GHMP genes family in Arabidopsis thaliana

Article 30 April 2015

Genome-wide identification and expression pattern analysis of the kiwifruit GRAS transcription factor family in response to salt stress

Article Open access 02 January 2024

Genome-wide identification, characterization, interaction network and expression profile of GRAS gene family in sweet orange (Citrus sinensis)

Article Open access 15 February 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Abiotic stresses like drought, salinity, cold and high temperature are the predominant environmental factors limiting productivity of crop plants (Breshears et al. 2005; Schroter et al. 2005). Plants respond to these environmental cues at molecular level by altering expression of different sets of genes (Qureshi et al. 2007; Tran et al. 2007). Expression of such genes is mainly regulated through transcriptional control process, while post-transcriptional and post-translational processes also play a crucial role. Transcriptional control machinery appears to be conserved among plant species (Hakimi et al. 2000; Hirt et al. 1990). It is well established from different experiments over past decades that promoters containing a particular cis-element respond to a specific trigger (Chinnusamy et al. 2003; Viswanathan and Zhu 2002; Yamaguchi-Shinozaki and Shinozaki 2005; Zhou et al. 2007). Combinatorial interactions of cis-acting DNA elements in the promoters with trans-acting protein factors are key processes governing spatio-temporal gene expression (Bustos et al. 1991; Hartmann et al. 2005; Hauffe et al. 1993). At an organism level, vast array of molecular genetic networks are operational in a very complex and dynamic mode. Complete understanding of the molecular genetic networks is a long cherished goal of system biologists (Chinnusamy et al. 2004; Li et al. 2006a). Targeted modification of molecular genetic networks has a tremendous potential for engineering tailor made elite genotypes “by-design.”

Availability of genome sequence of a crop plant like rice offers a new challenge and opportunities to explore the genetic mechanisms that regulate gene expression in response to various developmental and environmental cues. Rice is also closely related to other crop plants like wheat, maize, barley, sugarcane, oat, and sorghum, etc. A high degree of genomic synteny is conserved across different member species in the family Gramineae (Buell et al. 2005; Goff et al. 2002). Hence, rice is an ideal model to study complex gene regulation data coupled with comparative sequence information using computational tools. This type of study will give an insight to map, predict and decipher gene regulation mechanisms and functional classification of genes. Recently, a fare amount of large scale gene expression datasets have become available in rice in response to various stresses (Rabbani et al. 2003; Yazaki et al. 2003). However, the available data is not sufficient to do meta-analysis. Nevertheless, the rice gene expression data can be suitably integrated with promoter structure to find out it’s possible correlations (Benedict et al. 2006; Li et al. 2006b).

Roles of ABA in physiological, developmental and adaptive processes in plants are well known. Endogenous level of ABA is induced in response to various biotic (pathogen attack) and abiotic stresses (Fujita et al. 2006; Verslues and Zhu 2007). Exogenous application of ABA to plants mimics various stresses in term of co-expression of different sets of genes (Destefano-Beltran et al. 2006; Loik and Nobel 1993). The gene regulation in response to the elevated levels of ABA is mainly modulated by transcriptional control. Various studies suggest that co-expressed genes are likely involved in a common biological process. Integration of co-expression data with promoter structure in plants shows that promoters of co-expressed genes share a common cis-regulatory element (Kim and Kim 2006; Reiss et al. 2006; Werner 2001). Various algorithms such as expectation maximization (MEME) and Gibbs sampling have been used to search motifs that are over-represented within the set of related biological sequences (Bailey and Elkan 1994; Bailey et al. 2006; Thompson et al. 2003). Evidences from the promoter dissection and transcription factor binding experiments are the main references to evaluate the strength and confidence of computational methods. A handful of molecular dissection experiment like deletion and linker scanning analysis have pinpointed the ABA responsive elements (ABREs), also termed as G-box, C-box or G-box/C-box hybrids within promoters of the ABA responsive genes. ABRE contains ACGT as a core nucleotide sequence, which acts as a binding site for bZIP family transcription factors governing transcriptional regulation of ABA responsive genes (Guiltinan et al. 1990; Hattori et al. 2002; Mundy et al. 1990; Ross and Shen 2006; Shen and Ho 1995). ABREs are also coupled to the non-ACGT coupling elements like CE1, CE3, DRE, O2S, motif III, or ACGT core containing ABRE itself (Hobo et al. 1999; Shen et al. 1996; Singh 1998). As ABA plays crucial role in various signaling processes, it is logical to expect that other stress responsive integration points within ABA responsive promoters also govern gene regulation. In case of rice, genome-wide binding experiments like chromatin-immunoprecipitation coupled with microarray (Chip–chip) are lacking. Several databases like PLACE and Plant-CARE among others provide experimental evidences regarding cis-elements and transcription factors in plants (Higo et al. 1999; Lescot et al. 2002).

We present here, a targeted gene finding approach on a genome-wide scale in rice. Our prediction is based on over-represented ACGT core containing consensus motif found in co-expressed ABA responsive genes in vegetative tissues. Experimental verification by RT-PCR proves high accuracy (80%) of our integrated prediction method in the rice genome. Database mining suggested expression of the predicted genes in response to abiotic stresses as well. Among the diverse functional categories of genes, GO analysis showed the enrichment of the stress related and ABA signaling pathway genes among the genes predicted in this study.

Materials and methods

ABA responsive genes and random sequence datasets

From published microarray data, 105 genes showing two fold or more up-regulation in rice seedlings in response to ABA treatment were identified (Rabbani et al. 2003; Yazaki et al. 2003). Sequences of these genes were downloaded from NCBI database (http://www.ncbi.nlm.nih.gov/) and blasted with TIGR rice c-DNA sequences and corresponding loci were listed. 1 kb upstream sequences (promoters) from translational start site ATG, were downloaded from TIGR (http://www.tigr.org/plantProjects.shtml) Oryza sativa (Release 4.0; January 12, 2006) (Ouyang et al. 2007). Similarly other genomic sequences of rice and Arabidopsis used here were retrieved from the TIGR and TAIR databases, respectively. ABA up-regulated genes (692) in Arabidopsis identified by Li et al. (2006a, b), were considered in this study to search the presence of predicted CGMCACGTGB motif (Li et al. 2006b). Scuffled sequences were generated by randomly taking five ABA responsive promoters and scuffled 100 times using “Sequence Manipulation Suite” (http://www.bioinformatics.org/sms2/shuffle_dna.html). Other random sequence data sets used here were also generated by using “Sequence Manipulation Suite” (http://www.bioinformatics.org/sms2/random_dna.html).

Motif discovery and motif search

From several algorithms available, we chose the expectation maximization method MEME (Version 3.5.7) (http://meme.nbcr.net/meme/intro.html) for motif discovery using its default setting for a minimum and maximum width of a single motif as 10. The relevance of discovered motifs was analyzed using PLACE (http://www.dna.affrc.go.jp/PLACE/) (Bailey and Elkan 1994; Higo et al. 1999). The motifs obtained from analyzed sequences were plotted according to their positions within the regions and their consensus sequences were graphed using WebLogo3: Public Beta (http://weblogo.berkeley.edu/logo.cgi) (Crooks et al. 2004). Perl and JavaScript were used to search the perfect match of the target motifs within the rice, Arabidopsis, random and scuffled sequences. We have considered only one orientation while searching motifs here.

Analysis of gene ontology of predicted genes

To determine whether the occurrence of the discovered motifs were associated with specific gene functions, we retrieved the Plant GOSlim Assignment of rice Proteins from TIGR Database (http://www.tigr.org/tdb/e2k1/osa1/GO.retrieval.shtml) and correlated the annotated molecular function, biological process or cellular component of predicted and control data sets of rice genes.

Plant growth conditions, RNA sampling and RT-PCR analysis

Rice cultivar Nagina-22 (Oryza sativa) seeds were grown for 14 days at 28°C, 80% RH, and 12/12 h light/dark period in phytotron glass house. Sterile absorbent cotton soaked with Hoagland’s solution was used as seed bed for growing rice seedlings. Plants were irrigated with Hoagland’s solution at 3 days interval as supplemental irrigation.

Leaf and root samples from seedling were collected after 3, 5, and 24 h of 100 μM ABA treatment. ABA was also sprayed at every 2 h intervals from the first spraying. For treating roots, plants were submerged up to 1 cm above seed bed level in 100 μM ABA for 3, 5, and 24 h. RNA was extracted using TRI Reagent (Ambion, Inc. USA) and pooled from at least 20 independent controls and treated plant samples, respectively and was treated with DNase-I (QIAGEN GmbH, Germany). Subsequently RNA cleanup was carried out using RNeasy Plant Mini Kit (QIAGEN GmbH, Germany).

For RT-PCR analysis first strand c-DNA was synthesized by using 2 μg of total RNA using Superscript-III reverse transcriptase (Invitrogen, USA) with oligo(dT)₂₀ primer following manufacturer’s instructions. Two microliter of c-DNA was used in 25 μl of reaction volume with the following PCR conditions to study the gene expression: 30 cycles of 94°C for 1 min, annealing temperature according to melting temperature of primers for 1 min, and 72°C for 1 min, and then final extension at 72°C for 10 min. List of primer sets used in the study are given in the supplemental table (Additional Table 1). Out of the predicted 402 genes, randomly selected 45 genes tested here for RT-PCR analysis were not from the list of initial expression datasets (except LOC_Os01g02120 and LOC_Os02g43330 which are used as test control); hence their responsiveness to ABA is virtually un-known. Similarly 15 genes were used as negative control without having CGMCACGTGB motif. Quantitative estimation of RT-PCR amplicon on the gel was calculated as integrated density value (IDV) using AlphaEaseFC^™ software. Accuracy percentage of our prediction was calculated using the conversion:

Accuracy (%) = (Number of genes responsive to ABA detected through RT-PCR/total number of genes tested for RT-PCR) × 100.

Stress related expression data mining and phenotype searching

To analyze the expression of predicted genes in response to cold, drought and salt stresses, the physical position of these loci in the rice pseudomolecules were retrieved and searched against rice in the PlantQTL-GE database (http://www.scbit.org/qtl2gene/new/plantqtl-ge.html) (Zeng et al. 2007). All the above genes were searched for available phenotype in the Rice Tos17 Insertion Mutant Database, if there is an insertion mutation (http://tos.nias.affrc.go.jp/) (Miyao et al. 2007).

Results

ABRE consensus motif discovery and gene prediction

Response of plants to a particular trigger might be mediated by a common transcriptional regulatory mechanism, hardwired by cis-acting elements as proven in other model species (GuhaThakurta et al. 2002; Wolfsberg et al. 1999; Zhang et al. 2005). The cis-acting DNA elements are generally degenerative in nature and difficult to discover from the background but, ACGT-core containing ABRE was defined as ACGTGKC, which matched very well with the consensus derived from sequence comparison of ABA-responsive promoters in rice (Hattori et al. 2002). A genome wide computational prediction has successfully classified ABA responsive genes in Arabidopsis (Zhang et al. 2005). The modular arrangement of ABRE with its one of the coupling elements CE3 shows a clear divergence pattern in Arabidopsis and rice genome (Gomez-Porras et al. 2007). The ACGT core of ABRE is conserved in promoters of ABA responsive genes across monocot and dicot plant species. For example, for rice, maize and Arabidopsis ABRE are CGTACGTGTC, GACGTG, and CCACGTGG, respectively. However, this is not an exclusive list of ABREs in these species. We considered ABA responsive genes showing two fold or more up-regulation from the published expression profiling data for promoter analysis (Rabbani et al. 2003; Yazaki et al. 2003). These genes were aligned to TIGR gene model and the loci showing a perfect match (identity = 100%) were considered (Additional Table 2). The 1 kb upstream region from translational start site (ATG) of the selected genes was analyzed individually by PLACE, and among many other putative stress responsive cis-elements, ACGT core containing ABREs were found to be maximum (data not shown). The occurrence of PLACE-derived ABREs was highest within the 400 bp upstream region from the translational start site (Fig. 1). PLACE has documented 39 ABREs so far. Additional Fig. 1 shows the PLACE derived ABRE consensus which implicate that it is difficult to derive a clear cut consensus beyond ACGT core in the PLACE reported ABREs. We discovered over represented motifs in the promoters of co-expressed genes using expectation maximization algorithm MEME, which is considered as one of the best motif-sampling tool (Bailey and Elkan 1994). Using MEME, the ACGT core containing CGMCACGTGB motif was discovered as the top and best suited motif for genome-wide prediction of ABA responsive genes by partial interactive approach from the data sets used in this study (Fig. 2). PLACE analysis of this motif revealed that it consists of ABREs described for Arabidopsis thaliana, Oryza sativa, Lycopersicon esculentum, Triticum aestivum, Zea mays, Brassica napus, and Phaseolus vulgaris in PLACE (Table 1). Data mining from literature confirmed the sampled motif to be a strong ABRE (Busk et al. 1999; Shen and Ho 1995). The sampled ABRE motifs from promoters of co-expressed genes were used together and plotted according to their positions within the motif regions and their consensus was derived and plotted using WebLogo (Additional Fig. 2) (Crooks et al. 2004).

Table 1 PLACE description of ABRE motifs in the consensus MEME derived ABRE

Full size table

Here we explored a consensus of nucleotides flanking the ACGT core which was a decamer having a typical G-box (CACGTG) (position 4–9 WebLogo, (Additional Fig. 2). Using this top CGMCACGTGB motif, we predicted 402 protein coding genes as potential ABA responsive genes in the TIGR Rice Annotation (Release-4) model using Perl script. As this prediction strategy was stringent and only based on perfect match of cis-element, among these 402 predicted genes 392 genes were unique and independent from initial co-expressed gene data set. Table 2 shows MEME generated motifs from co-expressed genes, PLACE description of these motifs, and occurrences among the 402 predicted ABA responsive genes. Two other top motifs such as WWTTTTTYTW and SSSYGGCGSC were sampled as over-represented and found to be present in at-least one-third of the predicted genes (Table 2). However, motif WWTTTTTYTW and SSSYGGCGSC appears to be a common feature of rice genome as these motifs were sampled considerably from other control data sets such as randomly generated sequences, introns, coding DNA sequences (CDS) and intergenic region of rice (data not shown). Hence we did not consider WWTTTTTYTW and SSSYGGCGSC as real motifs for targeted gene prediction here.

Table 2 MEME generated top three motifs from the co-expressed genes

Full size table

Analysis of structural and occurrence rareness of the ABRE

To investigate the structural and occurrence rareness of ABRE motif, we have analyzed different sets of sequence such as 1 kb upstream sequences of co-expressed genes, predicted genes, 1 kb scuffled ABA responsive promoter sequence, 1 kb rice CDS, and 1 kb randomly sampled Arabidopsis promoters. The distribution of ABRE (ACGTG) was found to be similar between initial co-expressed genes used to derive ABRE-consensus and predicted genes based on ABRE-consensus; whereas the distribution pattern of ABRE differs among 1 kb scuffled ABA responsive promoter sequence of rice, 1 kb rice CDS and 1 kb randomly sampled Arabidopsis promoters as expected. However, enriched PLACE derived ABRE motif (ACGTG) shows a biased distribution in the promoters of predicted genes with a similar pattern of distribution as compared to co-expressed promoter sets. The occurrence of ABRE motifs was found to be higher within the 400 nucleotide upstream to translational start site (Fig. 1). Subsequently, we have checked the occurrence of the predicted motif within the 5′-UTR of the co-expressed and predicted genes. There was no CGMCACGTGB motif found within the 5′-UTR of the co-expressed genes, whereas only 7 predicted genes (1.7%) contain this motif in 5′-UTR. Hence it shows another common structural similarity between co-expressed and predicted genes. We considered the 1 kb upstream region from translational start site (ATG) of published ABA up-regulated (692) Arabidopsis genes (Li et al. 2006b) to analyze the enrichment of predicted CGMCACGTGB motif. Only 7.5% of the ABA up-regulated Arabidopsis genes contain this motif predicted for ABA responsive genes in rice. To further confirm the rareness of ABRE (CGMCACGTGB) motif, sets of 402 sequences each were analyzed within the 1 kb length such as randomly generated sequences, introns, CDS and intergenic region of rice. As compared to predicted promoters (100%) the occurrence of ABRE (CGMCACGTGB) was found to be 0.4, 0.4, 0.7, and 1.7% within the sets of randomly generated sequences, introns, CDS and intergenic region of rice, respectively, used here. Hence, this study confirms that ABRE (CGMCACGTGB) cis-element is really specific to promoters of ABA-responsive genes and not just a common feature of the rice genome.

Expression analysis of the predicted genes

To study the accuracy of prediction of ABA responsive genes in this study, out of 402 predicted genes, we randomly selected 45 genes whose response to ABA was unknown (except LOC_Os01g02120 and LOC_Os02g43330, which are used as test control) for expression analysis by RT-PCR. As high as 80% of the predicted ABA responsive genes showed induction in response to exogenous ABA (Table 3, Additional Fig. 3). A set of 15 genes without the predicted ABRE in their promoters were tested as negative control for prediction accuracy (Table 3). Ubiqutin was checked as internal control for RT-PCR. Expression analysis of each gene was confirmed in at least three independent RT-PCR reactions. Hence, occurrence of predicted ABRE (CGMCACGTGB) cis-element is important for ABA induction. A differential expression pattern was observed in different tissues at different time points of ABA treatment (Table 3). Although exogenous application of ABA mimics the expression of several stress and endogenous ABA responsive genes, plants are less sensitive to exogenous ABA under normal growth conditions than to endogenous ABA during stress (Sharp 2002). Expression analysis at different time points (3, 5, and 24 h) and tissues (leaf and root) illustrate that plants respond differentially to exogenous ABA. PlantQTL-GE database mining result showed (Additional Table 3) that many of the predicted ABA responsive genes in this study are also expressed under abiotic stresses such as cold, salt and osmotic stresses, which induce ABA accumulation (Zeng et al. 2007).

Table 3 RT-PCR analysis of predicted ABA responsive genes in different time points and vegetative tissues of rice

Full size table

Functional classification and ontology analysis of genes

ABA is the most versatile plant hormone involved in regulation of varied groups of genes. Functional annotation among these 402 predicted ABA responsive genes as per TIGR revealed diverse functional categories including important stress related signaling components (Additional Fig. 4). Details of these annotations are in the supplemental table (Additional Table 4).

Diverse gene ontology (GO) categories were enriched among these predicted ABA inducible genes (Additional Table 5). This is in consistent with the diverse roles of ABA in regulating biological processes (Ashburner et al. 2000; Hirayama and Shinozaki 2007). Among these important GO categories; considerable enrichments were obtained in different functional classes (Fig. 3). A set of randomly chosen 402 genes apart from predicted genes were analyzed for the GO analysis, where less GO enrichment was observed (Additional Fig. 5). But this study does not rule out the enrichment of GO functional categories in co-expressed genes under other environmental conditions. These results highlight the importance of conservation of ABA responsive genes and signaling pathways.

Discussion

The novel rice specific consensus for ABRE motif, CGMCACGTGB generated in this study is beyond the ACGT core and is distinct over PLACE derived ABRE motif for ABA responsive genes. This motif can be considered for finding ABA responsive genes in related species. In a related study cis-elements were discovered using correlated expression and sequence conservation between Arabidopsis and Brassica oleracea (Haberer et al. 2006). ABA responsive genes predicted here are not exhaustive, but represent considerable number of genes with a similar cis-regulatory element. The variability in ABRE and other over-presented motifs organization might be a reason for multiple signal integration points in combinatorial cis–trans interaction and versatile gene action under varied conditions (Suzuki et al. 2005). In our study, genes predicted by using only ABRE (CGMCACGTGB) cis-element showed 80% prediction accuracy among the randomly selected 45 genes of top 402 genes. RT-PCR expression analysis of 27 genes among the top 40 genes prediction by using ABRE–CE module shows that only 63.0% (17) genes were responsive to exogenous ABA in Arabidopsis (Zhang et al. 2005). Thus, ABRE motif identified in this study appears to be a better predictor of ABA-responsive genes in rice. The possibility of ABA-induction of the remaining 20% genes tested is not ruled out as they may express at different tissues/developmental stages/ABA concentration/time points. The differential ABA responsiveness of genes in different tissues and time points as revealed by RT-PCR analysis (Table 3, Additional Fig. 3) suggest the possible involvement of tissue, duration and developmental stage-specific ABRE interacting cis-elements or trans-acting factors in gene regulation. GO analysis showed the enrichments of signal transduction, stress-related and development-related genes among other categories in the predicted ABA regulated genes (Additional Table 5) and thus signifying the diverse role of ABA responsive genes.

Previous studies in different model systems have proven the importance of over re-represented cis-motifs in gene regulation. Integrating expression profile data with cis-motif consensus pattern had a much higher selectivity than only consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences (Fujibuchi et al. 2001). The influence of other external and internal cues apart from the treatment under study cannot be disregarded. Fine-tuning of transcriptional regulation under multitude conditions is most important aspect of plant adaptation process. Combinatorial control of transcription by multiple transcription factors has been reported in plant system (Lara et al. 2003; Narusaka et al. 2003). Integration of ABA responsive tissue-specific gene expression with promoter structure is a challenge to understand the universal and organism level molecular networking (Ma and Bohnert 2007). Binding experiments at different time points and developmental stages to design and verify system models might in turn give direct evidences with respect to dynamics of the molecular genetic network. Large scale cloning and characterization of predicted ABA-induced genes will help to unravel the role of ABA-regulated genes in genome wide chromatin structure, transcription, protein–protein/DNA–protein interaction, post-transcriptional and post-translational regulations.

We found direct Tos17 insertion in two of the predicted ABA responsive loci and searched its phenotypic impact using Tos17 mutant panel database (Miyao et al. 2007). Disruption of the predicted ABA regulated MYB-like DNA-binding domain gene, Os01g09280, by Tos17 insertion resulted in a dwarf and late flowering phenotype, supporting the key role of ABA-dependent components on plant development. Mutation in another predicted ABA regulated gene encoding leucine rich repeat family protein Os02g06130, gives a quite encouraging agronomical phenotype with comparatively high yield. We presume that Os02g06130 might have some adverse impact on yield under normal growth conditions. To validate the role of these loci on yield and plant development, we are developing siRNA knock-out lines and over expression lines.

In the post-genomic era, ability to deduce genome function has become an increasingly important task. For many genomes, the functional annotation immediately available will be based on computational predictions and comparisons with functional elements in related species. Targeted prediction of genes based on cis-motif is quite effective in functional categorization of genes that are most likely to be involved in a common molecular genetic network (Goda et al. 2004; Wang et al. 2007). Role of a particular CRE in a functional category is well demonstrated in yeast by sequencing and comparison to identify genes (Kellis et al. 2003). This method will help in functional annotation of genes predicted through ORF based approach, as ORF based gene prediction does not classify genes into functional categories. However, knowledge gained through sampling of over-represented cis-motifs from co-expressed genes responsive to a particular signal is useful to design genome-wide binding studies like Chip–chip, which in turn will help to unravel the complete molecular genetic network in biological systems. The genome level experimental knowledge of accurate dynamic spatio-temporal gene regulation integrated with promoter architecture is not available for ABA regulated genes. Computational prediction method provides a viable option to design suitable experiments and understand the dynamics of complex molecular genetic networks (Additional Fig. 6).

Conclusions

Identifying the key cis-elements and promoter architecture that regulate the expression of plant genome is a complex task that will require a series of complementary methods such as prediction, extensive experimental validation and proper understanding of the role of cis-elements in combinatorial control of plant gene expression. The ABRE (CGMCACGTGB) identified in this study is novel, rice-specific and can be used for functional classification of ABA responsive genes in related species. This cis- element based targeted gene finding approach will act as a supplemental tool for the classic ORF based gene prediction method for functional classification of genes. We advocate that the overall strategy will be cost effective and efficient for application in related plant species, where information is primarily limited to Genomic Survey Sequences (GSS).

Abbreviations

ABA:: Abscisic acid
CRE:: Cis-regulatory element
ABRE:: ABA responsive element
GO:: Gene ontology
CDS:: Coding DNA sequence

References

Ashburner M, Ball CA, Blake JA et al (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25:25–29. doi:10.1038/75556
Article CAS PubMed Google Scholar
Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 2:28–36
CAS PubMed Google Scholar
Bailey TL, Williams N, Misleh C et al (2006) MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res 34:W369–W373. doi:10.1093/nar/gkl303
Article CAS PubMed Google Scholar
Benedict C, Geisler M, Trygg J et al (2006) Consensus by democracy. Using meta-analyses of microarray and genomic data to model the cold acclimation signaling pathway in Arabidopsis. Plant Physiol 141:1219–1232. doi:10.1104/pp.106.083527
Article CAS PubMed Google Scholar
Breshears DD, Cobb NS, Rich PM et al (2005) Regional vegetation die-off in response to global-change-type drought. Proc Natl Acad Sci USA 102:15144–15148. doi:10.1073/pnas.0505734102
Article CAS PubMed Google Scholar
Buell CR, Yuan Q, Ouyang S et al (2005) Sequence, annotation, and analysis of synteny between rice chromosome 3 and diverged grass species. Genome Res 15:1284–1291
Article CAS PubMed Google Scholar
Busk PK, Pujal J, Jessop A et al (1999) Constitutive protein–DNA interactions on the abscisic acid-responsive element before and after developmental activation of the rab28 gene. Plant Mol Biol 41:529–536. doi:10.1023/A:1006345113637
Article CAS PubMed Google Scholar
Bustos MM, Begum D, Kalkan FA et al (1991) Positive and negative cis-acting DNA domains are required for spatial and temporal regulation of gene expression by a seed storage protein promoter. EMBO J 10:1469–1479
CAS PubMed Google Scholar
Chinnusamy V, Ohta M, Kanrar S et al (2003) ICE1: a regulator of cold-induced transcriptome and freezing tolerance in Arabidopsis. Genes Dev 17:1043–1054. doi:10.1101/gad.1077503
Article CAS PubMed Google Scholar
Chinnusamy V, Schumaker K, Zhu JK (2004) Molecular genetic perspectives on cross-talk and specificity in abiotic stress signaling in plants. J Exp Bot 55:225–236. doi:10.1093/jxb/erh005
Article CAS PubMed Google Scholar
Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14:1188–1190. doi:10.1101/gr.849004
Article CAS PubMed Google Scholar
Destefano-Beltran L, Knauber D, Huckle L et al (2006) Chemically forced dormancy termination mimics natural dormancy progression in potato tuber meristems by reducing ABA content and modifying expression of genes involved in regulating ABA synthesis and metabolism. J Exp Bot 57:2879–2886. doi:10.1093/jxb/erl050
Article CAS PubMed Google Scholar
Fujibuchi W, Anderson JS, Landsman D (2001) PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches. Nucleic Acids Res 29:3988–3996
CAS PubMed Google Scholar
Fujita M, Fujita Y, Noutoshi Y et al (2006) Crosstalk between abiotic and biotic stress responses: a current view from the points of convergence in the stress signaling networks. Curr Opin Plant Biol 9:436–442. doi:10.1016/j.pbi.2006.05.014
Article PubMed Google Scholar
Goda H, Sawa S, Asami T et al (2004) Comprehensive comparison of auxin-regulated and brassinosteroid-regulated genes in Arabidopsis. Plant Physiol 134:1555–1573. doi:10.1104/pp.103.034736
Article CAS PubMed Google Scholar
Goff SA, Ricke D, Lan TH et al (2002) A draft sequence of the rice genome (Oryza Sativa L. ssp. japonica). Science 296:92–100
Article CAS PubMed Google Scholar
Gomez-Porras JL, Riano-Pachon DM, Dreyer I et al (2007) Genome-wide analysis of ABA-responsive elements ABRE and CE3 reveals divergent patterns in Arabidopsis and rice. BMC Genomics 8:260. doi:10.1186/1471-2164-8-260
Article PubMed Google Scholar
GuhaThakurta D, Palomar L, Stormo GD et al (2002) Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. Genome Res 12:701–712. doi:10.1101/gr.228902
Article CAS PubMed Google Scholar
Guiltinan MJ, Marcotte WR Jr, Quatrano RS (1990) A plant leucine zipper protein that recognizes an abscisic acid response element. Science 250:267–271. doi:10.1126/science.2145628
Article CAS PubMed Google Scholar
Haberer G, Mader MT, Kosarev P et al (2006) Large-scale cis-element detection by analysis of correlated expression and sequence conservation between Arabidopsis and Brassica oleracea. Plant Physiol 142:1589–1602. doi:10.1104/pp.106.085639
Article CAS PubMed Google Scholar
Hakimi MA, Privat I, Valay JG et al (2000) Evolutionary conservation of C-terminal domains of primary sigma (70)-type transcription factors between plants and bacteria. J Biol Chem 275:9215–9221. doi:10.1074/jbc.275.13.9215
Article CAS PubMed Google Scholar
Hartmann U, Sagasser M, Mehrtens F et al (2005) Differential combinatorial interactions of cis-acting elements recognized by R2R3-MYB, BZIP, and BHLH factors control light-responsive and tissue-specific activation of phenylpropanoid biosynthesis genes. Plant Mol Biol 57:155–171. doi:10.1007/s11103-004-6910-0
Article CAS PubMed Google Scholar
Hattori T, Totsuka M, Hobo T et al (2002) Experimentally determined sequence requirement of ACGT-containing abscisic acid response element. Plant Cell Physiol 43:136–140. doi:10.1093/pcp/pcf014
Article CAS PubMed Google Scholar
Hauffe KD, Lee SP, Subramaniam R, Douglas CJ (1993) Combinatorial interactions between positive and negative cis-acting elements control spatial patterns of 4CL-1 expression in transgenic tobacco. Plant J 4:235–253. doi:10.1046/j.1365-313X.1993.04020235.x
Article CAS PubMed Google Scholar
Higo K, Ugawa Y, Iwamoto M, Korenaga T (1999) Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res 27:297–300. doi:10.1093/nar/27.1.297
Article CAS PubMed Google Scholar
Hirayama T, Shinozaki K (2007) Perception and transduction of abscisic acid signals: keys to the function of the versatile plant hormone ABA. Trends Plant Sci 12:343–351. doi:10.1016/j.tplants.2007.06.013
Article CAS PubMed Google Scholar
Hirt H, Kogl M, Murbacher T et al (1990) Evolutionary conservation of transcriptional machinery between yeast and plants as shown by the efficient expression from the CaMV 35S promoter and 35S terminator. Curr Genet 17:473–479. doi:10.1007/BF00313074
Article CAS PubMed Google Scholar
Hobo T, Asada M, Kowyama Y et al (1999) ACGT-containing abscisic acid response element (ABRE) and coupling element 3 (CE3) are functionally equivalent. Plant J 19:679–689. doi:10.1046/j.1365-313x.1999.00565.x
Article CAS PubMed Google Scholar
Kellis M, Patterson N, Endrizzi M et al (2003) Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423:241–254. doi:10.1038/nature01644
Article CAS PubMed Google Scholar
Kim SY, Kim Y (2006) Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data. BMC Bioinformatics 7:330. doi:10.1186/1471-2105-7-330
Article PubMed Google Scholar
Lara P, Onate-Sanchez L, Abraham Z et al (2003) Synergistic activation of seed storage protein gene expression in Arabidopsis by ABI3 and two bZIPs related to OPAQUE2. J Biol Chem 278:21003–21011. doi:10.1074/jbc.M210538200
Article CAS PubMed Google Scholar
Lescot M, Dehais P, Thijs G et al (2002) PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res 30:325–327. doi:10.1093/nar/30.1.325
Article CAS PubMed Google Scholar
Li S, Assmann SM, Albert R (2006a) Predicting essential components of signal transduction networks: a dynamic model of guard cell abscisic acid signaling. PLoS Biol 4:e312. doi:10.1371/journal.pbio.0040312
Article PubMed Google Scholar
Li Y, Lee KK, Walsh S et al (2006b) Establishing glucose- and ABA-regulated transcription networks in Arabidopsis by microarray analysis and promoter classification using a relevance vector machine. Genome Res 16:414–427. doi:10.1101/gr.4237406
Article CAS PubMed Google Scholar
Loik ME, Nobel PS (1993) Exogenous abscisic acid mimics cold acclimation for cacti differing in freezing tolerance. Plant Physiol 103:871–876
CAS PubMed Google Scholar
Ma S, Bohnert HJ (2007) Integration of Arabidopsis thaliana stress-related transcript profiles, promoter structures, and cell-specific expression. Genome Biol 8:R49. doi:10.1186/gb-2007-8-4-r49
Article PubMed Google Scholar
Miyao A, Iwasaki Y, Kitano H et al (2007) A large-scale collection of phenotypic data describing an insertional mutant population to facilitate functional analysis of rice genes. Plant Mol Biol 63:625–635. doi:10.1007/s11103-006-9118-7
Article CAS PubMed Google Scholar
Mundy J, Yamaguchi-Shinozaki K, Chua NH (1990) Nuclear proteins bind conserved elements in the abscisic acid-responsive promoter of a rice rab gene. Proc Natl Acad Sci USA 87:1406–1410. doi:10.1073/pnas.87.4.1406
Article CAS PubMed Google Scholar
Narusaka Y, Nakashima K, Shinwari ZK et al (2003) Interaction between two cis-acting elements, ABRE and DRE, in ABA-dependent expression of Arabidopsis rd29A gene in response to dehydration and high-salinity stresses. Plant J 34:137–148. doi:10.1046/j.1365-313X.2003.01708.x
Article CAS PubMed Google Scholar
Ouyang S, Zhu W, Hamilton J et al (2007) The TIGR rice genome annotation resource: improvements and new features. Nucleic Acids Res 35:D883–D887. doi:10.1093/nar/gkl976
Article CAS PubMed Google Scholar
Qureshi MI, Qadir S, Zolla L (2007) Proteomics-based dissection of stress-responsive pathways in plants. J Plant Physiol 164:1239–1260. doi:10.1016/j.jplph.2007.01.013
Article CAS PubMed Google Scholar
Rabbani MA, Maruyama K, Abe H et al (2003) Monitoring expression profiles of rice genes under cold, drought, and high-salinity stresses and abscisic acid application using cDNA microarray and RNA gel-blot analyses. Plant Physiol 133:1755–1767. doi:10.1104/pp.103.025742
Article CAS PubMed Google Scholar
Reiss DJ, Baliga NS, Bonneau R (2006) Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks. BMC Bioinformatics 7:280. doi:10.1186/1471-2105-7-280
Article PubMed Google Scholar
Ross C, Shen QJ (2006) Computational prediction and experimental verification of HVA1-like abscisic acid responsive promoters in rice (Oryza sativa). Plant Mol Biol 62:233–246. doi:10.1007/s11103-006-9017-y
Article CAS PubMed Google Scholar
Schroter D, Cramer W, Leemans R et al (2005) Ecosystem service supply and vulnerability to global change in Europe. Science 310:1333–1337. doi:10.1126/science.1115233
Article PubMed Google Scholar
Sharp RE (2002) Interaction with ethylene: changing views on the role of abscisic acid in root and shoot growth responses to water stress. Plant Cell Environ 25:211–222. doi:10.1046/j.1365-3040.2002.00798.x
Article CAS PubMed Google Scholar
Shen Q, Ho TH (1995) Functional dissection of an abscisic acid (ABA)-inducible gene reveals two independent ABA-responsive complexes each containing a G-box and a novel cis-acting element. Plant Cell 7:295–307
Article CAS PubMed Google Scholar
Shen Q, Zhang P, Ho TH (1996) Modular nature of abscisic acid (ABA) response complexes: composite promoter units that are necessary and sufficient for ABA induction of gene expression in barley. Plant Cell 8:1107–1119
Article CAS PubMed Google Scholar
Singh KB (1998) Transcriptional regulation in plants: the importance of combinatorial control. Plant Physiol 118:1111–1120. doi:10.1104/pp.118.4.1111
Article CAS PubMed Google Scholar
Suzuki M, Ketterling MG, McCarty DR (2005) Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis. Plant Physiol 139:437–447. doi:10.1104/pp.104.058412
Article CAS PubMed Google Scholar
Thompson W, Rouchka EC, Lawrence CE (2003) Gibbs recursive sampler: finding transcription factor binding sites. Nucleic Acids Res 31:3580–3585. doi:10.1093/nar/gkg608
Article CAS PubMed Google Scholar
Tran LS, Nakashima K, Shinozaki K et al (2007) Plant gene networks in osmotic stress response: from genes to regulatory networks. Methods Enzymol 428:109–128. doi:10.1016/S0076-6879(07)28006-1
Article CAS PubMed Google Scholar
Verslues PE, Zhu JK (2007) New developments in abscisic acid perception and metabolism. Curr Opin Plant Biol 10:447–452. doi:10.1016/j.pbi.2007.08.004
Article CAS PubMed Google Scholar
Viswanathan C, Zhu JK (2002) Molecular genetic analysis of cold-regulated gene transcription. Philos Trans R Soc Lond B Biol Sci 357:877–886. doi:10.1098/rstb.2002.1076
Article CAS PubMed Google Scholar
Wang D, Pei K, Fu Y et al (2007) Genome-wide analysis of the auxin response factors (ARF) gene family in rice (Oryza sativa). Gene 394:13–24. doi:10.1016/j.gene.2007.01.006
Article CAS PubMed Google Scholar
Werner T (2001) Cluster analysis and promoter modelling as bioinformatics tools for the identification of target genes from expression array data. Pharmacogenomics 2:25–36. doi:10.1517/14622416.2.1.25
Article CAS PubMed Google Scholar
Wolfsberg TG, Gabrielian AE, Campbell MJ et al (1999) Candidate regulatory sequence elements for cell cycle-dependent transcription in Saccharomyces cerevisiae. Genome Res 9:775–792
CAS PubMed Google Scholar
Yamaguchi-Shinozaki K, Shinozaki K (2005) Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters. Trends Plant Sci 10:88–94. doi:10.1016/j.tplants.2004.12.012
Article CAS PubMed Google Scholar
Yazaki J, Kishimoto N, Nagata Y et al (2003) Genomics approach to abscisic acid- and gibberellin-responsive genes in rice. DNA Res 10:249–261. doi:10.1093/dnares/10.6.249
Article CAS PubMed Google Scholar
Zeng H, Luo L, Zhang W et al (2007) PlantQTL-GE: a database system for identifying candidate genes in rice and Arabidopsis by gene expression and QTL information. Nucleic Acids Res 35:D879–D882. doi:10.1093/nar/gkl814
Article CAS PubMed Google Scholar
Zhang W, Ruan J, Ho TH et al (2005) Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana. Bioinformatics 21:3074–3081. doi:10.1093/bioinformatics/bti490
Article CAS PubMed Google Scholar
Zhou J, Wang X, Jiao Y et al (2007) Global genome expression analysis of rice in response to drought and high-salinity stresses in shoot, flag leaf, and panicle. Plant Mol Biol 63:591–608. doi:10.1007/s11103-006-9111-1
Article CAS PubMed Google Scholar

Download references

Acknowledgments

S. K. Lenka and A. Kumar thank University Grants Commission (UGC) and Council of Scientific and Industrial Research (CSIR) for CSIR-UGC Junior Research Fellowship grant. This work was supported by the Indian Council of Agricultural Research (ICAR)-sponsored Network Project on Transgenics in Crops (NPTC).

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

National Research Centre on Plant Biotechnology, Indian Agricultural Research Institute, New Delhi, 110012, India
Sangram K. Lenka, Bikash Lohia, Abhay Kumar & Kailash C. Bansal
Water Technology Center, Indian Agricultural Research Institute, New Delhi, 110012, India
Viswanathan Chinnusamy

Authors

Sangram K. Lenka
View author publications
You can also search for this author in PubMed Google Scholar
Bikash Lohia
View author publications
You can also search for this author in PubMed Google Scholar
Abhay Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Viswanathan Chinnusamy
View author publications
You can also search for this author in PubMed Google Scholar
Kailash C. Bansal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kailash C. Bansal.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Additional Fig. 1

MOESM1 (JPG 15 kb)

Additional Fig. 2

MOESM2 (JPG 21 kb)

Additional Fig. 3

MOESM3 (PPT 3,820 kb)

Additional Fig. 4

MOESM4 (PNG 46 kb)

Additional Fig. 5

MOESM5 (JPG 39 kb)

Additional Fig. 6

MOESM6 (JPG 59 kb)

Additional Table 1

MOESM7 (DOC 79 kb)

Additional Table 2

MOESM8 (XLS 58 kb)

Additional Table 3

MOESM9 (DOC 86 kb)

Additional Table 4

MOESM10 (XLS 58 kb)

Additional Table 5

MOESM11 (DOC 89 kb)

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Lenka, S.K., Lohia, B., Kumar, A. et al. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes. Plant Mol Biol 69, 261–271 (2009). https://doi.org/10.1007/s11103-008-9423-4

Download citation

Received: 19 May 2008
Accepted: 16 October 2008
Published: 08 November 2008
Issue Date: February 2009
DOI: https://doi.org/10.1007/s11103-008-9423-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes

Abstract

Similar content being viewed by others

Introduction

Materials and methods

ABA responsive genes and random sequence datasets

Motif discovery and motif search

Analysis of gene ontology of predicted genes

Plant growth conditions, RNA sampling and RT-PCR analysis

Stress related expression data mining and phenotype searching

Results

ABRE consensus motif discovery and gene prediction

Analysis of structural and occurrence rareness of the ABRE

Expression analysis of the predicted genes

Functional classification and ontology analysis of genes

Discussion

Conclusions

Abbreviations

References

Acknowledgments

Open Access

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation