Abstract
Arabidopsis Thaliana HARDY (AtHRD) is a gene with an APETELA 2 / Ethylene Responsive Factor (AP2/ERF) domain linked to improved performance under drought in rice. We hypothesized that the sorghum genome could possess a similar gene product and were motivated to conduct a computational genome scale mining for the protein and analyse its structural and functional properties. AtHRD sequence was used as a query to BLAST against the sorghum genome dataset followed by multiple alignment analysis. A homology model of the target was built using a template detected based on the pair-wise comparison of hidden Markov models for alignments. DNA docking with a matrix of homologous interface contacts was done. Functional and structural analysis of the query and target was conducted using various online servers.
A High-scoring segment pair from Chromosome 6 of the sorghum genome in the region between 54948120 and 54948668 had 68 amino acid similarities out of the 184 residues and was 1.4% above twilight zone threshold. The homology model showed 86.8% residues in most favoured regions. The target protein which had an AP2/ERF domain when docked with GCC box DNA motif had conserved residues involved in binding; it had a long unstructured region beyond the AP2 domain with several motifs for the recognition of serine/threonine protein kinase group. The protein model showed that it could bind to a GCC box which is present in several drought responsive genes. The presence of possible signalling domains and intrinsic disorder in the target protein suggest that this could play a role in drought tolerance which is an inherent character of sorghum. These results offer a jumpstart for validation experiments which could pave the way for cis/trans genic improvement of a range of crops.
Similar content being viewed by others
References
Abogadallah, G.M., Nada, R.M., Malinowski, R., Quick, P. 2011. Overexpression of HARDY, an AP2/ERF gene from Arabidopsis, improves drought and salt tolerance by reducing transpiration and sodium uptake in transgenic Trifolium alexandrinum L. Planta 233, 1264–1276.
Benkert, P., Kunzli, M., Schwede, T. 2009. QMEAN server for protein model quality estimation. Nucleic Acids Res 37, W510–W514.
Bordoli, L., Kiefer, F., Arnold, K., Benkert, P., Battey, J., Schwede, T. 2008. Protein structure homology modeling using SWISS-MODEL workspace. Nature Protocols 4, 1–13.
Botha, G.M., Viljoen, C.D. 2008. Can GM sorghum impact Africa? Trends Biotechnol 26(2), 64–69.
Buchanan, C.D., Lim, S., Salzman, R.A., Kagiampakis, I., Morishige, D.T., Weers, B.D., Klein, R.R., Pratt, L.H., Cordonnier-Pratt, M.M., Klein, P.E., Mullet, J.E. 2005. Sorghum bicolor’s transcriptome response to dehydration, high salinity and ABA. Plant Mol Biol 58, 699–720.
Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G., Thompson, J.D. 2003. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res 31, 3497–3500.
Contreras-Moreira, B., Branger, P.-A., Collado-Vides, J. 2007. TFmodeller: Comparative modelling of protein-DNA complexes. Bioinformatics 23, 1694–1696.
Daggett, V., Fersht, A.R. 2009. Protein folding and binding: moving into unchartered territory. Curr Opin Struct Biol 19, 1–2.
Dev Sharma, A., Kumar, S., Singh, P. 2006. Expression analysis of a stress-modulated transcript in drought tolerant and susceptible cultivars of sorghum (Sorghum bicolor). J Plant Physiol 163, 570–576.
Eisenberg, D., Lüthy, R., Bowie, J.U. 1997. VERIFY3D: Assessment of protein models with three dimensional profiles. Methods Enzymol 277, 396–404.
Fujii, H., Chinnusamy, V., Rodrigues, A., Rubio, S., Antoni, R., Park, S.Y., Cutler, S.R., Sheen, J., Rodriguez, P.L., Zhu, J.K. 2009. In vitro reconstitution of an abscisic acid signalling pathway. Nature 462, 660–664.
Karaba, A., Dixit, S., Greco, R., Aharoni, A., Trijatmiko, K.R., Marsch-Martinez, N., Krishnan, A., Nataraja, K.N., Udayakumar, M., Pereira, A. 2007. Improvement of water use efficiency in rice by expression of HARDY, an Arabidopsis drought and salt tolerance gene. Proceedings of the National Academy of Sciences 104, 15270–15275.
Kaufmann, K., Nagasaki, M., Jáuregui, R. 2010. Modelling the molecular interactions in the flower developmental network of Arabidopsis thaliana. In Silico Biology 10, 125–143.
Kim, S., Soltis, P.S., Wall, K., Soltis, D.E. 2006. Phylogeny and domain evolution in the APETALA2-like gene family. Mol Biol Evol 23, 107–120.
Laskowski, R.A., Watson, J.D., Thornton, J.M. 2005. ProFunc: A server for predicting protein function from 3D structure. Nucleic Acids Res 33, W89–W93.
Linding, R., Jensen, L.J., Diella, F., Bork, P., Gibson, T.J., Russell, R.B. 2003. Protein disorder prediction: implications for structural proteomics. Structure 11, 1453–1459.
Magnani, E., Sjölander, K., Hake, S. 2004. From endonucleases to transcription factors: Evolution of the AP2 DNA binding domain in plants. Plant Cell 16, 2265–2277.
Marmorstein, R., Fitzgerald, M.X. 2003. Modulation of DNA-binding domains for sequence-specific DNA recognition. Gene 304, 1–12.
Mullet, J.E., Klein, R.R., Klein, P.E. 2002. Sorghum bicolor — an important species for comparative grass genomics and a source of beneficial genes for agriculture. Curr Opin Plant Biol 5, 118–121.
Nakashima, K., Ito, Y., Yamaguchi-Shinozaki, K. 2009. Transcriptional regulatory networks in response to abiotic stresses in Arabidopsis and grasses. Plant Physiol 149, 88–95.
Obenauer, J.C., Cantley, L.C., Yaffe, M.B. 2003. Scansite 2.0: Proteome-wide prediction of cell signaling interactions using short sequence motifs. Nucleic Acids Res 31, 3635–3641.
Ortiz, A.R., Strauss, C., Olmea, O. 2002. MAMMOTH (matching molecular models obtained from theory): An automated method for model comparison. Protein Sci 11, 2606–2621.
Paliy, O., Gargac, S.M., Cheng, Y., Uversky, V.N., Dunker, A.K. 2008. Protein disorder is positively correlated with gene expression in Escherichia coli. J Proteome Res 7, 2234–2245.
Paterson, A.H., Bowers, J.E., Bruggmann, R., Dubchak, I., Grimwood, J., Gundlach, H., Haberer, G., Hellsten, U., Mitros, T., Poliakov, A., Schmutz, J., Spannagl, M., Tang, H., Wang, X., Wicker, T., Bharti, A.K., Chapman, J., Feltus, F.A., Gowik, U., Grigoriev, I.V., Lyons, E., Maher, C.A., Martis, M., Narechania, A., Otillar, R.P., Penning, B.W., Salamov, A.A., Wang, Y., Zhang, L., Carpita, N.C., Freeling, M., Gingle, A.R., Hash, C.T., Keller, B., Klein, P., Kresovich, S., McCann, M.C., Ming, R., Peterson, D.G., Mehboob-ur-Rahman, W.D., Westhoff, P., Mayer, K.F., Messing, J., Rokhsar, D.S. 2009. The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551–556.
Pratt, L.H., Liang, C., Shah, M., Sun, F., Wang, H., Reid, S.P., Gingle, A.R., Paterson, A.H., Wing, R., Dean, R., Klein, R., Nguyen, H.T., Ma, H.M., Zhao, X., Morishige, D.T., Mullet, J.E., Cordonnier-Pratt, M.M. 2005. Sorghum expressed sequence tags identify signature genes for drought, pathogenesis, and skotomorphogenesis from a milestone set of 16,801 unique transcripts. Plant Physiol 139, 869–884.
Prilusky, J., Felder, C.E., Zeev-Ben-Mordehai, T., Rydberg, E.H., Man, O., Beckmann, J.S., Silman, I., Sussman, J.L. 2005. FoldIndex: A simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 21, 3435–3438.
Sanchez, A.C., Subudhi, P.K., Rosenow, D.T., Nguyen, H.T. 2002. Mapping QTLs associated with drought resistance in sorghum (Sorghum bicolor L. Moench). Plant Mol Biol 48, 713–726.
Schuppler, U. 1998. Effect of water stress on cell division and cell-division-cycle 2-like cell-cycle kinase activity in wheat leaves. Plant Physiol 117, 667–678.
Shao, H.B., Song, W.Y., Chu, L.Y. 2008. Advances of calcium signals involved in plant anti-drought. C R Biol 331, 587–596.
Sharoni, A.M., Nuruzzaman, M., Satoh, K., Moumeni, A., Attia, K., Venuprasad, R., Serraj, R., Kumar, A., Leung, H., Islam, A.K., Kikuchi, S. 2012. Comparative transcriptome analysis of AP2/EREBP gene family under normal and hormone treatments, and under two drought stresses in NILs setup by Aday Selection and IR64. Mol Genet Genomics 284, 1–19.
Söding, J. 2005. Protein homology detection by HMMHMM comparison. Bioinformatics 21, 951–960.
Tamura, K., Dudley, J., Nei, M., Kumar, S. 2007. MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24, 1596–1599.
Tompa, P. 2005. The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett 579, 3346–3354.
Wright, P.E., Dyson, H.J. 2009. Linking folding and binding. Curr Opin Struct Biol 19, 31–38.
Zdobnov, E.M., Apweiler, R. 2001. InterProScan — an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17, 847–848.
Zhang, Z.G., Zhou, H.L., Chen, T., Gong, Y., Cao, W.H., Wang, Y.J., Zhang, J.S., Chen, S.Y. 2004. Evidence for serine/threonine and histidine kinase activity in the tobacco ethylene receptor protein NTHK2. Plant Physiol 136, 2971–2981.
Zhuang, J., Cai, B., Peng, R.H., Zhu, B., Jin, X.F., Xue, Y., Gao, F., Fu, X.Y., Tian, Y.S., Zhao, W., Qiao, Y.S., Zhang, Z., Xiong, A.S., Yao, Q.H. 2008. Genome-wide analysis of the AP2/ERF gene family in Populus trichocarpa. Biochem. Biophys Res Commun 371, 468–474.
Zhuang, J., Peng, R.H., Cheng, Z.M., Zhang, J., Caia, B., Zhang, Z., Gao, F., Bo, Z., Fu, X.Y., Jin, X.F., Chen, J.M., Qiao, Y.S., Xiong, A.S., Yao, Q.G. 2009. Genome-wide analysis of the putative AP2/ERF family genes in Vitis vinifera. Scientia Horticulturae 123, 73–81.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Shanker, A.K., Maddaala, A., Kumar, M.A. et al. In silico targeted genome mining and comparative modelling reveals a putative protein similar to an Arabidopsis drought tolerance DNA binding transcription factor in Chromosome 6 of Sorghum bicolor genome. Interdiscip Sci Comput Life Sci 4, 133–141 (2012). https://doi.org/10.1007/s12539-012-0121-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12539-012-0121-1