Skip to main content
Log in

In silico targeted genome mining and comparative modelling reveals a putative protein similar to an Arabidopsis drought tolerance DNA binding transcription factor in Chromosome 6 of Sorghum bicolor genome

  • Published:
Interdisciplinary Sciences: Computational Life Sciences Aims and scope Submit manuscript

Abstract

Arabidopsis Thaliana HARDY (AtHRD) is a gene with an APETELA 2 / Ethylene Responsive Factor (AP2/ERF) domain linked to improved performance under drought in rice. We hypothesized that the sorghum genome could possess a similar gene product and were motivated to conduct a computational genome scale mining for the protein and analyse its structural and functional properties. AtHRD sequence was used as a query to BLAST against the sorghum genome dataset followed by multiple alignment analysis. A homology model of the target was built using a template detected based on the pair-wise comparison of hidden Markov models for alignments. DNA docking with a matrix of homologous interface contacts was done. Functional and structural analysis of the query and target was conducted using various online servers.

A High-scoring segment pair from Chromosome 6 of the sorghum genome in the region between 54948120 and 54948668 had 68 amino acid similarities out of the 184 residues and was 1.4% above twilight zone threshold. The homology model showed 86.8% residues in most favoured regions. The target protein which had an AP2/ERF domain when docked with GCC box DNA motif had conserved residues involved in binding; it had a long unstructured region beyond the AP2 domain with several motifs for the recognition of serine/threonine protein kinase group. The protein model showed that it could bind to a GCC box which is present in several drought responsive genes. The presence of possible signalling domains and intrinsic disorder in the target protein suggest that this could play a role in drought tolerance which is an inherent character of sorghum. These results offer a jumpstart for validation experiments which could pave the way for cis/trans genic improvement of a range of crops.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Abogadallah, G.M., Nada, R.M., Malinowski, R., Quick, P. 2011. Overexpression of HARDY, an AP2/ERF gene from Arabidopsis, improves drought and salt tolerance by reducing transpiration and sodium uptake in transgenic Trifolium alexandrinum L. Planta 233, 1264–1276.

    Article  Google Scholar 

  2. Benkert, P., Kunzli, M., Schwede, T. 2009. QMEAN server for protein model quality estimation. Nucleic Acids Res 37, W510–W514.

    Article  PubMed  CAS  Google Scholar 

  3. Bordoli, L., Kiefer, F., Arnold, K., Benkert, P., Battey, J., Schwede, T. 2008. Protein structure homology modeling using SWISS-MODEL workspace. Nature Protocols 4, 1–13.

    Article  Google Scholar 

  4. Botha, G.M., Viljoen, C.D. 2008. Can GM sorghum impact Africa? Trends Biotechnol 26(2), 64–69.

    Article  PubMed  CAS  Google Scholar 

  5. Buchanan, C.D., Lim, S., Salzman, R.A., Kagiampakis, I., Morishige, D.T., Weers, B.D., Klein, R.R., Pratt, L.H., Cordonnier-Pratt, M.M., Klein, P.E., Mullet, J.E. 2005. Sorghum bicolor’s transcriptome response to dehydration, high salinity and ABA. Plant Mol Biol 58, 699–720.

    Article  PubMed  CAS  Google Scholar 

  6. Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G., Thompson, J.D. 2003. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res 31, 3497–3500.

    Article  PubMed  CAS  Google Scholar 

  7. Contreras-Moreira, B., Branger, P.-A., Collado-Vides, J. 2007. TFmodeller: Comparative modelling of protein-DNA complexes. Bioinformatics 23, 1694–1696.

    Article  PubMed  CAS  Google Scholar 

  8. Daggett, V., Fersht, A.R. 2009. Protein folding and binding: moving into unchartered territory. Curr Opin Struct Biol 19, 1–2.

    Article  PubMed  CAS  Google Scholar 

  9. Dev Sharma, A., Kumar, S., Singh, P. 2006. Expression analysis of a stress-modulated transcript in drought tolerant and susceptible cultivars of sorghum (Sorghum bicolor). J Plant Physiol 163, 570–576.

    Article  PubMed  Google Scholar 

  10. Eisenberg, D., Lüthy, R., Bowie, J.U. 1997. VERIFY3D: Assessment of protein models with three dimensional profiles. Methods Enzymol 277, 396–404.

    Article  PubMed  CAS  Google Scholar 

  11. Fujii, H., Chinnusamy, V., Rodrigues, A., Rubio, S., Antoni, R., Park, S.Y., Cutler, S.R., Sheen, J., Rodriguez, P.L., Zhu, J.K. 2009. In vitro reconstitution of an abscisic acid signalling pathway. Nature 462, 660–664.

    Article  PubMed  CAS  Google Scholar 

  12. Karaba, A., Dixit, S., Greco, R., Aharoni, A., Trijatmiko, K.R., Marsch-Martinez, N., Krishnan, A., Nataraja, K.N., Udayakumar, M., Pereira, A. 2007. Improvement of water use efficiency in rice by expression of HARDY, an Arabidopsis drought and salt tolerance gene. Proceedings of the National Academy of Sciences 104, 15270–15275.

    Article  CAS  Google Scholar 

  13. Kaufmann, K., Nagasaki, M., Jáuregui, R. 2010. Modelling the molecular interactions in the flower developmental network of Arabidopsis thaliana. In Silico Biology 10, 125–143.

    PubMed  CAS  Google Scholar 

  14. Kim, S., Soltis, P.S., Wall, K., Soltis, D.E. 2006. Phylogeny and domain evolution in the APETALA2-like gene family. Mol Biol Evol 23, 107–120.

    Article  PubMed  CAS  Google Scholar 

  15. Laskowski, R.A., Watson, J.D., Thornton, J.M. 2005. ProFunc: A server for predicting protein function from 3D structure. Nucleic Acids Res 33, W89–W93.

    Article  PubMed  CAS  Google Scholar 

  16. Linding, R., Jensen, L.J., Diella, F., Bork, P., Gibson, T.J., Russell, R.B. 2003. Protein disorder prediction: implications for structural proteomics. Structure 11, 1453–1459.

    Article  PubMed  CAS  Google Scholar 

  17. Magnani, E., Sjölander, K., Hake, S. 2004. From endonucleases to transcription factors: Evolution of the AP2 DNA binding domain in plants. Plant Cell 16, 2265–2277.

    Article  PubMed  CAS  Google Scholar 

  18. Marmorstein, R., Fitzgerald, M.X. 2003. Modulation of DNA-binding domains for sequence-specific DNA recognition. Gene 304, 1–12.

    Article  PubMed  CAS  Google Scholar 

  19. Mullet, J.E., Klein, R.R., Klein, P.E. 2002. Sorghum bicolor — an important species for comparative grass genomics and a source of beneficial genes for agriculture. Curr Opin Plant Biol 5, 118–121.

    Article  PubMed  CAS  Google Scholar 

  20. Nakashima, K., Ito, Y., Yamaguchi-Shinozaki, K. 2009. Transcriptional regulatory networks in response to abiotic stresses in Arabidopsis and grasses. Plant Physiol 149, 88–95.

    Article  PubMed  CAS  Google Scholar 

  21. Obenauer, J.C., Cantley, L.C., Yaffe, M.B. 2003. Scansite 2.0: Proteome-wide prediction of cell signaling interactions using short sequence motifs. Nucleic Acids Res 31, 3635–3641.

    Article  PubMed  CAS  Google Scholar 

  22. Ortiz, A.R., Strauss, C., Olmea, O. 2002. MAMMOTH (matching molecular models obtained from theory): An automated method for model comparison. Protein Sci 11, 2606–2621.

    Article  PubMed  CAS  Google Scholar 

  23. Paliy, O., Gargac, S.M., Cheng, Y., Uversky, V.N., Dunker, A.K. 2008. Protein disorder is positively correlated with gene expression in Escherichia coli. J Proteome Res 7, 2234–2245.

    Article  PubMed  CAS  Google Scholar 

  24. Paterson, A.H., Bowers, J.E., Bruggmann, R., Dubchak, I., Grimwood, J., Gundlach, H., Haberer, G., Hellsten, U., Mitros, T., Poliakov, A., Schmutz, J., Spannagl, M., Tang, H., Wang, X., Wicker, T., Bharti, A.K., Chapman, J., Feltus, F.A., Gowik, U., Grigoriev, I.V., Lyons, E., Maher, C.A., Martis, M., Narechania, A., Otillar, R.P., Penning, B.W., Salamov, A.A., Wang, Y., Zhang, L., Carpita, N.C., Freeling, M., Gingle, A.R., Hash, C.T., Keller, B., Klein, P., Kresovich, S., McCann, M.C., Ming, R., Peterson, D.G., Mehboob-ur-Rahman, W.D., Westhoff, P., Mayer, K.F., Messing, J., Rokhsar, D.S. 2009. The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551–556.

    Article  PubMed  CAS  Google Scholar 

  25. Pratt, L.H., Liang, C., Shah, M., Sun, F., Wang, H., Reid, S.P., Gingle, A.R., Paterson, A.H., Wing, R., Dean, R., Klein, R., Nguyen, H.T., Ma, H.M., Zhao, X., Morishige, D.T., Mullet, J.E., Cordonnier-Pratt, M.M. 2005. Sorghum expressed sequence tags identify signature genes for drought, pathogenesis, and skotomorphogenesis from a milestone set of 16,801 unique transcripts. Plant Physiol 139, 869–884.

    Article  PubMed  Google Scholar 

  26. Prilusky, J., Felder, C.E., Zeev-Ben-Mordehai, T., Rydberg, E.H., Man, O., Beckmann, J.S., Silman, I., Sussman, J.L. 2005. FoldIndex: A simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 21, 3435–3438.

    Article  PubMed  CAS  Google Scholar 

  27. Sanchez, A.C., Subudhi, P.K., Rosenow, D.T., Nguyen, H.T. 2002. Mapping QTLs associated with drought resistance in sorghum (Sorghum bicolor L. Moench). Plant Mol Biol 48, 713–726.

    CAS  Google Scholar 

  28. Schuppler, U. 1998. Effect of water stress on cell division and cell-division-cycle 2-like cell-cycle kinase activity in wheat leaves. Plant Physiol 117, 667–678.

    Article  PubMed  CAS  Google Scholar 

  29. Shao, H.B., Song, W.Y., Chu, L.Y. 2008. Advances of calcium signals involved in plant anti-drought. C R Biol 331, 587–596.

    Article  PubMed  CAS  Google Scholar 

  30. Sharoni, A.M., Nuruzzaman, M., Satoh, K., Moumeni, A., Attia, K., Venuprasad, R., Serraj, R., Kumar, A., Leung, H., Islam, A.K., Kikuchi, S. 2012. Comparative transcriptome analysis of AP2/EREBP gene family under normal and hormone treatments, and under two drought stresses in NILs setup by Aday Selection and IR64. Mol Genet Genomics 284, 1–19.

    Article  Google Scholar 

  31. Söding, J. 2005. Protein homology detection by HMMHMM comparison. Bioinformatics 21, 951–960.

    Article  PubMed  Google Scholar 

  32. Tamura, K., Dudley, J., Nei, M., Kumar, S. 2007. MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24, 1596–1599.

    Article  PubMed  CAS  Google Scholar 

  33. Tompa, P. 2005. The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett 579, 3346–3354.

    Article  PubMed  CAS  Google Scholar 

  34. Wright, P.E., Dyson, H.J. 2009. Linking folding and binding. Curr Opin Struct Biol 19, 31–38.

    Article  PubMed  CAS  Google Scholar 

  35. Zdobnov, E.M., Apweiler, R. 2001. InterProScan — an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17, 847–848.

    Article  PubMed  CAS  Google Scholar 

  36. Zhang, Z.G., Zhou, H.L., Chen, T., Gong, Y., Cao, W.H., Wang, Y.J., Zhang, J.S., Chen, S.Y. 2004. Evidence for serine/threonine and histidine kinase activity in the tobacco ethylene receptor protein NTHK2. Plant Physiol 136, 2971–2981.

    Article  PubMed  CAS  Google Scholar 

  37. Zhuang, J., Cai, B., Peng, R.H., Zhu, B., Jin, X.F., Xue, Y., Gao, F., Fu, X.Y., Tian, Y.S., Zhao, W., Qiao, Y.S., Zhang, Z., Xiong, A.S., Yao, Q.H. 2008. Genome-wide analysis of the AP2/ERF gene family in Populus trichocarpa. Biochem. Biophys Res Commun 371, 468–474.

    Article  PubMed  CAS  Google Scholar 

  38. Zhuang, J., Peng, R.H., Cheng, Z.M., Zhang, J., Caia, B., Zhang, Z., Gao, F., Bo, Z., Fu, X.Y., Jin, X.F., Chen, J.M., Qiao, Y.S., Xiong, A.S., Yao, Q.G. 2009. Genome-wide analysis of the putative AP2/ERF family genes in Vitis vinifera. Scientia Horticulturae 123, 73–81.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arun K. Shanker.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shanker, A.K., Maddaala, A., Kumar, M.A. et al. In silico targeted genome mining and comparative modelling reveals a putative protein similar to an Arabidopsis drought tolerance DNA binding transcription factor in Chromosome 6 of Sorghum bicolor genome. Interdiscip Sci Comput Life Sci 4, 133–141 (2012). https://doi.org/10.1007/s12539-012-0121-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12539-012-0121-1

Key words

Navigation