Seed Dormancy pp 403-419 | Cite as

Seed Bioinformatics

  • George W. Bassel
  • Michael J. Holdsworth
  • Nicholas J. ProvartEmail author
Part of the Methods in Molecular Biology book series (MIMB, volume 773)


Analysis of gene expression data sets is a potent tool for gene function prediction, cis-element discovery, and hypothesis generation for the model plant Arabidopsis thaliana, and more recently for other agriculturally relevant species. In the case of Arabidopsis thaliana, experiments conducted by individual researchers to document its transcriptome have led to large numbers of data sets being made publicly available for data mining by the so-called “electronic northerns,” co-expression analysis and other methods. Given that approximately 50% of the genes in Arabidopsis have no function ascribed to them by “conventional” homology searches, and that only around 10% of the genes have had their function experimentally determined in the laboratory, these analyses can accelerate the identification of potential gene function at the click of a mouse. This chapter covers the use of bioinformatic data mining tools available at the Bio-Array Resource ( and elsewhere for hypothesis generation in the context of seed biology.

Key words

Gene expression analysis Co-expression analysis Microarrays Data mining Gene ­function prediction Arabidopsis thaliana Seed biology 


  1. 1.
    Lee, H. K., Hsu, A. K., Sajdak, J., Qin, J., and Pavlidis, P. (2004) Coexpression analysis of human genes across many microarray data sets. Genome Res. 14, 1085–94.Google Scholar
  2. 2.
    Wolfe, C., Kohane, I., and Butte, A. (2005) Systematic survey reveals general applicability of “guilt-by-association” within gene coexpression networks. BMC Bioinformatics 6, 227.Google Scholar
  3. 3.
    Zhang, W., Morris, Q., Chang, R., Shai, O., Bakowski, M., Mitsakakis, N., Mohammad, N., Robinson, M., Zirngibl, R., Somogyi, E., Laurin, N., Eftekharpour, E., Sat, E., Grigull, J., Pan, Q., Peng, W.-T., Krogan, N., Greenblatt, J., Fehlings, M., van der Kooy, D., Aubin, J., Bruneau, B., Rossant, J., Blencowe, B., Frey, B., and Hughes, T. (2004) The functional landscape of mouse gene expression. J. Biology 3, 21.Google Scholar
  4. 4.
    Schmid, M., Davison, T. S., Henz, S. R., Pape, U. J., Demar, M., Vingron, M., Scholkopf, B., Weigel, D., and Lohmann, J. U. (2005) A gene expression map of Arabidopsis thaliana development. Nature Genet. 37, 501–06.Google Scholar
  5. 5.
    Kilian, J., Whitehead, D., Horak, J., Wanke, D., Weinl, S., Batistic, O., D’Angelo, C., Bornberg-Bauer, E., Kudla, J., and Harter, K. (2007) The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J. 50, 347–63.Google Scholar
  6. 6.
    Edgar, R., Domrachev, M., and Lash, A. E. (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucl. Acids Res. 30, 207–10.Google Scholar
  7. 7.
    Rocca-Serra, P., Brazma, A., Parkinson, H., Sarkans, U., Shojatalab, M., Contrino, S., Vilo, J., Abeygunawardena, N., Mukherjee, G., Holloway, E., Kapushesky, M., Kemmeren, P., Lara, G. G., Oezcimen, A., and Sansone, S. A. (2003) ArrayExpress: a public database of gene expression data at EBI. Current Res. Biol. 326, 1075–78.Google Scholar
  8. 8.
    Craigon, D. J., James, N., Okyrere, J., Higgins, J., Jotham, J., and May, S. (2004) NASCArrays: a reposiotory for microarray data generated by NASC’s transciptomics service. Nucleic Acids Res. 32, D575-77.Google Scholar
  9. 9.
    Garcia-Hernandez, M., Berardini, T. Z., Chen, G., Crist, D., Doyle, A., Huala, E., Knee, E., Lambrecht, M., Miller, N., Mueller, L. A., Mundodi, S., Reiser, L., Rhee, S. Y., Scholl, R., Tacklind, J., Weems, D. C., Wu, Y., Xu, I., Yoo, D., Yoon, J., and Zhang, P. (2002) TAIR: a resource for integrated Arabidopsis data. Funct. Integr. Genomics 2, 239–53.Google Scholar
  10. 10.
    Zimmermann, P., Hirsch-Hoffman, M., Hennig, L., and Gruissem, W. (2004) GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol. 136, 2621–32.Google Scholar
  11. 11.
    Toufighi, K., Brady, M., Austin, R., Ly, E., and Provart, N. (2005) The Botany Array Resource: e-Northerns, Expression Angling, and Promoter Analyses. Plant J. 43, 153–63.Google Scholar
  12. 12.
    Manfield, I. W., Jen, C.-H., Pinney, J. W., Michalopoulos, I., Bradford, J. R., Gilmartin, P. M., and Westhead, D. R. (2006) Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis. Nucl. Acids Res. 34, W504-09.Google Scholar
  13. 13.
    Obayashi, T., Kinoshita, K., Nakai, K., Shibaoka, M., Hayashi, S., Saeki, M., Shibata, D., Saito, K., and Ohta, H. (2007) ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis. Nucl. Acids Res. 35, D863-69.Google Scholar
  14. 14.
    Steinhauser, D., Usadel, B., Luedemann, A., Thimm, O., and Kopka, J. (2004) CSB.DB: a comprehensive systems-biology database. Bioinformatics 20, 3647–51.Google Scholar
  15. 15.
    Aoki, K., Ogata, Y., and Shibata, D. (2007) Approaches for extracting practical information from gene co-expression networks in plant biology. Plant Cell Physiol. 48, 381–90.Google Scholar
  16. 16.
    Wille, A., Zimmermann, P., Vranova, E., Furholz, A., Laule, O., Bleuler, S., Hennig, L., Prelic, A., von Rohr, P., Thiele, L., Zitzler, E., Gruissem, W., and Buhlmann, P. (2004) Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana. Genome Biol. 5, R92.Google Scholar
  17. 17.
    Ehlting, J., Provart, N. J., and Werck-Reichhart, D. (2006) Functional annotation of the Arabidopsis P450 superfamily based on large-scale co-expression analysis. Biochem. Soc. Trans. 34, 1192–98.Google Scholar
  18. 18.
    Winter, D., Vinegar, B., Nahal, H., Ammar, R., Wilson, G. V., and Provart, N. J. (2007) An ‘Electronic Fluorescent Pictograph’ browser for exploring and analyzing large-scale biological data sets. PLoS One 2, e718.Google Scholar
  19. 19.
    Bassel, G. W., Fung, P., Chow, T.-f.  F., Foong, J. A., Provart, N. J., and Cutler, S. R. (2008) Elucidating the germination transcriptional program using small molecules. Plant Physiol. 147, 143–55.Google Scholar
  20. 20.
    Carrera, E., Holman, T., Medhurst, A., Dietrich, D., Footitt, S., Theodoulou, F. L., and Holdsworth, M. J. (2008) Seed after-ripening is a discrete developmental pathway associated with specific gene networks in Arabidopsis. Plant J. 53, 214–24.Google Scholar
  21. 21.
    Carrera, E., Holman, T., Medhurst, A., Peer, W., Schmuths, H., Footitt, S., Theodoulou, F. L., and Holdsworth, M. J. (2007) Gene expression profiling reveals defined functions of the ATP-binding cassette transporter COMATOSE late in phase II of germination. Plant Physiol. 143, 1669–79.Google Scholar
  22. 22.
    Holdsworth, M. J., Finch-Savage, W. E., Grappin, P., and Job, D. (2008) Post-genomics dissection of seed dormancy and germination. Trends Plant Sci. 13, 7–13.Google Scholar
  23. 23.
    Howell, K. A., Narsai, R., Carroll, A., Ivanova, A., Lohse, M., Usadel, B., Millar, A. H., and Whelan, J. (2008) Mapping metabolic and transcript temporal switches during germination in Oryza sativa highlights specific transcription factors and the role of RNA instability in the germination process. Plant Physiol. pp.108.129874. Not sure what this means? Please put vol, page nos- thanks.Google Scholar
  24. 24.
    Sreenivasulu, N., Usadel, B., Winter, A., Radchuk, V., Scholz, U., Stein, N., Weschke, W., Strickert, M., Close, T. J., Stitt, M., Graner, A., and Wobus, U. (2008) Barley grain maturation and germination: Metabolic pathway and regulatory network commonalities and differences highlighted by new MapMan/PageMan profiling tools. Plant Physiol. 146, 1738–58.Google Scholar
  25. 25.
    Gentleman, R., Carey, V., Bates, D., Bolstad, B., Dettling, M., Dudoit, S., Ellis, B., Gautier, L., Ge, Y., Gentry, J., Hornik, K., Hothorn, T., Huber, W., Iacus, S., Irizarry, R., Leisch, F., Li, C., Maechler, M., Rossini, A., Sawitzki, G., Smith, C., Smyth, G., Tierney, L., Yang, J., and Zhang, J. (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80.Google Scholar
  26. 26.
    Tusher, V. G., Tibshirani, R., and Chu, G. (2001) Significance analysis of microarrays applied to the ionizing radiation response. Proc.Natl. Acad. Sci. USA 98, 5116–21.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • George W. Bassel
    • 1
  • Michael J. Holdsworth
    • 2
  • Nicholas J. Provart
    • 3
    Email author
  1. 1.Department of HorticultureOregon State UniversityCorvallisUSA
  2. 2.Division of Plant and Crop Sciences, School of Biosciences, Centre for Plant Integrative BiologyUniversity of NottinghamSutton BoningtonUK
  3. 3.Department of Cell & Systems Biology/Centre for the Analysis of Genome Evolution and FunctionUniversity of TorontoTorontoCanada

Personalised recommendations