Plant Molecular Biology

, Volume 57, Issue 2, pp 203–224 | Cite as

Large-scale statistical analysis of secondary xylem ESTs in pine

  • Nathalie PavyEmail author
  • Jérôme Laroche
  • Jean Bousquet
  • John Mackay


A computational analysis of pine transcripts was conducted to contribute to the functional annotation of conifer sequences. A statistical analysis of expressed sequential tags(ESTs) belonging the 7732 contigs in the TIGR Pinus Gene Index (PGI1.0) identified 260 differentially represented gene sequences across six cDNA libraries from loblolly pine secondary xylem. Cluster analysis of this subset of contigs resulted in five groups representing genes preferentially represented in one of the xylem samples (compression wood, plannings, root xylem, latewood) and one group containing mostly genes simultaneously present in compression and side wood libraries. To complement the sequence annotation, 27 cDNA clones representing selected transcripts were completely sequenced. Several genes were identified that could represent putative markers for xylem from different organs, at different stages of development. Several sequences encoding regulatory proteins were over-represented in root xylem as opposed to the other xylem samples. Some of them belonged to known families of plant transcription factors, but two genes were previously uncharacterized in plants. One transcript was homologous to the gene encoding the Smad4 interacting factor, a key co-activator in TGFβ (transforming growth factor) signalling in animals. Thus, the digital analysis of pine ESTs highlighted a putative gene function of potentially broad interest but that has yet to be investigated in plants. More generally, this study showed that the application of numerical approaches to EST databases should be helpful in establishing priorities among genes to consider for targeted functional studies. Thus, we illustrated the potential of extracting information from conifer sequences already accessible through well-structured public databases.


computational analysis EST gene discovery pine xylem 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Ahern-Djamali, S.M., Comer, A.R., Bachmann, C., Kastenmeier, A.S., Reddy, S.K., Beckerle, M.C., Walter, U., Hoffmann, F.M. 1998Mutations in Drosophila enabled and rescue by human vasodilator-stimulated phosphoprotein (VASP) indicate important functional roles for Ena/VASP homology domain 1 (EVH1) and EVH2 domainsMol. Biol. Cell921572171Google Scholar
  2. Allona, I., Quinn, M., Shoop, E., Swope, K., St Cyr, S., Carlis, J., Riedl, J., Retzel, E., Campbell, M., Sederoff, R., Whetten, R.W. 1998Analysis of xylem formation in pine by cDNA sequencingProc. Natl. Acad. Sci. USA9596939698Google Scholar
  3. Altschul, S.F., Maden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. 1997Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Res.2533893402CrossRefPubMedGoogle Scholar
  4. Audic, S., Claverie, J.M. 1997The significance of digital gene expression profilesGenome Res.7986995Google Scholar
  5. Bai, R.Y., Koester, C., Ouyang, T., Hahn, S.A., Hammerschmidt, M., Peschel, C., Duyster, J. 2002SMIF, a Smad4-interacting protein that functions as a co-activator in TGFβ signallingNature Cell Biol.4181190Google Scholar
  6. Ball, L.J., Jarchau, T., Oschkinat, H., Walter, U. 2002EVH1 domains: structure, function and interactionsFEBS Lett.194552Google Scholar
  7. Bedon, F., Blais, S., Roy, V., Bérubé, H., Morency, M.J., Mackay, J. 2004Characterization, expression and phylogeny of xylem expressed R2R3-MYB genes of spruce and pine treesPlant & Animal Genomes XII ConferenceSan Diego, CAGoogle Scholar
  8. Ben-Yaacov, S., Le Borgne, R., Abramson, I., Schweisguth, F., Schejter, E.D. 2001Wasp, the Drosophila Wiskott–Aldrich syndrome gene homologue, is required for cell fate decisions mediated by Notch signallingJ. Cell Biol152113Google Scholar
  9. Bonke, M., Thitamadee, S., Mahonen, A.P., Hauser, M.T., Helariutta, Y. 2003APL regulates vascular tissue identity in ArabidopsisNature426181186Google Scholar
  10. Bortoluzzi, S., d’Alessi, F., Romualdi, C., Danieli, G.A. 2001Differential expression of genes coding for ribosomal proteins in different human tissuesBioinformatics1711521157Google Scholar
  11. Bortoluzzi, S., d’Alessi, F., Danieli, G.A. 2000A computational reconstruction of the adult human heart transcriptional profileJ. Mol. Cell Cardiol3219311938Google Scholar
  12. Bortoluzzi, S., Danieli, G.A. 1999Towards an in silico analysis of transcription patternsTrends Genet15118119Google Scholar
  13. Brenner, E.D., Stevenson, D.W., McCombie, R.W., Katari, M.S., Rudd, S.A., Mayer, K.F., Palenchar, P.M., Runko, S.J., Twigg, R.W., Dai, G.,  et al. 2003Expressed sequence tag analysis in Cycas, the most primitive living seed plantGenome Biol.4R78Google Scholar
  14. Callebaut, I. 2002An EVH1/WH1 domain as a key actor in TGFbeta signallingFEBS Lett22178180Google Scholar
  15. Chang, S., Pureyear, J., Cairney, J. 1993A simple and efficient method for isolating RNA from pine treesPlant Mol. Biol. Rep.11113116Google Scholar
  16. Claverie, J.M. 1999Computational methods for the identification of differential and coordinated gene expressionHum. Mol. Genet.818211832Google Scholar
  17. Dubos, C., Plomion, C. 2003Identification of water-deficit responsive genes in maritime pine (Pinus pinaster Ait.) rootsPlant Mol. Biol.51249262Google Scholar
  18. Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D. 1998Cluster analysis and display of genome-wide expression patternsProc. Natl. Acad. Sci. USA951486314868Google Scholar
  19. Ewing, R.M., Kahla, A.B., Poirot, O., Lopez, F., Audic, S., Claverie, J.M. 1999Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expressionGenome Res.9950959Google Scholar
  20. Felsenstein J. (1993). PHYLIP (Phylogeny Inference Package) version 3.6a2. Distributed by the author. Department of Genetics, University of Washington, Seattle.Google Scholar
  21. Florin, R. 1963The distribution of conifer and taxad genera in time and spaceActa Horti. Bergiani20121312Google Scholar
  22. Geourjon, C., Deleage, G. 1994SOPM: a self-optimized method for protein secondary structure predictionProtein Eng.7157164Google Scholar
  23. Greller, L.D., Tobin, F.L. 1999Detecting selective expression of genes and proteinsGenome Res9282296Google Scholar
  24. Hertzberg, M., Aspeborg, H., Schrader, J., Andersson, A., Erlandsson, R., Blomqvist, K., Bhalerao, R., Uhlen, M., Teeri, T.T., Lundeberg, J.,  et al. 2001A transcriptional roadmap to wood formationProc. Natl. Acad. Sci. USA981473214737Google Scholar
  25. Karlowski, W.M., Hirsch, A.M. 2003The over-expression of an alfalfa RING-H2 gene induces pleiotropic effects on plant growth and developmentPlant Mol. Biol.52121133Google Scholar
  26. Kirst, M., Johnson, A.F., Baucom, C., Ulrich, E., Hubbard, K., Staggs, R., Paule, C., Retzel, E., Whetten, R., Sederoff, R. 2003Apparent homology of expressed genes from wood-forming tissues of loblolly pine (Pinus taeda L.) with Arabidopsis thalianaProc. Natl. Acad. Sci. USA10073837388Google Scholar
  27. Kosarev, P., Mayer, K.F. and Hardtke, C.S. 2002. Evaluation and classification of RING-finger domains encoded by the Arabidopsis genome. Genome Biol. 3, research0016.1-0016.12.Google Scholar
  28. Kranz, H.D., Denekamp, M., Greco, R., Jin, H., Levya, A., Meissner, R.C., Petroni, K., Urzainqui, A., Bevan, M., Martin, C.,  et al. 1998Towards functional characterisation of the members of the R2R3-MYB gene family from Arabidopsis thalianaPlant J.16263276Google Scholar
  29. Mata, J., Bahler, J. 2003Correlations between gene expression and gene conservation in fission yeastGenome Res.1326862690Google Scholar
  30. McDougall, G.J. 2000A comparison of proteins from the developing xylem of compression and non-compression wood of branches of Sitka spruce (Picea sitchensis) reveals a differentially expressed laccaseJ. Exp. Bot.5113951401Google Scholar
  31. Megy, K., Audic, S. and Claverie, J.M. 2002. Heart-specific genes revealed by expressed sequence tag (EST) sampling. Genome Biol. 16: 3(9).Google Scholar
  32. Mellerowicz, E.J., Baucher, M., Sundberg, B., Boerjan, W. 2001Unravelling cell wall formation in the woody dicot stemPlant Mol. Biol.47239274Google Scholar
  33. Mercy, I.S., Meeley, R.B., Nichols, S.E., Olsen, O.A. 2003Zea mays ZmMybst1 cDNA, encodes a single Myb-repeat protein with the VASHAQKYF motifJ. Exp. Bot.5411171119Google Scholar
  34. Nagano, Y., Furuhashi, H., Inaba, T., Sasaki, Y. 2001A novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequencesNucleic Acids Res.2940974105Google Scholar
  35. Notredame, C., Higgins, D.G., Heringa, J. 2000T-Coffee: A novel method for fast and accurate multiple sequence alignmentJ. Mol. Biol.302205217Google Scholar
  36. Ogihara, Y., Mochida, K., Nemoto, Y., Murai, K., Yamazaki, Y., Shin, I.T., Kohara, Y. 2003Correlated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tagsPlant J.3310011011Google Scholar
  37. Olivier, S.G. 1996From DNA sequence to biological functionNature379597600Google Scholar
  38. Patzlaff, A., McInnis, S., Courtenay, A., Surman, C., Newman, L.J., Smith, C., Bevan, M.W., Mansfield, S., Whetten, R.W., Sederoff, R.R., Campbell, M.M. 2003Characterization of a pine MYB that regulates lignificationPlant J.36743754Google Scholar
  39. Quackenbush, J., Liang, F., Holt, I., Pertea, G., Upton, J. 2000The TIGR Gene Indices: reconstruction and representation of expressed gene sequencesNucleic Acids Res.28141145Google Scholar
  40. Romualdi, C., Bortoluzzi, S., Danieli, G.A. 2001Detecting differentially expressed genes in multiple tag sampling experiments: comparative evaluation of statistical testsHum. Mol. Genet.1021332141Google Scholar
  41. Ronning, C.M., Stegalkina, S.S., Ascenzi, R.A., Bougri, O., Hart, A.L., Utterbach, T.R., Vanaken, S.E., Riedmuller, S.B., White, J.A., Cho, J.,  et al. 2003Comparative analyses of potato expressed sequence tag librariesPlant Physiol.131419429Google Scholar
  42. Rost, B., Sander, C. 1993Prediction of protein secondary structure at better than 70 accuracyJ. Mol. Biol232584599Google Scholar
  43. Rozen, S., Skaletsky, H.J. 2000Primer3 on the WWW for general users and for biologist programmersKrawetz, S.Misener, S. eds. Bioinformatics Methods and Protocols: Methods in Molecular Biology.Humana PressTotowa, NJ365386Google Scholar
  44. Savard, L., Li, P., Strauss, S.H., Chase, M.W., Michaud, M., Bousquet, J. 1994Chloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plantsProc. Natl. Acad. Sci. USA9151635167Google Scholar
  45. Stekel, G., Git, Y., Falciani, F. 2000The comparison of gene expression from multiple cDNA librariesGenome Res.1020552061Google Scholar
  46. Sterky, F., Regan, S., Karlsson, J., Hertzberg, M., Rohde, A., Holmberg, A., Amini, B., Bhalerao, R., Larsson, M., Villarroel, R.,  et al. 1998Gene discovery in the wood-forming tissues of poplar: analysis of 5,692 expressed sequence tagsProc. Natl. Acad. Sci. USA951333013335Google Scholar
  47. Tamagnone, L., Merida, A., Parr, A., Mackay, S., Culianez-Macia, F.A., Roberts, K., Martin, C. 1998The AmMYB308 and AmMYB330 transcription factors from Antirrhinum regulate phenylpropanoid and lignin biosynthesis in transgenic tobaccoPlant Cell.10135154Google Scholar
  48. Whetten, R., Sun, Y.H., Zhang, Y., Sederoff, R. 2001Functional genomics and cell wall biosynthesis in loblolly pinePlant Mol. Biol.47275291Google Scholar
  49. Wu, X.M., Lim, S.H., Yang, W.C. 2003Characterization, expression and phylogenetic study of R2R3-MYB genes in orchidPlant Mol. Biol.51959972Google Scholar
  50. Zhang, Y., Sederoff, R.R., Allona, I. 2000Differential expression of genes encoding cell wall proteins in vascular tissues from vertical and bent loblolly pine treesTree Physiol.20457466Google Scholar

Copyright information

© Springer 2005

Authors and Affiliations

  • Nathalie Pavy
    • 1
    Email author
  • Jérôme Laroche
    • 3
  • Jean Bousquet
    • 1
    • 2
  • John Mackay
    • 1
  1. 1.ARBOREA and Centre de Recherche en Biologie ForestiéreUniversitè LavalQueCanada
  2. 2.Chaire de Recherche du Canada en Génomique Forestiére et EnvironnementaleUniversité LavalQueCanada
  3. 3.Centre de bio-informatiqueUniversité LavalQueCanada

Personalised recommendations