Abstract
Putative single-copy genes and conserved ortholog sets (COS) were identified in model plant species thale cress (Arabidopsis thaliana), rice (Oryza sativa ssp. japonica), and poplar [black cottonwood, Populus trichocarpa (Torr. & Gray ex Brayshaw)] and used to find putative COS in four conifers (the Coniferales order). Using expressed sequence tag sequences, unique transcript sets were assembled in loblolly pine (Pinus taeda L.), white spruce [Picea glauca (Moench) Voss], Douglas-fir [Pseudotsuga menziesii (Mirb.) Franco var. menziesii], and sugi [Cryptomeria japonica (Thunberg ex Linnaeus f.) D. Don]. They were compared with COS sets identified in three model plant species using comparative sequence analysis. Almost half of the single-copy genes in herbaceous species (Arabidopsis and rice) had additional copies and homologs in poplar and conifers. The identified tentative COS sets have many applications in evolutionary genomics studies, phylogenetic analysis, and comparative mapping.
Similar content being viewed by others
References
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Bennett MD, Smith JB (1991) Nuclear DNA amounts in angiosperms. Philos Trans R Soc Lond, B 334:309–345
Bradshaw HD, Stettler RF (1993) Molecular genetics of growth and development in Populus. I. Triplody in hybrid poplars. Theor Appl Genet 86:301–307
Brown GR, Kadel EE III, Bassoni DL, Kiehne KL, Temesgen B, van Buijtenen JP, Sewell MM, Marshall KA, Neale DB (2001) Anchored reference loci in loblolly pine (Pinus taeda L.) for integrating pine genomics. Genetics 159:799–809
DiFazio SP (2005) A pioneer perspective on adaptation. Functional genomics of environmental adaptation in Populus: the 12th New Phytologist Symposium, Gatlinburg, TN, USA, October 2004. New Phytol 165:661–664
Dong Q, Schlueter SD, Brendel V (2004) PlantGDB, plant genome database and analysis tools. Nucleic Acids Res 32:D354–D359
Frankis MP (1989) Generic inter-relationships in Pinaceae. Notes Roy Bot Gard Edinburgh 45:527–548
Fulton TM, van der Hoeven R, Eannetta NT, Tanksley SD (2002) Identification, analysis, and utilization of conserved ortholog set markers for comparative genomics in higher plants. Plant Cell 14:1457–1467
Guillet-Claude C, Isabel N, Pelgas B, Bousquet J (2004) The evolutionary implications of knox-I gene duplications in conifers: correlated evidence from phylogeny, gene mapping, and analysis of functional divergence. Mol Biol Evol 21:2232–2245
Gupta PK, Rustgi S (2004) Molecular markers from the transcribed/expressed region of the genome in higher plants. Funct Integr Genomics 4:139–162
Hizume M, Kondo T, Shibata F, Ishizuka R (2001) Flow cytometric determination of genome size in the Taxodiaceae, Cupressaceae sensu stricto and Sciadopityaceae. Cytologia 66:307–311
Huang X, Madan A (1999) CAP3: a DNA sequence assembly program. Genome Res 9:868–877
International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome. Nature 436:793–800
Krutovsky KV, Troggio M, Brown GR, Jermstad KD, Neale DB (2004) Comparative mapping in the Pinaceae. Genetics 168:447–461
Morton NE (1991) Parameters of the human genome. Proc Natl Acad Sci USA 88:7474–7476
Neale DB, Krutovsky KV (2004) Comparative genetic mapping in trees: the group of conifers. In: Lörz H, Wenzel G (eds) Biotechnology in agriculture and forestry: molecular marker systems. Springer, Berlin Heidelberg New York, pp 267–277
O’Brien IEW, Smith DR, Gardner RC, Murray BG (1996) Flow cytometric determination of genome size in Pinus. Plant Sci 115:91–99
Ohri D, Khoshoo TN (1986) Genome size in gymnosperms. Plant Syst Evol 153:119–132
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J (2003) TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19:651–652
Rudd S, Schoof H, Mayer K (2005) PlantMarkers—a database of predicted molecular markers from plants. Nucleic Acids Res 33(Suppl 1):D628–D632
Tatusov RL, Galperin MY, Natale DA, Koonin EV (2000) The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4:41
Temesgen B, Brown GR, Harry DE, Kinlaw CS, Sewell MM, Neale DB (2001) Genetic mapping of expressed sequence tag polymorphism (ESTP) markers in loblolly pine (Pinus taeda L.). Theor Appl Genet 102:664–675
Zhang Z, Schwartz S, Wagner L, Miller W (2000) A greedy algorithm for aligning DNA sequences. J Comput Biol 7:203–214
Acknowledgments
We thank Glenn Howe and Dana Howe (Oregon State University, USA) for providing additional Douglas-fir EST sequences and Stephen DiFazio (West Virginia University, Morgantown, WV, USA) for the poplar predicted protein set. We also thank Santiago C. González-Martínez (Center of Forest Research, Madrid, Spain), Glenn Howe, Jean Bousquet (Université Laval, Canada) and anonymous reviewers for thorough reviewing of the manuscript and useful recommendations that greatly helped us improve the paper. Funding for this project was provided by the USDA Plant Genome National Research Initiative (grant no. 00-35300-9316) and the Pacific Southwest Research Station, the USDA Forest Service within the American Forest & Paper Association Agenda 2020 program. Trade names and commercial products or enterprises are mentioned solely for information and no endorsement by the USDA is implied.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by S. Aitken
Electronic supplementary material
Below is the link to the electronic supplementary material.
These files are unfortunately not in the Publisher's archive anymore:
-
C_japonica_55-COS-genes-shared-with-ARP (FASTA 38 kb)
-
P_glauca_359-COS-genes-shared-with-ARP (FASTA 355 kb)
-
P_menziesii_90-COS-genes-shared-with-ARP (FASTA 56 kb)
-
P_taeda_216-COS-genes-shared-with-ARP (FASTA 175 kb)
-
P_trichocarpa-753-COS-genes-shared-with-ARP (FASTA 338 kb)
-
Poplar-9605-single-copy-genes-locat-annot (FASTA 3290 kb)
-
rice-12004-single-hits (FASTA 3849 kb)
Table 1S
contigs ESTs ID (XLS 10230 kb)
Table 2S
COS summary (XLS 9884 kb)
Table 3S
26 COS annotation (XLS 46 kb)
Table 4S
753 COS trees (XLS 526 kb)
Rights and permissions
About this article
Cite this article
Krutovsky, K.V., Elsik, C.G., Matvienko, M. et al. Conserved ortholog sets in forest trees. Tree Genetics & Genomes 3, 61–70 (2006). https://doi.org/10.1007/s11295-006-0052-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11295-006-0052-2