Gene Orthology Assessment with OrthologID

  • Mary Egan
  • Ernest K. Lee
  • Joanna C. Chiu
  • Gloria Coruzzi
  • Rob DeSalle
Part of the Methods in Molecular Biology book series (MIMB, volume 537)


OrthologID ( allows for the rapid and accurate identification of gene orthology within a character-based phylogenetic framework. The Web application has two functions – an orthologous group search and a query orthology classification. The former determines orthologous gene sets for complete genomes and identifies diagnostic characters that define each orthologous gene set; and the latter allows for the classification of unknown query sequences to orthology groups. The first module of the Web application, the gene family generator, uses an E-value based approach to sort genes into gene families. An alignment constructor then aligns members of gene families and the resulting gene family alignments are submitted to the tree builder to obtain gene family guide trees. Finally, the diagnostics generator extracts diagnostic characters from guide trees and these diagnostics are used to determine gene orthology for query sequences.

Key words

Single linkage cluster orthology phylogeny alignment diagnosis genomics 


  1. 1.
    Chiu, J. C., Lee, E. K., Egan, M. G., Sarkar, I. N., Coruzzi, G. M., and DeSalle, R. (2006) OrthologID: automation of genome-scale ortholog identification within a parsimony framework. Bioinformatics 22, 699–707.PubMedCrossRefGoogle Scholar
  2. 2.
    Koski, L. B., and Golding, G. B. (2001) The closest BLAST hit is often not the nearest neighbor. J Mol Evol 52, 540–42.PubMedGoogle Scholar
  3. 3.
    Sarkar, I. N., Thornton, J. W., Planet, P. J., Figurski, D. H., Schierwater, B., and DeSalle, R. (2002) An automated phylogenetic key for classifying homeoboxes. Mol Phylogenet Evol 24, 388–99.PubMedCrossRefGoogle Scholar
  4. 4.
    Brower, A. V. Z., and Schawaroch, V. (1996) Three steps of homology assessment. Cladistics 12, 265–72.Google Scholar
  5. 5.
    Altschul, S. F., Gish, W., Miller, W., Meyers, E. W., and Lipman, D. J. (1990) Basic local alignment search tool. J Mol Biol 215, 403–10.PubMedGoogle Scholar
  6. 6.
    Tatusov, R. L., Galperin, M. Y., Natale, D. A., and Koonin, E. V. (2000) The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28, 33–6.PubMedCrossRefGoogle Scholar
  7. 7.
    Dehal, P. S., and Boore, J. L. (2006) A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) Database. BMC Bioinformatics 7, 201.PubMedCrossRefGoogle Scholar
  8. 8.
    Katoh, K., Kuma, K., Toh, H., and Miyata, T. (2005) MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res 33, 511–18.PubMedCrossRefGoogle Scholar
  9. 9.
    Swofford, D. L. (2003) PAUP* Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sinauer Associates, Sunderland, Massachusetts.Google Scholar
  10. 10.
    Sarkar, I. N., Planet, P. J., Bael, T. E., Stanley, S. E., Siddall, M., DeSalle, R., and Figurski, D. H. (2002) Characteristic attributes in cancer microarrays. J Biomed Inform Apr/May; 35(2), 111–22PubMedCrossRefGoogle Scholar
  11. 11.
    Eisen, J. A. (1998) Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis. Genome Res 8, 163–67.PubMedGoogle Scholar
  12. 12.
    Thornton, J. W., and DeSalle, R. (2000) Phylogenetics meets genomics: homology and evolution in gene families. Annu Rev Genomics Hum Gene 1, 43–72.Google Scholar
  13. 13.
    DePinna, M. C. C. (1991) Concepts and tests of homology in the cladistic paradigm. Cladistics 7, 367–94.CrossRefGoogle Scholar
  14. 14.
    Lienau, E. K., DeSalle, R., Rosenfeld, J. A., and Planet, P. J. (2006) Reciprocal illumination in the gene content tree of life. Syst Biol 55, 441–53.PubMedCrossRefGoogle Scholar
  15. 15.
    Fitch, W. M. (1970) Distinguishing homologous from analogous proteins. Syst Zool 19, 99–113.PubMedCrossRefGoogle Scholar
  16. 16.
    Davis, J. J., and Nixon, K.C. (1992) Populations, genetic variation and the delimitation of phylogenetic species. Syst Biol 41, 121–35.Google Scholar

Copyright information

© Humana Press, a part of Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Mary Egan
    • 1
  • Ernest K. Lee
    • 2
  • Joanna C. Chiu
    • 3
  • Gloria Coruzzi
    • 2
  • Rob DeSalle
    • 4
  1. 1.Department of BiologyMontclair State UniversityMontclairUSA
  2. 2.Department of BiologyNew York UniversityNew YorkUSA
  3. 3.Department of Molecular Biology and BiochemistryRutgers UniversityPiscatawayUSA
  4. 4.Sackler Institute of Comparative Genomics, American Museum of Natural History New YorkNew YorkUSA

Personalised recommendations