Skip to main content

Exploring Protein Functional Relationships Using Genomic Information and Data Mining Techniques

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2714))

Abstract

Anapproach that uses both supervised and unsupervised learning methods for exploring protein functional relationships is reported; we refer to this as Maximum Contrast (MC) tree. The tree is constructed by performing a hierarchical decomposition of the feature space; this step is performed regardless of complex nature of protein functions, i.e. it performs this decomposition even without knowledge of the protein functional class labels. In order to test our algorithm, we have constructed a library of Protein Phylogenetic Profiles for the proteins in the yeast Saccharomyces Cerevisiae with 60 species. Results showed our algorithm compares favorably to other classification algorithms such as the decision tree algorithms C4.5, C5, and to support vector machines.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D.J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389–3402.

    Article  Google Scholar 

  2. Brown, M. P. S., Grundy, W. N., Lin, D., Cristianini, N., Sugnet, C.W., Furey, T. S., Ares M. J., and Haussler, D. (2000), Knowledge-based analysis of microarray gene expression data by using support vector machines, PNAS 97, p. 262–267.

    Google Scholar 

  3. Cover, T. M. and Hart, P. E. (1967) “Nearest Neighbor Pattern Classification” IEEE Trans. IT Vol. 13.No.1 P21–27, 1967.

    Article  MATH  Google Scholar 

  4. Ersoy, O K., Choe W, Bina M (2000) “Neural network schemes for detecting rare events in human genomic DNA” Bioinformatics, Vol. 16no 12 Pages 1062–1072.

    Article  Google Scholar 

  5. Ersoy, O.K., Deng, S.W. (1995). “Parallel, Self-Organizing Neural Networks with Continuous Inputs and Outputs”, IEEE Transactions on Neural Networks Volume 6Number 4, pp. 1037–1044.

    Article  Google Scholar 

  6. Ersoy, O. K. et al (1998) in Algorithm and Architectures (Leondes, C. T. editor) Pages 364–401, Academic Press 1998 (ISBN: 012443861X).

    Google Scholar 

  7. Marcotte, E. M., Pellegrini, M., Thompson, M. J., Yeates, T. O., and Eisenberg, D. (1999), A combined algorithm for genome-wide prediction of protein function, Nature 402, p.83–86.

    Article  Google Scholar 

  8. Pavlidis, Paul, Jason Weston, Jinsong Cai and William Noble Grundy. “Learning Gene Functional Classification from Multiple Data Types”. J. of Computational Biology, Vol 9. pp. 401–444.

    Google Scholar 

  9. Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D., and Yeates, T. O. (1999), Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles, PNAS 96, p. 4285–4288.

    Google Scholar 

  10. Yang, Jack, Yang, Mary and Ersoy, O.K. (2002) “Gene finding and protein functional determination by protein phylogenetic profile and computational intelligence,” Intelligent Engineering Systems through Neural Networks, Vol 12. Page 733–740 ASME Press (ISBN: 0791801918)

    Google Scholar 

  11. Vert J.(2002) “A tree kernel to analyze phylogenetic profiles”, Bioinformatics, Vol 18Suppl 1. pp. S276–S284.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, J.Y., Yang, M.Q., Ersoy, O.K. (2003). Exploring Protein Functional Relationships Using Genomic Information and Data Mining Techniques. In: Kaynak, O., Alpaydin, E., Oja, E., Xu, L. (eds) Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003. ICANN ICONIP 2003 2003. Lecture Notes in Computer Science, vol 2714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44989-2_128

Download citation

  • DOI: https://doi.org/10.1007/3-540-44989-2_128

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40408-8

  • Online ISBN: 978-3-540-44989-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics