Fuzzy k-Nearest Neighbor Method for Protein Secondary Structure Prediction and Its Parallel Implementation

Kim, Seung-Yeon; Sim, Jaehyun; Lee, Julian

doi:10.1007/11816102_48

Fuzzy k-Nearest Neighbor Method for Protein Secondary Structure Prediction and Its Parallel Implementation

Seung-Yeon Kim²¹,
Jaehyun Sim²² &
Julian Lee²³

Conference paper

1459 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4115))

Abstract

Fuzzy k-nearest neighbor method is a generalization of nearest neighbor method, the simplest algorithm for pattern classification. One of the important areas for application of the pattern classification is the protein secondary structure prediction, an important topic in the field of bioinformatics. In this work, we develop a parallel algorithm for protein secondary structure prediction, based on the fuzzy k-nearest neighbor method, that uses evolutionary profile obtained from PSI-BLAST (Position Specific Iterative Basic Local Sequence Alignment Tool) as the feature vectors.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kryshtafovych, A., Venclovas, C., Fidelis, K., Moult, J.: Progress over the First Decade of CASP Experiments. Proteins 61, 225–236 (2005)
Article Google Scholar
Lee, J., Kim, S.-Y., Joo, K., Kim, I., Lee, J.: Prediction of Protein Tertiary Structure using PROFESY, a Novel Method Based on Fragment Assembly and Conformational Space Annealing. Proteins 56, 704–714 (2004)
Article Google Scholar
Lee, J., Kim, S.-Y., Lee, J.: Protein Structure Prediction Based on Fragment Assembly and Parameter Optimization. Biophys. Chem. 115, 209–214 (2005)
Article Google Scholar
Lee, J., Kim, S.-Y., Lee, J.: Protein Structure Prediction Based on Fragment Assembly and Beta-strand Pairing Energy Function. J. Korean Phys. Soc. 46, 707–712 (2005)
Google Scholar
Rost, B., Sander, C.: Prediction of Secondary Structure at Better than 70% Accuracy. J. Mol. Biol. 232, 584–599 (1993)
Article Google Scholar
Jones, D.: Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices. J. Mol. Biol. 292, 195–202 (1999)
Article Google Scholar
Ouali, M., King, R.: Cascaded Multiple Classifiers for Secondary Structure Prediction. Protein Science 9, 1162–1176 (1999)
Article Google Scholar
Adamczak, R., Porollo, A., Meller, J.: Combining Prediction of Secondary Structure and Solvent accessibility in proteins. Proteins 59, 467–475 (2005)
Article Google Scholar
Hua, S., Sun, Z.: A Novel Method of Protein Secondary Structure Prediction with High Segment Overlap Measure: Support Vector Machine Approach. J. Mol. Biol. 308, 397–407 (2001)
Article Google Scholar
Kim, K., Park, H.: Protein Secondary Structure Prediction based on improved Support Vector Machines Approach. Protein Eng. 16, 553–560 (2003)
Article Google Scholar
Joo, K., Lee, J., Kim, S.-Y., Kim, I., Lee, S.J., Lee, J.: Profile-based Nearest Neighbor Method for Pattern Recognition. J. Korean Phys. Soc. 44, 599–604 (2004)
Google Scholar
Joo, K., Kim, I., Lee, J., Kim, S.-Y., Lee, S.J., Lee, J.: Prediction of the Secondary Structure of Proteins Using PREDICT, a Nearest Neighbor Method on Pattern Space. J. Korean Phys. Soc. 45, 1441–1449 (2004)
Google Scholar
Pollastri, G., McLysaght, A.: Porter: a new, Accurate Server for Protein Secondary Structure Prediction. Bioinformatics 21, 1719–1720 (2004)
Article Google Scholar
Jiang, F.: Prediction of Protein Secondary Structure with a Reliability Score Estimated by Local Sequence Clustering. Protein Eng. 16, 651–657 (2003)
Article Google Scholar
Salamov, A.A., Solovyev, V.V.: Protein Secondary Structure Prediction Using Local Alignments. J. Mol. Biol. 268, 31–35 (1997)
Article Google Scholar
Kim, H., Park, H.: Prediction of Protein Relative Solvent Accessibility with Support Vector Machines and Long-range Interaction 3D Local Descriptor. Proteins 54, 557–562 (2004)
Article Google Scholar
Kabsch, W., Sander, C.: Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen-bonded and Geometrical Features. Biopolymers 22, 2577–2637 (1983)
Article Google Scholar
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a New Generation of Protein Database Search Programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Article Google Scholar
Keller, J.M., Gray, R., Givens, J.A.: A Fuzzy k-nearest Neighbor Algorithm. IEEE Trans. Systems Man Cybernet. 15, 580–585 (1985)
Google Scholar
Sim, J.H., Kim, S.-Y., Lee, J.: Prediction of Protein Solvent Accessibility Using Fuzzy k-Nearest Neighbor Method. Bioinformatics 21, 2844–2849 (2005)
Article Google Scholar
Brenner, S.E., Koehl, P., Levitt, M.: The ASTRAL Compendium for Protein Structure and Sequence Analysis. Nucleic Acids Res. 28, 254–256 (2000)
Article Google Scholar
Koh, I.Y., Eyrich, V., Marti-Renom, M.A., Przybylski, D., Madhusudhan, M.S., Eswar, N., Grana, O., Pazos, F., Valencia, A., Sali, A., Rost, B.: EVA: Evaluation of Protein Structure Prediction Servers. Nucleic Acids Res. 31, 3311–3315 (2003)
Article Google Scholar
Zemla, A., Venclovas, C., Fidelis, K., Rost, B.: A Modified Definition of Sov, a Segment-Based Measurement for Protein Secondary Structure Prediction Assessment. Proteins 34, 220–223 (1999)
Article Google Scholar
Gorodkin, J.: Comparing two K-category Assignment by a K-category Correlation Coefficient. Comput. Biol. and Chem. 28, 367–374 (2004)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computer Aided Molecular Design Research Center, Soongsil University, Seoul, 156-743, Korea
Seung-Yeon Kim
School of Dentistry, Seoul National University, Seoul, 110-749, Korea
Jaehyun Sim
Department of Bioinformatics and Life Science, Soongsil University, Seoul, 156-743, Korea
Julian Lee

Authors

Seung-Yeon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jaehyun Sim
View author publications
You can also search for this author in PubMed Google Scholar
Julian Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui, China
De-Shuang Huang
Queen’s University, Belfast, UK
Kang Li & George William Irwin &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, SY., Sim, J., Lee, J. (2006). Fuzzy k-Nearest Neighbor Method for Protein Secondary Structure Prediction and Its Parallel Implementation. In: Huang, DS., Li, K., Irwin, G.W. (eds) Computational Intelligence and Bioinformatics. ICIC 2006. Lecture Notes in Computer Science(), vol 4115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816102_48

Download citation

DOI: https://doi.org/10.1007/11816102_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37277-6
Online ISBN: 978-3-540-37282-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics