Application of the Multi-modal Relevance Vector Machine to the Problem of Protein Secondary Structure Prediction
The aim of the paper is to experimentally examine the plausibility of Relevance Vector Machines (RVM) for protein secondary structure prediction. We restrict our attention to detecting strands which represent an especially problematic element of the secondary structure. The commonly adopted local principle of secondary structure prediction is applied, which implies comparison of a sliding window in the given polypeptide chain with a number of reference amino-acid sequences cut out of the training proteins as benchmarks representing the classes of secondary structure. As distinct from the classical RVM, the novel version applied in this paper allows for selective combination of several tentative window comparison modalities. Experiments on the RS126 data set have shown its ability to essentially decrease the number of reference fragments in the resulting decision rule and to select a subset of the most appropriate comparison modalities within the given set of the tentative ones.
KeywordsProtein secondary structure prediction machine learning multimodal relational pattern recognition Relevance Vector Machine controlled selectivity of reference objects object-comparison modalities
Unable to display preview. Download preview PDF.
- 1.Branden, C., Tooze, J.: Introduction to Protein Structure, 2nd edn., p. 410. Garland Publishing, Inc., New York (1999)Google Scholar
- 4.Critical Assessment of the Protein Structure Prediction. Protein Structure Prediction Center. Sponsored by the US National Library of Medicine (NIH/NLM), http://predictioncenter.org/http://predictioncenter.org/index.cgi?page=proceedings
- 6.Torshin, I.Y.: Bioinformatics in the Post-Genomic Era: The Role of Biophysics. Nova Biomedical Books, NY (2007) ISBN: 1-60021-048Google Scholar
- 7.Vapnik, V.: Statistical Learning Theory, p. 736. John-Wiley & Sons, Inc. (1998)Google Scholar
- 9.Bishop, C., Tipping, M.: Variational Relevance Vector Machines. In: Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence, pp. 46–53. Morgan Kaufmann (2000)Google Scholar
- 10.Seredin, O., Mottl, V., Tatarchuk, A., Razin, N., Windridge, D.: Convex Support and Relevance Vector Machines for selective multimodal pattern recognition. In: The 21th International Conference on Pattern Recognition, Tsukuba Science City, Japan, November 11-15 (2012)Google Scholar
- 12.Ni, Y., Niranjan, M.: Exploiting long-range dependencies in protein β-sheet secondary structure prediction. In: Proceedings of the 5th IAPR International Conference on Pattern Recognition in Bioinformatics, Nijmegen, The Netherlands, September 22-24, pp. 349–357 (2010)Google Scholar
- 13.Cole, C., Barber, J., Barton, G.: The Jpred 3 secondary structure prediction server. Nucl. Acids Res. 36 (suppl. 2), W197–W201 (2008)Google Scholar
- 16.Dayhoff, M., Schwarts, R., Orcutt, B.: A model of evolutionary change in proteins. Atlas of Protein Sequences and Structures 5(suppl. 3), 345–352 (1978)Google Scholar
- 17.Sulimova, V., Mottl, V., Kulikowski, C., Muchnik, I.: Probabilistic evolutionary model for substitution matrices of PAM and BLOSUM families. DIMACS Technical Report 2008-16, Rutgers University, p. 17 (2008)Google Scholar