Abstract
The prediction of putative RNA-interacting residues in proteins is an important problem in a field of molecular recognition. We suggest a weighted profile based method for predicting RNA-interacting residues, which utilizes the trained neural network. Most neural networks have a learning rule which allows the network to adjust its connection weights in order to correctly classify the training data. We focus on the network weights that are dependent on the training data set and give evidence of which inputs were more influential in the network. A large set of the network weights trained on sequence profiles is analyzed and qualified. We explore the feasibility of utilizing the qualified information to improve the prediction performance for protein-RNA interaction. Our proposed method shows a considerable improvement, which has been applied to the profiles of the PSI-BLAST alignment. Results for predictions using alternative representations of profile are included for comparison.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allers, J., Shamoo, Y.: Structure-based analysis of protein-RNA interactions using the program ENTANGLE. J. Mol. Biol. 311, 75–86 (2001)
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)
Chen, Y., Kortemme, T., Robertson, T., Baker, D., Varani, G.: A new hydrogen-bonding potential for the design of protein-RNA interactions predicts specific contacts and discriminates decoys. Nucleic Acids Res. 32, 5147–5172 (2004)
Crooks, G.E., Hon, G., Chandonia, J.-M., Brenner, S.E.: WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004)
Cuff, J.A., Barton, G.J.: Application of multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins 40, 502–511 (2000)
Draper, D.E.: Themes in RNA-protein recognition. J. Mol. Biol. 293, 255–270 (1999)
Eddy, S.R.: Profile hidden Markov models. Bioinformatics 14, 755–763 (1998)
Han, L., Cai, C., Lo, S., Chung, M., Chen, Y.: Prediction of RNA-binding proteins from primary sequence by a support vector machine approach. RNA 10, 355–368 (2004)
Hassibi, B., Stork, D.G.: Second order derivatives for network pruning: Optimal brain surgeon. Advances in Neural Information Processing Systems 5, 164–172 (1993)
Henikoff, S., Henikoff, J.G.: Protein family classification based on searching a database of blocks. Genomics 19, 97–107 (1994)
Henikoff, S., Henikoff, J.G.: Using substitution probabilities to improve position-specific scoring matrices. CABIOS 12, 135–143 (1996)
Jeong, E., Chung, I., Miyano, S.: A neural network method for identification of RNA-interacting residues in protein. Genome Informatics 15, 105–116 (2004)
Jones, D.T.: Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195–202 (1999)
Jones, S., Daley, D.T.A., Luscombe, N.M., Berman, H.M., Thornton, J.M.: Protein-RNA interactions: a structural analysis. Nucleic Acids Res. 29(4), 943–954 (2001)
Kaur, H., Raghava, G.: A neural network method for prediction of β-turn types in proteins using evolutionary information. Bioinformatics 20, 2751–2758 (2004)
Kim, H., Jeong, E., Lee, S.W., Han, K.: Computational analysis of hydrogen bonds in protein-RNA complexes for interaction patterns. FEBS Lett. 552, 231–239 (2003)
Kloczkowski, A., Ting, K.L., Jernigan, R.L., Garnier, G.: Combining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence. Proteins 49, 154–166 (2000)
Krogh, A., Brown, M., Mian, I.S., Sjölander, K., Haussler, D.: Hidden Markov models in computational biology. J. Mol. Biol. 235, 1501–1531 (1994)
Le Cun, Y., Denker, J.S., Solla, S.A.: Optimal brain damage. Advances in Neural Information Processing Systems 2, 598–605 (1990)
Mitchison, G., Durbin, R.: Tree-based maximal likelihood substitution matrices and hidden Markov models. J. Mol. Evol. 41, 1139–1151 (1995)
Mount, W.: Bioinformatics: sequence and genome analysis. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (2001)
Rost, B., Sander, C.: Combining evolutionary information and neural networks to predict protein secondary structure. Proteins 19, 55–72 (1994)
Rost, B., Sander, C.: Conservation and prediction of solvent accessibility in protein families. Proteins 20, 216–226 (1994)
Sayle, R., Milner-White, E.J.: RasMol: Biomolecular graphics for all. Trends Biochem. Sci. 20, 374–376 (1995)
Schneider, T.D., Stephens, R.M.: Sequence logos: a new way to display consensus sequence. Nucleic Acids Res. 18, 6097–6100 (1990)
Sharp, K.A., Honig, B., Harvey, S.C.: Electrical potential of transfer RNAs: codon-anticodon recognition. Biochemistry 29, 340–346 (1990)
Sjölander, K., Karplus, K., Brown, M., Hughey, R., Krogh, A., Mian, I.S., Haussler, D.: Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci. 12, 327–345 (1996)
Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994)
Zell, A., Mamier, G.: Stuttgart neural network simulator version 4.2 (1997)
Zhou, H.X., Shan, Y.: Prediction of protein interaction sites from sequence profile and residue neighbor list. Proteins 44, 336–343 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jeong, E., Miyano, S. (2006). A Weighted Profile Based Method for Protein-RNA Interacting Residue Prediction. In: Priami, C., Cardelli, L., Emmott, S. (eds) Transactions on Computational Systems Biology IV. Lecture Notes in Computer Science(), vol 3939. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11732488_11
Download citation
DOI: https://doi.org/10.1007/11732488_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33245-9
Online ISBN: 978-3-540-33248-0
eBook Packages: Computer ScienceComputer Science (R0)