Advertisement

Machine Learning in Bioinformatics

  • Supawan Prompramote
  • Yan Chen
  • Yi-Ping Phoebe Chen

Keywords

Neural Network Genetic Algorithm Support Vector Machine Artificial Neural Network Fuzzy Logic 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adeli, H. (1995) Machine learning: neural networks, genetic algorithms, and fuzzy systems. New York: Wiley.MATHGoogle Scholar
  2. Adriaenssens, V., Baetsb, B.D., Goethalsa, P.L.M. and Pauwa, N.D. (2004) Fuzzy rule-based models for decision support in ecosystem management. Science of The Total Environment, vol. 319, pp 1–12.CrossRefGoogle Scholar
  3. Agatonovic-Kustrin, S., Beresford, R. (2000). Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. Journal of Pharmaceutical and Biomedical Analysis 22 (5) 717–727.Google Scholar
  4. Azuaje, F. (2003) A computational evolutionary approach to evolving game strategy and cooperation. IEEE Transactions on Systems, Man, and Cybernetics, Part B, vol. 33, pp 498–503.Google Scholar
  5. Baldi, P. and Brunak, S. (2001) “Bioinformatics The Machine Learning Approach”, The MIT Press.Google Scholar
  6. Baldi, P., Brunak, S., Frasconi, P., Pollastri, G. and Soda, G. (2000) Bidirectional IOHMMs and Recurrent Neural Networks for Protein Secondary Structure Prediction. In: Protein Sequence Analysis in the Genomic Era, R. Casadio and L. Masotti, Eds. CLUEB, Bologna, Italy.Google Scholar
  7. Baldi, P. and Brunak, S. (1998) Bioinformatics: the Machine Learning Approach. MIT Press.Google Scholar
  8. Barton, G.J. and Sternberg, M.J.E. (1987) A strategy for the rapid multiple alignment of protein sequences: Confidence levels from tertiary structure comparisons. Journal of Molecular Biology, vol. 198, pp 327–337.CrossRefGoogle Scholar
  9. Baxevanis, A.D., Ouellette, B.F.F. (2001) Bioinformatics: a practical guide to the analysis of genes and proteins, 2nd ed ed. New York: Wiley-Interscience.Google Scholar
  10. Benedetti, G. and Morosetti, S. (1995) A genetic algorithm to search for optimal and suboptimal RNA secondary structures. Biophysical Chemistry, vol. 55, pp 253–259.CrossRefGoogle Scholar
  11. Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J. and Rapp, B.A., Wheeler DL (2002) GenBank. Nucleic Acids Res., vol. 28, pp 15–18.Google Scholar
  12. Bicciato, S., Pandin, M., Didone, G. and Bello, C.D. (2001) Analysis of an Associative Memory Neural Network for Pattern Identification in Gene Expression Data. 1st Workshop on Data Mining in Bioinformatics (in conjunction with 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining), San Francisco, CA, USA.Google Scholar
  13. Boegl, K., Adlassnig, K.P., Hayashi, Y., Rothenfluh, T.E. and Leitich, H. (2004) Knowledge acquisition in the fuzzy knowledge representation framework of a medical consultation system. Artificial Intelligence in Medicine, vol. 30, pp 1–26.CrossRefGoogle Scholar
  14. Bohr, H., Bohr, J., Brunak, S., Cotteril, R.M., Fredholm, H., Lautrup, B. and Peterson, S.B. (1990) A novel approach to prediction of the 3-dimensional structures of protein backbones by neural networks. FEBS Letters, vol. 261, pp 43–46.CrossRefGoogle Scholar
  15. Brown, M.P.S., Grundy, W, N., et al. (2000) Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences 97(1): 262–267Google Scholar
  16. Brusic, V., Rudy, G., Honeyman, M., Hammer, J. and Harrison, L. (1998a) Prediction of MHC class-II binding peptides using an evolutionary algorithm and artificial neural network. Bioinformatics 14, 121–130.CrossRefGoogle Scholar
  17. Brusic, V., Van Endert, P., Zeleznikow, J., Daniel, S., Hammer, J. and Petrovsky, N. (1998b) A Neural Network Model Approach to the Study of Human TAP Transporter. Silico biology 1, 0010.Google Scholar
  18. Brusic, V. and Zeleznikow, J. (1999) Knowledge discovery and data mining in biological databases. The Knowledge Engineering Review 14, 257–277.CrossRefGoogle Scholar
  19. Chellapilla, K. and Fogel, G.B. (1999) Multiple sequence alignment using evolutionary programming. Congress on Evolutionary Computation, pp 445–452.Google Scholar
  20. Chen, J-H., Le, S-Y. and Maizel, J.V. (2000) Prediction of common secondary structures of RNAs: a genetic algorithm approach. Nucleic Acids Res., vol. 28, pp 991–999.Google Scholar
  21. Coley, D.A. (1999) An introduction to genetic algorithms for scientists and engineers. Singapore: World Scientific.Google Scholar
  22. Cristianini, N. and Shawe-Taylor, J. (2000) An Introduction to Support Vector Machines (and other kernel-based learning methods), Cambridge University Press.Google Scholar
  23. Darwin, C. (1859) On the Origin of Species by means of natural selection: John Murray, London.Google Scholar
  24. Dopazo J and Carazo JM (1997) Phylogenetic reconstruction using a growing neural work that adopts the topology of a phylogenetic tree. Journal of Moleculer Evolution 44: 226–233.Google Scholar
  25. Fairchild, S., Pachter, R. and Perrin, R. (1995) Protein Structure Analysis and Prediction. The Mathematica Journal, vol. 5.Google Scholar
  26. Feng, D.F. and Doolitle, R.F. (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. Journal of Molecular Evolution, vol. 25, pp 351–360.Google Scholar
  27. Finlay, J. and Dix, A. (1996) Introduction to artificial intelligence: Taylor and Francis.Google Scholar
  28. Forrest, S. (1985) Documentation for prisonner dilemna and norms programs that use the genetic algorithm. University of Michigan, Technical report.Google Scholar
  29. Freeman, J.A. and Skapura, D.M. (1991) Neural networks: algorithms, applications, and programming techniques. Reading, Mass: Addison-Wesley.MATHGoogle Scholar
  30. Fritzke, B (1994) Growing cell structures—a self-organizing network for unsupervised and supervised learning. Neural Networks 7: 1141–1160.CrossRefGoogle Scholar
  31. Fu, L. (1999) Knowledge Discovery Based on Neural Networks. Communications of the ACM (CACM), vol. 42, pp 47–50.CrossRefGoogle Scholar
  32. Furey, T.S., Cristianini, N., et al. (2000) Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16(10): 906–914.CrossRefGoogle Scholar
  33. Ghanea-Hercock, R. (2003) Applied evolutionary algorithms in Java. New York: Springer.MATHGoogle Scholar
  34. Goldberg, D.E. (1989) Genetic algorithms in search, optimization, and machine learning: Addison-Wesley Pub. Co.Google Scholar
  35. Goldberg, D.E., Deb, K. (1991) A comparison of selection schemes used in genetic algorithms. In: Foundations of Genetic Algorithms, G.J.E. Rawlins, Edn. pp 69–93.Google Scholar
  36. Golub, T.R. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, vol. 286, pp 531–537.CrossRefGoogle Scholar
  37. Granzow, M., Berrar, D., et al. (2001) Tumour classification by gene expression profiling: comparison and validation of five clustering methods. ACM SIGBIO Newsletter 21(1): 16–22.Google Scholar
  38. Grefenstette, J.J., Baker, J.E. (1989) How genetic algorithms work: A critical look at implicit parallelism. Presented at Proc. 3rd Intl. Conf. Genetic Algorithms, San Mateo, CA.Google Scholar
  39. Gultyaev, A.P., Batenburg, F.H.D., Pleij, C.W.A. (1995) The Computer Simulation of RNA Folding Pathways using a Genetic Algorithm. Journal of Molecular Biology, vol. 250, pp 37–51.CrossRefGoogle Scholar
  40. Han, J. and Kamber, M. (2001) Data mining: concepts and techniques. San Francisco, Calif.: Morgan Kaufmann Publishers.Google Scholar
  41. Haykin, S. (1994) Neural networks: a comprehensive foundation. Upper Saddle River, N.J.: Prentice Hall.MATHGoogle Scholar
  42. Herrero, J., Valencia, A. and Dopazo, J. (2001) A hierarchical unsupervised growing neural network for clustering gene expression patterns. Bioinformatics, vol. 17, pp 126–136.CrossRefGoogle Scholar
  43. Herrero, J., Valencia, A., et al. (2001) A hierarchical unsupervised growing neural network for clustering gene expression patterns. Bioinformatics 17(2): 126–136.CrossRefGoogle Scholar
  44. Holland, J.H. (1975) Adaptation in Natural and Artificial Systems: University of Michigan Press, Ann Arbor.Google Scholar
  45. Isokawa, M., Wayama, M. and Shimizu, T. (1996) Multiple sequence alignment using a genetic algorithm. Genome Informatics, vol. 7, pp 176–177.Google Scholar
  46. Jaakkola, T., Diekhans, M., et al. (1999) Using the Fisher kernel method to detect remote protein homologies. 7th International Conference on Intelligent Systems for Molecular Biology, Menlo Park, CA.Google Scholar
  47. Jagota, A. (2000) Data analysis and classification for bioinformatics. California: Bay Press.Google Scholar
  48. Kalate, R.N., Tambe, S.S., Kulkarni, B.D. (2003) Artificial neural networks for prediction of mycobacterial promoter sequences. Comput Biol Chem, vol. 27, pp 555–564.CrossRefGoogle Scholar
  49. Kandel, A. (1992) Fuzzy expert systems. Boca Raton, FL: CRC Press.Google Scholar
  50. Keedwell, E.C., Narayanan, A. (2003) Genetic algorithms for gene expression analysis. Presented at Applications of Evolutionary Computation: Proceedings of the 1st European Workshop on Evolutionary Bioinformatics (EvoBIO 2003).Google Scholar
  51. Kohonen, T. (1982) Self-organized formation of topologically correct feature maps. Biological Cybernetics 43: 59–69.CrossRefMATHMathSciNetGoogle Scholar
  52. Kohonen, T. (1990) The self-organizing map. Proceedings of IEEE 78(9): 1464–1480.CrossRefGoogle Scholar
  53. Kruse, R., Gebhardt, J., Palm, R. (1994) Foundations of fuzzy systems. Chichester, West Sussex, England; New York: Wiley & Sons.Google Scholar
  54. Kruse, R., Gebhardt, J., Palm, R. (1994) Fuzzy systems in computer science. Braunschweig: Vieweg.Google Scholar
  55. Kuonen, D. (2003) Challenges in Bioinformatics for Statistical Data Miners. Bulletin of the Swiss Statistical Society, vol. 46, pp 10–17.Google Scholar
  56. Li, L., Weinberg, C.R., Darden, T.A. and Pedersen, L.G. (2001) Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics, vol. 17, pp 1131–1142.Google Scholar
  57. Lippman, R.P. (1987) An introduction to computing with neural nets. IEEE Acoustics, Speech, and Signal Processing Magazine 4(2): 4–22.MathSciNetGoogle Scholar
  58. Mamdani, E.H. and Assilian, S. (1975) An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller. International Journal of Man-Machine Studies, vol. 7, pp 1–13.MATHGoogle Scholar
  59. Minsky, M. and Papert, S. (1969) Perceptrons: MIT Press, Cambridge.MATHGoogle Scholar
  60. Müller, B. and Reinhardt, J. (1990) Neural networks: an introduction. Berlin; New York: Springer-Verlag.MATHGoogle Scholar
  61. Narayanan, A., Keedwell, E., et al. (2003a) Artificial Intelligence Techniques for Bioinformatics. Applied Bioinformatics 1(4): 191–222.Google Scholar
  62. Narayanan, A., Keedwell, E., et al. (2003b) Single-Layer Artificial Neural Networks for Expression Analysis. Special Issue on Bioinformatics of Neurocomputing.Google Scholar
  63. Gene Narayanan, A., Keedwell, E.C. and Olsson, B. (2002) Artificial intelligence techniques for bioinformatics. Applied Bioinformatics, vol. 1, pp 191–222.Google Scholar
  64. Negnevitsky, M. (2002) Artificial Intelligence: A Guide to Intelligent Systems. New York: Addison Wesley.Google Scholar
  65. Ng, S-K. and Wong, L. (2004) Accomplishments and Challenges in Bioinformatics. In: IT pro, pp 12–18.Google Scholar
  66. Nguyen, H.D., Yoshihara, I., Yamamori, K. and Yasunage, M. (2002) A Parallel Hybrid Genetic Algorithm for Multiple Protein Sequence Alignment. Presented at Proceedings of the 2002 Congress on Evolutionary Computation CEC2002, Piscataway, NJ.Google Scholar
  67. Nilsson, N.J. (1996) Introduction to Machine learning.Google Scholar
  68. Notredame, C. and Higgins, D.G. (1996) SAGA: Sequence alignment by genetic algorithm. Nucleic Acids Res., vol. 24, pp 1515–1524.CrossRefGoogle Scholar
  69. Ohno-Machado, L. and Vinterbo, S., Weber, G. (2002) Classification of gene expression data using fuzzy logic. Journal of Intelligent and Fuzzy Systems, vol. 12, pp 19–24.MATHGoogle Scholar
  70. Qian, N. and Sejnowski, T.J. (1988) Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology 202: 865–884.CrossRefGoogle Scholar
  71. Qian, N. and Sejnowski, T.J. (1988) Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology, vol. 202, pp 865–884.CrossRefGoogle Scholar
  72. Riis, S.K. and Krogh, A. (1996) Improving prediction of protein secondary structure using structured neural networks and multiple sequence alignments. J Comput Biol., vol. 3, pp 163–183.CrossRefGoogle Scholar
  73. Rosenblatt, F. (1958) The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, vol. 65, pp 386–408.MathSciNetGoogle Scholar
  74. Rost, B. and Sander, C. (1993) Prediction of protein secondary structure at better than 70% accuracy. Journal of Molecular Biology, vol. 232, pp 584–599.CrossRefGoogle Scholar
  75. Saritas, I., Allahverdi, N. and Sert, I.U. (2003) A fuzzy expert system design for diagnosis of prostate cancer. Presented at International Conference on Computer Systems and Technologies, Sofia, Bulgaria.Google Scholar
  76. Sarkar, M. and Leong, T-Y. (2003) Characterization of medical time series using fuzzy similarity-based fractal dimensions. Artificial Intelligence in Medicine, vol. 27, pp 201–222.CrossRefGoogle Scholar
  77. Schneider, J., Peltri, G., Bitterlich, N., Neu, K., Velcovsky, H.G., Morr, H., Katz, N. and Eigenbrodt, E. (2003) Fuzzy logic-based tumor marker profiles including a new marker tumor M2-PK improved sensitivity to the detection of progression in lung cancer patients. Anticancer Res., vol. 23, pp 899–906, 2003.Google Scholar
  78. Seker, H., Odetayo, M.O., Petrovic, D. and Naguib, R.N.G. (2003) A fuzzy logic based-method for prognostic decision making in breast and prostate cancers. Information Technology in Biomedicine, IEEE Transactions, vol. 7, pp 114–122.Google Scholar
  79. Shapiro, B.A. and Navetta, J. (1994) A massively parallel genetic algorithm for RNA secondary structure prediction. Journal of Supercomputing, vol. 8, pp 195–207.MATHGoogle Scholar
  80. Shapiro, B.A., Wu, JC., Bengali, D. and Potts, M.J. (2001) The massively parallel genetic algorithm for RNA folding: MIMD implementation and population variation. Bioinformatics, vol. 17, pp. 137–148.CrossRefGoogle Scholar
  81. Spicker, J.S., Wikman, F., Lu, M-L., Cordon-Cardo, C., Workman, C., Rntoft, T.F., Brunak, S. and Knudsen, S. (2002) Neural network predicts sequence of TP53 gene based on DNA chip. Bioinformatics, vol. 18, pp 1133–1134, 2002.CrossRefGoogle Scholar
  82. Stormo, G.D., Schneider, T.D., Gold, L. and Ehrenfeucht, A. (1982) Use of “Perceptron” algorithm to distinguish translatioanl initiation in E.coli. Nucleic Acids Research 10, 2997–3011.Google Scholar
  83. Synder, E.E. and Stormo, G.D. (1993) Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks. Nucleic Acids Res., vol. 21, pp 607–613.Google Scholar
  84. Synder, E.E. and Stormo, G.D. (1997) Identifying genes in genomic DNA sequences. In: DNA and Protein Sequence, M.J. Bishop and C.J. Rawlings, Edn. NewYork: Oxford University Press, pp 209–224.Google Scholar
  85. Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., Lander, E.S. and Golub, T.R. (1999) Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Presented at Proc Natl Acad Sci, USA.Google Scholar
  86. Thompson, J.D., Higgins, D.G. and Gibson, T.J. (1994) CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., vol. 22, pp 4673–4680.Google Scholar
  87. Tomida, S., Hanai, T., Honda, H. and Kobayashi, T. (2001) Gene Expression Analysis Using Fuzzy ART. Genome Informatics, vol. 12, pp 245–246.Google Scholar
  88. Torkkola, K., Mike G.R., Kaysser-Kranich, T. and Ma, C. (2001) Self-organizing maps in mining gene expression data. Information Sciences 139, 79–96.CrossRefMATHGoogle Scholar
  89. Toronen, P., Kolehmainen, M., Wong, C. and Castren, E. (1999) Analysis of gene expression data using self-organizing maps. In: FEBS Letters, vol. 451, pp 142–146.Google Scholar
  90. Toronen, P., Kolehmainen, M., Wong, G. and Castrén, E. (1999) Analysis of gene expression data using self-organizing maps. Federation of European Biochemical Societies Letters 451, 142–146.Google Scholar
  91. Uberbacher, E.C. and Mural, R.J. (1991) Locating Protein Coding Regions in Human DNA Sequences Using a Multiple Sensor-Neural Network Approach. Presented at Proc.Natl. Acad. Sci., USA.Google Scholar
  92. Valentini, G. (2002). Gene expression data analysis of human lymphoma using support vector machines and output coding ensembles. Artificial Intelligence in Medicine 26(3): 281–304.CrossRefGoogle Scholar
  93. Vapnik, V. (1996) The nature of statistical learning theory, Springer.Google Scholar
  94. Virant-Klun, I. and Virant, J. (1999) Fuzzy logic alternative for analysis in the biomedical sciences. Comput Biomed Res, vol. 32, pp 305–321.CrossRefGoogle Scholar
  95. Wang, J.T.L., Ma, Q., Shasha, D. and Wu, C.H. (2000) Application of neural networks to biological data mining: a case study in protein sequence classification. 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 20, 305–309.Google Scholar
  96. Wayama, M., Takahashi, K., Shimizu, T. (1995) An approach to amino acid sequence alignment using a genetic algorithm. Genome Informatics, vol. 6, pp 122–123.Google Scholar
  97. Westhead, D.R., Parish, J.H. and Twyman, R.M. (2002) “Bioinformatics”, Instant Notes in Bioinformatics, BIOS Scientific Publishing.Google Scholar
  98. Wong, L. (2000) Kleisli, a Funcitonal Query System. Journal of Functional Programming, vol. 10, pp 19–56.CrossRefGoogle Scholar
  99. Wong, L. (2002) Technologies for Integrating Biological Data. Briefings in Bioinformatics, vol. 3, pp 389–404, 2002.MATHGoogle Scholar
  100. Woolf, P.J. and Wang, Y. (2000) A fuzzy logic approach to analyzing gene expression data. Physiological Genomics, vol. 3, pp 9–15.Google Scholar
  101. Wu, C. (1993) Classification Neural Networks For Rapid Sequence Annotation And Automated Database Organization. Computer & Chemistry, vol. 17, pp 219–227,.Google Scholar
  102. Wu, C. (1996) Gene Classification Artificial Neural System. In: Methods In Enzymology: Computer Methods for Macromolecular Sequence Analysis, vol. 266, R. Doolittle, Ed, pp 71–88.Google Scholar
  103. Wu, C., Berry M, Shivakumar S, McLarty J (1995) Neural Networks For Full-Scale Protein Sequence Classification: Sequence Encoding With Singular Value Decomposition. Machine Learning, vol. 21.Google Scholar
  104. Wu, C. and Shivakumar, S. (1994) Back-Propagation And Counter-Propagation Neural Networks For Phylogenetic Classification Of Ribosomal RNA Sequences. Nucleic Acids Research, vol. 22, pp 4291–4299.Google Scholar
  105. Wu, C., Whitson, G., McLarty, J., Ermongkonchai, A. and Chang, T. (1992) Protein Classification Artificial Neural System. Protein Science, vol. 1, pp 667–677.CrossRefGoogle Scholar
  106. Wu, C.H. (1997) Artificial neural networks for molecular sequence analysis. Computers and Chemistry 21(4): 237–256.CrossRefGoogle Scholar
  107. Wu, C.H., Berry, M.W., et al. (1993) Neural Networks for Molecular Sequence Classification. Proceeding of 1st ISMB.Google Scholar
  108. Wu, C.H., Berry, M, W., et al. (1995) Neural Networks for Full-Scale Protein Sequence Classification: Sequence Encoding with Singular Value Decomposition. Machine Learning 21(1-2): 177–193.CrossRefGoogle Scholar
  109. Wu, C.H., McLarty, J.W. (2000) Neural Networks and Genome Informatics. Meth Comp Biol Biochem 1.Google Scholar
  110. Wu, J.C. and Shapiro, B.A. (1999) A Boltzmann filter improves RNA folding pathway in a massively parallel genetic algorithm. Journal of Biomolecular Structure and Dynamics, vol. 17, pp 581–595.Google Scholar
  111. Zhang, C. and Wong, A.K.C. (1997) A genetic algorithm for multiple molecular sequence alignment. Computer Application for Bioscience, vol. 13, pp 565–581.Google Scholar

Copyright information

© Springer-Verlag Berlin Hiedelberg 2005

Authors and Affiliations

  • Supawan Prompramote
    • 1
  • Yan Chen
    • 1
  • Yi-Ping Phoebe Chen
    • 1
    • 2
  1. 1.School of Information Technology Faculty of Science and TechnologyDeakin UniversityAustralia
  2. 2.ARC Centre in BioinformaticsAustralia

Personalised recommendations