Abstract
The field of bioinformatics shows a tremendous growth at the crossroads of biology, medicine, information science, and computer science. Figures clearly demonstrate that today bioinformatics research is as productive as data mining research as a whole. However most bioinformatics research deals with tasks of prediction, classification, and tree or network induction from data. Bioinformatics tasks consist mainly in similarity-based sequence search, microarray data analysis, 2D or 3D macromolecule shape prediction, and phylogenetic classification. It is therefore interesting to consider how the methods of bioinformatics can be pertinent advances in data mining and to highlight some examples of how these bioinformatics algorithms can potentially be applied to domains outside biology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
DOE Human Genome Project. Genome Glossary, http://www.ornl.gov/sci/techresources/Human_Genome/glossary/glossary_b.shtml (accessed April 22, 2010)
Miller, P.: Opportunities at the Intersection of Bioinformatics and Health Informatics: A Case Study. Journal of the American Medical Informatics Association 7(5), 431–438 (2000)
Felsenstein, J.: Inferring Phylogenies. Sinauer Associates, Inc., Sunderland (2004)
Sokal, R.R., Rohlf, F.J.: Biometry. The Principles and Practice of Statistics in Biological Research. W.H. Freeman and Company, New York (2001)
Kuonen, D.: Challenges in Bioinformatics for Statistical Data Miners. Bulletin of the Swiss Statistical Society 46, 10–17 (2003)
Piatetsky-Shapiro, G., Tamayo, P.: Microarray Data Mining: Facing the Challenges. ACM SIGKDD Explorations Newsletter 5(2), 1–5 (2003)
Annest, A., Bumgarner, R.E., Raftery, A.E., Yeung, K.Y.: Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data. BMC Bioinformatics 10, 10–72 (2009)
Felsenstein, J.: The troubled growth of statistical phylogenetics. Systematic-Biology 50(4), 465–467 (2001)
Maddison, W.P., Maddison, D.R.: MacClade: analysis of phylogeny and character evolution. Version 3.0. Sinauer Associates, Sunderland (1992)
Swofford, D.L.: PAUP: Phylogenetic Analysis Using Parcimony. Version 4. Sinauer Associates Inc. (2002)
Martins, E.P., Diniz-Filho, J.A., Housworth, E.A.: Adaptation and the comparative method: A computer simulation study. Evolution 56, 1–13 (2002)
Meacham, C.A.: A manual method for character compatibility analysis. Taxon 30(3), 591–600 (1981)
Raftery, A.: Bayesian Model Selection in Social Research (with Discussion). In: Marsden, P. (ed.) Sociological Methodology 1995, pp. 111–196. Blackwell, Cambridge (1995)
Volinsky, C., Madigan, D., Raftery, A., Kronmal, R.: Bayesian Model Averaging in Proprtional Hazard Models: Assessing the Risk of a Stroke. Applied Statistics 46(4), 433–448 (1997)
Hoeting, J., Madigan, D., Raftery, A., Volinsky, C.: Bayesian Model Averaging: A Tutorial. Statistical Science 14(4), 382–417 (1999)
Yeung, K., Bumgarner, R., Raftery, A.: Bayesian Model Averaging: Development of an Improved Multi-Class, Gene Selection and Classification Tool for Microarray Data. Bioinformatics 21(10), 2394–2402 (2005)
Hosmer, D., Lemeshow, S., May, S.: Applied Survival Analysis: Regression Modeling of Time to Event Data, 2nd edn. Wiley Series in Probability and Statistics. Wiley Interscience, Hoboken (2008)
O’Brien, M.J., Lyman, R.L.: Evolutionary Archaeology: Current Status and Future Prospects. Evolutionary Anthropology 11, 26–36 (2002)
Benedetto, D., Caglioti, E., Loreto, V.: Language Trees and Zipping. Physical Review Letters 88(4), 048702-1– 048702-1 (2002)
Houkes, W.: Tales of Tools and Trees: Phylogenetic Analysis and Explanation in evolu-tionary Archeology. In: EPSA 2009 2nd Conference of the European Philosophy of Science Association Proceedings (2010), http://philsci-archive.pitt.edu/archive/00005238/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bichindaritz, I. (2010). Bioinformatics Contributions to Data Mining. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2010. Lecture Notes in Computer Science(), vol 6171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14400-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-14400-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14399-1
Online ISBN: 978-3-642-14400-4
eBook Packages: Computer ScienceComputer Science (R0)