Abstract
During the last few years there has been a growing need for using computational intelligence techniques to analyze microarray data. The aim of the system presented in this study is to provide innovative decision support techniques for classifying data from microarrays and for extracting knowledge about the classification process. The computational intelligence techniques used in this chapter follow the case-based reasoning paradigm to emulate the steps followed in expression analysis. This work presents a novel filtering technique based on statistical methods, a new clustering technique that uses ESOINN (Enhanced Self-Organizing Incremental Neuronal Network), and a knowledge extraction technique based on the RIPPER algorithm. The system presented within this chapter has been applied to classify CLL patients and extract knowledge about the classification process. The results obtained permit us to conclude that the system provides a notable reduction of the dimensionality of the data obtained from microarrays. Moreover, the classification process takes the detection of relevant and irrelevant probes into account, which is fundamental for subsequent classification and an extraction of knowledge tool with a graphical interface to explain the classification process, and has been much appreciated by the human experts. Finally, the philosophy of the CBR systems facilitates the resolution of new problems using past experiences, which is very appropriate regarding the classification of leukemia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Tu, Y.J., Zhou, W., Piramuthu, S.: Identifying RFID-embedded objects in pervasive healthcare applications. Decision Support Systems 46(2), 586–593 (2008)
Chakraborty, D., Takahashi, H., Suganuma, T., Takeda, A., Kitagata, G., Hashimoto, K., Shiratori, N.: Context-aware remote healthcare support system based on overlay network. WSEAS Transactions on Computers 7(9), 1505–1514 (2008)
Lina, S., Chien, F.: Cluster analysis of genome-wide expression data for feature extraction. Expert Systems with Applications 36(2-2), 3327–3335 (2009)
Stadlera, Z.K., Come, S.E.: Review of gene-expression profiling and its clinical use in breast cancer. Critical Reviews in Oncology/Hematology 69(1), 1–11 (2009)
Affymetrix. GeneChip® Human Genome U133 Arrays, http://www.affymetrix.com/support/technical/datasheets/hgu133arrays_datasheet.pdf
Sawa, T., Ohno-Machado, L.: A neural network based similarity index for clustering DNA microarray data. Computers in Biology and Medicine 33(1), 1–15 (2003)
Bianchia, D., Calogero, R., Tirozzi, B.: Kohonen neural networks and genetic classification. Mathematical and Computer Modelling 45(1-2), 34–60 (2007)
Baladandayuthapani, V., Ray, S., Mallick, B.K.: Bayesian Methods for DNA Microarray Data Analysis. Handbook of Statistics 25(1), 713–742 (2005)
Avogadri, R., Valentini, G.: Fuzzy ensemble clustering based on random projections for DNA microarray data analysis. Artificial Intelligence in Medicine 45(2-3), 173–183 (2009)
Kolodner, J.: Case-Based Reasoning. Morgan Kaufmann, San Francisco (1993)
Riverola, F., Díaz, F., Corchado, J.M.: Gene-CBR: a case-based reasoning tool for cancer diagnosis using microarray datasets. Computational Intelligence 22(3-4), 254–268 (2006)
Corchado, J.M., De Paz, J.F., Rodríguez, S., Bajo, J.: Model of Experts for decision support in the diagnosis of leukemia patients. Artificial Intelligence in Medicine 46, 179–200 (2009)
Bichindaritz, I.: Role and Significance of Case-based Reasoning in the Health Sciences. KI 23(1), 12–17 (2009)
Bichindaritz, I., Marling, C.: Case-based reasoning in the health sciences: What’s next? Artificial Intelligence in Medicine 36(2), 127–135 (2006)
Furao, S., Ogura, T., Hasegawa, O.: An enhanced self-organizing incremental neural network for online unsupervised learning. Neural Networks 20(8), 893–903 (2007)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)
Saravanan, N., Cholairajana, S., Ramachandran, K.I.: Vibration-based fault diagnosis of spur bevel gear box using fuzzy technique. Expert Systems with Applications 36(2-2), 3119–3135 (2009)
Borg, I., Groenen, P.: Modern multidimensional scaling theory and applications. Springer, Heidelberg (1997)
Kruskal, J.B.: Multidimensional scaling by optimizing goodness of fit to nonmetric hypothesis. Psychometrika 29(1), 1–27 (1964)
Ture, M., Tokatli, F., Kurt, I.: Using Kaplan–Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recurrence-free survival of breast cancer patients. Expert Systems with Applications 36(2), 2017–2026 (2009)
Quackenbush, J.: Computational analysis of microarray data. Nature Review Genetics 2(6), 418–427 (2001)
Lipshutz, R.J., Fodor, S.P.A., Gingeras, T.R., Lockhart, D.H.: High density synthetic oligonucleotide arrays. Nature Genetics 21(1), 20–24 (1999)
Taniguchi, M., Guan, L.L., Basarab, J.A., Dodson, M.V., Moore, S.S.: Comparative analysis on gene expression profiles in cattle subcutaneous fat tissues. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics 3(4), 251–256
Avogadri, R., Valentini, G.: Fuzzy ensemble clustering based on random projections for DNA microarray data analysis. Artificial Intelligence in Medicine 45(2-3), 173–183 (2009)
Margalit, O., Somech, R., Amariglio, N., Rechav, G.: Microarray based gene expression profiling of hematologic malignancies: basic concepts and clinical applications. Blood Reviews 4(4), 223–234
Armstrong, N.J., Van de Wiel, M.A.: Microarray data analysis: From hypotheses to conclusions using gene expression data. Cellular Oncology 26(5-6), 279–290 (2004)
Jurisica, I., Glasgow, J.: Applications of case-based reasoning in molecular biology. Artificial Intelligence Magazine, Special issue on Bioinformatics 25(1), 85–95 (2004)
Aaronson, J.S., Juergen, H., Overton, G.C.: Knowledge Discovery in GENBANK. In: Proceedings of the First International Conference on Intelligent Systems for Molecular Biology, pp. 3–11 (1993)
Arshadi, N., Jurisica, I.: Data Mining for Case-Based Reasoning in High-Dimensional Biological Domains. IEEE Transactions on Knowledge and Data Engineering 17(8), 1127–1137 (2005)
Affymetrix. Statistical Algorithms Description Document, http://www.affymetrix.com/support/technical/whitepapers/sadd_whitepaper.pdf
Affymetrix. Guide to Probe Logarithmic Intensity Error (PLIER) Estimation, http://www.affymetrix.com/support/technical/technotes/plier_technote.pdf
Irizarry, R.A., Hobbs, B., Collin, F., Beazer-Barclay, Y.D., Antonellis, K.J.: Exploration, Normalization, and Summaries of High density Oligonucleotide Array Probe Level Data. Biostatistics 4, 249–264 (2003)
Brunelli, R.: Histogram Analysis for Image Retrieval. Pattern Recognition 34, 1625–1637 (2001)
Jurečkováa, J., Picek, J.: Shapiro–Wilk type test of normality under nuisance regression and scale. Computational Statistics & Data Analysis 51(10), 5184–5191 (2007)
Saitou, N., Nie, M.: The neighbor-joining method: A new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 4, 406–425 (1987)
Kohonen, T.: Self-organized formation of topologically correct feature maps. Biological Cybernetics, 59–69 (1982)
Fritzke, B.: A growing neural gas network learns topologies. In: Advances in Neural Information Processing Systems, vol. 7, pp. 625–632 (1995)
Shen, F.: An algorithm for incremental unsupervised learning and topology representation, Tokyo: Ph.D. thesis. Tokyo Institute of Technology (2006)
Redmond, S.J., Heneghan, C.: A method for initialising the K-means clustering algorithm using kd-trees. Pattern Recognition Letters 28(8), 965–973 (2007)
Martinetz, T.: Competitive Hebbian learning rule forms perfectly topology preserving maps. In: ICANN 1993: International Conference on Artificial Neural Networks, pp. 427–434 (1993)
Guinn, B., Gilkes, A.F., Woodward, E., Westwood, N.B., Muftia, G.J., Linchc, D., Burnett, A.K., Mills, K.I.: Microarray analysis of tumour antigen expression in presentation acute myeloid leukaemia. Biochemical and Biophysical Research Communication 333(5), 703–713 (2005)
Corchado, J.M., Bajo, J., De Paz, Y., De Paz, J.F.: Integrating Case Planning and RPTW Neuronal Networks to Construct an Intelligent Environment for Health Care. Expert Systems with Applications 36(3), 5844–5858 (2009)
Holte, R.C.: Very simple classification rules perform well on most commonly used datasets, Machine Learning (1993)
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization, pp. 144–151. Morgan Kaufmann, San Francisco (1998)
Vogiatzis, D., Tsapatsoulis, N.: Active learning for microarray data. International Journal of Approximate Reasoning 47(1), 85–96 (2008)
Yang, T.Y.: Efficient multi-class cancer diagnosis algorithm, using a global similarity pattern. Computational Statistics & Data Analysis 53(3), 756–765 (2009)
Leng, C.: Sparse optimal scoring for multiclass cancer diagnosis and biomarker detection using microarray data. Computational Biology and Chemisty 32(6), 417–425 (2008)
Foon, K.A., Rai, K.L., Gale, R.P.: Chronic lymphocytic leukemia: new insights into biology and therapy. Annals of Internal Medicine 113(7), 525–539 (1990)
Chronic Lymphocytic Leukemia (2008), The leukemia and lymphoma society, http://www.leukemia-lymphoma.org/all_page.adp?item_id=7059
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
De Paz, J.F., Bajo, J., Rodríguez, S., Corchado, J.M. (2010). Computational Intelligence Techniques for Classification in Microarray Analysis. In: Bichindaritz, I., Vaidya, S., Jain, A., Jain, L.C. (eds) Computational Intelligence in Healthcare 4. Studies in Computational Intelligence, vol 309. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14464-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-14464-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14463-9
Online ISBN: 978-3-642-14464-6
eBook Packages: EngineeringEngineering (R0)