Skip to main content

Computational Intelligence Techniques for Classification in Microarray Analysis

  • Chapter
Computational Intelligence in Healthcare 4

Part of the book series: Studies in Computational Intelligence ((SCI,volume 309))

Abstract

During the last few years there has been a growing need for using computational intelligence techniques to analyze microarray data. The aim of the system presented in this study is to provide innovative decision support techniques for classifying data from microarrays and for extracting knowledge about the classification process. The computational intelligence techniques used in this chapter follow the case-based reasoning paradigm to emulate the steps followed in expression analysis. This work presents a novel filtering technique based on statistical methods, a new clustering technique that uses ESOINN (Enhanced Self-Organizing Incremental Neuronal Network), and a knowledge extraction technique based on the RIPPER algorithm. The system presented within this chapter has been applied to classify CLL patients and extract knowledge about the classification process. The results obtained permit us to conclude that the system provides a notable reduction of the dimensionality of the data obtained from microarrays. Moreover, the classification process takes the detection of relevant and irrelevant probes into account, which is fundamental for subsequent classification and an extraction of knowledge tool with a graphical interface to explain the classification process, and has been much appreciated by the human experts. Finally, the philosophy of the CBR systems facilitates the resolution of new problems using past experiences, which is very appropriate regarding the classification of leukemia.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tu, Y.J., Zhou, W., Piramuthu, S.: Identifying RFID-embedded objects in pervasive healthcare applications. Decision Support Systems 46(2), 586–593 (2008)

    Article  Google Scholar 

  2. Chakraborty, D., Takahashi, H., Suganuma, T., Takeda, A., Kitagata, G., Hashimoto, K., Shiratori, N.: Context-aware remote healthcare support system based on overlay network. WSEAS Transactions on Computers 7(9), 1505–1514 (2008)

    Google Scholar 

  3. Lina, S., Chien, F.: Cluster analysis of genome-wide expression data for feature extraction. Expert Systems with Applications 36(2-2), 3327–3335 (2009)

    Article  Google Scholar 

  4. Stadlera, Z.K., Come, S.E.: Review of gene-expression profiling and its clinical use in breast cancer. Critical Reviews in Oncology/Hematology 69(1), 1–11 (2009)

    Article  Google Scholar 

  5. Affymetrix. GeneChip® Human Genome U133 Arrays, http://www.affymetrix.com/support/technical/datasheets/hgu133arrays_datasheet.pdf

  6. Sawa, T., Ohno-Machado, L.: A neural network based similarity index for clustering DNA microarray data. Computers in Biology and Medicine 33(1), 1–15 (2003)

    Article  Google Scholar 

  7. Bianchia, D., Calogero, R., Tirozzi, B.: Kohonen neural networks and genetic classification. Mathematical and Computer Modelling 45(1-2), 34–60 (2007)

    Article  MathSciNet  Google Scholar 

  8. Baladandayuthapani, V., Ray, S., Mallick, B.K.: Bayesian Methods for DNA Microarray Data Analysis. Handbook of Statistics 25(1), 713–742 (2005)

    Article  MathSciNet  Google Scholar 

  9. Avogadri, R., Valentini, G.: Fuzzy ensemble clustering based on random projections for DNA microarray data analysis. Artificial Intelligence in Medicine 45(2-3), 173–183 (2009)

    Article  Google Scholar 

  10. Kolodner, J.: Case-Based Reasoning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  11. Riverola, F., Díaz, F., Corchado, J.M.: Gene-CBR: a case-based reasoning tool for cancer diagnosis using microarray datasets. Computational Intelligence 22(3-4), 254–268 (2006)

    Article  MathSciNet  Google Scholar 

  12. Corchado, J.M., De Paz, J.F., Rodríguez, S., Bajo, J.: Model of Experts for decision support in the diagnosis of leukemia patients. Artificial Intelligence in Medicine 46, 179–200 (2009)

    Article  Google Scholar 

  13. Bichindaritz, I.: Role and Significance of Case-based Reasoning in the Health Sciences. KI 23(1), 12–17 (2009)

    Google Scholar 

  14. Bichindaritz, I., Marling, C.: Case-based reasoning in the health sciences: What’s next? Artificial Intelligence in Medicine 36(2), 127–135 (2006)

    Article  Google Scholar 

  15. Furao, S., Ogura, T., Hasegawa, O.: An enhanced self-organizing incremental neural network for online unsupervised learning. Neural Networks 20(8), 893–903 (2007)

    Article  MATH  Google Scholar 

  16. Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)

    Google Scholar 

  17. Saravanan, N., Cholairajana, S., Ramachandran, K.I.: Vibration-based fault diagnosis of spur bevel gear box using fuzzy technique. Expert Systems with Applications 36(2-2), 3119–3135 (2009)

    Article  Google Scholar 

  18. Borg, I., Groenen, P.: Modern multidimensional scaling theory and applications. Springer, Heidelberg (1997)

    MATH  Google Scholar 

  19. Kruskal, J.B.: Multidimensional scaling by optimizing goodness of fit to nonmetric hypothesis. Psychometrika 29(1), 1–27 (1964)

    Article  MATH  MathSciNet  Google Scholar 

  20. Ture, M., Tokatli, F., Kurt, I.: Using Kaplan–Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recurrence-free survival of breast cancer patients. Expert Systems with Applications 36(2), 2017–2026 (2009)

    Article  Google Scholar 

  21. Quackenbush, J.: Computational analysis of microarray data. Nature Review Genetics 2(6), 418–427 (2001)

    Article  Google Scholar 

  22. Lipshutz, R.J., Fodor, S.P.A., Gingeras, T.R., Lockhart, D.H.: High density synthetic oligonucleotide arrays. Nature Genetics 21(1), 20–24 (1999)

    Article  Google Scholar 

  23. Taniguchi, M., Guan, L.L., Basarab, J.A., Dodson, M.V., Moore, S.S.: Comparative analysis on gene expression profiles in cattle subcutaneous fat tissues. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics 3(4), 251–256

    Google Scholar 

  24. Avogadri, R., Valentini, G.: Fuzzy ensemble clustering based on random projections for DNA microarray data analysis. Artificial Intelligence in Medicine 45(2-3), 173–183 (2009)

    Article  Google Scholar 

  25. Margalit, O., Somech, R., Amariglio, N., Rechav, G.: Microarray based gene expression profiling of hematologic malignancies: basic concepts and clinical applications. Blood Reviews 4(4), 223–234

    Google Scholar 

  26. Armstrong, N.J., Van de Wiel, M.A.: Microarray data analysis: From hypotheses to conclusions using gene expression data. Cellular Oncology 26(5-6), 279–290 (2004)

    Google Scholar 

  27. Jurisica, I., Glasgow, J.: Applications of case-based reasoning in molecular biology. Artificial Intelligence Magazine, Special issue on Bioinformatics 25(1), 85–95 (2004)

    Google Scholar 

  28. Aaronson, J.S., Juergen, H., Overton, G.C.: Knowledge Discovery in GENBANK. In: Proceedings of the First International Conference on Intelligent Systems for Molecular Biology, pp. 3–11 (1993)

    Google Scholar 

  29. Arshadi, N., Jurisica, I.: Data Mining for Case-Based Reasoning in High-Dimensional Biological Domains. IEEE Transactions on Knowledge and Data Engineering 17(8), 1127–1137 (2005)

    Article  Google Scholar 

  30. Affymetrix. Statistical Algorithms Description Document, http://www.affymetrix.com/support/technical/whitepapers/sadd_whitepaper.pdf

  31. Affymetrix. Guide to Probe Logarithmic Intensity Error (PLIER) Estimation, http://www.affymetrix.com/support/technical/technotes/plier_technote.pdf

  32. Irizarry, R.A., Hobbs, B., Collin, F., Beazer-Barclay, Y.D., Antonellis, K.J.: Exploration, Normalization, and Summaries of High density Oligonucleotide Array Probe Level Data. Biostatistics 4, 249–264 (2003)

    Article  MATH  Google Scholar 

  33. Brunelli, R.: Histogram Analysis for Image Retrieval. Pattern Recognition 34, 1625–1637 (2001)

    Article  MATH  Google Scholar 

  34. Jurečkováa, J., Picek, J.: Shapiro–Wilk type test of normality under nuisance regression and scale. Computational Statistics & Data Analysis 51(10), 5184–5191 (2007)

    Article  MathSciNet  Google Scholar 

  35. Saitou, N., Nie, M.: The neighbor-joining method: A new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 4, 406–425 (1987)

    Google Scholar 

  36. Kohonen, T.: Self-organized formation of topologically correct feature maps. Biological Cybernetics, 59–69 (1982)

    Google Scholar 

  37. Fritzke, B.: A growing neural gas network learns topologies. In: Advances in Neural Information Processing Systems, vol. 7, pp. 625–632 (1995)

    Google Scholar 

  38. Shen, F.: An algorithm for incremental unsupervised learning and topology representation, Tokyo: Ph.D. thesis. Tokyo Institute of Technology (2006)

    Google Scholar 

  39. Redmond, S.J., Heneghan, C.: A method for initialising the K-means clustering algorithm using kd-trees. Pattern Recognition Letters 28(8), 965–973 (2007)

    Article  Google Scholar 

  40. Martinetz, T.: Competitive Hebbian learning rule forms perfectly topology preserving maps. In: ICANN 1993: International Conference on Artificial Neural Networks, pp. 427–434 (1993)

    Google Scholar 

  41. Guinn, B., Gilkes, A.F., Woodward, E., Westwood, N.B., Muftia, G.J., Linchc, D., Burnett, A.K., Mills, K.I.: Microarray analysis of tumour antigen expression in presentation acute myeloid leukaemia. Biochemical and Biophysical Research Communication 333(5), 703–713 (2005)

    Article  Google Scholar 

  42. Corchado, J.M., Bajo, J., De Paz, Y., De Paz, J.F.: Integrating Case Planning and RPTW Neuronal Networks to Construct an Intelligent Environment for Health Care. Expert Systems with Applications 36(3), 5844–5858 (2009)

    Article  Google Scholar 

  43. Holte, R.C.: Very simple classification rules perform well on most commonly used datasets, Machine Learning (1993)

    Google Scholar 

  44. Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization, pp. 144–151. Morgan Kaufmann, San Francisco (1998)

    Google Scholar 

  45. Vogiatzis, D., Tsapatsoulis, N.: Active learning for microarray data. International Journal of Approximate Reasoning 47(1), 85–96 (2008)

    Article  MATH  Google Scholar 

  46. Yang, T.Y.: Efficient multi-class cancer diagnosis algorithm, using a global similarity pattern. Computational Statistics & Data Analysis 53(3), 756–765 (2009)

    Article  MATH  Google Scholar 

  47. Leng, C.: Sparse optimal scoring for multiclass cancer diagnosis and biomarker detection using microarray data. Computational Biology and Chemisty 32(6), 417–425 (2008)

    Article  MATH  MathSciNet  Google Scholar 

  48. Foon, K.A., Rai, K.L., Gale, R.P.: Chronic lymphocytic leukemia: new insights into biology and therapy. Annals of Internal Medicine 113(7), 525–539 (1990)

    Google Scholar 

  49. Chronic Lymphocytic Leukemia (2008), The leukemia and lymphoma society, http://www.leukemia-lymphoma.org/all_page.adp?item_id=7059

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

De Paz, J.F., Bajo, J., Rodríguez, S., Corchado, J.M. (2010). Computational Intelligence Techniques for Classification in Microarray Analysis. In: Bichindaritz, I., Vaidya, S., Jain, A., Jain, L.C. (eds) Computational Intelligence in Healthcare 4. Studies in Computational Intelligence, vol 309. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14464-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14464-6_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14463-9

  • Online ISBN: 978-3-642-14464-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics