Reduction Combination Determination for Efficient Microarray Data Classification with Three Stage Dimensionality Reduction Approach

Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 32)


Classification of microarray data with high dimension and small sample size is a complex task. This work explores the optimal search space appropriate for classification. Here the crush of dimensionality is handled with a three stages dimension reduction technique. At the first stage, statistical measures are used to remove genes that do not contribute for classification. In the second stage, more noisy genes are removed by considering signal to noise ratio (SNR). In the third stage, principal component analysis (PCA) method is used to further reduce the dimension. Further, how much to reduce at each stage is crucial to develop an efficient classifier. Combination of different proportion of reduction at each stage is considered in this study to find appropriate combination for each dataset which maximizes the classifier performance. Help of naïve Bayes classifier is taken here to find appropriate combination of reduction.


Microarray data Data classification Feature selection 


  1. 1.
    Quackenbush, J.: Computational analysis of microarray data. Nat. Rev. Genet. 2(6), 418–427 (2001)CrossRefGoogle Scholar
  2. 2.
    Zhou, X., Tuck, D.P.: MSVM-RFE extensions of SVM-REF for multiclass gene selection on DNA microarray data. Bioinformatics 23(9), 1106–1114 (2007)CrossRefGoogle Scholar
  3. 3.
    Mutch, D.M., Berger, A., Mansourian, R., Rytz, A., Roberts, M.A.: Microarray data analysis. a practical approach for selecting differentially expressed genes. Genome Biol. 2(12) (2001)Google Scholar
  4. 4.
    Chien-Pang, L., Yungho, L.: A novel hybrid feature selection method for microarray data analysis. Appl. Soft Comput. 11, 208–213 (2011)CrossRefGoogle Scholar
  5. 5.
    Dash, R., Misra, B.B., Dehuri, S., Cho, S.B.: Efficient microarray data classification with three stage dimensionality reduction. International Conference on Intelligent Computing, Communication and Devices ICCD (2014)Google Scholar
  6. 6.
    Yan, J., Zhang, B., Ning, L., Shuicheng, Y., Qiansheng, C., Weiguo, F., Qiang, Y., Wensi, X., Zheng, C.: Effective and efficient dimensionality reduction for large-scale and streaming data preprocessing. IEEE Trans. Knowl. Data Eng. 18(3), 320–333 (2006)CrossRefGoogle Scholar
  7. 7.
    Gordon, G.J.: Translation of microarray data into clinically relevant cancer diagnostic tests using gege expression ratios in lung cancer and mesothelioma. Cancer Res. 62, 4963–4967 (2002)Google Scholar
  8. 8.
    Beer, D.G.: Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat. Med. 8, 816–823 (2002)Google Scholar
  9. 9.
    Pomeroy, S.L.: Prediction of central nervous system embryonal tumour outcome based on gene expression. Nature 415, 436–442 (2002)CrossRefGoogle Scholar
  10. 10.
    Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)CrossRefGoogle Scholar
  11. 11.
    Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.J.: Broad patterns of gene expression revealed by clustering analysis of tumor and normalcolon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. USA 96(12), 6745–6750 (1999)CrossRefGoogle Scholar
  12. 12.
    Wigle, D.A.: Molecular profiling of non-small cell lung cancer and correlation with disease-free survival. Cancer Res. 62, 3005–3008 (2002)Google Scholar
  13. 13.
    Alizadeh, A.A.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRefGoogle Scholar
  14. 14.
    Kent Ridge Bio-medical Dataset. Retrieved 25 Aug 2013, from

Copyright information

© Springer India 2015

Authors and Affiliations

  1. 1.Department of Computer Science and Information Technology, Institute of Technical Education and ResearchSiksha ‘O’ Anusandhan UniversityBhubaneswarIndia
  2. 2.Department of Computer Science and EngineeringSilicon Institute of TechnologyBhubaneswarIndia

Personalised recommendations