Abstract
Microarray gene dataset often contains huge number of attributes many of which are irrelevant and redundant with respect to classification. Presence of such attributes may sometimes reduce the classification accuracy of the dataset. Therefore, the data should be pre-processed to filter out the unimportant attributes before passing them on to the classifier. In the paper, the concepts of Rough Set Theory (RST) and Genetic Algorithm (GA) are used for selecting only the relevant attributes of the dataset. The method constructs relative discernibility matrix to compute the core attributes based on which attributes are encoded to strings used as an initial population for running the genetic algorithm. The method runs each time by adding a single attribute to the initial strings to select only a minimal attribute set known as reduct. The fitness function is defined based on the attribute dependency of the formed rough set. Attribute dependency gives a measure of the degree of influence of the selected attribute subset on the decision. The experimental results show that, the proposed method yields better result than some well-known attribute reduction algorithms for some real-world microarray cancerous datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Garey, M., Johnson, D.: Computers and Intractability- A Guide to the Theory of NP-completeness. Freeman, New York (1979)
Pawlak, Z.: Rough Sets. International Journal of Information and Computer Sciences 11, 341–356 (1982)
Pawlak, Z.: Rough Set Theory and its Applications to Data Analysis. Cybernetics and Systems 29, 661–688 (1998)
Komorowski, J., Pawalk, Z., Polkowski, S.A.: Rough sets: A Tutorial. In: Pal, S.K., Skowron, A. (eds.) Rough Fuzzy Hybridization: A New Trend in Decision-Making, pp. 3–98. Springer, Berlin (1999)
Zhong, N., Dong, J., Ohsuga, S.: Using Rough Sets with Heuristics for Feature Selection. J. Intelligent Information System, 199–214 (2001)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning, pp. 432–440. Addison-Wesley (1989)
Beasley, D., Bull, D.R., Martin, R.R.: An Overview of Genetic Algorithms: Part 2 Research Topics. University Computing 15, 170–181 (1993)
Devroye, L., Gyorfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, New York (1996)
Pal, S.K., Mitra, S.: Neuro-Fuzzy Pattern Recognition: Methods in Soft Computing. Willey, New York (1999)
Gupta, S.C., Kapoor, V.K.: Fundamental of Mathematical Statistics. Sultan Chand & Sons, A.S. Printing Press, India (1994)
Skowron, A., Rauszer, C.: The Discernibility Matrices and Functions in Information Systems. In: Slowinski, R. (ed.) Intelligent Decision Support. Handbook of Applications and Advances of the Rough Sets Theory. Kluwer, Dordrecht (1992)
Jiang, B., Liang, M., Mei, L.: Attribute Reduction Algorithm based on Discernibility Matrix of Skowron and Itemset Lattice. In: Intl. Conf. AICI (2010)
Yao, Y.Y., Zhao, Y.: Discernibility Matrix Simplification for Constructing at Tribute Reducts. Information Sciences 179(5), 867–882 (2009)
Pappa, G.L., Freitas, A.A., Kaestner, C.A.A.: Attribute Selection with a Multi-objective Genetic Algorithm. In: Bittencourt, G., Ramalho, G.L. (eds.) SBIA 2002. LNCS (LNAI), vol. 2507, pp. 280–290. Springer, Heidelberg (2002)
Shi, H., Fu, J.-Z.: A Heuristic Genetic Algorithm for Attribute Reduction. In: Fifth International Conference on Machine Learning and Cybernetics (2006)
Liu, B.X., Liu, F., Cheng, X.: An Adaptive Genetic Algorithm based on Rough Set Attribute Reduction. In: Third International Conference on Biomedical Engineering and Informatics (2010)
Kerber, R.: ChiMerge: Discretization of Numeric Attributes. In: Proc. of AAAI 1992, Ninth Intl. Conf. Artificial Intelligence, pp. 123–128. AAAI-Press (1992)
WEKA: Machine Learning Software, http://www.cs.waikato.ac.nz/~ml
Hall, M.A.: Correlation-Based Feature Selection for Machine Learning. Ph.D thesis, Dept. of Computer Science, University of Waikato, Hamilton, New Zealand (1998)
Liu, Setiono, R.: A Probabilistic Approach to Feature Selection: A Filter Solution. In: Proceedings of 13th International Conference on Machine Learning, pp. 319–327 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das, A.K., Chakrabarty, S., Pati, S.K., Sahaji, A.H. (2012). Applying Restrained Genetic Algorithm for Attribute Reduction Using Attribute Dependency and Discernibility Matrix. In: Venugopal, K.R., Patnaik, L.M. (eds) Wireless Networks and Computational Intelligence. ICIP 2012. Communications in Computer and Information Science, vol 292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31686-9_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-31686-9_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31685-2
Online ISBN: 978-3-642-31686-9
eBook Packages: Computer ScienceComputer Science (R0)