Abstract
Minimal test cost attribute reduction is an important problem in cost-sensitive learning since it reduces the dimensionality of the attributes space. To address this issue, many heuristic algorithms have been used by researchers, however, the effectiveness of these algorithms are often unsatisfactory on large-scale datasets. In this paper, we develop a logarithmic weighted algorithm to tackle the minimal test cost attribute reduction problem. More specifically, two major issues are addressed with regard to the logarithmic weighted algorithm. One relates to a logarithmic strategy that can suggest a way of obtaining the attribute reduction to achieve the best results at the lowest cost. The other relates to the test costs which are normalized to speed up the convergence of the algorithm. Experimental results show that our algorithm attains better cost-minimization performance than the existing a weighted information gain algorithm. Moreover, when the test cost distribution is Normal, the effectiveness of the proposed algorithm is more effective for dealing with relatively medium-sized datasets and large-scale datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yang, Q., Wu, X.: 10 challenging problems in data mining research. International Journal of Information Technology 5(04), 597–604 (2006)
Min, F., He, H., Qian, Y., Zhu, W.: Test-cost-sensitive attribute reduction. Information Sciences 181(22), 4928–4942 (2011)
Yao, Y.Y., Zhao, Y.: Attribute reduction in decision-theoretic rough set models. Information Sciences 178(17), 3356–3373 (2008)
Min, F., Liu, Q.: A hierarchical model for test-cost-sensitive decision systems. Information Sciences 179, 2442–2452 (2009)
Turney, P.: Cost-sensitive classification: Empirical evaluation of a hybrid genetic decision tree induction algorithm. Journal of Artificial Intelligence Research (JAIR)Â 2 (1995)
Zhou, Z., Liu, X.: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Transactions on Knowledge and Data Engineering 18(1), 63–77 (2006)
Chai, X., Deng, L., Yang, Q., Ling, C.: Test-cost sensitive naive bayes classification. In: Fourth IEEE International Conference on Data Mining, ICDM 2004, pp. 51–58. IEEE (2004)
Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
He, H., Min, F.: Accumulated cost based test-cost-sensitive attribute reduction. In: Kuznetsov, S.O., Ślęzak, D., Hepting, D.H., Mirkin, B.G. (eds.) RSFDGrC 2011. LNCS (LNAI), vol. 6743, pp. 244–247. Springer, Heidelberg (2011)
Zhu, W., Wang, F.: Reduction and axiomization of covering generalized rough sets. Information Sciences 152, 217–230 (2003)
Zhu, W.: Topological approaches to covering rough sets. Information Sciences 177(6), 1499–1508 (2007)
Ziarko, W.: Variable precision rough set model. Journal of Computer and System Sciences 46(1), 39–59 (1993)
Greco, S., Matarazzo, B., Słowiński, R., Stefanowski, J.: Variable consistency model of dominance-based rough sets approach. In: Ziarko, W.P., Yao, Y. (eds.) RSCTC 2000. LNCS (LNAI), vol. 2005, pp. 170–181. Springer, Heidelberg (2001)
Hu, Q., Yu, D., Liu, J., Wu, C.: Neighborhood rough set based heterogeneous feature subset selection. Information Sciences 178(18), 3577–3594 (2008)
Zhao, H., Min, F., Zhu, W.: Cost-sensitive feature selection of numeric data with measurement errors. Journal of Applied Mathematics 2013, 1–13 (2013)
Qian, Y., Liang, J., Pedrycz, W., Dang, C.: Positive approximation: An accelerator for attribute reduction in rough set theory. Artificial Intelligence 174(9), 597–618 (2010)
Min, F., Du, X., Qiu, H., Liu, Q.-H.: Minimal attribute space bias for attribute reduction. In: Yao, J., Lingras, P., Wu, W.-Z., Szczuka, M.S., Cercone, N.J., Ślęzak, D. (eds.) RSKT 2007. LNCS (LNAI), vol. 4481, pp. 379–386. Springer, Heidelberg (2007)
Min, F., Zhu, W.: Attribute reduction with test cost constraint. Journal of Electronic Science and Technology of China 9(2), 97–102 (2011)
Yao, Y., Zhao, Y., Wang, J.: On reduct construction algorithms. In: Wang, G.-Y., Peters, J.F., Skowron, A., Yao, Y. (eds.) RSKT 2006. LNCS (LNAI), vol. 4062, pp. 297–304. Springer, Heidelberg (2006)
Wang, G., Yu, H., Yang, D.: Decision table reduction based on conditional information entropy. Chinese Journal of Computers 2(7), 759–766 (2002)
Min, F., Zhu, W., Zhao, H., Pan, G.: Coser: Cost-senstive rough sets (2011), http://grc.fjzs.edu.cn/~fmin/coser/
Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1988), http://www.ics.uci.edu/~mlearn/mlrepository/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Niu, J., Zhao, H., Zhu, W. (2014). A Logarithmic Weighted Algorithm for Minimal Test Cost Attribute Reduction. In: Miao, D., Pedrycz, W., Ślȩzak, D., Peters, G., Hu, Q., Wang, R. (eds) Rough Sets and Knowledge Technology. RSKT 2014. Lecture Notes in Computer Science(), vol 8818. Springer, Cham. https://doi.org/10.1007/978-3-319-11740-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-11740-9_13
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11739-3
Online ISBN: 978-3-319-11740-9
eBook Packages: Computer ScienceComputer Science (R0)