Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data
- 1.3k Downloads
Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.
KeywordsHigh-dimensional data Genetic programming Granular computing Classification
This work was supported in part by the Marsden Fund of New Zealand Government under Contracts VUW1209, VUW1509 and VUW1615, Huawei Industry Fund E2880/3663, Natural Science Foundation of Jiangsu, China BK20161406, and the University Research Fund at Victoria University of Wellington 209862/3580, and 213150/3662.
- 1.Bargiela, A., Pedrycz, W.: Granular computing. In: Handbook in Computational Intelligence. Fuzzy Logic, Systems, Artificial Neural Networks, and Learning Systems, vol. 1, pp. 43–66. World Scientific (2016)Google Scholar
- 3.Cervante, L., Xue, B., Shang, L., Zhang, M.: A dimension reduction approach to classification based on particle swarm optimisation and rough set theory. In: Thielscher, M., Zhang, D. (eds.) AI 2012. LNCS (LNAI), vol. 7691, pp. 313–325. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35101-3_27CrossRefGoogle Scholar
- 7.Joshi, A., Dangra, J., Rawat, M.: A decision tree based classification technique for accurate heart disease classification and prediction. Int. J. Technol. Res. Manag. 3, 1–4 (2016)Google Scholar
- 8.Luna, J.M., Pechenizkiy, M., del Jesus, M.J., Ventura, S.: Mining context-aware association rules using grammar-based genetic programming. IEEE Trans. Cybern. (2017)Google Scholar
- 9.Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006)Google Scholar
- 11.Poli, R., Langdon, W., McPhee, N.: A Field Guide to Genetic Programming (2008)Google Scholar
- 15.Thearling, K.: An Introduction to Data Mining (2017)Google Scholar
- 17.Tran, B., Xue, B., Zhang, M.: Using feature clustering for GP-based feature construction on high-dimensional data. In: McDermott, J., Castelli, M., Sekanina, L., Haasdijk, E., García-Sánchez, P. (eds.) EuroGP 2017. LNCS, vol. 10196, pp. 210–226. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55696-3_14CrossRefGoogle Scholar
- 20.Yao, J.: Novel Developments in Granular Computing: Applications for Advanced Human Reasoning and Soft Computation. IGI Global (2010)Google Scholar