Abstract
Data Mining is the process of examining huge pre-existing databases in order to produce new information. Decision Tree is a very popular and practical approach in Data Mining. Decision Trees plays a crucial role in medical field. Ensemble methods create multiple models and produce more accurate solutions than a single model. Feature Selection improves the standard of the data by removing irrelevant attributes, due to that accuracy will increase. In this study experiments are conducted on a Hybrid Method i.e., C4.5 Decision Tree with MultiBoostAB Ensemble technique with different Feature Selection Techniques on Tumor datasets for finding out accuracy, execution time to build a decision tree and size of the tree.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
J. Han, M. Kamber, Data Mining; Concepts and Techniques (Morgan Kaufmann Publishers, 2001)
M. Venkatadri. C. Lokanatha Reddy, A review on data mining from past to the future. Int. J. Comput. Appl. 15(7), 19–22 (2011). ISSN: 0975-8887
Z.H. Zhou, Y. Jiang, Medical diagnosis with C4.5 rule proceeded by artificial neural network ensemble. IEEE Trans. Inf. Technol. Biomed. 7(1), 37–42 (2003)
M. Lundin, J. Lundin, H.B. Burke, S. Toikkanen, L. Pylkkanen, H. Joensuu, Artificial neural networks applied to survival prediction in breastcancer. Oncology 57, 281–286 (1999)
D. Delen, G. Walker, A. Kadam, Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–27 (2005)
M. Venkatadri, C. Lokanatha Reddy, A comparative study on decision tree classification algorithms in data mining. IJCAETS 2(2), 24–29 (2010). ISSN: 0974-3596
R. Polikar, Ensemble based systems in decision making. IEEE Circ. Syst. Mag. 6(3), 21–45 (2006)
K. Kourou, T.P. Exarchos, K.P. Exarchos, M.V. Karamouzis, D.I. Fotiadis, Machine learning application in cancer prognosis and prediction. Comput. Struct. Biotech. J. 13, 8–17 (2015)
G. Sujatha, K. Usha Rani, Advanced ensemble technique on decision tree classifiers—an experimental study. Special Issue Comput. Sci., Math. Biol. IJCSME-SCSMB 264–268 (2016)
G. Sujatha, K. Usha Rani, Ensemble decision tree classifier performance with varying committee sizes. Int. J. Comput. Eng. Technol. 9(1), 96–101 (2018)
W. Awada, R. Wald, A. Naplolitano, A review of the stability of feature selection techniques for bio informatics data, in IEEE International Conference on Information Reuse and Integration (IEEE, 2012), pp. 356–363
B. Pes, Feature selection for high dimensional data: the issue of stability, in 26th IEEE Conference, WETICE-2017 (2017), pp. 170–175
X.Y. Chen, B. Liu, Z.F. Zhang, X. Xia, The analysis of GCFS algorithm in medical data processing and mining. Am. J. Softw. Eng. Appl. 3(6), 68–73 (2014). https://doi.org/10.11648/j.ajsea.20140306.11
C. Bala Kumar, A data mining approach on various classifiers in email spam filtering. IJRASET 3(1), 8–14 (2015)
J.D. Novakovic, Support vector machine as feature selection method in classifier ensembles. I.J. Mod. Educ. Comput. Sci. 4, 1–8 (2014)
B.C. Rokach, O. Maimon, Feature selection by combining multiple methods. Adv. Web Intell. Data Min. 295– 304 (2019)
J.T. de Souza, N. Japkowicz, S. Matwin, Stochfs: a framework for combining feature selection outcomes through a stochastic process, in Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (2005), pp. 667–674
K. Polat, S. Sahan, H. Kodaz, S. Günes, A new classification method for breast cancer diagnosis: feature selection artificial immune recognition system (FS-AIRS), in Proceedings of ICNC (2)’ (2005), pp. 830–838
C. Deisy, B. Subbulakshmi, S. Baskar, N. Ramaraj, Efficient dimensional reduction approach for feature selection, in Conference on Computational Intelligence and Multimedia Applications (2007)
M.A. Hall, L.A. Smith, Feature subset selection: a correlation based filter approach, in 1997 International Conference on Neural Information Processing and Intelligent Information Systems (1997), pp. 855–858
S. Aruna, S.P. Rajagopalan, A novel SVM based CSSFFS feature selection algorithm for detecting breast cancer. Int. J. Comput. Appl. 31(8), 14–20 (2011). ISSN: 0975-8887
D. Lavanya, K. Usha Rani, Analysis of feature selection with classification: breast cancer datasets. Indian J. Comput. Sci. Eng. (IJCSE) (2011)
V. Kumar, Sonajharia, Feature selection: a literature review. Soft Comput. Rev. 4(3) (2014)
M. Hall, Correlation-based feature selection for machine learning, Tech. Rep., Doctoral Dissertation, University of Waikato, Department of Computer Science, View at Google Scholar (1999)
UCI Irvine Machine Learning Repository, www.ics.uci.edu/~mlearn/MLRepository.html
Bioinformatics Group Seville, http://www.upo.es/eps/bigs/datasets.html
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sujatha, G., Usha Rani, K. (2020). A Comprehensive Hybrid Ensemble Method with Feature Selection Techniques. In: Jyothi, S., Mamatha, D., Satapathy, S., Raju, K., Favorskaya, M. (eds) Advances in Computational and Bio-Engineering. CBE 2019. Learning and Analytics in Intelligent Systems, vol 15. Springer, Cham. https://doi.org/10.1007/978-3-030-46939-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-46939-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46938-2
Online ISBN: 978-3-030-46939-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)