Abstract
The growing availability of large amounts of multimedia contents in science and industry have made data mining applications such as data classification highly demanding. The contribution of this paper is two-fold. First, we propose an approach for constructing a decision tree based classification model for multimedia contents. Second, in order to speed up the performance of the proposed model, we propose a hybrid CPU-GPU approach for construction of decision tree on Graphic Processing Unit (GPU). Our approach not only accelerates the construction of decision tree via GPU computing, but also does so by considering the power and energy consumption of the GPU. Through the experiments, we demonstrate that the proposed hybrid CPU-GPU approach outperforms CPU-based sequential implementation by several times.
Similar content being viewed by others
References
Basak J, Krishnapuram R (2005) Interpretable hierarchical clustering by constructing an unsupervised decision tree. IEEE Trans Knowl Data Eng 17(1):121–132
Chui FCF, Bindoff I, Williams R (2009) Applying feature extraction for classification problems. J Signal Process Image Process Pattern Recogn 2(1):1–16
Hong TH, Yun CH, Park JW, Lee HG, Jung SH, Lee YW (2013) Big data processing with MapReduce for E-book. Int J Multimed Ubiquit Eng 8(1):151–162
Huang S, Xiao S, Feng W (2009) On the Energy Efficiency of Graphics Processing Units for Scientific Computing. In: Proceeding of the 2009 IEEE International Symposium on Parallel and Distributed Processing, Rome, Italy, pp. 1–8
Jin R, Yang G, Agrawal G (2005) Shared memory parallelization of data mining algorithms: techniques, programming interface, and performance. IEEE Trans Knowl Data Eng 17(1):71–89
Lee YC, Zomaya AY (2009) On effective slack reclamation in task scheduling for energy reduction. J Inf Process Syst 5(4):175–186
Lim N (2007) Classification by ensembles from random partitions using logistic regression models. Stony Brook University, New York, USA
Liu M, Wan C, Wang L (2002) Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines. Soft Comput 6(5):357–364
Ma A, Sethi IK, Patel NV (2009) Multimedia Content Tagging Using Multilabel Decision Tree. In: Proceeding of 11th IEEE International Symposium on Multimedia, San Diego, California, USA, pp. 606–611
Nguyen H (2007) GPU Gems 3. Addison-Wesley Professional, Boston, MA, USA
Park YH, Whang KY, Lee BS, Han WS (2006) Efficient evaluation of linear path expressions on large-scale heterogeneous XML documents using information retrieval techniques. J Syst Softw 79(2):180–190
Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106
Shafer CJ, Agrawal R, Mehta M (1996) SPRINT: A Scalable Parallel Classifier for Data Mining. In: Proceedings of the 22th International Conference on Very Large Data Bases, Bombay, India, pp. 544–555
Teng Z, Du W (2007) A hybrid multi-group privacy-preserving approach for building decision trees. In: Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining, Nanjing, China, pp. 296–307
Van Der Laan WJ, Jalba AC, Roerdink JBTM (2011) Accelerating wavelet lifting on graphics hardware using CUDA. IEEE Trans Parallel Distrib Syst 22(1):132–146
Zhou ZH (2003) Three perspectives of data mining. Artif Intell 143(1):139–146
Acknowledgments
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2012003797).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nasridinov, A., Ihm, SY. & Park, YH. A hybrid construction of a decision tree for multimedia contents. Multimed Tools Appl 74, 8455–8465 (2015). https://doi.org/10.1007/s11042-013-1614-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1614-6