Abstract
The emergence of cloud computing provide the medium enterprises many low-cost mass data analysis solutions. Decision tree algorithm in which one of the biggest problems is its computational complexity is proportional to the size and training data, resulting in a large number of computing time in constructing Data Set. The article aim at the SPRINT algorithm based on the Hadoop platform, presenting a parallel method of constructing a decision tree and then solving the parallel problem in Hadoop platform.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Miller, M., Lei, J., Ruizhi, S., Yong, X., et al.: Cloud computing. Mechanical Press, Beijing (2009)
Dean, J., Ghemawat, S.: MapReduce: Symplified Date Processing on Large Clusters, vol. 51(1), pp. 107–113. ACM, New York (2009)
Campbell: Data mining concepts and techniques. Mechanical Industry Press, Beijing (2010)
Shafer, J., Agrawal, R., Mehta, M.: SPRINT: A Scalable Parallel Classifier for Data Mining. IBM Almaden Research Center, U.S
Zhu, Z.: Massive data processing and application based on Hadoop model. Beijing University of Posts and Telecommunications, Beijing (2010)
Liu, Y., Wang, L.: Improvement of SPRINT Algorithm. Computer Engineering (2008)
Pen, C., Lou, K.: Improving the Method Used by SPRINT Algorithm to Find the Best Split Point of Continuous Attribute. Computer Engineering and Application (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Pan, T. (2012). The Performance Improvements of SPRINT Algorithm Based on the Hadoop Platform. In: Jin, D., Lin, S. (eds) Advances in Future Computer and Control Systems. Advances in Intelligent and Soft Computing, vol 160. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29390-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-29390-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29389-4
Online ISBN: 978-3-642-29390-0
eBook Packages: EngineeringEngineering (R0)