The Performance Improvements of SPRINT Algorithm Based on the Hadoop Platform

Pan, TianMing

doi:10.1007/978-3-642-29390-0_12

TianMing Pan³

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 160))

1200 Accesses
2 Citations

Abstract

The emergence of cloud computing provide the medium enterprises many low-cost mass data analysis solutions. Decision tree algorithm in which one of the biggest problems is its computational complexity is proportional to the size and training data, resulting in a large number of computing time in constructing Data Set. The article aim at the SPRINT algorithm based on the Hadoop platform, presenting a parallel method of constructing a decision tree and then solving the parallel problem in Hadoop platform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Miller, M., Lei, J., Ruizhi, S., Yong, X., et al.: Cloud computing. Mechanical Press, Beijing (2009)
Google Scholar
Dean, J., Ghemawat, S.: MapReduce: Symplified Date Processing on Large Clusters, vol. 51(1), pp. 107–113. ACM, New York (2009)
Google Scholar
Campbell: Data mining concepts and techniques. Mechanical Industry Press, Beijing (2010)
Google Scholar
Shafer, J., Agrawal, R., Mehta, M.: SPRINT: A Scalable Parallel Classifier for Data Mining. IBM Almaden Research Center, U.S
Google Scholar
Zhu, Z.: Massive data processing and application based on Hadoop model. Beijing University of Posts and Telecommunications, Beijing (2010)
Google Scholar
Liu, Y., Wang, L.: Improvement of SPRINT Algorithm. Computer Engineering (2008)
Google Scholar
Pen, C., Lou, K.: Improving the Method Used by SPRINT Algorithm to Find the Best Split Point of Continuous Attribute. Computer Engineering and Application (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Science and Engineering, East China Normal University, Shanghai, China
TianMing Pan

Authors

TianMing Pan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to TianMing Pan .

Editor information

Editors and Affiliations

Researcher Association, Wuhan Section, International Science & Education, Special No.1, Jiangxia Road of Wuhan, Wuhan, China, People's Republic
David Jin
Researcher Association, Guangzhou Section, International Science & Education, Jinheng Road, Jinbi Garden 85-1102 144, Guang Zhou, China, People's Republic
Sally Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pan, T. (2012). The Performance Improvements of SPRINT Algorithm Based on the Hadoop Platform. In: Jin, D., Lin, S. (eds) Advances in Future Computer and Control Systems. Advances in Intelligent and Soft Computing, vol 160. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29390-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-29390-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29389-4
Online ISBN: 978-3-642-29390-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics