Based on Cloud-Computing’s Web Data Mining

Ruan, Shen

doi:10.1007/978-3-642-31968-6_29

Shen Ruan³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 289))

2498 Accesses
2 Citations

Abstract

On the Internet, huge amounts of data generated is distributed, heterogeneous, dynamic, more complex, if the use of the existing centralized data mining methods can not meet the application requirements. To solve these problems, proposed a cloud computing- based Web data mining method, the massive data and mining tasks will be decomposed on multiple computers parallely processed. We use open platform–Hadoop to establish a parallel association rules mining algorithm based on Apriori, and it tests and veriftes the efficiency of system. This paper proposed a design thinking that migrate the calculation to the store, the calculation will be implemented on the locals to rage nodes, thus it can avoid the large amount of data transmission on the network, and will no take a lot of band width.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Li, J., Xu, C., Tan, S.-B.: A Web data mining system design and research. Computer Technology and Development 19(2), 55–58 (2009)
Google Scholar
Tao, Z.: Web Data Mining Analysis. Friends of Science 6(17), 68–73 (2009)
Google Scholar
Branch, C.K., Dashun, Y.: Web data integration in data mining research. Computer Engineering and Design 8(27), 271–350 (2006)
Google Scholar
Jun, J.: A cloud-based data mining platform architecture design and implementation. Qingdao University, Qingdao (2009)
Google Scholar
Zheng, J.: Grid-based parallel implementation of data mining algorithms. Fujian University of Technology 2(8), 20–24 (2010)
Google Scholar
Ye, Y.-B., Chiang, C.C.: A Parallel Apriori Al gori thm f or Frequent It em set s Mining. In: Proceedings of the Fourth International Conference on Software Engineering Research Management and Applications (SERA 2006), pp. 7–94 (2006)
Google Scholar
Zheng, J.: Grid-based parallel implementation of data mining algorithms. Fujian University of Technology 2(8), 57–64 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, Liuzhou Teachers College, Liuzhou, Guangxi, 545004, China
Shen Ruan

Authors

Shen Ruan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Taiyuan University, Xueyuan Road 3, 030051, Taiyuan, Shanxi, China
Maotai Zhao
Xi’an University, Jinhua Road 4, 710032, Xi’an, Shanxi, China
Junpin Sha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ruan, S. (2012). Based on Cloud-Computing’s Web Data Mining. In: Zhao, M., Sha, J. (eds) Communications and Information Processing. Communications in Computer and Information Science, vol 289. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31968-6_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-31968-6_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31967-9
Online ISBN: 978-3-642-31968-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics