Big Data Mining in the Cloud

Shi, Zhongzhi

doi:10.1007/978-3-642-32891-6_4

Zhongzhi Shi⁴

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 385))

Included in the following conference series:

International Conference on Intelligent Information Processing

2081 Accesses
2 Citations

Abstract

Big Data is the growing challenge that organizations face as they deal with large and fast-growing sources of data or information that also present a complex range of analysis and use problems. Digital data production in many fields of human activity from science to enterprise is characterized by an exponential growth. Big data technologies will become a new generation of technologies and architectures which is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time.

Massive data sets are hard to understand, and models and patterns hidden within them cannot be identified by humans directly, but must be analyzed by computers using data mining techniques. The world of big data present rich cross-media contents, such as text, image, video, audio, graphics and so on. For cross-media applications and services over the Internet and mobile wireless networks, there are strong demands for cross-media mining because of the significant amount of computation required for serving millions of Internet or mobile users at the same time. On the other hand, with cloud computing booming, new cloud-based cross-media computing paradigm emerged, in which users store and process their cross-media application data in the cloud in a distributed manner. Cross-media is the outstanding characteristics of the age of big data with large scale and complicated processing task. Cloud-based Big Data platforms will make it practical to access massive compute resources for short time periods without having to build their own big data farms. We propose a framework for cross-media semantic understanding which contains discriminative modeling, generative modeling and cognitive modeling. In cognitive modeling, a new model entitled CAM is proposed which is suitable for cross-media semantic understanding. A Cross-Media Intelligent Retrieval System (CMIRS), which is managed by ontology-based knowledge system KMSphere, will be illustrated.

This talk also concerns Cloud systems which can be effectively employed to handle parallel mining since they provide scalable storage and processing services, as well as software platforms for developing and running data analysis environments. We exploit Cloud computing platforms for running big data mining processes designed as a combination of several data analysis steps to be run in parallel on Cloud computing elements. Finally, the directions for further researches on big data mining technology will be pointed out and discussed.

Download to read the full chapter text

Chapter PDF

Multimedia Social Big Data: Mining

Big Web Data: Warehousing and Analytics

Cloud Technologies: A New Level for Big Data Mining

Author information

Authors and Affiliations

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Zhongzhi Shi

Authors

Zhongzhi Shi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, China
Zhongzhi Shi
Computer Science Department, Indiana University, 47405, Bloomington, IN, USA
David Leake
School of Computing Science and Engineering, University of Salford, M5 4WT, Salford, UK
Sunil Vadera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, Z. (2012). Big Data Mining in the Cloud. In: Shi, Z., Leake, D., Vadera, S. (eds) Intelligent Information Processing VI. IIP 2012. IFIP Advances in Information and Communication Technology, vol 385. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32891-6_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-32891-6_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32890-9
Online ISBN: 978-3-642-32891-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Big Data Mining in the Cloud

Abstract

Chapter PDF

Similar content being viewed by others

Multimedia Social Big Data: Mining

Big Web Data: Warehousing and Analytics

Cloud Technologies: A New Level for Big Data Mining

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Big Data Mining in the Cloud

Abstract

Chapter PDF

Similar content being viewed by others

Multimedia Social Big Data: Mining

Big Web Data: Warehousing and Analytics

Cloud Technologies: A New Level for Big Data Mining

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation