Advertisement

A Review on Big Data Mining in Cloud Computing

  • Bhaludra R. Nadh SinghEmail author
  • B. Raja Srinivasa Reddy
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 8)

Abstract

Data mining has become indispensable in the wake of ever-growing data in enterprises. The IT departments of organizations have their data mining services. However, the size of data is increased exponentially in such a way that the existing mining algorithms are inadequate to handle such data. The rationale behind this is that the data has become big data with characteristics like volume, variety and velocity. The big data needs to be handled in a distributed environment. Such environment is provided by cloud computing with its rich pool of computing resources. Therefore it is important to understand the dynamics of big data and mining of big data. Aligning IT wings of organizations to handle big data and perform mining on the big data is time consuming and needs investment. For this reason, it is desirable to have a mining service in cloud that can cater to the needs of organizations at an affordable price. This is actually a paradigm shift in thinking of obtaining business intelligence required by organizations. Towards this end this paper reviews literature and provides useful insights that can help in comprehending the present state-of-the-art on big data and possibility of mining service in cloud computing.

Keywords

Big data Cloud computing Data mining Mining as a service (MaaS) 

References

  1. 1.
    C.L. Philip Chen, Chun-Yang Zhang. (2014). Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. ELsevier. p. 32–44.Google Scholar
  2. 2.
    Xindong Wu, Xingquan Zhu, Gong-Qing Wu. (2014). Data Mining with Big Data. IEEE. 26 (1), p. 97–107.Google Scholar
  3. 3.
    I. Kopanas, N. Avouris, and S. Daskalaki, “The Role of Domain Knowledge in a Large Scale Data Mining Project,” Proc. Second Hellenic Conf. AI: Methods and Applications of Artificial Intelligence, I.P. Vlahavas, C.D. Spyropoulos, eds., pp. 288–299, 2002.Google Scholar
  4. 4.
    J. Lorch, B. Parno, J. Mickens, M. Raykova, and J. Schiffman, “Shoroud: Ensuring Private Access to Large-Scale Data in the Data Center,” Proc. 11th USENIX Conf. File and Storage Technologies (FAST’13), 2013.Google Scholar
  5. 5.
    E. Schadt, “The Changing Privacy Landscape in the Era of Big Data,” Molecular Systems, vol. 8, article 612, 2012.Google Scholar
  6. 6.
    A. Machanavajjhala and J.P. Reiter, “Big Privacy: Protecting Confidentiality in Big.Google Scholar
  7. 7.
    IbrahimAbakerTargioHashem, IbrarYaqoob, NorBadrulAnuar, Salimah Mokhtar, AbdullahGani, and SameeUllahKhan. (2015). The rise of “big data” on cloud computing: Review and open research issues. ELsevier. 47 (1), p. 98–115.Google Scholar
  8. 8.
    S. Papadimitriou and J. Sun, “Disco: Distributed Co-Clustering with Map-Reduce: A Case Study Towards Petabyte-Scale End-to End Mining,” Proc. IEEE Eighth Int’l Conf. Data Mining (ICDM’08), pp. 512–521, 2008.Google Scholar
  9. 9.
    C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis, “Evaluating MapReduce for Multi-Core and Multiprocessor Systems,” Proc. IEEE 13th Int’l Symp. High Performance Computer Architecture (HPCA’07), pp. 13–24, 2007.Google Scholar
  10. 10.
    X. Zhu, P. Zhang, X. Lin, and Y. Shi, “Active Learning From Stream Data Using Optimal Weight Classifier Ensemble,” IEEE Trans. Systems, Man, and Cybernetics, Part B, vol. 40, no. 6, pp. 1607– 1621, Dec. 2010.Google Scholar
  11. 11.
    Yingyi Bu, Bill Howe, Magdalena Balazinska and Michael D. Ernst. 2010. HaLoop: Efficient Iterative Data Processing on Large Clusters. USA: IEEE. p 1–12.Google Scholar
  12. 12.
    Sriram Rao, Raghu Ramakrishnan and Adam Silberstein. 2012. Sailfish: A Framework for Large Scale Data Processing. USA: Microsoft. p 1–14.Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2017

Authors and Affiliations

  • Bhaludra R. Nadh Singh
    • 1
    Email author
  • B. Raja Srinivasa Reddy
    • 2
  1. 1.Department of CSEANUGunturIndia
  2. 2.ANUGunturIndia

Personalised recommendations