Soft Computing: Theories and Applications pp 637-650 | Cite as
A Rigorous Investigation on Big Data Analytics
Abstract
Nowadays Big Data becomes a new trend in science, technology, business, and marketing. Traditional data analytics techniques are not able to be applied straightforward toward big data. There is a requirement to developed high-performance platform that analyzes big data more efficiently. It is challenging for the organizations to unlock the patterns of information actionable value in massive volume of data, enabling great improvements in business and technical processes, customer analytics. Datasets are heterogeneous in granularity and accessibility. There are many issues and challenges that company faces while storing and handling Big Data. The skill to automatically store, organize, review, and analyze the data is essential. This paper will tell why there is need of big data? Why big data is such a big hype? What is the need of analytics? This paper describes need of Big Data and its analysis. Paper discusses a brief investigation on big data analytics. The use of tools like HADOOP, HIVE PIG, and SPARK in summarizing the data.
Keywords
Big data analytics Big data issues and challenges Analytics techniques Apache hadoop Apache drill Project stormReferences
- 1.StructuredData: http://www.webopedia.com/TERM/S/structured_data.html
- 2.Semi structured Data: http://en.wikipedia.org/wiki/Semistructured_data
- 3.Apache-Hadoop: http://hadoop.apache.org/#What+Is+Apache+Hadoop%3F
- 4.Project Storm: http://storm-project.net/
- 5.Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters. In: Sixth Symposium on Operating System Design and Implementation, San Francisco, CA, December 2004Google Scholar
- 6.
- 7.Hausenblas, M., Nadeau, J.: Apache Drill ad-hoc interactive analysis at scale, June 2013Google Scholar
- 8.Characteristics of Big Data: http://www.datatechnocrats.com/tag/bigdata/
- 9.Storing and querying data Big Data in HDFS: http://ecomcanada.wordpress.com/2012/11/14/storing-and-querying-bigdata-in-hadoop-hdfs/
- 10.Storm cluster: https://github.com/nathanmarz/storm/wiki/Tutorial
- 11.Katal, A., Wazid, M., Goudar, R.H.: Big data: Issues, challenges, tools and good practices. In: Sixth International Conference on Contemporary Computing (IC3) (2013) Google Scholar
- 12.Stephen, K., Frank, A.J., Alberto, E., William, M.: Big data: Issues and challenges moving forward. In: IEEE, 46th Hawaii International Conference on System Sciences (2013)Google Scholar
- 13.Conference on Communication, Information & Computing Technology (ICCICT), 19–20 Oct 2012 Google Scholar
- 14.Michael, K., Miller, K.W.: Big data: New opportunities and new challenges. IEEE Technol. Soc. Mag. 13 Google Scholar
- 15.Sergey, M., Andrey, G., Jing Jing, L., Geoffrey, R., Shiva, S., Matt, T., Theo, V.: Dremel: Interactive analysis of web-scale datasets. Google (2013)Google Scholar
- 16.
- 17.Apache HBase: http://hbase.apache.org/