Abstract
Bigdata comes into big picture in early 2000, since it becomes focus of researchers and data scientist. Main purpose of research and development in the field of Bigdata is to extract and predicts meaningful information from large amount of structured as well as unstructured real world data. In this paper, systematic review of background, existing related technologies used by various big enterprises, data researchers, government officials has been discussed. In addition, presented standardized complex processes to extract useful information such as data generation, storage, modeling/analysis, visualization and interpretation. Finally discusses open issues, challenges and point out the emerging directions in which researchers can work in the age of Bigdata
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baldonado, M., Chang, C.-C.K., Gravano, L., Paepcke, A.: The Stanford digital library metadata architecture. Int. J. Digit. Libr. 1, 108–121 (1997)
Lohr, S.: The age of big data. New York Times 11 (2012)
Fan, W., Bifet, A.: Mining big data: current status, and forecast to the future. ACM SIGKDD Explor. Newsl. 14(2), 1–5 (2013)
Alexandros, L., Jagadish, H.V.: Challenges and opportunities with big data. Proc. VLDB Endow. 5(12), 2032–2033 (2012)
Gantz, J., Reinsel, D.: Extracting value from chaos. IDC iView, pp. 1–12 (2011)
Turner, V., Reinsel, D., Gantz, J.F., Minton, S.: The digital universe of opportunities: rich data and the increasing value of the internet of things. IDC Anal. Future (2014)
Ghemawat, S., Gobioff, H., Leung, S.-T.: The Google file system. ACM SIGOPS Oper. Syst. Rev. 37(5) (2003). ACM
Jeffrey, D., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Chang, F.: Bigtable: a distributed storage system for structured data. ACM Trans. Comput. Syst. 26(2), 4 (2008)
Győrödi, C., Győrödi, R., Pecherle, G., Olah, A.: A comparative study: MongoDB vs. MySQL. In: 2015 13th International Conference on Engineering of Modern Electric Systems (EMES), Oradea (2015)
DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W: Dynamo: amazon’s highly available key-value store. ACM SIGOPS Oper. Syst. Rev. 41(6), 205–220 (2007). ACM
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F.B., Babu, S.: Starfish: a self-tuning system for big data analytic. CIDR 11, 261–272 (2011)
Nagwani, N.K.: Summarizing large text collection using topic modeling and clustering based on MapReduce framework. J. Big Data 2(1), 1–18 (2015)
Palit, I., Reddy, C.K.: Scalable and parallel boosting with mapreduce. IEEE Trans. Knowl. Data Eng. 24(10), 1904–1916 (2012)
Wu, C.-J., Ku, C.-F., Ho, J.-M., Chen, M.-S.: A novel pipeline approach for efficient big data broadcasting. IEEE Trans. Knowl. Data Eng. 28(1), 17–28 (2016)
Rathore, M.M., Paul, A., Ahmad, A., Rho, S.: Urban planning and building smart cities based on the internet of things using big data analytics. Comput. Netw. (2016)
SAS Institute Inc.: Five big data challenges and how to overcome them with visual analytics. Report, pp. 1–2 (2013)
Lü, H., Fogarty, J.: Cascaded treemaps: examining the visibility and stability of structure in treemaps. In: Proceedings of Graphics Interface, Toronto, ON, Canada, pp. 259–266 (2014)
Moens, S., Aksehirli, E., Goethals, B.: Frequent itemset mining for big data. In: IEEE 30th International Conference on Data Engineering, IL, Chicago, pp. 6–9 (2013)
Riondato, M., DeBrabant, J.A., Fonseca, R., Upfal, E.: PARMA: a parallel randomized algorithm for approximate association rules mining in MapReduce. In: Proceedings of the CIKM, pp. 85–94. ACM (2012)
Malek, M., Kadima, H.: Searching frequent itemsets by clustering data: towards a parallel approach using mapreduce. In: Proceedings of the WISE 2011 and 2012 Workshops, pp. 251–258. Springer, Heidelberg (2013)
Zhang, F., et al.: A distributed frequent itemset mining algorithm using spark for big data analytics. Clust. Comput. 18(4), 1493–1501 (2015)
Joao, G.: A survey on learning from data streams: current and future trends. Prog. Artif. Intell. 1(1), 45–55 (2012)
Vu, A.T., De Francisci Morales, G., Gama, J., Bifet, A.: Distributed adaptive model rules for mining big data streams. In: IEEE International Conference on Big Data (Big Data), Washington, DC, pp. 345–353 (2014)
Agerri, R., Artola, X., Beloki, Z., Rigau, G., Soroa, A.: Big data for natural language processing: a streaming approach. Knowl.-Based Syst. 79, 36–42 (2015)
Lee, J.G., Kang, M.: Geospatial big data: challenges and opportunities. Big Data Res. 2(2), 74–81 (2015)
Shekhar, S.: Spatial big data challenges. In: Keynote at ARO/NSF Workshop on Big Data at Large: Applications and Algorithms, Durham, NC (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Tidke, B., Mehta, R., Dhanani, J. (2018). A Comprehensive Survey and Open Challenges of Mining Bigdata. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems (ICTIS 2017) - Volume 1. ICTIS 2017. Smart Innovation, Systems and Technologies, vol 83. Springer, Cham. https://doi.org/10.1007/978-3-319-63673-3_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-63673-3_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63672-6
Online ISBN: 978-3-319-63673-3
eBook Packages: EngineeringEngineering (R0)