Abstract
In recent years, with the massive growth of data, the world today has entered the era of big data. Big data has brought tremendous value to all fields of today’s society, and it has also brought enormous challenges, which has attracted great attention from all walks of life. Analyze and forecast the research hotspots and future development trends in the field of big data, and understand the development changes and priorities in the field of big data research, which will play a significant role in promoting the development of social development and scientific research. In the era of big data, how to extract information from huge amounts of complex data and present complex information more clearly and clearly, the most effective way is to use visualization technology. The article uses the information visualization software Citespace to study the data related to big data in the Web of Science and CNKI database from 2008 to 2017 for 10 years, from macro to micro to the representative countries of the literature, keywords and co-cited documents. Through visualization analysis, the article clarifies the key research directions, key documents and hot spot frontiers in the field of big data research, forecasts the future development trends in this field, and compares the research situation at home and abroad, in order to provide readers and other researchers with certain reference and help.
Similar content being viewed by others
References
Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: a nucleus for a web of open data. The semantic web. Springer, Berlin, pp 722–735
Boyd D, Crawford K (2012) Critical questions for big data. Inf Commun Soc 15:1–18
Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177
Chen CLP, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: A survey on big data. Inf Sci 275(11):314–347
Chen K, Zheng W (2010) Cloud computing: system instances and current research: cloud computing: system instances and current research. J Softw 20:1337–1348
Chen Y, Liu Z, Chen J, Hou J (2008) History and theory of mapping knowledge domains. Stud Sci Sci 26(3):449–460
Chen H, Chiang R, Storey V (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36:1165–1188
Chen M, Mao S, Liu Y (2014) Big data: a survey. Mob Netw Appl 19(2):171–209
Chen X, Chen B, Zhang C, Hao T (2017a) Discovering the recent research in natural language processing field based on a statistical approach. Springer, Berlin, pp 507–517
Chen X, Weng H, Hao T (2017b) A data-driven approach for discovering the recent research status of diabetes in china. Health information science. Springer International Publishing, Cham, pp 89–101
Cheng X, Jin X, Wang Y, Guo J, Zhang T, Li G (2014) Survey on big data system and analytic technology. J Softw 9:1889–1908
Dai S, Dong J, Xue J (2014) Visualization analysis and application of the big data in scientific computing. Eng Eng Interdiscip Perspect 6(3):275–281
Danasingh AA, Tamizhpoonguil B, Epiphany JL (2016) A survey on big data and cloud computing. Int J Recent Innov Trends Comput Commun 4:273–277
Dean J, Ghemawat S (2008) Mapreduce: simplified data processing on large clusters. Commun ACM 51(1):107–113
Fu H, Ho Y, Sui Y, Li Z (2010) A bibliometric analysis of solid waste research during the period 1993–2008. Waste Manag (New York) 30:2410–2417
Gantz J, Reinsei D (2011) Extracting value from chaos. IDC iview 1142(2011):1–12
Guan S, Meng X, Li Z, Liu Y (2015) Big data study on the current situation, problems and countermeasures. J Intell 5:98–104
Hao T, Chen X, Li G, Yan J (2018) A bibliometric analysis of text mining in medical research. Soft Comput 22(23):7875–7892
Hou J, Hu Z (2019) Review on the application of Citespace at home and Abroad. J Mod Inf 33:99–103
Lazer D, Kennedy R, King G, Vespignani A (2014) The parable of Google Flu: traps in big data analysis. Science 343(6176):1203
Li G, Cheng X (2012) Research status and scientific thinking of big data. Bull Chin Acad Sci 6:647–657
Li Z, Nie F, Chang X, Yang Y (2017) Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE Trans Knowl Data Eng 29(10):2100–2110
Liu H, Morstatter F, Tang J, Zafarani R (2016a) The good, the bad, and the ugly: uncovering novel research opportunities in social media mining. Int J Data Sci Anal 1:137–143
Liu Q, Li Y, Duan H, Liu Y, Qin Z (2016b) Knowledge graph construction techniques. J Comput Res Dev 53(3):582–600
Luo M, Chang X, Yang Y, Nie L, Hauptmann A, Zheng Q (2017) Simple to complex cross-modal learning to rank. Comput Vis Image Underst 163:67–77
Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Hung-Byers A (2011) Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute, Available at: https://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/big-data-the-next-frontier-for-innovation
Marcos S, Garcia-Penalvo F (2018) Information retrieval methodology for aiding scientific database search. Soft Comput 10:1–10
Mayerschonberger V, Cukier K (2014) Big data: a revolution that will transform how we live, work, and think. Math Comput Educ 47(17):181–183
McAfee A, Brynjolfsson E (2012) Big data: the management revolution. Harv Bus Rev 90:60–68
Meng X, Ci X (2013) Big data management: concepts, techniques and challenges. J Comput Res Dev 1:146–169
Price D (1965) Networks of scientific papers. Science 149(3683):510–515
Qin C, Hou H (2009) Mapping knowledge domain—a new field of information management and knowledge management. J Acad Libr 1:30–37
Science (2011) A special issue of science: dealing with data. Sci Technol Appl 2(1):93–94
Shi Y, Meng X (2014) A survey of query techniques in cloud data management systems. Chin J Comput 36:209–225
Shneider AM (2009) Four stages of a scientific discipline; four types of scientist. Trends Biochem Sci 34(5):217–223
Tan H, Gao Y (2017) Regularized constraint subspace based method for image set classification. IEEE Access 5:15001–15012
Tan H, Gao Y, Ma Z (2018) Regularized constraint subspace based method for image set classification. Pattern Recogn 76:434–448
Taylor R (2010) An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. BMC Bioinf 11(Suppl 12):S1
Wang S, Wang H, Tan X (2011) Architecting big data: challenges, studies and forecasts. Chin J Comput 34(10):1741–1752
Wang Y, Jin X, Cheng X (2013) Network big data: present and future. Chin J Comput 36(6):1125–1138
Wei L, Zhao Y (2015) Bibliometric analysis of global environmental assessment research in a 20-year period. Environ Impact Assess Rev 50:158–166
Wu X, Zhu X, Wu G, Wei D (2013) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107
Wu Y, Yang F, Lai G, Lin K (2016) Research progress of knowledge graph learning and reasoning. J Chin Comput Syst 37(9):2007–2013
Yang L, Wei X (2011) Visualization research in foreign social network analysis based on mapping knowledge domain. Inf Sci 29:1041–1048
Zhang Y, Chen M, Liao X (2013) Big data applications: a survey. J Comput Res Dev 50(z2):216–233
Zhang S, Yang Z, Xing X, Gao Y, Xie D, Wong HS (2017) Generalized pair-counting similarity measures for clustering and cluster ensembles. IEEE Access 5:16904–16918
Zheng L (2013) Stride into the era of “big data”. Inf Constr 1(2011):10–13
Acknowledgements
This paper is funded by the National Natural Science Special Fund Project (61340058) and the Zhejiang Provincial Natural Science Fund Key Project (LZ14F020001).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All Authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Communicated by Mu-Yen Chen.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, W., Lu, C. Visualization analysis of big data research based on Citespace. Soft Comput 24, 8173–8186 (2020). https://doi.org/10.1007/s00500-019-04384-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-019-04384-7