Skip to main content
Log in

Visualization analysis of big data research based on Citespace

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

In recent years, with the massive growth of data, the world today has entered the era of big data. Big data has brought tremendous value to all fields of today’s society, and it has also brought enormous challenges, which has attracted great attention from all walks of life. Analyze and forecast the research hotspots and future development trends in the field of big data, and understand the development changes and priorities in the field of big data research, which will play a significant role in promoting the development of social development and scientific research. In the era of big data, how to extract information from huge amounts of complex data and present complex information more clearly and clearly, the most effective way is to use visualization technology. The article uses the information visualization software Citespace to study the data related to big data in the Web of Science and CNKI database from 2008 to 2017 for 10 years, from macro to micro to the representative countries of the literature, keywords and co-cited documents. Through visualization analysis, the article clarifies the key research directions, key documents and hot spot frontiers in the field of big data research, forecasts the future development trends in this field, and compares the research situation at home and abroad, in order to provide readers and other researchers with certain reference and help.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://www.analytictech.com/ucinet/.

  2. http://pajek.imfm.si/doku.php?id=pajek.

  3. http://cluster.cis.drexel.edu/cchen/citespace/.

  4. http://www.umu.se/inforsk/Bibexcel/.

  5. http://gephi.org.

  6. https://www.vosviewer.com/.

References

  • Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: a nucleus for a web of open data. The semantic web. Springer, Berlin, pp 722–735

    Google Scholar 

  • Boyd D, Crawford K (2012) Critical questions for big data. Inf Commun Soc 15:1–18

    Google Scholar 

  • Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177

    MATH  Google Scholar 

  • Chen CLP, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: A survey on big data. Inf Sci 275(11):314–347

    Google Scholar 

  • Chen K, Zheng W (2010) Cloud computing: system instances and current research: cloud computing: system instances and current research. J Softw 20:1337–1348

    Google Scholar 

  • Chen Y, Liu Z, Chen J, Hou J (2008) History and theory of mapping knowledge domains. Stud Sci Sci 26(3):449–460

    Google Scholar 

  • Chen H, Chiang R, Storey V (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36:1165–1188

    Google Scholar 

  • Chen M, Mao S, Liu Y (2014) Big data: a survey. Mob Netw Appl 19(2):171–209

    Google Scholar 

  • Chen X, Chen B, Zhang C, Hao T (2017a) Discovering the recent research in natural language processing field based on a statistical approach. Springer, Berlin, pp 507–517

    Google Scholar 

  • Chen X, Weng H, Hao T (2017b) A data-driven approach for discovering the recent research status of diabetes in china. Health information science. Springer International Publishing, Cham, pp 89–101

    Google Scholar 

  • Cheng X, Jin X, Wang Y, Guo J, Zhang T, Li G (2014) Survey on big data system and analytic technology. J Softw 9:1889–1908

    Google Scholar 

  • Dai S, Dong J, Xue J (2014) Visualization analysis and application of the big data in scientific computing. Eng Eng Interdiscip Perspect 6(3):275–281

    Google Scholar 

  • Danasingh AA, Tamizhpoonguil B, Epiphany JL (2016) A survey on big data and cloud computing. Int J Recent Innov Trends Comput Commun 4:273–277

    Google Scholar 

  • Dean J, Ghemawat S (2008) Mapreduce: simplified data processing on large clusters. Commun ACM 51(1):107–113

    Google Scholar 

  • Fu H, Ho Y, Sui Y, Li Z (2010) A bibliometric analysis of solid waste research during the period 1993–2008. Waste Manag (New York) 30:2410–2417

    Google Scholar 

  • Gantz J, Reinsei D (2011) Extracting value from chaos. IDC iview 1142(2011):1–12

    Google Scholar 

  • Guan S, Meng X, Li Z, Liu Y (2015) Big data study on the current situation, problems and countermeasures. J Intell 5:98–104

    Google Scholar 

  • Hao T, Chen X, Li G, Yan J (2018) A bibliometric analysis of text mining in medical research. Soft Comput 22(23):7875–7892

    Google Scholar 

  • Hou J, Hu Z (2019) Review on the application of Citespace at home and Abroad. J Mod Inf 33:99–103

    Google Scholar 

  • Lazer D, Kennedy R, King G, Vespignani A (2014) The parable of Google Flu: traps in big data analysis. Science 343(6176):1203

    Google Scholar 

  • Li G, Cheng X (2012) Research status and scientific thinking of big data. Bull Chin Acad Sci 6:647–657

    Google Scholar 

  • Li Z, Nie F, Chang X, Yang Y (2017) Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE Trans Knowl Data Eng 29(10):2100–2110

    Google Scholar 

  • Liu H, Morstatter F, Tang J, Zafarani R (2016a) The good, the bad, and the ugly: uncovering novel research opportunities in social media mining. Int J Data Sci Anal 1:137–143

    Google Scholar 

  • Liu Q, Li Y, Duan H, Liu Y, Qin Z (2016b) Knowledge graph construction techniques. J Comput Res Dev 53(3):582–600

    Google Scholar 

  • Luo M, Chang X, Yang Y, Nie L, Hauptmann A, Zheng Q (2017) Simple to complex cross-modal learning to rank. Comput Vis Image Underst 163:67–77

    Google Scholar 

  • Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Hung-Byers A (2011) Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute, Available at: https://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/big-data-the-next-frontier-for-innovation

  • Marcos S, Garcia-Penalvo F (2018) Information retrieval methodology for aiding scientific database search. Soft Comput 10:1–10

    Google Scholar 

  • Mayerschonberger V, Cukier K (2014) Big data: a revolution that will transform how we live, work, and think. Math Comput Educ 47(17):181–183

    Google Scholar 

  • McAfee A, Brynjolfsson E (2012) Big data: the management revolution. Harv Bus Rev 90:60–68

    Google Scholar 

  • Meng X, Ci X (2013) Big data management: concepts, techniques and challenges. J Comput Res Dev 1:146–169

    Google Scholar 

  • Price D (1965) Networks of scientific papers. Science 149(3683):510–515

    Google Scholar 

  • Qin C, Hou H (2009) Mapping knowledge domain—a new field of information management and knowledge management. J Acad Libr 1:30–37

    Google Scholar 

  • Science (2011) A special issue of science: dealing with data. Sci Technol Appl 2(1):93–94

    Google Scholar 

  • Shi Y, Meng X (2014) A survey of query techniques in cloud data management systems. Chin J Comput 36:209–225

    Google Scholar 

  • Shneider AM (2009) Four stages of a scientific discipline; four types of scientist. Trends Biochem Sci 34(5):217–223

    Google Scholar 

  • Tan H, Gao Y (2017) Regularized constraint subspace based method for image set classification. IEEE Access 5:15001–15012

    Google Scholar 

  • Tan H, Gao Y, Ma Z (2018) Regularized constraint subspace based method for image set classification. Pattern Recogn 76:434–448

    Google Scholar 

  • Taylor R (2010) An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. BMC Bioinf 11(Suppl 12):S1

    Google Scholar 

  • Wang S, Wang H, Tan X (2011) Architecting big data: challenges, studies and forecasts. Chin J Comput 34(10):1741–1752

    Google Scholar 

  • Wang Y, Jin X, Cheng X (2013) Network big data: present and future. Chin J Comput 36(6):1125–1138

    Google Scholar 

  • Wei L, Zhao Y (2015) Bibliometric analysis of global environmental assessment research in a 20-year period. Environ Impact Assess Rev 50:158–166

    Google Scholar 

  • Wu X, Zhu X, Wu G, Wei D (2013) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107

    Google Scholar 

  • Wu Y, Yang F, Lai G, Lin K (2016) Research progress of knowledge graph learning and reasoning. J Chin Comput Syst 37(9):2007–2013

    Google Scholar 

  • Yang L, Wei X (2011) Visualization research in foreign social network analysis based on mapping knowledge domain. Inf Sci 29:1041–1048

    Google Scholar 

  • Zhang Y, Chen M, Liao X (2013) Big data applications: a survey. J Comput Res Dev 50(z2):216–233

    Google Scholar 

  • Zhang S, Yang Z, Xing X, Gao Y, Xie D, Wong HS (2017) Generalized pair-counting similarity measures for clustering and cluster ensembles. IEEE Access 5:16904–16918

    Google Scholar 

  • Zheng L (2013) Stride into the era of “big data”. Inf Constr 1(2011):10–13

    Google Scholar 

Download references

Acknowledgements

This paper is funded by the National Natural Science Special Fund Project (61340058) and the Zhejiang Provincial Natural Science Fund Key Project (LZ14F020001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chang Lu.

Ethics declarations

Conflict of interest

All Authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Communicated by Mu-Yen Chen.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, W., Lu, C. Visualization analysis of big data research based on Citespace. Soft Comput 24, 8173–8186 (2020). https://doi.org/10.1007/s00500-019-04384-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-019-04384-7

Keywords

Navigation