Abstract
The extraction of meaningful, accurate, and relevant information is at the core of Big Data research. Furthermore, the ability to obtain an insight is essential in any decision-making process, even though the diverse and complex nature of big data sets raises a multitude of challenges. In this paper, we propose a novel method to address the automated assessment of influence among concepts in big data sets. This is carried out by investigating their mutual co-occurrence, which is determined via topologically reducing the corresponding network. The main motivation is to provide a toolbox to classify and analyse influence properties, which can be used to investigate their dynamical and statistical behaviour, potentially leading to a better understanding and prediction of the properties of the system(s) they model. An evaluation was carried out on two real-world data sets, which were analysed to test the capabilities of our system. The results show the potential of our approach, indicating both accuracy and efficiency.
Similar content being viewed by others
References
Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47
Aviation Safety Reporting System Database. http://asrs.arc.nasa.gov/search/database.html. 1 March 2014
Azar AT, Hassanien AE (2014) Dimensionality reduction of medical big data using neural-fuzzy classifier. Soft Comput. doi:10.1007/s00500-014-1327-4
Blanco E, Castell N, Moldovan D (2008) Causal relation extraction. In: Proceedings of the 6th international conference on language resources and evaluation (LREC’08), 2008
Bollobás B (1998) Modern graph theory. Graduate texts in mathematics, vol 184, Springer, New York
Cormen TH, Leiserson CE, Rivest RL (1990) Introduction to algorithms. MIT Press, Cambridge
De Marneffe MF, MacCartney B, Manning CD (2006) Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC
Ebel H, Mielsch LI, Bornholdt S (2002) Scale-free topology of E-mail networks. Phys Rev E 66:035103
Engelhardt-Nowitzki C, Kryvinska N, Strauss C (2011) Strategic demands on information services in uncertain businesses: a layer-based framework from a value network perspective. The 1st international workshop on frontiers in service transformations and innovations (FSTI-2011), in conjunction with EIDWT 2011, 7–9 Sept 2011, Tirana, Albania, pp 131–136
European-Mediterranean Seismological Centre Database. http://www.emsc-csem.org/. 1 May 2014
Feldman R, Sanger J (2006) The text mining handbook. Cambridge University Press, Cambridge
Francis WN, Kucera H (1979) The Brown Corpus: a standard corpus of present-day. Edited American English. Providence, RI: Department of Linguistics, Brown University [producer and distributor]
Gupta R, Gupta H, Mohania M (2012) Cloud computing and big data analytics: what is new from databases perspective? Big Data Anal. Lecture notes in computer science. Springer, Berlin Heidelberg, pp 42–61
Hein O, Schwind M, Konig W (2006) Scale-free networks. Wirtschaftsinformatik 48:4
Liu B (2012) Sentiment analysis and opinion mining. Morgan and Claypool Publishers
Molnar E, Kryvinska N, Gregus̆ M (2014) Customer driven big-data analytics for the companies’ servitization. In: Baines T, Clegg B, Harrison D (eds) The spring servitization conference 2014 (SSC 2014), 12–14 May 2014, Aston Business School, Aston University, UK, pp 133–140
Sanchez-Graillet O, Poesio M (2004) Acquiring Bayesian networks from text. In: Proceedings of LREC, European Language Resources Association
Schaffer J (2001) Causation, influence, and effluence. Analysis 61:1119. doi:10.1111/1467-8284.00263
Schonbauer R, Sommers P, Misfeld M, Dinov B, Fiedler F, Huo Y, Arya A (2013) Relevant ventricular septal defect caused by steam pop during ablation of premature ventricular contraction. Circulation 127(24):e843–844
Soheilykhah S, Sheikhani A, Sharif AG, Daevaeiha MM (2013) Localization of premature ventricular contraction foci in normal individuals based on multichannel electrocardiogram signals processing. Springerplus 2:486
Trovati M (2015) Reduced topologically real-world networks: a big-data approach. Int J Distrib Syst Technol (in press)
Trovati M, Asimakopoulou E, Bessis N (2014) An analytical tool to map big data to networks with reduced topologies. In: Proceedings of InCoS, pp 411–414
Trovati M, Bagdasar O (2014) Influence discovery in semantic networks: an initial approach. In: Proceedings of UKSim
Trovati M, Bessis N, Huber A, Zelenkauskaite A, Asimakopoulou E (2014) Extraction, identification and ranking of network structures from data sets. In: Proceedings of CISIS, pp 331–337
Watts DJ, Strogatz HS (1998) Collective dynamics of small-world networks. Nature 393:440–442
Weng J, Lim E, Jiang J, He Q (2010) TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 261–270
Wren JD (2006) Using fuzzy set theory and scale-free network properties to relate MEDLINE terms. Soft Comput 10(4):374–381
Zelenkauskaite A, Bessis N, Sotiriadis S, Asimakopoulou E (2012) Interconnectedness of complex systems of internet of things through social network analysis for disaster management. In: Proceedings of 4th IEEE INCoS-2012, pp 503–508
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by V. Loia.
Rights and permissions
About this article
Cite this article
Trovati, M., Bessis, N. An influence assessment method based on co-occurrence for topologically reduced big data sets. Soft Comput 20, 2021–2030 (2016). https://doi.org/10.1007/s00500-015-1621-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-015-1621-9