Soft Computing

, Volume 20, Issue 5, pp 2021–2030 | Cite as

An influence assessment method based on co-occurrence for topologically reduced big data sets

  • Marcello Trovati
  • Nik Bessis
Methodologies and Application


The extraction of meaningful, accurate, and relevant information is at the core of Big Data research. Furthermore, the ability to obtain an insight is essential in any decision-making process, even though the diverse and complex nature of big data sets raises a multitude of challenges. In this paper, we propose a novel method to address the automated assessment of influence among concepts in big data sets. This is carried out by investigating their mutual co-occurrence, which is determined via topologically reducing the corresponding network. The main motivation is to provide a toolbox to classify and analyse influence properties, which can be used to investigate their dynamical and statistical behaviour, potentially leading to a better understanding and prediction of the properties of the system(s) they model. An evaluation was carried out on two real-world data sets, which were analysed to test the capabilities of our system. The results show the potential of our approach, indicating both accuracy and efficiency.


Knowledge discovery Large-scale networks Information extraction Data analytics 


  1. Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47MathSciNetCrossRefzbMATHGoogle Scholar
  2. Aviation Safety Reporting System Database. 1 March 2014
  3. Azar AT, Hassanien AE (2014) Dimensionality reduction of medical big data using neural-fuzzy classifier. Soft Comput. doi: 10.1007/s00500-014-1327-4
  4. Blanco E, Castell N, Moldovan D (2008) Causal relation extraction. In: Proceedings of the 6th international conference on language resources and evaluation (LREC’08), 2008Google Scholar
  5. Bollobás B (1998) Modern graph theory. Graduate texts in mathematics, vol 184, Springer, New YorkGoogle Scholar
  6. Cormen TH, Leiserson CE, Rivest RL (1990) Introduction to algorithms. MIT Press, CambridgezbMATHGoogle Scholar
  7. De Marneffe MF, MacCartney B, Manning CD (2006) Generating typed dependency parses from phrase structure parses. In: Proceedings of LRECGoogle Scholar
  8. Ebel H, Mielsch LI, Bornholdt S (2002) Scale-free topology of E-mail networks. Phys Rev E 66:035103CrossRefGoogle Scholar
  9. Engelhardt-Nowitzki C, Kryvinska N, Strauss C (2011) Strategic demands on information services in uncertain businesses: a layer-based framework from a value network perspective. The 1st international workshop on frontiers in service transformations and innovations (FSTI-2011), in conjunction with EIDWT 2011, 7–9 Sept 2011, Tirana, Albania, pp 131–136Google Scholar
  10. European-Mediterranean Seismological Centre Database. 1 May 2014
  11. Feldman R, Sanger J (2006) The text mining handbook. Cambridge University Press, CambridgeCrossRefGoogle Scholar
  12. Francis WN, Kucera H (1979) The Brown Corpus: a standard corpus of present-day. Edited American English. Providence, RI: Department of Linguistics, Brown University [producer and distributor]Google Scholar
  13. Gupta R, Gupta H, Mohania M (2012) Cloud computing and big data analytics: what is new from databases perspective? Big Data Anal. Lecture notes in computer science. Springer, Berlin Heidelberg, pp 42–61Google Scholar
  14. Hein O, Schwind M, Konig W (2006) Scale-free networks. Wirtschaftsinformatik 48:4CrossRefGoogle Scholar
  15. Liu B (2012) Sentiment analysis and opinion mining. Morgan and Claypool PublishersGoogle Scholar
  16. Molnar E, Kryvinska N, Gregus̆ M (2014) Customer driven big-data analytics for the companies’ servitization. In: Baines T, Clegg B, Harrison D (eds) The spring servitization conference 2014 (SSC 2014), 12–14 May 2014, Aston Business School, Aston University, UK, pp 133–140Google Scholar
  17. Sanchez-Graillet O, Poesio M (2004) Acquiring Bayesian networks from text. In: Proceedings of LREC, European Language Resources AssociationGoogle Scholar
  18. Schaffer J (2001) Causation, influence, and effluence. Analysis 61:1119. doi: 10.1111/1467-8284.00263
  19. Schonbauer R, Sommers P, Misfeld M, Dinov B, Fiedler F, Huo Y, Arya A (2013) Relevant ventricular septal defect caused by steam pop during ablation of premature ventricular contraction. Circulation 127(24):e843–844Google Scholar
  20. Soheilykhah S, Sheikhani A, Sharif AG, Daevaeiha MM (2013) Localization of premature ventricular contraction foci in normal individuals based on multichannel electrocardiogram signals processing. Springerplus 2:486Google Scholar
  21. Trovati M (2015) Reduced topologically real-world networks: a big-data approach. Int J Distrib Syst Technol (in press)Google Scholar
  22. Trovati M, Asimakopoulou E, Bessis N (2014) An analytical tool to map big data to networks with reduced topologies. In: Proceedings of InCoS, pp 411–414Google Scholar
  23. Trovati M, Bagdasar O (2014) Influence discovery in semantic networks: an initial approach. In: Proceedings of UKSimGoogle Scholar
  24. Trovati M, Bessis N, Huber A, Zelenkauskaite A, Asimakopoulou E (2014) Extraction, identification and ranking of network structures from data sets. In: Proceedings of CISIS, pp 331–337Google Scholar
  25. Watts DJ, Strogatz HS (1998) Collective dynamics of small-world networks. Nature 393:440–442CrossRefGoogle Scholar
  26. Weng J, Lim E, Jiang J, He Q (2010) TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 261–270Google Scholar
  27. Wren JD (2006) Using fuzzy set theory and scale-free network properties to relate MEDLINE terms. Soft Comput 10(4):374–381Google Scholar
  28. Zelenkauskaite A, Bessis N, Sotiriadis S, Asimakopoulou E (2012) Interconnectedness of complex systems of internet of things through social network analysis for disaster management. In: Proceedings of 4th IEEE INCoS-2012, pp 503–508Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2015

Authors and Affiliations

  1. 1.Department of Computing and MathematicsUniversity of DerbyDerbyUK

Personalised recommendations