Skip to main content
Log in

An influence assessment method based on co-occurrence for topologically reduced big data sets

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

The extraction of meaningful, accurate, and relevant information is at the core of Big Data research. Furthermore, the ability to obtain an insight is essential in any decision-making process, even though the diverse and complex nature of big data sets raises a multitude of challenges. In this paper, we propose a novel method to address the automated assessment of influence among concepts in big data sets. This is carried out by investigating their mutual co-occurrence, which is determined via topologically reducing the corresponding network. The main motivation is to provide a toolbox to classify and analyse influence properties, which can be used to investigate their dynamical and statistical behaviour, potentially leading to a better understanding and prediction of the properties of the system(s) they model. An evaluation was carried out on two real-world data sets, which were analysed to test the capabilities of our system. The results show the potential of our approach, indicating both accuracy and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47

    Article  MathSciNet  MATH  Google Scholar 

  • Aviation Safety Reporting System Database. http://asrs.arc.nasa.gov/search/database.html. 1 March 2014

  • Azar AT, Hassanien AE (2014) Dimensionality reduction of medical big data using neural-fuzzy classifier. Soft Comput. doi:10.1007/s00500-014-1327-4

  • Blanco E, Castell N, Moldovan D (2008) Causal relation extraction. In: Proceedings of the 6th international conference on language resources and evaluation (LREC’08), 2008

  • Bollobás B (1998) Modern graph theory. Graduate texts in mathematics, vol 184, Springer, New York

  • Cormen TH, Leiserson CE, Rivest RL (1990) Introduction to algorithms. MIT Press, Cambridge

    MATH  Google Scholar 

  • De Marneffe MF, MacCartney B, Manning CD (2006) Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC

  • Ebel H, Mielsch LI, Bornholdt S (2002) Scale-free topology of E-mail networks. Phys Rev E 66:035103

    Article  Google Scholar 

  • Engelhardt-Nowitzki C, Kryvinska N, Strauss C (2011) Strategic demands on information services in uncertain businesses: a layer-based framework from a value network perspective. The 1st international workshop on frontiers in service transformations and innovations (FSTI-2011), in conjunction with EIDWT 2011, 7–9 Sept 2011, Tirana, Albania, pp 131–136

  • European-Mediterranean Seismological Centre Database. http://www.emsc-csem.org/. 1 May 2014

  • Feldman R, Sanger J (2006) The text mining handbook. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Francis WN, Kucera H (1979) The Brown Corpus: a standard corpus of present-day. Edited American English. Providence, RI: Department of Linguistics, Brown University [producer and distributor]

  • Gupta R, Gupta H, Mohania M (2012) Cloud computing and big data analytics: what is new from databases perspective? Big Data Anal. Lecture notes in computer science. Springer, Berlin Heidelberg, pp 42–61

  • Hein O, Schwind M, Konig W (2006) Scale-free networks. Wirtschaftsinformatik 48:4

    Article  Google Scholar 

  • Liu B (2012) Sentiment analysis and opinion mining. Morgan and Claypool Publishers

  • Molnar E, Kryvinska N, Gregus̆ M (2014) Customer driven big-data analytics for the companies’ servitization. In: Baines T, Clegg B, Harrison D (eds) The spring servitization conference 2014 (SSC 2014), 12–14 May 2014, Aston Business School, Aston University, UK, pp 133–140

  • Sanchez-Graillet O, Poesio M (2004) Acquiring Bayesian networks from text. In: Proceedings of LREC, European Language Resources Association

  • Schaffer J (2001) Causation, influence, and effluence. Analysis 61:1119. doi:10.1111/1467-8284.00263

  • Schonbauer R, Sommers P, Misfeld M, Dinov B, Fiedler F, Huo Y, Arya A (2013) Relevant ventricular septal defect caused by steam pop during ablation of premature ventricular contraction. Circulation 127(24):e843–844

  • Soheilykhah S, Sheikhani A, Sharif AG, Daevaeiha MM (2013) Localization of premature ventricular contraction foci in normal individuals based on multichannel electrocardiogram signals processing. Springerplus 2:486

  • Trovati M (2015) Reduced topologically real-world networks: a big-data approach. Int J Distrib Syst Technol (in press)

  • Trovati M, Asimakopoulou E, Bessis N (2014) An analytical tool to map big data to networks with reduced topologies. In: Proceedings of InCoS, pp 411–414

  • Trovati M, Bagdasar O (2014) Influence discovery in semantic networks: an initial approach. In: Proceedings of UKSim

  • Trovati M, Bessis N, Huber A, Zelenkauskaite A, Asimakopoulou E (2014) Extraction, identification and ranking of network structures from data sets. In: Proceedings of CISIS, pp 331–337

  • Watts DJ, Strogatz HS (1998) Collective dynamics of small-world networks. Nature 393:440–442

    Article  Google Scholar 

  • Weng J, Lim E, Jiang J, He Q (2010) TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 261–270

  • Wren JD (2006) Using fuzzy set theory and scale-free network properties to relate MEDLINE terms. Soft Comput 10(4):374–381

  • Zelenkauskaite A, Bessis N, Sotiriadis S, Asimakopoulou E (2012) Interconnectedness of complex systems of internet of things through social network analysis for disaster management. In: Proceedings of 4th IEEE INCoS-2012, pp 503–508

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nik Bessis.

Additional information

Communicated by V. Loia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Trovati, M., Bessis, N. An influence assessment method based on co-occurrence for topologically reduced big data sets. Soft Comput 20, 2021–2030 (2016). https://doi.org/10.1007/s00500-015-1621-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-015-1621-9

Keywords

Navigation