Advertisement

A roadmap towards implementing parallel aspect level sentiment analysis

  • Syed Muzamil Basha
  • Dharmendra Singh RajputEmail author
Article

Abstract

In the digital era, the importance of extracting the hidden sentiments from user reviews plays a prominent role, to increase the profitability of an organization. The interest among, the research community in Sentiment Analysis (SA) has grown exponentially. But there are enormous challenges still being faced in the field of SA namely: Identification of sarcasm/Irony/Conditional/Modifier statements present in the review, Identification of Aspects and sentiment word as a pair (Data Transformation), Rating the recognized Aspects towards predicting the overall aggregated sentiment, Analyzing and designing issues towards implementing the parallel Aspect Level sentiment. In the present research work, We have addressed each of this challenges using a serial hybridization model, Where, the output of each step, is input to the following stage. First, towards identification sarcasm. In which, the dictionary is updated with the set of sentiment words by manually crafted rules. Next, to mitigate the discovery of sentiment and aspect word pair. In which, Latent Dirichlet Allocation (LDA), Gibbs sampling techniques are used. Next, to present the result of sentiment analysis as the overall rating of data considered, Latent Aspect Rating Regression (LARR) model is proposed (Data Presentation). Finally, addressed the designing issues (deciding numbers of mappers and reducers needed) towards implementing the parallel Aspect Level sentiment Analysis with the objective of improving the resource utilization in Big Data clusters. This work can help the researchers doing research in the field of speech recognition, development of recommended systems. The evaluation Metric used in estimating the performance of each step in our research are F-score, Rand Index, Classification accuracy and Root Mean Absolute Error (RMAE), Throughput. The findings of our research work help the customer to directly use the result obtained from the proposed model in the form of Aspect level rating.

Keywords

Sentiment analysis Aspects Sarcasm F-score Latent Dirichlet allocation Gibbs sampling Rand index Classification accuracy Latent aspect rating regression Root mean absolute error 

Notes

References

  1. 1.
    Bafna K, Toshniwal D (2013) Feature based summarization of customers’ reviews of online products. Procedia Computer Science 22:142–151CrossRefGoogle Scholar
  2. 2.
    Balazs JA, Velásquez JD (2016) Opinion mining and information fusion: a survey. Information Fusion 27:95–110CrossRefGoogle Scholar
  3. 3.
    Bamman D, Smith NA (2015) Contextualized sarcasm detection on twitter. International Conference on Web and Social Media 2:15Google Scholar
  4. 4.
    Basha SM, Rajput DS (2017) Evaluating the impact of feature selection on overall performance of sentiment analysis. In: Proceedings of the 2017 International Conference on Information Technology. ACM, p 96–102Google Scholar
  5. 5.
    Basha SM, Rajput DS (2018) Sentiment analysis: using artificial neural fuzzy inference system. In: Handbook of research on pattern engineering system development for big data analytics. IGI Global, p 130–152Google Scholar
  6. 6.
    Basha SM, Rajput DS, Poluru RK, Bhushan SB, Basha SAK (2018) Evaluating the performance of supervised classification models: decision tree and Naïve Bayes using KNIME. International Journal of Engineering & Technology 7(4.5):248–253CrossRefGoogle Scholar
  7. 7.
    Basha SM, Rajput DS, Vandhan V (2018) Impact of gradient ascent and boosting algorithm in classification. International Journal of Intelligent Engineering and Systems (IJIES) 11(1):41–49CrossRefGoogle Scholar
  8. 8.
    Basha SM, Rajput DS, Iyengar NCS (2019) Conceptual approach to predict loan defaults using decision trees. In: Sentiment analysis and knowledge discovery in contemporary business. IGI Global, p 148–161Google Scholar
  9. 9.
    Bouk SH, Ahmed SH, Kim D, Song H (2017) Named-data-networking-based ITS for smart cities. IEEE Commun Mag 55(1):105–111CrossRefGoogle Scholar
  10. 10.
    Bulkowski BJ, Srinivasan V (2018) Data distribution across nodes of a distributed database base system. United States patent application US 15/488,511Google Scholar
  11. 11.
    Buschmeier K, Cimiano P, Klinger R (2014) An impact analysis of features in a classification approach to irony detection in product reviews. In: Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, p 42–49Google Scholar
  12. 12.
    Chan H (2004) ACE: an emergent algorithm for highly uniform cluster formation. In: European workshop on wireless sensor networks. Berlin, p 154–171Google Scholar
  13. 13.
    Charalampakis B, Spathis D, Kouslis E, Kermanidis K (2016) A comparison between semi-supervised and supervised text mining techniques on detecting irony in Greek political tweets. Eng Appl Artif Intell 51:50–57CrossRefGoogle Scholar
  14. 14.
    Dupuy C, Bach F (2017) Online but accurate inference for latent variable models with local Gibbs sampling. J Mach Learn Res 18(1):4581–4625MathSciNetzbMATHGoogle Scholar
  15. 15.
    Ganesan K, Zhai C (2012) Opinion-based entity ranking. Inf Retr 15(2):116–150CrossRefGoogle Scholar
  16. 16.
    Guo Y, Barnes SJ, Jia Q (2017) Mining meaning from online ratings and reviews: tourist satisfaction analysis using latent Dirichlet allocation. Tour Manag 59:467–483CrossRefGoogle Scholar
  17. 17.
    Hai Z, Cong G, Chang K, Liu W, Cheng P (2014) Coarse-to-fine review selection via supervised joint aspect and sentiment model. In: Proceedings of the 37th International ACM SIGIR conference on Research & development in information retrieval. ACM, p 617–626Google Scholar
  18. 18.
    Hai Z, Chang K, Kim JJ, Yang CC (2014) Identifying features in opinion mining via intrinsic and extrinsic domain relevance. IEEE Trans Knowl Data Eng 26(3):623–634CrossRefGoogle Scholar
  19. 19.
    Hai Z, Cong G, Chang K, Cheng P, Miao C (2017) Analyzing sentiments in one go: a supervised joint topic modeling approach. IEEE Trans Knowl Data Eng 29(6):1172–1185CrossRefGoogle Scholar
  20. 20.
    Hirschberg J, Manning CD (2015) Advances in natural language processing. Science 349(6245):261–266MathSciNetCrossRefGoogle Scholar
  21. 21.
    Hox JJ, Moerbeek M, Van de Schoot R (2017) Multilevel analysis: techniques and applications. RoutledgeGoogle Scholar
  22. 22.
    Huang S, Niu Z, Shi C (2014) Automatic construction of domain-specific sentiment lexicon based on constrained label propagation. Knowl-Based Syst 56:191–200CrossRefGoogle Scholar
  23. 23.
    Hutchins B (2011) The acceleration of media sports culture: twitter, telepresence and online messaging. Inf Commun Soc 14(2):237–257CrossRefGoogle Scholar
  24. 24.
    Ibrahim IA, Bassiouni M (2017) Improving MapReduce performance with progress and feedback based speculative execution. 2017 IEEE International Conference on InSmart Cloud (SmartCloud). IEEE, p 120–125Google Scholar
  25. 25.
    Joshi A, Bhattacharyya P, Carman MJ (2017) Automatic sarcasm detection: a survey. ACM Computing Surveys (CSUR) 50(5):73CrossRefGoogle Scholar
  26. 26.
    Kaplan AM, Haenlein M (2010) Users of the world, unite! The challenges and opportunities of social media. Business horizons 53(1):59–68CrossRefGoogle Scholar
  27. 27.
    Karmiloff-Smith A (2018) Précis of beyond modularity: a developmental perspective on cognitive science. In: Thinking developmentally from constructivism to neuroconstructivism. Routledge, p 64–94Google Scholar
  28. 28.
    Kim Y, Shin H (2017) Finding sentiment dimension in vector space of movie reviews: an unsupervised approach. Journal of Cognitive Science 18(1):85–101MathSciNetCrossRefGoogle Scholar
  29. 29.
    Lau RY, Zhang W, Xu W (2018) Parallel aspect-oriented sentiment analysis for sales forecasting with big data. Prod Oper Manag 27(10):1775–1794Google Scholar
  30. 30.
    Liao X, Qin Z, Ding L (2017) Data embedding in digital images using critical functions. Signal Process Image Commun 58:146–156CrossRefGoogle Scholar
  31. 31.
    Liao X, Guo S, Yin J, Wang H, Li X, Sangaiah AK (2017) New cubic reference table based image steganography. Multimed Tools Appl 1–18Google Scholar
  32. 32.
    Liu Z, Wang Y, Dontcheva M, Hoffman M, Walker S, Wilson A (2017) Patterns and sequences: interactive exploration of clickstreams to understand common visitor paths. IEEE Trans Vis Comput Graph 23(1):321–330CrossRefGoogle Scholar
  33. 33.
    Mathwick C, Mosteller J (2017) Online reviewer engagement: a typology based on reviewer motivations. J Serv Res 20(2):204–218CrossRefGoogle Scholar
  34. 34.
    McGranahan N, Swanton C (2017) Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168(4):613–628CrossRefGoogle Scholar
  35. 35.
    Miura Y, Sakaki S, Hattori K, Ohkuma T (2014) TeamX: a sentiment analyzer with enhanced lexicon mapping and weighting scheme for unbalanced data. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), p 628–632Google Scholar
  36. 36.
    Morad TY (2006) Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors. IEEE Comput Archit Lett 5(1):14–17CrossRefGoogle Scholar
  37. 37.
    Papadimitriou D, Koutrika G, Velegrakis Y, Mylopoulos J (2017) Finding related forum posts through content similarity over intention-based segmentation. IEEE Trans Knowl Data Eng 29(9):1860–1873CrossRefGoogle Scholar
  38. 38.
    Rajadesingan A, Zafarani R, Liu H (2015) Sarcasm detection on twitter: a behavioral modeling approach. In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining. ACM, p 97–106Google Scholar
  39. 39.
    Reyes A, Rosso P, Veale T (2013) A multidimensional approach for detecting irony in twitter. Lang Resour Eval 47(1):239–268CrossRefGoogle Scholar
  40. 40.
    Saif H, He Y, Fernandez M, Alani H (2016) Contextual semantics for sentiment analysis of twitter. Inf Process Manag 52(1):5–19CrossRefGoogle Scholar
  41. 41.
    Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 1:1–1Google Scholar
  42. 42.
    Smith AN, Fischer E, Yongjian C (2012) How does brand-related user-generated content differ across YouTube, Facebook, and twitter? J Interact Mark 26(2):102–113CrossRefGoogle Scholar
  43. 43.
    Soleymani M, Garcia D, Jou B, Schuller B, Chang SF (2017) Pantic M. a survey of multimodal sentiment analysis. Image Vis Comput 65:3–14CrossRefGoogle Scholar
  44. 44.
    Tang J, Wu W, Qin X, Feng Y (2016) Structural analysis methods for differential algebraic equations via fixed-point iteration. J Comput Theor Nanosci 13(10):7705–7711CrossRefGoogle Scholar
  45. 45.
    Tzoreff E, Weiss AJ (2017) Expectation-maximization algorithm for direct position determination. Signal Process 133:32–39CrossRefGoogle Scholar
  46. 46.
    Venkatesh S (1997) A steady-state throughput analysis of cluster tools: dual-blade versus single-blade robots. IEEE Trans Semicond Manuf 10(4):418–424CrossRefGoogle Scholar
  47. 47.
    Wang P, Xu B, Xu J, Tian G, Liu CL, Hao H (2016) Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing 174:806–814CrossRefGoogle Scholar
  48. 48.
    You L, Tunçer B (2016) Exploring public sentiments for livable places based on a crowd-calibrated sentiment analysis mechanism. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, p 693–700Google Scholar
  49. 49.
    You L, Tunçer B (2016) Exploring the utilization of places through a scalable “Activities in Places” analysis mechanism. 2016 IEEE International Conference on In Big Data (Big Data). IEEE, p 3563–3572Google Scholar
  50. 50.
    You L, Tunçer B, Xing H (2018) Harnessing multi-source data about public sentiments and activities for informed design. IEEE Trans Knowl Data EngGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.School of Computer Science and EngineeringVellore Institute of TechnologyVelloreIndia
  2. 2.School of Information Technology and EngineeringVellore Institute of TechnologyVelloreIndia

Personalised recommendations