Advertisement

A Statistical-Semantic PSO Model for Customer Reviews-Based Question Answering Systems

  • Garima Dwivedi
  • Manju Venugopalan
  • Deepa Gupta
Conference paper
  • 17 Downloads
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1118)

Abstract

Often customers refer to product specifications to know if a product fulfills their requirement, but in case they do not find answer to their queries they turn to the online reviews. A knowledge-based system tries to answer user queries using product specifications alone, but specifications available can be insufficient. An idea is introduced of combining question answer and review datasets to find the most relevant reviews corresponding to the question using semantic and statistical similarity measures and optimizing their weights. This is accomplished by using a large volume of already answered queries to find weights of four different similarity metrics (Cosine, BM25, WordNet and Word Embedding) used to find similarity between question and reviews. Similarity measure weights are optimized using PSO-based weight optimization technique where the fitness function is evaluated in terms of how best the sentiment extracted from top reviews agrees with answer of question in Q/A dataset. Results achieved surpass baseline in three out of four domains.

Keywords

Question answering Similarity measures PSO Word embedding WordNet 

References

  1. 1.
    Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM (1999)Google Scholar
  2. 2.
    Venugopalan, M., Gupta, D.: Exploring sentiment analysis on twitter data. In: 2015 Eighth International Conference on Contemporary Computing (IC3). IEEE (2015)Google Scholar
  3. 3.
    Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)Google Scholar
  4. 4.
    Venugopalan, M., Gupta, D.: An enhanced polarity lexicon by learning-based method using related domain knowledge. Int. J. Inf. Process. Manag. 6(2), 61 (2015)Google Scholar
  5. 5.
    Vishnu, K.S., Apoorva, T., Gupta, D.: Learning domain-specific and domain-independent opinion oriented lexicons using multiple domain knowledge. In: 2014 Seventh International Conference on Contemporary Computing (IC3). IEEE (2014)Google Scholar
  6. 6.
    Sarawgi, K., Pathak, V.: Opinion mining: aspect level sentiment analysis using SentiWordNet and amazon web services. Int. J. Comput. Appl. 158(6), 31–36 (2017).  https://doi.org/10.5120/ijca2017912830CrossRefGoogle Scholar
  7. 7.
    Daelemans, W., du Plessis, T., Snyman, C., Teck, L. (eds.) Text mining with information extraction. In: Proceedings of the 4th International MIDP Colloquium, September 2003, pp. 141–160Google Scholar
  8. 8.
    Chahal, P., et al.: Ranking of web documents using semantic similarity. In: 2013 International Conference on Information Systems and Computer Networks (2013).  https://doi.org/10.1109/iciscon.2013.6524191
  9. 9.
    Botev, V., et al.: Word importance-based similarity of documents metric (WISDM). In: Proceedings of the 6th International Workshop on Mining Scientific Publications—WOSP 2017, (2017).  https://doi.org/10.1145/3127526.3127530
  10. 10.
    Kumar Gupta D., Srikanth R.K., Shweta, E.A.: PSOASent: feature selection using particle swarm optimization for aspect based sentiment analysis. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) Natural Language Processing and Information Systems. NLDB 2015. Lecture Notes in Computer Science, vol. 9103. Springer, Cham (2015)Google Scholar
  11. 11.
    Akhtar, MS., et al.: Feature Selection and ensemble construction: a two-step method for aspect based sentiment analysis. Knowl.-Based Syst. 125, 116–135 (2017)  https://doi.org/10.1016/j.knosys.2017.03.020
  12. 12.
    Shi, Y., Eberhart, R. (n.d.). A modified particle swarm optimizer. In: 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No. 98TH8360).  https://doi.org/10.1109/icec.1998.699146
  13. 13.
    Davis, L. (ed.): Handbook of Genetic Algorithms. Van Nostrand Reinhold, New York, NY (1991)Google Scholar
  14. 14.
    Al-Chalabi, H., Ray, S., Shaalan, K.: Semantic based query expansion for arabic question answering systems. In: 2015 First International Conference on Arabic Computational Linguistics (ACLing) (2015).  https://doi.org/10.1109/acling.2015.25
  15. 15.
    Sneiders, E.: Automated FAQ answering with question-specific knowledge representation for web self-service. In: 2009 2nd Conference on Human System Interactions (2009).  https://doi.org/10.1109/hsi.2009.5090996
  16. 16.
    Moghaddam, S., Ester, M.: AQA: Aspect-based Opinion Question Answering. In: 2011 IEEE 11th International Conference on Data Mining Workshops (2011).  https://doi.org/10.1109/icdmw.2011.34
  17. 17.
    Mcauley, J., Yang, A.: Addressing complex and subjective product-related queries with customer reviews. In: Proceedings of the 25th International Conference on World Wide Web-WWW ‘16, (2016).  https://doi.org/10.1145/2872427.2883044
  18. 18.
    Wan, M., Mcauley, J.: Modeling ambiguity, subjectivity, and diverging viewpoints in opinion question answering systems. In: 2016 IEEE 16th International Conference on Data Mining (ICDM) (2016).  https://doi.org/10.1109/icdm.2016.0060
  19. 19.
    Lahitani, A.R., Permanasari, A.E., Setiawan, N.A.: Cosine similarity to determine similarity measure: study case in online essay assessment. In: 2016 4th International Conference on Cyber and IT Service Management (2016).  https://doi.org/10.1109/citsm.2016.7577578
  20. 20.
    Hong-Minh, T., Smith, D.: Word similarity in WordNet. In: Modeling, Simulation and Optimization of Complex Processes, pp. 293–302 (2008).  https://doi.org/10.1007/978-3-540-79409-7_19
  21. 21.
    Xiang, L., Yu, J., Yang, C., Zeng, D., Shen, X.: A Word-Embedding-Based Steganalysis Method For Linguistic Steganography Via Synonym Substitution. IEEE Access 6, 64131–64141 (2018).  https://doi.org/10.1109/access.2018.2878273CrossRefGoogle Scholar
  22. 22.
    Lv, Y., Zhai, C.: Lower-bounding term frequency normalization. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management—CIKM 11, (2011)  https://doi.org/10.1145/2063576.2063584
  23. 23.
    Eberhart, Y.S.: Particle Swarm optimization: developments, applications and resources. Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546),  https://doi.org/10.1109/cec.2001.934374

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  • Garima Dwivedi
    • 1
  • Manju Venugopalan
    • 1
  • Deepa Gupta
    • 1
  1. 1.Department of Computer Science and EngineeringAmrita School of Engineering, Amrita Vishwa VidyapeethamBengaluruIndia

Personalised recommendations