Dictionary-Based Problem Phrase Extraction from User Reviews

  • Valery Solovyev
  • Vladimir Ivanov
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8655)


This paper describes a system for problem phrase extraction from texts that contain users’ reviews of products. In contrast to recent works, this system is based on dictionaries and heuristics, not a machine learning algorithms. We explored two approaches to dictionary construction: manual and automatic. We evaluated the system on a dataset constructed using Amazon Mechanical Turk. Performance values are compared to a machine learning baseline.


natural language processing information extraction 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    De Saeger, S., Torisawa, K., Kazama, J.: Looking for Trouble. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 185–192. Association for Computational Linguistics (2008)Google Scholar
  2. 2.
    Fleiss, J.L.: Statistical Methods for Rates and Proportions. Wiley Series in Probability and Mathematical Statistics. Applied Probability and Statistics. Wiley, New York (1981)Google Scholar
  3. 3.
    Gupta, N.: Extracting Descriptions of Problems with Products and Services from Twitter Data. In: Proceedings of the Third Workshop on Social Web Search and Mining, Beijing (2011)Google Scholar
  4. 4.
    Gupta, N.: Extracting Phrases Describing Problems with Products and Services from Twitter Messages. Technical report, Conference on Intelligent Text Processing and Computational Linguistics (2013)Google Scholar
  5. 5.
    Hu, M., Liu, B.: Mining and Summarizing Customer Reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2004, pp. 168–177. ACM, New York (2004a)CrossRefGoogle Scholar
  6. 6.
    Hu, M., Liu, B.: Mining Opinion Features in Customer Reviews. In: Proceedings of the 19th National Conference on Artificial Intelligence (2004b)Google Scholar
  7. 7.
    Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, vol. 2. CRC Press (2010)Google Scholar
  8. 8.
    Liu, B., Hu, M., Cheng, J.: Proceedings of the 14th International Conference on World Wide Web, WWW 2005, pp. 342–351. ACM, New York (2005)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Valery Solovyev
    • 1
  • Vladimir Ivanov
    • 1
    • 2
    • 3
  1. 1.Kazan Federal UniversityKazanRussia
  2. 2.National University of Science and Technology “MISIS”MoscowRussia
  3. 3.Institute of Informatics, Tatarstan Academy of SciencesKazanRussia

Personalised recommendations