Unsupervised Ontology- and Sentiment-Aware Review Summarization

  • Nhat X. T. LeEmail author
  • Neal Young
  • Vagelis Hristidis
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11881)


In this Web 2.0 era, there is an ever increasing number of customer reviews, which must be summarized to help consumers effortlessly make informed decisions. Previous work on reviews summarization has simplified the problem by assuming that aspects (e.g., “display”) are independent of each other and that the opinion for each aspect in a review is Boolean: positive or negative. However, in reality aspects may be interrelated – e.g., “display” and “display color” – and the sentiment takes values in a continuous range – e.g., somewhat vs very positive. We present a novel, unsupervised review summarization framework that advances the state-of-the-art by leveraging a domain hierarchy of concepts to handle the semantic overlap among the aspects, and by accounting for different sentiment levels. We show that the problem is NP-hard and present bounded approximate algorithms to compute the most representative set of sentences or reviews, based on a principled opinion coverage framework. We experimentally evaluate the proposed algorithms on real datasets in terms of their efficiency and effectiveness compared to the optimal algorithms. We also show that our methods generate summaries of superior quality than several baselines in short execution times.


Review summarization Unsupervised extractive summarization Online customer review Aspect based sentiment analysis 



This work was partially supported by NSF grants IIS-1838222, IIS-1619463, IIS-1901379 and IIS-1447826


  1. 1.
    Aronson, A.R.: Effective mapping of biomedical text to the UMLS metathesaurus: the metamap program. In: AMIA (2001)Google Scholar
  2. 2.
    Belica, M.: Sumy: Module for automatic summarization of text documents (2017)Google Scholar
  3. 3.
    Blair-Goldensohn, S., Hannan, K., McDonald, R., Neylon, T., Reis, G.A., Reynar, J.: Building a sentiment summarizer for local service reviews. In: WWW Workshop on NLP in the Information Explosion Era (2008)Google Scholar
  4. 4.
    Chrobak, M., Kenyon, C., Noga, J., Young, N.E.: Oblivious medians via online bidding. In: Correa, J.R., Hevia, A., Kiwi, M. (eds.) LATIN 2006. LNCS, vol. 3887, pp. 311–322. Springer, Heidelberg (2006). Scholar
  5. 5.
    Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: WSDM 2008. ACM (2008)Google Scholar
  6. 6.
    Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)CrossRefGoogle Scholar
  7. 7.
    Goldstein, J., Mittal, V., Carbonell, J., Kantrowitz, M.: Multi-document summarization by sentence extraction. In: NAACL-ANLP (2000)Google Scholar
  8. 8.
    Gurobi Optimization: Gurobi optimizer manual (2015).
  9. 9.
    Hu, M., Liu, B.: Mining and summarizing customer reviews. In: KDD (2004)Google Scholar
  10. 10.
    International: SNOMED CT (2016).
  11. 11.
    Jadhav, A., Rajan, V.: Extractive summarization with swap-net: sentences and words from alternating pointer networks. In: ACL (2018)Google Scholar
  12. 12.
    Kim, S., Zhang, J., Chen, Z., Oh, A.H., Liu, S.: A hierarchical aspect-sentiment model for online reviews. In: AAAI (2013)Google Scholar
  13. 13.
    Lappas, T., Crovella, M., Terzi, E.: Selecting a characteristic set of reviews. In: KDD (2012)Google Scholar
  14. 14.
    Le, N.X., Hristidis, V., Young, N.: Ontology- and sentiment-aware review summarization. In: ICDE, pp. 171–174 (2017)Google Scholar
  15. 15.
    Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: ICML (2014)Google Scholar
  16. 16.
    Liu, L., Lu, Y., Yang, M., Qu, Q., Zhu, J., Li, H.: Generative adversarial network for abstractive text summarization. In: AAAI (2017)Google Scholar
  17. 17.
    Lu, Y., Zhai, C., Sundaresan, N.: Rated aspect summarization of short comments. In: World Wide Web 2009 (2009)Google Scholar
  18. 18.
    Mihalcea, R., Tarau, P.: TextRank: bringing order into text. In: EMNLP (2004)Google Scholar
  19. 19.
    Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar
  20. 20.
    Mukherjee, A., Liu, B.: Aspect extraction through semi-supervised modeling. In: 50th Annual Meeting of ACL (2012)Google Scholar
  21. 21.
    Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment classification using machine learning techniques. In: ACL-2002 Conference on EMNLP (2002)Google Scholar
  22. 22.
    Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)CrossRefGoogle Scholar
  23. 23.
    Speer, R., Havasi, C.: Representing general relational knowledge in conceptNet 5. In: LREC (2012)Google Scholar
  24. 24.
    Steinberger, J., Jezek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of ISIM 2004 (2004)Google Scholar
  25. 25.
    Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)CrossRefGoogle Scholar
  26. 26.
    Wolsey, L.A.: An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica 2(4), 385–393 (1982)MathSciNetCrossRefGoogle Scholar
  27. 27.
    Young, N.E.: K-medians, facility location, and the Chernoff-Wald bound. arXiv preprint arXiv:cs/0205047 (2002)

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringUniversity of CaliforniaRiversideUSA

Personalised recommendations