Skip to main content

Improving Predictions with an Ensemble of Linguistic Approach and Matrix Factorization

  • Conference paper
Book cover Web Information Systems and Technologies (WEBIST 2015)

Abstract

This paper extends a previous work done by the same authors [1] having the aim of improving the predictions coming from a matrix factorization based on latent factor models through an ensemble with the predictions obtained by an Opinion Mining methodology based on a linguistic approach. The experimental analysis was carried out on the Yelp business dataset, limited to the Restaurant category. An hypothesis of influence of the restaurant average rating on the number of stars given by the users is tested. An analysis of the meaning of some of the latent factors is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Angioni, M., Clemente, M.L., Tuveri, F.: Combining opinion mining with collaborative filtering. In: Proceedings of WEBIST 2015, 11th International Conference on Web Information Systems and Technologies, Lisbon (2015)

    Google Scholar 

  2. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008). doi:10.1561/1500000011

    Article  Google Scholar 

  3. Ghose, A., Ipeirotis, P.G.: Designing novel review ranking systems: predicting usefulness and impact of reviews. In: International Conference on Electronic Commerce (ICEC) (2007)

    Google Scholar 

  4. Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 42–49 (2009). IEEE Computer Society

    Article  Google Scholar 

  5. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Item-based collaborative filtering recommendation algorithms. In: Proceedings of IEEE Internet Computing, 10th International World Wide Web Conference (2001)

    Google Scholar 

  6. Hinton, G.E.: A practical guide to training restricted boltzmann machines. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) Neural Networks: Tricks of the Trade. LNCS, vol. 7700, 2nd edn, pp. 599–619. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  7. Linden, G., Smith, B., York, J.: Amazon.com recommendations. In: IEEE Internet Computing, vol. 07, no. 1, pp. 76–80 (2003)

    Google Scholar 

  8. Clemente, M.L.: Experimental results on item-based algorithms for independent domain collaborative filtering. In: Proceedings of AXMEDIS 2008, pp. 87–92. IEEE Computer Society (2008)

    Google Scholar 

  9. Tosher, A., Jahrer, M., Bell, R.M.: The BigChaos solution to the Netflix grand prize, Netflix Prize Documentation (2009)

    Google Scholar 

  10. Jahrer, M., Töscher, A., Legenstein, R.: Combining Predictions for Accurate Recommender Systems. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 693–702. ACM (2010)

    Google Scholar 

  11. Koukourikos, A., Stoisis, G., Karampiperis, P.: Sentiment analysis: a tool for rating attribution to content in recommender systems. In: 2nd Workshop on Recommender Systems for Technology Enhances Learning (RecSysTEL 2012). Saarbrucken, Germany (2012)

    Google Scholar 

  12. Quadrana, M.: E-tourism recommender systems (2013). http://hdl.handle.net/10589/84901

  13. Levi, A., Mokryn, O., Diot, C., Taft, N.: Finding a needle in a haystack of reviews: cold start context-based hotel recommender system. In: Proceedings of the Sixth ACM Conference on Recommender Systems, pp. 115–122. ACM (2012)

    Google Scholar 

  14. Wu, Y., Ester, M.: FLAME: a probabilistic model combining aspect based opinion mining and collaborative filtering. In: WSDM 2015, Shanghai, China (2015)

    Google Scholar 

  15. Singh, V.K., Mukherjee, M., Mehta, G.K.: Combining collaborative filtering and sentiment classification for improved movie recommendations. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS, vol. 7080, pp. 38–50. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  16. Huang, J., Rogers, S., Joo, E.: Improving restaurants by extracting subtopics from yelp reviews. SOCIAL MEDIA EXPO (2014). https://www.ideals.illinois.edu/bitstream/handle/2142/48832/Huang-iConference2014-SocialMediaExpo.pdf

  17. Burke, R.: Hybrid recommender systems: survey and experiments. User Model. User-Adap. Inter. 12(3), 331–370 (2002)

    Article  MATH  Google Scholar 

  18. Ganu, G., Kakodkar, Y., Marian, A.: Improving the quality of predictions using textual information in online user reviews. Inf. Syst. (2012). doi:10.1016/j.is.2012.03.001

    Google Scholar 

  19. Ganu, G., Elhadad, N., Marian, A.: Beyond the stars: improving rating predictions using review text content. In: Twelfth International Workshop on the Web and Databases (WebDB 2009), Providence, Rhode Island, USA (2009)

    Google Scholar 

  20. Trevisiol, M., Chiarandini, L., Baeza-Yates, R.: Buon Appetito - Recommending Personalized menus (2014)

    Google Scholar 

  21. Govindarajan, M.: Sentiment analysis of restaurant reviews using hybrid classification method. Int. J. Soft Comput. Artif. Intell. 2, 17–23 (2014)

    Google Scholar 

  22. Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing, pp. 44–49 (1994)

    Google Scholar 

  23. Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC 2010, 7th International Conference on Language Resources and Evaluation, Malta, pp. 2200–2204 (2010)

    Google Scholar 

  24. Miller, G.: WordNet: an Electronic Lexical Database. Bradford Books, Cambridge (1998)

    MATH  Google Scholar 

  25. Clark, P.: Yelp’s Newest Weapon Against Fake Reviews: Lawsuits (2013). http://www.businessweek.com/articles/2013-09-09/yelps-newest-weapon-against-fake-reviews-lawsuits

  26. Jong, J.: Predicting Rating with Sentiment Analysis (2011). http://cs229.stanford.edu/proj2011/Jong-%20PredictingRatingwithSentimentAnalysis.pdf

  27. Mingming, F., Khademi, M.: Predicting a Business Star in Yelp from Its Reviews Text Alone. ArXiv e-prints: arXiv:1401.0864 (2014)

  28. Benamara, F., Cesarano, C., Picariello, A., Reforgiato, D., Subrahmanian, V.S.: Sentiment analysis: adjectives and adverbs are better than adjectives alone. In: Proceedings of ICWSM 2007, International Conference on Weblogs and Social Media, pp. 203–206 (2007)

    Google Scholar 

  29. Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: International Conference on Web Search and Web Data Mining, ACM, NY, USA (2008)

    Google Scholar 

  30. Agerri, R., Garcia-Serrano, A.: Q-WordNet: extracting polarity from WordNet senses. In: 7th International Conference on Language Resources and Evaluation (LREC2010), Malta (2010)

    Google Scholar 

  31. Tuveri, F., Angioni, M.: A linguistic approach to feature extraction based on a lexical database of the properties of adjectives and adverbs. In: Global WordNet Conference (GWC 2012), Matsue, Japan (2012)

    Google Scholar 

  32. Owen, S., Anil, R., Dunning, T., Friedman, E.: Mahout in Action. Manning Publications Co., Shelter Island (2011). ISBN 9781935182689

    Google Scholar 

  33. Shelter, S., Owen, S.: Collaborative Filtering with Apache Mahout. In: RecSys Challenge (2012)

    Google Scholar 

  34. Paterek, A.: Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDDCup and Workshop, pp. 39–42. ACM Press (2007)

    Google Scholar 

Download references

Acknowledgements

This study is part of a POR FESR 2007-2013 project co-funded by the Autonomous Region of Sardinia: Comunimatica (P.I.A. n. 205 co-funded according to the DGR 39/3 of 10/11/2012).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Manuela Angioni , Maria Laura Clemente or Franco Tuveri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Angioni, M., Clemente, M.L., Tuveri, F. (2016). Improving Predictions with an Ensemble of Linguistic Approach and Matrix Factorization. In: Monfort, V., Krempels, KH., Majchrzak, T.A., Turk, Ž. (eds) Web Information Systems and Technologies. WEBIST 2015. Lecture Notes in Business Information Processing, vol 246. Springer, Cham. https://doi.org/10.1007/978-3-319-30996-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30996-5_9

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30995-8

  • Online ISBN: 978-3-319-30996-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics