Skip to main content
Log in

Weighted Similarity Schemes for High Scalability in User-Based Collaborative Filtering

  • Published:
Mobile Networks and Applications Aims and scope Submit manuscript

Abstract

Similarity-based algorithms, often referred to as memory-based collaborative filtering techniques, are one of the most successful methods in recommendation systems. When explicit ratings are available, similarity is usually defined using similarity functions, such as the Pearson correlation coefficient, cosine similarity or mean square difference. These metrics assume similarity is a symmetric criterion. Therefore, two users have equal impact on each other in recommending new items. In this paper, we introduce new weighting schemes that allow us to consider new features in finding similarities between users. These weighting schemes, first, transform symmetric similarity to asymmetric similarity by considering the number of ratings given by users on non-common items. Second, they take into account the habit effects of users are regarded on rating items by measuring the proximity of the number of repetitions for each rate on common rated items. Experiments on two datasets were implemented and compared to other similarity measures. The results show that adding weighted schemes to traditional similarity measures significantly improve the results obtained from traditional similarity measures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. http://www.grouplens.org

  2. https://www.cse.cuhk.edu.hk/irwin.king/pub/data/Douban

References

  1. Liu H, Hu Z, Mian AU, Tian H, Zhu X (2014) A new user similarity model to improve the accuracy of collaborative filtering. Knowl-Based Syst 56:156–166

    Article  Google Scholar 

  2. Sanchez F, Barrilero M, Uribe S, Alvarez F, Tena A, Menendez JM (2012) Social and content hybrid image recommender system for mobile social networks. Mobile Netw Appl 17(6):782– 795

    Article  Google Scholar 

  3. Ahn HJ, Kang H, Lee J (2010) Selecting a small number of products for effective user profiling in collaborative filtering. Expert Syst Appl 37(4):3055–3062

    Article  Google Scholar 

  4. de Sousa EPM, Traina Caetano J, Traina AJM, Wu L, Faloutsos C (2007) A fast and effective method to find correlations among attributes in databases. Data Min Knowl Disc 14(3):367–407

    Article  Google Scholar 

  5. Pham XH, Jung JJ (2014) Recommendation system based on multilingual entity matching on linked open data. J Intell Fuzzy Syst 27(2):589–599

    Google Scholar 

  6. Jung JJ (2012) Attribute selection-based recommendation framework for short-head user group: An empirical study by MovieLens and IMDB. Expert Syst Appl 39(4):4049–4054

    Article  Google Scholar 

  7. Rodgers JL, Nicewander WA (1988) Thirteen ways to look at the correlation coefficient. Am Stat 42(1):59–66

    Article  Google Scholar 

  8. Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans Knowl Data Eng 17(6):734–749

    Article  Google Scholar 

  9. Herlocker JL, Konstan JA, Terveen LG, Riedl J (2004) Evaluating collaborative filtering recommender systems. IEEE Trans Knowl Data Eng 22(1):5–53

    Google Scholar 

  10. Jung JJ (2013) Cross-lingual query expansion in multilingual folksonomies: a case study on Flickr. Knowl-Based Syst 42:60–67

    Article  Google Scholar 

  11. Cai Z, Hu C, Kang Z, Liu Y (2010) An improved similarity algorithm based on hesitation degree for user-based collaborative filtering. In: Proceedings ISICA 2010, 5th international symposium on advances in computation and intelligence. Springer, pp 261–271

  12. Bobadilla J, Serradilla F, Bernal J (2010) A new collaborative filtering metric that improves the behavior of recommender systems. Knowl-Based Syst 23(6):520–528

    Article  Google Scholar 

  13. Choi K, Suh Y (2013) A new similarity function for selecting neighbors for each target item in collaborative filtering. Knowl-Based Syst 37:146–153

    Article  Google Scholar 

  14. Sarwar BM, Karypis G, Konstan J, Riedl J (2002) Incremental singular value decomposition algorithms for highly scalable recommender systems. In: Proceedings of the 5th international conference on computer and information science. Springer, pp 27–28

  15. Koren Y, Bell RM, Volinsky C (2009) Matrix factorization techniques for recommender systems. IEEE Comput 42(8):30–37

    Article  Google Scholar 

  16. Breese JS, Heckerman D, Kadie CM (1998) Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers, pp 43–52

  17. Wang J, Pouwelse JA, Fokker J, de Vries AP, Reinders MJT (2008) Personalization on a peer-to-peer television system. Multimed Tools Appl 36(1–2):89–113

    Article  Google Scholar 

  18. Bobadilla J, Ortega F, Hernando A, Bernal J (2012) A collaborative filtering approach to mitigate the new user cold start problem. Knowl-Based Syst 26(1–2):225–238

    Article  Google Scholar 

  19. Jamali M, Ester M (2009) Trustwalker: a random walk model for combining trust-based and item-based recommendation. In: Proceedings ACM SIGKDD (2009), 15th international conference on knowledge discovery and data mining. ACM, pp 397–406

  20. Massa P, Avesani P (2009) Trust metrics in recommender systems. In: Computing with social trust. Springer, pp 259–285

  21. Papagelis M, Plexousakis D, Kutsuras T (2005) Alleviating the sparsity problem of collaborative filtering using trust inferences. In: Proceedings of trust management. Springer, Berlin Heidelberg, pp 224–239

  22. Neal Lathia Stephen Hailes LC (2008) Trust-based collaborative filtering. In: Proceedings of trust management II. Springer, Boston, pp 119–134

  23. Luis M, de Campos Juan M, Fernandez-Luna JFH (2010) Combining content-based and collaborative recommendations: A hybrid approach based on bayesian networks. Int J Approx Reason 51(7):785–799

    Article  Google Scholar 

  24. Shinde SK, Kulkarni UV (2012) Hybrid personalized recommender system using centering-bunching based clustering algorithm. Expert Syst Appl 39(1):1381–1387

    Article  Google Scholar 

  25. Maneeroj S, Takasu A (2009) Hybrid recommender system using latent features. In: Proceedings AINA 2009, 23rd international conference on advanced information networking and applications. IEEE Computer Society, pp 661–666

  26. Candillier L, Meyer F, Fessant F (2008) Designing specific weighted similarity measures to improve collaborative filtering systems. In: Proceedings ICDM (2008), 8th industrial conference on advances in data mining, medical applications, e-commerce, marketing, and theoretical aspects. Springer, pp 242–255

  27. Desrosiers C, Karypis G (2011) A comprehensive survey of neighborhood-based recommendation methods. In: Recommender systems handbook. Springer, pp 107–144

  28. Ricci F, Rokach L, Shapira B (2011) Introduction to recommender systems handbook. In: Recommender systems handbook. Springer, pp 1–35

  29. Schein AI, Popescul A, Ungar LH, Pennock DM (2002) Methods and metrics for cold-start recommendations. In: Proceedings SIGIR (2002) 25th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 253–260

  30. Barragáns-Martínez AB, Costa-Montenegro E, Burguillo-Rial JC, Rey-López M, Mikic-Fonte FA, Peleteiro-Ramallo A (2010) A hybrid content-based and item-based collaborative filtering approach to recommend tv programs enhanced with singular value decomposition. Inf Sci 180(22):4290–4311

    Article  Google Scholar 

  31. Kannan R, Ishteva M, Park H (2014) Bounded matrix factorization for recommender system. Knowl Inf Syst 39(3):491–511

    Article  Google Scholar 

  32. YZhang YC, Blattner M, Yu YK (2007) Heat conduction process on community networks as a recommendation model. PRL 99(15):154–301

    Google Scholar 

  33. Huang Z, Zeng DD (2011) Why does collaborative filtering work? transaction-based recommendation model validation and selection by analyzing bipartite random graphs. Informs J Comput 23(1):138–152

    Article  MathSciNet  MATH  Google Scholar 

  34. Liu JG, Wang BH, Guo Q (2009) Improved collaborative filtering algorithm via information transformation. Int J Mod Phys C 20(02):285–293

    Article  Google Scholar 

  35. Liu JG, Zhou T, Xuan ZG, Che HA, Wang BH, Zhang YC (2010) Degree correlation effect of bipartite network on personalized recommendation. CoRR 21(01):137–147

    MATH  Google Scholar 

  36. Liu JG, Zhou T, Guo Q, Wang BH, Zhang YC (2009) Effect of user tastes on personalized recommendation. CoRR 20(12):1925–1932

    Google Scholar 

  37. Qiu T, Chen G, Zhang ZK, Zhou T (2011) An item-oriented recommendation algorithm on cold-start problem. Europhys Lett 95(5):580003

    Article  Google Scholar 

  38. Liu C, Zhou WX (2012) An improved heats+probs hybrid recommendation algorithm based on heterogeneous initial resource configurations. CoRR 391(22):5704–5711

    Google Scholar 

  39. Baselga A, Jiménez-Valverde A, Niccolini G (2007) A multiple-site similarity measure independent of richness. Biol Lett 3(6):642–645

    Article  Google Scholar 

  40. Massa P, Avesani P (2007) Trust metrics on controversial users: balancing between tyranny of the majority and echo chambers. IJSWIS 3(1):39–64

    Google Scholar 

  41. Sarwar BM, Karypis G, Konstan JA, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Proceedings WWW (2001) 10th international conference on World Wide Web. ACM, pp 285–295

  42. Zhang Bin SH (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol 4(1)

  43. Francois F, Pirotte A, Renders JM, Saerens M (2007) Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Trans Knowl Data Eng 19(3):355–369

    Article  Google Scholar 

  44. Buckland MK, Gey FC (1994) The relationship between recall and precision. JASIS 45(1):12–19

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. NRF-2014R1A2A2A05007154).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jai E. Jung.

Additional information

These authors contributed equally to this work as the first author.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pirasteh, P., Hwang, D. & Jung, J.E. Weighted Similarity Schemes for High Scalability in User-Based Collaborative Filtering. Mobile Netw Appl 20, 497–507 (2015). https://doi.org/10.1007/s11036-014-0544-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11036-014-0544-5

Keywords

Navigation