Skip to main content
Log in

Noise–Robust Sampling for Collaborative Metric Learning

  • Article
  • Published:
The Review of Socionetwork Strategies Aims and scope Submit manuscript

Abstract

Abundant research has been conducted recently on recommendation systems. A recommendation system is a subfield of information retrieval and machine learning that aims to identify the value of objects and information for everyone. For example, recommendation systems on web services learn the latent preferences of users based on their behavioral history and display content that the system considers as s user favorite. Specifically, recommendation systems for web services have two main requirements: (1) use the embedding of users and items for predictions and (2) use implicit feedback data that does not require users’ active actions when learning. Recently, a method called collaborative metric learning (CML) has been developed to satisfy the first requirement. However, this method does not address noisy label issues caused by implicit feedback data in the second requirement. This study proposes a comprehensive and effective method to deal with noise in CML. The experimental results show that the proposed method significantly improves the performance of the two requirements compared with existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. https://colab.research.google.com/drive/1_4hegtR2BORSN8a9PjLTfTsHdiN55Z4p?usp=sharing.

  2. https://webscope.sandbox.yahoo.com/catalog.php?datatype=r.

  3. https://www.cs.cornell.edu/~schnabts/mnar/.

References

  1. Wieschollek, P., Wang, O., Sorkine-Hornung, A., & Lensch, H. (2016). Efficient large-scale approximate nearest neighbor search on the GPU. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 2027–2035.

    Google Scholar 

  2. Huang, J.-T., Sharma, A., Sun, S., Xia, L., Zhang, D., Pronin, P., Padmanabhan, J., Ottaviano, G., Yang, L. (2020). Embedding-based retrieval in facebook search. In: KDD ’20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2553–2561.

  3. Grbovic, M., & Cheng, H. (2018). Real-time personalization using embeddings for search ranking at airbnb. In: KDD ’18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 311–320.

  4. Guo, D., Xu, J., Zhang, J., Xu, M., Cui, Y., & He, X. (2017). User relationship strength modeling for friend recommendation on instagram. Neurocomputing, 239, 9–18.

    Article  Google Scholar 

  5. Brovman, Y. M., Jacob, M., Srinivasan, N., Neola, S., Galron, D., Snyder, R., & Wang, P. (2016). Optimizing similar item recommendations in a semi-structured marketplace to maximize conversion. In: RecSys ’16: Proceedings of the 10th ACM Conference on Recommender Systems, pp. 199–202.

  6. Hsieh, C.-K., Yang, L., Cui, Y., Lin, T.-Y., Belongie, S., & Estrin, D. (2017). Collaborative metric learning. In: WWW ’17 : Proceedings of the 26th International Conference on World Wide Web, pp. 193–201.

  7. Yu, L., Zhang, C., Pei, S., Sun, G., & Zhang, X. (2018). Walkranker: A unified pairwise ranking model with multiple relations for item recommendation. In: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), pp. 2596–2603.

  8. Chen, C.-M., Wang, C.-J., Tsai, M.-F., & Yang, Y.-H.(2019). Collaborative similarity embedding for recommender systems. In: WWW ’19: The World Wide Web Conference, pp. 2637–2643.

  9. Campo, M., Espinoza, J., Rieger, J., & Taliyan, A. (2018). Collaborative metric learning recommendation system: Application to theatrical movie releases. arXiv preprint arXiv:1803.00202.

  10. Tay, Y., Tuan, L. A., & Hui, S. C. (2018). Latent relational metric learning via memory-based attention for collaborative ranking. In: WWW ’18: Proceedings of the 2018 World Wide Web Conference, pp. 729–739.

  11. Zhou, X., Liu, D., Lian, J., & Xie, X. (2019). Collaborative metric learning with memory network for multi-relational recommender systems. In: IJCAI’19: Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 4454–4460.

  12. Park, C., Kim, D., Xie, X., & Yu, H. (2018). Collaborative translational metric learning. IEEE International Conference on Data Mining (ICDM), 2018, 367–376.

    Article  Google Scholar 

  13. Vinh T. L., Tay, Y., Zhang, S., Cong, G., & Li, X. (2020). Hyperml: A boosting metric learning approach in hyperbolic space for recommender systems. In: WSDM ’20: Proceedings of the 13th International Conference on Web Search and Data Mining, pp. 609–617.

  14. Tran, V.-A., Salha-Galvan, G., Hennequin, R., & Moussallam, M. (2021). Hierarchical latent relation modeling for collaborative metric learning. arXiv preprint arXiv:2108.04655.

  15. Saito, Y. (2020). Unbiased pairwise learning from biased implicit feedback. In: ICTIR’20 : Proceedings of the 2020 ACM SIGIR International Conference on Theory of Information Retrieval, pp. 5–12.

  16. He, X., Liao, L., Zhang, H., Nie, L., Hu, X., & Chua, T.-S. (2017). Neural collaborative filtering. In: WWW ’17 : Proceedings of the 26th International Conference on World Wide Web, pp. 173–182.

  17. Rendle, S., Freudenthaler, C., Gantner, Z., & Schmidt-Thieme, L. (2009). BPR: Bayesian personalized ranking from implicit feedback. In: UAI ’09: Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, pp. 452–461.

  18. Covington, P., Adams, J., & Sargin, E. (2016). Deep neural networks for youtube recommendations. In: RecSys ’16: Proceedings of the 10th ACM Conference on Recommender Systems, pp. 191–198.

  19. Chen, J., Zhang, H., He, X., Nie, L., Liu, W., & Chua, T.-S. (2017).Attentive collaborative filtering: Multimedia recommendation with item- and component-level attention. In: SIGIR ’17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 335–344.

  20. Saito, Y., Yaginuma, S., Nishino, Y., Sakata, H., & Nakata, K. (2020). Unbiased recommender learning from missing-not-at-random implicit feedback. In: WSDM ’20: Proceedings of the 13th International Conference on Web Search and Data Mining, pp. 501–509.

  21. Hu, Y., Koren, Y., & Volinsky, C. (2008). Collaborative filtering for implicit feedback datasets. In: ICDM ’08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272.

  22. Lu, H., Zhang, M., & Ma, S. (2018). Between clicks and satisfaction: Study on multi-phase user preferences and satisfaction for online news reading. In: SIGIR ’18: The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 435–444.

  23. Wang, W., Feng, F., He, X., Nie, L., & Chua, T.-S. (2021). Denoising implicit feedback for recommendation. In: WSDM ’21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 373–381.

  24. Wang, W., Feng, F., He, X., Zhang, H., & Chua, T.-S. (2021). Clicks can be cheating: Counterfactual recommendation for mitigating clickbait issue. In: SIGIR ’21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

  25. Yu, W., & Qin, Z. (2020). Sampler design for implicit feedback data by noisy-label robust learning. In: SIGIR ’20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 861–870.

  26. Wang, N., Qin, Z., Wang, X., & Wang, H. (2021). Non-clicks mean irrelevant? Propensity ratio scoring as a correction. In: WSDM ’21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 481–489.

  27. Tran, V.-A., Hennequin, R., Royo-Letelier, J., & Moussallam, M. (2019). Improving collaborative metric learning with efficient negative sampling. In: SIGIR’19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1201–1204.

  28. Northcutt, C. G., Jiang, L., & Chuang, I. L. (2021). Confident learning: Estimating uncertainty in dataset labels. Journal of Artificial Intelligence Research, 70, 1373–1411.

    Article  Google Scholar 

  29. Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Proceedings of the 26th International Conference on Neural Information Processing Systems, 26, 3111–3119.

    Google Scholar 

  30. Barkan, O., & Koenigstein, N. (2016). Item2vec: Neural item embedding for collaborative filtering. IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP), 2016, 1–6.

    Google Scholar 

  31. Zhang, X., Yu, F. X., Kumar, S., & Chang, S.-F. (2017). Learning spread-out local feature descriptors. IEEE International Conference on Computer Vision (ICCV), 2017, 4605–4613.

    Article  Google Scholar 

  32. Johnson, C. C. (2014). Logistic matrix factorization for implicit feedback data. In: Advances in neural information processing systems 27 : 28th Annual Conference on Neural Information Processing Systems 2014 (NIPS), pp. 1–9.

  33. Rendle, S. (2010). Factorization machines. In: ICDM ’10: Proceedings of the 2010 IEEE International Conference on Data Mining, pp. 995–1000.

  34. Kingma, D. P., & Ba, J. L. (2015). Adam: A method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015, Conference Track Proceedings.

  35. Van der Maaten, L., & Hinton, G. (2008). Visualizing data using t-sne. Journal of Machine Learning Research, 9(11), 2579–2605.

    Google Scholar 

  36. Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.

    Article  Google Scholar 

  37. Lee, J.-W., Park, S., & Lee, J. (2021). Dual unbiased recommender learning for implicit feedback. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1647–1651.

  38. Yu, J., Zhu, H., Chang, C.-Y., Feng, X., Yuan, B., He, X., & Dong, Z. (2020). Influence function for unbiased recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1929–1932.

  39. Wang, Y., Liang, D., Charlin, L., & Blei, D. M. (2020). Causal inference for recommender systems. In: Fourteenth ACM Conference on Recommender Systems, pp. 426–431.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ryo Matsui.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Matsui, R., Yaginuma, S., Naito, T. et al. Noise–Robust Sampling for Collaborative Metric Learning. Rev Socionetwork Strat 16, 307–332 (2022). https://doi.org/10.1007/s12626-022-00131-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12626-022-00131-x

Keywords

Mathematics Subject Classification

Navigation