Abstract
Collaborative filtering systems predict a user's interest in new items based on the recommendations of other people with similar interests. Instead of performing content indexing or content analysis, collaborative filtering systems rely entirely on interest ratings from members of a participating community. Since predictions are based on human ratings, collaborative filtering systems have the potential to provide filtering based on complex attributes, such as quality, taste, or aesthetics. Many implementations of collaborative filtering apply some variation of the neighborhood-based prediction algorithm. Many variations of similarity metrics, weighting approaches, combination measures, and rating normalization have appeared in each implementation. For these parameters and others, there is no consensus as to which choice of technique is most appropriate for what situations, nor how significant an effect on accuracy each parameter has. Consequently, every person implementing a collaborative filtering system must make hard design choices with little guidance. This article provides a set of recommendations to guide design of neighborhood-based prediction systems, based on the results of an empirical study. We apply an analysis framework that divides the neighborhood-based prediction approach into three components and then examines variants of the key parameters in each component. The three components identified are similarity computation, neighbor selection, and rating combination.
Article PDF
Similar content being viewed by others
References
Aggarwal CC, Wolf JL, Wu K-L and Yu PS (1999) Horting hatches an Egg: A new graph-theoretic approach to collaborative filtering. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.
Basu C, Hirsh H and Cohen WW (1998) Recommendation as classification: Using social and content-based information in recommendation. In: Rich C and Mostow J, eds., Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98). AAAI Press, Menlo Park, CA, pp. 714–720.
Billsus D and Pazzani MJ (1998) Learning collaborative information filters. In: Rich C and Mostow J, eds., Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98). AAAI Press, Menlo Park, CA, pp. 46–53.
Breese JS, Heckerman D and Kadie C (1998) Empirical analysis of predictive algorithms for collaborative filtering.In: Cooper GF and Moral S, eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann, San Francisco, pp. 43–52.
Condliff MK, Lewis DD, Madigan D and Posse C (1999) Bayesian mixed-effect models for recommender systems.In: 1999 SIGIR Workshop on Recommender Systems.
Delgado J and Ishii N (1999) Memory-based weighted majority prediction for recommender systems. In: 1999 SIGIR Workshop on Recommender Systems.
Goldberg K, Roeder T, Guptra D and Perkins C(2001) Eigentaste:Aconstant-time collaborative filtering algorithm. Information Retrieval, 4(2):133–151.
Good N, Schafer JB, Konstan JA, Borchers A, Sarwar BM, Herlocker JL and Riedl J (1999) Combining collaborative filtering with personal agents for better recommendations. In: Hendler J and Subramanian D, eds., Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99). AAAI Press, Menlo Park, pp. 439–446.
Herlocker JL (2000) Understanding and improving automated collaborative filtering systems. Ph.D. Dissertation.University of Minnesota, 9-30-2000.
Herlocker JL, Konstan JA, Borchers A and Riedl J (1999) An algorithmic framework for performing collaborative filtering. In: Hearst MA, Gey F and Tong R, eds., Proceedings of the 22nd International Conference on Research and Development in Information Retrieval (SIGIR ‘99). ACM Press, New York, pp. 230–237.
Hill W, Stead L, Rosenstein M and Furnas GW (1995) Recommending and evaluating choices in a virtual community of use. In: Proceedings of ACM CHI'95 Conference on Human Factors in Computing Systems. ACM Press, New York, pp. 194–201.
Hofmann T and Puzicha J (1999) Latent class models for collaborative filtering. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, pp. 688–693.
Pennock DM, Horvitz E and Giles CL (2000a) Social choice theory and recommender systems: Analysis of the axiomatic foundations of collaborative filtering. In: Kautz H and Porter B, eds., Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000). AAAI Press, Menlo Park, CA, pp. 729–734.
Pennock DM, Horvitz E, Lawrence S and Giles CL (2000b) Collaborative filtering by personality diagnosis: A hybrid memory-and model-based approach. In: Proceedings of the Sixteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI-2000). Morgan Kaufmann, San Francisco, pp. 473–480.
Press WH, Flannery BP, Teukolsky SA and Yan T (1986) Numerical Recipes: The Art of Scientific Computing.Cambridge University Press, New York, NY.
Resnick P, Iacovou N, Suchak M, Bergstrom P and Riedl J (1994) GroupLens:Anopen architecture for collaborative filtering of netnews. In: Furuta R and Neuwirth C, eds., Proceedings of the 1994 Conference on Computer Supported Collaborative Work. ACM Press, New York, pp. 175–186.
Sarwar BM, Karypis G, Konstan JA and Riedl J (2000a) Analysis of recommendation algorithms for e-Commerce.In: Proceedings of the 2nd ACM Conference on Electronic Commerce (EC'00). ACM Press, New York, pp. 285–295.
Sarwar BM, Karypis G, Konstan JA and Riedl J (2000b) Application of dimensionality reduction in recommender system-A case study. In: ACM WebKDD 2000 Web Mining for E-Commerce Workshop.
Sarwar BM, Konstan JA, Borchers A, Herlocker JL, Miller BN, and Riedl J (1998) Using filtering agents to improve prediction quality in the grouplens research collaborative filtering system. In: Proceedings of the ACM 1998 Conference on Computer Supported Cooperative Work (CSCW ‘98). ACM Press, New York.
Shardanand U and Maes P (1995) Social information filtering: Algorithms for automating "word of mouth." In: Proceedings of ACM CHI'95 Conference on Human Factors in Computing Systems. ACM, New York, pp. 210–217.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Herlocker, J., Konstan, J.A. & Riedl, J. An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms. Information Retrieval 5, 287–310 (2002). https://doi.org/10.1023/A:1020443909834
Issue Date:
DOI: https://doi.org/10.1023/A:1020443909834