Content-based collaborative information filtering: Actively learning to classify and recommend documents

  • Delgado Joaquin
  • Ishii Naohiro
  • Ura Tomoki
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1435)


Next generation of intelligent information systems will rely on cooperative agents for playing a fundamental role in actively searching and finding relevant information on behalf of their users in complex and open environments, such as the Internet. Whereas relevant can be defined solely for a specific user, and under the context of a particular domain or topic. On the other hand shared “social” information can be used to improve the task of retrieving relevant information, and for refining each agent's particular knowledge. In this paper, we combine both approaches developing a new content-based filtering technique for learning up-to-date users' profile that serves as basis for a novel collaborative information-filtering algorithm. We demonstrate our approach through a system called RAAP (Research Assistant Agent Project) devoted to support collaborative research by classifying domain specific information, retrieved from the Web, and recommending these “bookmarks” to other researcher with similar research interests.


Cooperative Information Systems Software Agents Collaborative Information Retrieval Social Filtering On-line machine learning algorithms 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Blum, A., “On-line Algorithms in Machine Learning” (a survey). Dagstuhl workshop on On-Line algorithms (June 1996).Google Scholar
  2. 2.
    Foner, L., “A Multi-Agent Referral System for Matchmaking”, in Proceedings of the First International Conference on the Practical Applications of Intelligent Agent Technology (PAAM'96), London (April 1996).Google Scholar
  3. 3.
    Buckley, C., Sallon,G., “The effect of adding relevance information in a relevance feedback environment”. In Proceedings of the 17th International ACM/SIGIR Conference on Research and Development in Information Retrieval (1994).Google Scholar
  4. 4.
    Sallon,G., Buckley, C., “Improving retrieval performance by relevance feedback”. Journal of the American Society for Information Science, 41, 288–297 (1990).CrossRefGoogle Scholar
  5. 5.
    Maes, P.:,“Agents that Reduce Work and Information Overload”, Comm ACM, 37, No7 (1994).Google Scholar
  6. 6.
    Tokunaga, T., Iwayama M.: “Text categorization based on weighted inverse document frequency”, Technical Report 94-TR0001, Department of Computer Science, Tokyo Institute of Technology (March 1994).Google Scholar
  7. 7.
    Lewis, D.: “Challenges in machine learning for text classification”, in Proceedings of the Ninth Annual Conference on Computational Learning Theory, P1. New York (1996). ACMGoogle Scholar
  8. 8.
    Yang, Y., Pederscn, J. “Feature selection in statistical learning of text categorization”, Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97), (1997).Google Scholar
  9. 9.
    Quinlan, J.R., “Induction of decision trees”. Machine Learning, 1(1):81–106 (1986)Google Scholar
  10. 10.
    Mitchell T., “Machine Learning” McGraw Hill, 1996Google Scholar
  11. 11.
    Pazzani,M.,Muramatsu,J.,and Billsus, D., “Syskill & Wcbcrt: Identifying interesting websites”, In Proceedings of the American National Conference on Artificial Intelligence (AAAI'96), Portland, OR. (1996)Google Scholar
  12. 12.
    Frakes, W.,Baeza-Yates,R.: “Information Retrieval: Data Structure & Algorithms” Printice Hall, NJ (1992)Google Scholar
  13. 13.
    Blum, A.,Langley, P.: “Selection of Relevant Features and Examples in Machine Learning”, Artificial Intelligence, 97:245–277, (1997)MATHMathSciNetCrossRefGoogle Scholar
  14. 14.
    Blum, A.: “Empirical support for Winnow and Weighted-Majority based algorithm: results on a calendar scheduling domain”, Machine Learning 26:5–23. (1997).MathSciNetCrossRefGoogle Scholar
  15. 15.
    Armstrong, R., Frietag, D., Joachims, T. and T.M. Mitchell: “Web Watcher: a learning apprentice for the world wide web” In Proceedings of the 1995 AAAI Spring Symposium of Information Gathering from Heterogeneous, Distributed Environments, Stanford, CA, 1995. AAAI Press.Google Scholar
  16. 16.
    Shardanand, U. and Maes P.: “Social Information Filtering: Algorithms for Automation “Word of Mouth””: ACM/CHI'95. hltp:// Scholar
  17. 17.
    Resnick, P., Iacovou N., Sushak, M., Bergstrom, P., Riedl, J.: “GroupLens: An Open Architecture for Collaborative Filtering of Netnews”, in the Proceedings of the CSCW 1994 conference, October 1994.Google Scholar
  18. 18.
    Kautz, H., Selman, B. and Shah, M.: “The Hidden Web”, Al Magazine, Summer 1997. AAAI Press.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Delgado Joaquin
    • 1
  • Ishii Naohiro
    • 1
  • Ura Tomoki
    • 1
  1. 1.Department of Intelligence & Computer ScienceNagoya Institute of TechnologyNagoyaJapan

Personalised recommendations