Information Retrieval

, Volume 10, Issue 2, pp 115–141

An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information


    • Department of Mechanical EngineeringUniversity of Bath
  • S. J. Culley
    • Department of Mechanical EngineeringUniversity of Bath
  • C. A. McMahon
    • Department of Mechanical EngineeringUniversity of Bath
  • F. Sellini
    • Airbus UK (Knowledge Management)

DOI: 10.1007/s10791-006-9016-2

Cite this article as:
Campbell, D.R., Culley, S.J., McMahon, C.A. et al. Inf Retrieval (2007) 10: 115. doi:10.1007/s10791-006-9016-2


A number of technologies exist which enable the unobtrusive capture of computer interface interactions in the background of a user's working environment. The resulting data can be used in a variety of ways to model aspects of search activity and the general use of electronic documents in normal working routines. In this paper we present an approach for using captured data to identify relationships between documents used by an individual or group, representing their value in a given context—that may relate to specific information need or activity. The approach employs the use of a naïve Bayesian classifier to evaluate possible relationships that are derived implicitly from the data. It is intended that the relationships established be stored within an information retrieval (IR) system to aid in the retrieval of related documents where future users arrive at a similar context. In the evaluation of the approach over 70 hours of data from computer users in industrial and academic settings are collected to assess its overall feasibility. The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.


Implicit indicators Recommender systems Context Naïve Bayesian classification

Copyright information

© Springer Science+Business Media, LLC 2006