Abstract
This paper assesses the retrieval effectiveness of automatically constructed inter-document hypertext links in Information Retrieval (IR). The objectives of the experiments described are to obtain evidence concerning the usefulness of querying and browsing automatically constructed IR hypertexts. Links are built by using IR techniques, as these enable rapid, automatic production of hypertexts from a document collection for accessing the collection itself. These tests are carried out in a laboratory environment and through simulation of link browsing. Results of experiments show that browsing has little impact on the retrieval of relevant documents if used in place of querying or relevance feedback methods, though may be practical if used in combination with them.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Agosti M (1988) Is hypertext a new model of information retrieval?. In: Proceedings of the 12th International Online Information Meeting, Vol. 1. Oxford, pp. 57–62.
Agosti M and Allan J, eds. (1997) Special issue on methods and tools for the automatic construction of hypertexts. Information Processing & Management, 33(2).
Agosti M, Benfante L and Melucci M (1998) OFAHIR: On-the-fly automatic authoring of hypertexts for information retrieval. In: Spaccapietra S and Maryanski F, eds. Data Mining and Reverse Engineering: Searching for Semantics, IFIP. Chapman and Hall, pp. 269–300.
Agosti M, Colotti R and Gradenigo G (1991) A two-level hypertext retrieval model for legal data. In: Bookstein A, Chiaramella Y, Salton G and Raghavan V, eds. Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR). Chicago, pp. 316–325.
Agosti M and Crestani F (1993) A methodology for the automatic construction of a hypertext for information retrieval. In: Proceedings of the ACM Symposium on Applied Computing. Indianapolis, USA, pp. 745–753.
Agosti M, Crestani F and Melucci M (1996) Design and implementation of a tool for the automatic construction of hypertexts for information retrieval. Information Processing & Management, 32(4):459–476.
Agosti M, Crestani F and Melucci M (1997) On the use of information retrieval techniques for the automatic construction of hypertexts. Information Processing & Management, 33(2):133–144.
Allan J (1997) Building hypertexts using information retrieval. Information Processing & Management, 33(2):145–159.
Blustein J, Webber R and Tague-Sutcliffe J (1997) Methods for evaluating the quality of hypertext links. Information Processing & Management, 33(2):255–271.
Botafogo R, Rivlin E and Shneiderman B (1992) Structural analysis of hypertext: Identifying hierarchies and useful metrics. ACM Transactions on Information Systems, 10(2):142–180.
Crestani F and Melucci M (1998) A case study of automatic authoring: From a textbook to a hyper-textbook. Data and Knowledge Engineering, 27(1):1–30.
Croft W and Harper D (1979) Using probabilistic models of document retrieval without relevance information. Journal of Documentation, 35:285–295.
Croft W and Turtle H (1993) Retrieval strategies for hypertext. Information Processing & Management, 29(3):313–324.
Efthimiadis E (1996) Query expansion. In: Williams M, ed. Annual Review of Information Science and Technology (ARIST), Vol. 31. Information Today for the American Society for Information Science, Medford, NJ, USA, chap. 4, pp. 121–185.
Fox E (1983) Characterization of two new experimental collections in computer and information science containing textual and bibliographic concepts. Technical Report TR83–561, Cornell University, Computer Science Department.
Furner J, Ellis D and Willett P (1996) The representation and comparison of hypertext structures using graphs. In: Agosti M and Smeaton A, eds. Information Retrieval and Hypertext. Kluwer Academic, chap. 4, pp. 75–96.
Griffitths A, Luckhurts H and Willett P (1986), Using inter-document similarity information in document retrieval systems. Journal of the American Society for Information Science, 37:3–11.
Harman D (1992) Relevance feedback and other query modification techniques. In: Frakes W and Baeza-Yates R, eds. Information Retrieval: Data Structures and Algorithms. Prentice Hall, Englewood Cliffs, NJ, USA, chap. 11.
Hersh W, Buckley C, Leone T and Hickam D (1994) OHSUMED: An interactive retrieval evaluation and new large test collection for research. In: Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR). Dublin, Ireland, pp. 192–201.
Jardine N and van Rijsbergen C (1971) The use of hierarchical clustering in information retrieval. Information Storage and Retrieval, 7:217–240.
Kwok K (1996), A new method of weighting query terms for ad-hoc retrieval. In: Frei H, Harman D, Schäuble P and Wilkinson R, eds. Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR). New York, pp. 187–196.
Robertson S and Sparck Jones K (1976) Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129–146.
Rocchio JJ (1971) Relevance feedback in information retrieval. In: Salton G, ed. The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall, Englewood Cliffs, NJ, chap. 14, pp. 313–323.
Salton G and Buckley C (1990) Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41(4):288–297.
Salton G and McGill M (1983) Introduction to Modern Information Retrieval. McGraw-Hill, New York.
Salton G, Singhal A, Mitra M and Buckley C (1997) Automatic text structuring and summarization. Information Processing & Management, 33(2):193–207.
Savoy J (1997) Ranking schemes in hybrid Boolean systems: A new approach. Journal of the American Society for Information Science, 48(3):235–253.
Smeaton A (1995) Building hypertext under the influence of topology metrics. In: Proceedings of IWHD Conference. Montpellier.
Smeaton A and Morrissey P (1995) Experiments on the automatic construction of hypertext from text. Technical Report, Dublin City University, School of Computer Applications, Ireland, Working Paper: CA-0295.
Sparck Jones K (1971) Automatic Keyword Classification. Butterworths.
Sparck Jones K (1981) Information Retrieval Experiments. Butterworths.
Sparck Jones K and Willett P (1997) Readings in Information Retrieval. Morgan Kaufmann, San Francisco, CA, USA.
Tague-Sutcliffe J (1992) The pragmatics of information retrieval experimentation, revisited. Information Processing & Management, 28(4):467–490.
Thistlewaite P (1995) Automatic construction of open webs using derived link patterns. In: Agosti M and Allan J, eds. Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR). Seattle, WA.
van Rijsbergen C (1979) Information Retrieval, 2nd ed. Butterworths, London.
van Rijsbergen C and Croft W (1975) Document clustering: An evaluation of some experiments with the Cranfield 1400 collection. Information Processing & Management, 11(5/7):171–182.
Voorhees EM (1985) The cluster hypothesis revisited. Technical Report TR85–658, Computer Science Department, Cornell University.
Willett P (1988) Recent trends in hierarchic document clustering: A critical review. Information Processing & Management, 24(5):577–597.
Rights and permissions
About this article
Cite this article
Melucci, M. An Evaluation of Automatically Constructed Hypertexts for Information Retrieval. Information Retrieval 1, 91–114 (1999). https://doi.org/10.1023/A:1009934321199
Issue Date:
DOI: https://doi.org/10.1023/A:1009934321199