Predicting Indexer Performance in a Distributed Digital Library

  • Naomi Dushay
  • James C. French
  • Carl Lagoze
Conference paper

DOI: 10.1007/3-540-48155-9_11

Part of the Lecture Notes in Computer Science book series (LNCS, volume 1696)
Cite this paper as:
Dushay N., French J.C., Lagoze C. (1999) Predicting Indexer Performance in a Distributed Digital Library. In: Abiteboul S., Vercoustre AM. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1999. Lecture Notes in Computer Science, vol 1696. Springer, Berlin, Heidelberg

Abstract

Resource discovery in a distributed digital library poses many challenges, one of which is how to choose search engines for query distribution, given a query and a set of search engines. This paper focuses on search engine performance as a criterion for search engine selection and defines two measurements of search engine performance: availability – will the search engine respond within a time limit, and response time – how quickly will the search engine respond, given that it responds at all. We predicted both of these performance characteristics with a variety of algorithms, all of which required little computation time and combined past performance data for each search engine into a succinct record. We used operational data from the NCSTRL distributed digital library to make and evaluate predictions, and we found that simple prediction methods performed as well as more complex methods and that prediction accuracy was closely related to data consistency.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • Naomi Dushay
    • 1
  • James C. French
    • 2
  • Carl Lagoze
    • 1
  1. 1.Dept. of Computer ScienceCornell UniversityIthacaUSA
  2. 2.Dept. of Computer ScienceUniversity of VirginiaCharlottesvilleUSA

Personalised recommendations