Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Searching Digital Libraries

  • Panagiotis G. Ipeirotis
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_327

Synonyms

Federated search

Definition

Searching digital libraries refers to searching and retrieving information from remote databases of digitized or digital objects. These databases may hold either the metadata for an object of interest (e.g., author and title), or a complete object such as a book or a video.

Historical Background

The initial efforts to standardize and facilitate searching of digital libraries date back to the 1970s, when the development of the Z39.50 protocol started. The Z39.50 protocol is an ANSI standard and defines how to search and retrieve items from a remote database catalog. The Z39.50 protocol was widely deployed within library environments, allowing users to perform searches to remote libraries.

With the advent of the Web, libraries started digitizing and making contents available on the Web, and the Z39.50 protocol started losing its importance. Many libraries made their content “searchable” through standard Web forms, allowing users to search and retrieve...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Bergholz A, Chidlovskii B. Using query probing to identify query language features on the web. In: Proceedings of the Distributed Multimedia Information Retrieval, SIGIR 2003 Workshop on Distributed Information Retrieval; 2004. p. 21–30.CrossRefGoogle Scholar
  2. 2.
    Bergman MK. The deep web: surfacing hidden value. J Electron Pub. 2001;7(1).Google Scholar
  3. 3.
    Callan JP, Connell M. Query-based sampling of text databases. ACM Trans Inf Syst. 2001;19(2):97–30.CrossRefGoogle Scholar
  4. 4.
    Chakrabarti S, van den Berg M, Dom B. Focused crawling: a new approach to topic-specific web resource discovery. Comput Netw. 1999;31(11–16):1623–40.CrossRefGoogle Scholar
  5. 5.
    Cope J, Craswell N, Hawking D. Automated discovery of search interfaces on the web. In: Proceedings of the 14th Australasian Database Conference; 2003. p. 181–189.Google Scholar
  6. 6.
    Gravano L, Ipeirotis PG, Sahami M. QProber: a system for automatic classification of hidden-web databases. ACM Trans Inf Syst. 2003;21(1):1–41.CrossRefGoogle Scholar
  7. 7.
    Hawking D, Thomas P. Server selection methods in hybrid portal search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2005. p. 75–82.Google Scholar
  8. 8.
    Ipeirotis PG, Gravano L. Distributed search over the hidden web: hierarchical database sampling and selection. In: Proceedings of the 28th International Conference on Very Large Data Bases; 2002. p. 394–405.CrossRefGoogle Scholar
  9. 9.
    Ipeirotis PG, Gravano L. When one sample is not enough: improving text database selection using shrinkage. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2004. p. 767–778.Google Scholar
  10. 10.
    Ipeirotis PG, Ntoulas A, Cho J, Gravano L. Modeling and managing content changes in text databases. In: Proceedings of the 21st International Conference on Data Engineering; 2005. p. 606–617.Google Scholar
  11. 11.
    Lynch CA. The Z39.50 information retrieval standard. D-Lib Mag. 1997;3(4).Google Scholar
  12. 12.
    Ntoulas A, Zerfos P, Cho J. Downloading textual hidden web content by keyword queries. In: Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries; 2005.Google Scholar
  13. 13.
    Raghavan S, García-Molina H. Crawling the hidden web. In: Proceedings of the 27th International Conference on Very Large Data Bases; 2001. p. 129–138.Google Scholar
  14. 14.
    Si L, Callan J. A semisupervised learning method to merge search engine results. ACM Trans Inf Syst. 2003;21(4):457–91.CrossRefGoogle Scholar
  15. 15.
    Si L, Callano J. Modeling search engine effectiveness for federated search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2005. p. 83–90.Google Scholar
  16. 16.
    Zhang Z, He B, Chang KC-C. Understanding web query interfaces: best-effort parsing with hidden syntax. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2004. p. 107–118.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.New York UniversityNew YorkUSA

Section editors and affiliations

  • Amr El Abbadi
    • 1
  1. 1.Dept. of Computer ScienceUC Santa BarbaraSanta BarbaraUSA