Chapter Overview
The field of information retrieval (IR) is generally concerned with the indexing and retrieval of knowledge-based information. Although the name implies the retrieval of any type of information, the field has traditionally focused on retrieval of text-based documents, reflecting the type of information that was initially available by this early application of computer use. However, with the growth of multimedia content, including images, video, and other types of information, IR has broadened considerably. The proliferation of IR systems and on-line content has also changed the notion of libraries, which have traditionally been viewed as buildings or organizations. However, the developments of the Internet and new models for publishing have challenged this notion as well, and new digital libraries have emerged.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anonymous. (1999). Dublin Core Metadata Element Set, Version 1.1: Reference Description. Dublin Core Metadata Initiative, http://www.dublincore.org/documents/dces/.
Anonymous. (2000a). Cataloging Practices. National Library of Medicine, http://www.nlm.nih.gov/mesh/catpractices.html.
Anonymous. (2000b). Features of the MeSH Vocabulary. National Library of Medicine, http://www.nlm.nih.gov/mesh/features.html.
Anonymous. (2000c). Organization of National Library of Medicine Bibliographic Databases. National Library of Medicine, http://www.nlm.nih.gov/pubs/techbull/mjOO/mjOO_buckets.html.
Anonymous. (2001a). The Dublin Core Metadata Element Set. Dublin Core Metadata Initiative, http://www.niso.org/standards/resources/Z39-85.pdf.
Anonymous. (2001b). “The Future of the Electronic Scientific Literature,” Nature, 413: 1–3.
Anonymous. (2001c). PubMed Help. National Library of Medicine, http://www.ncbi.nlm.nih.gov/entrez/query/static/help/pmhelp.html. Accessed: July 1, 2002.
Anonymous. (2003). The Google Ranking Report. Sedona, AZ, Cyberdifference Corp., http://www.mseo.com/google_ranking_report.html.
Arms, W., Hillmann, D., et al. (2002). “A Spectrum of Interoperability: The Site for Science Prototype for the NSDL,” D-Lib Magazine, 8, http://www.dlib.Org/dlib/january02/arms/01arms.html.
Aronson, A., Bodenreider, O., et al. (2000). “The NLM Indexing Initiative,” in Proceedings of the AMIA 2000 Annual Symposium, Los Angeles, CA. Hanley & Belfus, 17–21.
Bahls, C, Weitzman, J., et al. (2003). “Biology’s Models,” The Scientist. June 2, 2003. 5, http://www.the-scientist.com/yr2003/jun/feature_030602.html.
Baxevanis, A. (2003). “The Molecular Biology Database Collection: 2003 update,” Nucleic Acids Research, 31: 1–12.
Beagrie, N. (2002). “An Update on the Digital Preservation Coalition,” D-Lib Magazine, 8, http://www.dlib.org/dlib/april02/beagrie/04beagrie.html.
Beckett, D., Miller, E., et al. (2000). Using Dublin Core in XML. Dublin Core Metadata Initiative, http://dublincore.org/documents/dcmes-xml/.
Besser, H. (2002). “The Next Stage: Moving from Isolated Digital Collections to Interoperable Digital Libraries,” First Monday, 7(6), http://www.firstmonday.dk/issues/issue7_6/besser/
Borgman, C. (1999). “What are Digital Libraries? Competing Visions,” Information Processing and Management, 35: 227–244.
Boyd, S. and Herkovic, A. (1999). Crisis in Scholarly Publishing: Executive Summary. Stanford Academic Council Committee on Libraries, http://www.stanford.edu/~boyd/schol_pub_crisis.html.
Brenner, S. and McKinin, E. (1989). “CINAHL and MEDLINE: A Comparison of Indexing Practices,” Bulletin of the Medical Library Association, 77: 366–371.
Brin, S. and Page, L. (1998). “The Anatomy of a Large-scale Hypertextual Web Search Engine,” Computer Networks, 30: 107–117.
Bult, C, Blake, J., et al. (2004). “The Mouse Genome Database (MGD): Integrating Biology with the Genome,” Nucleic Acids Research, 32: D476–481.
Callan, J. (1994). “Passage Level Evidence in Document Retrieval,” in Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland. Springer-Verlag. 302–310.
Caruso, D. (2000). “Digital Commerce; If the AOL-Time Warner Deal is about Proprietary Content, Where Does that Leave a Noncommercial Directory It Will Own?” New York Times. January 17, 2000.
Charen, T. (1976). MEDLARS Indexing Manual, Part I: Bibliographic Principles and Descriptive Indexing, 1977. Springfield, VA: National Technical Information Service.
Charen, T. (1983). MEDLARS Indexing Manual, Part II. Springfield, VA: National Technical Information Service.
Coletti, M. and Bleich, H. (2001). “Medical Subject Headings Used to Search the Biomedical Literature,” Journal of the American Medical Informatics Association, 8: 317–323.
DeAngelis, C. and Musacchio, R. (2004). “Access to JAMA,” Journal of the American Medical Association, 291: 370–371.
Diehn, M., Sherlock, G., et al. (2003). “SOURCE: A Unified Genomic Resource of Functional Annotations, Ontologies, and Gene Expression Data,” Nucleic Acids Research, 31:219–223.
Dolin, R., Alschuler, L., et al. (2001). “The HL7 Clinical Document Architecture,” Journal of the American Medical Informatics Association, 8: 552–569.
Egan, D., Remde, J., et al. (1989). “Formative Design-evaluation of Superbook,” ACM Transactions on Information Systems, 7: 30–57.
Fox, C. (1992). “Lexical Analysis and Stop Lists,” in Frakes, W. and Baeza-Yates, R., eds. Information Retrieval: Data Structures and Algorithms, Englewood Cliffs, NJ: Prentice-Hall, pp. 102–130
Frakes, W. (1992). “Stemming Algorithms,” in Frankes, W. and Baeza-Yates, R., eds. Information Retrieval: Data Structures and Algorithms, Englewood Cliffs, NJ: Prentice-Hall, pp. 131–160.
Friedlander, A. (2002). “The National Digital Information Infrastructure Preservation Program: Expectations, Realities, Choices, and Progress to Date,” D-Lib Magazine, 8, http://www.dlib.org/dlib/april02/friedlander/04friedlander.html.
Funk, M. and Reid, C. (1983). “Indexing Consistency in MEDLINE,” Bulletin of the Medical Library Association, 71: 176–183.
Harter, S. (1992). “Psychological Relevance and Information Science,” Journal of the American Society for Information Science, 43: 602–615.
Haynes, R., McKibbon, K., et al. (1990). “Online Access to MEDLINE in Clinical Settings,” Annals of Internal Medicine, 112: 78–84.
Haynes, R., Wilczynski, N., et al. (1994). “Developing Optimal Search Strategies for Detecting Clinically Sound Studies in MEDLINE,” Journal of the American Medical Informatics Association, 1: 447–458.
Hearst, M. and Karadi, C. (1997). “Cat-a-Cone: An Interactive Interface for Specifying Searches and Viewing Retrieval Results Using a Large Category Hierarchy,” in Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA. ACM Press. 246–255.
Hersh, W. (1994). “Relevance and Retrieval Evaluation: Perspectives from Medicine,” Journal of the American Society for Information Science, 45: 201–206.
Hersh, W. (2001). “Interactivity at the Text Retrieval Conference (TREC),” Information Processing and Management, 37: 365–366.
Hersh, W. (2003). Information Retrieval: A Health and Biomedical Perspective. Second Edition. New York: Springer-Verlag, http://www.irbook.org.
Hersh, W. and Bhupatiraju, R. (2003). “TREC Genomics track overview,” in The Twelfth Text Retrieval Conference: TREC 2003, Gaithersburg, MD. National Institute of Standards & Technology, http://trec.nist.gov/pubs/trecl2/papers/GENOMICS.OVERVIEW3.pdf.
Hersh, W., Crabtree, M., et al. (2002). “Factors Associated with Success for Searching MEDLINE and Applying Evidence to Answer Clinical Questions,” Journal of the American Medical Informatics Association, 9: 283–293.
Hersh, W., Elliot, D., et al. (1994). “Towards New Measures of Information Retrieval Evaluation,” in Proceedings of the 18th Annual Symposium on Computer Applications in Medical Care, Washington, DC. Hanley & Belfus. 895–899.
Hersh, W. and Hickam, D. (1998). “How Well Do Physicians Use Electronic Information Retrieval Systems? A Framework for Investigation and Review of the Literature,” Journal of the American Medical Association, 280: 1347–1352, http://jama.ama-assn.org/cgi/content/full/280/15/1347.
Hersh, W., Pentecost, J., et al. (1996). “A Task-oriented Approach to Information Retrieval Evaluation,” Journal of the American Society for Information Science, 47: 50–56.
Hersh, W. and Rindfleisch, T. (2000). “Electronic Publishing of Scholarly Communication in the Biomedical Sciences,” Journal of the American Medical Informatics Association, 7: 324–325.
Humphreys, B. (2000). “Electronic Health Record Meets Digital Library: A New Environment for Achieving an Old Goal,” Journal of the American Medical Informatics Association, 7: 444–452.
Lagoze, C. and VandeSompel, H. (2001). “The Open Archives Initiative: Building a Low-barrier Interoperability Framework,” in Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Libraries, Roanoke, VA. ACM Press. 54–62.
Lassila, O., Hendler, J., et al. (2001). “The Semantic Web,” Scientific American, 284(5): 34–43, http://www.scientificamerican.com/article.cfm?articleID=00048144-10D2-lC70-84A9809EC588EF21&catID=2.
Lawrence, S. (2001). “Online or Invisible?” Nature, 411: 521.
Lawrence, S., Giles, C, et al. (1999). “Digital Libraries and Autonomous Citation Indexing,” Computer, 32: 67–71.
Lesk, M. (1997). Practical Digital Libraries-Books, Bytes, and Bucks. San Francisco: Morgan Kaufmann.
McCook, A. (2004). “Open Access to US Govt Work Urged,” The Scientist, http://www.biomedcentral.com/news/20040721/01.
McKibbon, K., Haynes, R., et al. (1990). “How Good Are Clinical MEDLINE Searches? A Comparative Study of Clinical End-user and Librarian Searches,” Computers and Biomedical Research, 23(6): 583–593.
McKiernan, G. (2003). “Open Archives Initiative Service Providers. Part I: Science and Technology,” Library Hi Tech News, 20(9):30–38, http://www.public.iastate.edu/~gerrymck/OAI-SP-I.pdf.
McLellan, F. (2003). “US Bill Says Government Funded Work Must Be Open Access,” Lancet, 362: 52.
Meek, J. (2001). “Science World in Revolt at Power of the Journal Owners,” The Guardian, http://www.guardian.co.uk/Archive/Article/0,4273,4193292,00.html.
Miller, E. (1998). “An Introduction to the Resource Description Framework,” D-Lib Magazine, 4, http://www.dlib.org/dlib/may98/miller/05miller.html.
Miller, N., Lacroix, E., et al. (2000). “MEDLINEplus: Building and Maintaining the National Library of Medicine’s Consumer Health Web Service,” Bulletin of the Medical Library Association, 88: 11–17.
Mynatt, B., Leventhal, L., et al. (1992). “Hypertext or Book: Which Is Better for Answering Questions?” in Proceedings of Computer-Human Interface 92. 19–25.
Paskin, N. (1999). “DOI: Current Status and Outlook,” D-Lib Magazine, 5, http://www.dlib.org/dlib/may99/05paskin.html.
Perkel, J. (2003). “Feeding the Info Junkies,” The Scientist. June 2, 2003. 39, http://www.the-scientist.com/yr2003/jun/feature14_030602.html.
Ponte, J. and Croft, W. (1998). “A Language Modeling Approach to Information Retrieval,” in Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia. ACM Press. 275–281.
Pratt, W., Hearst, M., et al. (1999). “A Knowledge-based Approach to Organizing Retrieved Documents,” in Proceedings of the 16th National Conference on Artificial Intelligence, Orlando, FL. AAAI. 80–85.
Redman, P., Kelly, J., et al. (1997). “Common Ground: The HealthWeb Project as a Model for Internet Collaboration,” Bulletin of the Medical Library Association, 85: 325–330.
Robertson, S. and Walker, S. (1994). “Some Simple Effective Aproximations to the 2-Poisson Model for Probabilistic Weighted Retrieval,” in Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland. Springer-Verlag. 232–241.
Salton, G. (1983). Introduction to Modern Information Retrieval. New York: McGraw-Hill.
Salton, G. (1991). “Developments in Automatic Text Retrieval,” Science, 253: 974–980.
Salton, G., Fox, E., et al. (1983). “Extended Boolean Information Retrieval,” Communications of the ACM, 26: 1022–1036.
Salton, G. and Lesk, M. (1965). “The SMART Automatic Document Retrieval System: An Illustration,” Communications of the ACM, 8: 391–398.
Singhal, A., Buckley, C, et al. (1996). “Pivoted Document Length Normalization,” in Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland. ACM Press. 21–29.
Sollins, K. and Masinter, L. (1994). Functional Requirements for Uniform Resource Names. Internet Engineering Task Force, http://www.w3.org/Addressing/rfcl737.txt.
Spitzer, V., Ackerman, M., et al. (1996). “The Visible Human Male: A Technical Report.,” Journal of the American Medical Informatics Association, 3: 118–130.
Srinivasan, P. (1996). “Query Expansion and MEDLINE,” Information Processing and Management, 32: 431–444.
Swanson, D. (1988). “Historical Note: Information Retrieval and the Future of an Illusion,” Journal of the American Society for Information Science, 39: 92–98.
Tibbo, H. (2001). “Archival Perspectives on the Emerging Digital Library,” Communications of the ACM, 44(5): 69–70.
vanRijsbergen, C. (1979). Information Retrieval. London. Butterworth.
Voorhees, E. (1998). “Variations in Relevance Judgments and the Measurement of Retrieval Effectiveness,” in Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia. ACM Press. 315–323.
Voorhees, E. and Harman, D. (2000). “Overview of the Sixth Text REtrieval Conference (TREC),” Information Processing and Management, 36: 3–36.
Voorhees, E. and Harman, D. (2001). “Overview of TREC 2001,” in Proceedings of the Text Retrieval Conference 2001, Gaithersburg, MD. 1–15.
Weibel, S. (1996). “The Dublin Core: A Simple Content Description Model for Electronic Resources,” ASIS Bulletin, 24(1): 9–11, http://www.asis.org/Bulletin/Oct-97/weibel.htm.
Wildemuth, B., DeBliek, R., et al. (1995). “Medical Students’ Personal Knowledge, Searching Proficiency, and Database Use in Problem Solving,” Journal of the American Society for Information Science, 46: 590–607.
Xu, J. and Croft, W. (1996). “Query Expansion Using Local and Global Document Analysis,” in Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland. ACM Press. 4–11.
Suggested Readings
Baeza-Yates, R. and Ribeiro-Neto, B., eds. 1999. Modern Information Retrieval. New York. McGraw-Hill. A book surveying most of the automated approaches to information retrieval.
Frakes, W.B., Baeza-Yates, R. Information Retrieval: Data Structures and Algorithms, Englewood Cliffs, NJ: Prentice-Hall, 1992. A textbook on implementation of information retrieval systems. Covers all of the major data structures and algorithms, including inverted files, ranking algorithms, stop word lists, and stemming. There are plentiful examples of code in the C programming language.
Hersh, W.R. Information Retrieval, A Health and Biomedical Perspective (Second Edition), New York: Springer-Verlag, 2003. A textbook on information retrieval systems in the health and biomedical domain that covers the state of the art as well as research systems.
Humphreys, B., Lindberg, D., et al. 1998. The Unified Medical Language System: an informatics research collaboration. Journal of the American Medical Informatics Association, 5: 1–11. A paper describing the motivation and implementation of the National Library of Medicine’s Unified Medical Language System.
Miles, W.D. A History of the National Library of Medicine, Bethesda, MD: U.S. Dept. of Health and Human Services, 1982. A comprehensive history of the National Library of Medicine and its forerunners, covering the story of Dr. John Shaw Billings and his founding of Index Medicus to the modern implementation of MEDLINE.
Salton, G. Developments in automatic text retrieval, Science, 253: 974–980, 1991. The last succinct exposition of word-statistical retrieval systems from the person who originated the approach.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Hersh, W.R. (2005). Information Retrieval and Digital Libraries. In: Chen, H., Fuller, S.S., Friedman, C., Hersh, W. (eds) Medical Informatics. Integrated Series in Information Systems, vol 8. Springer, Boston, MA. https://doi.org/10.1007/0-387-25739-X_9
Download citation
DOI: https://doi.org/10.1007/0-387-25739-X_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-24381-8
Online ISBN: 978-0-387-25739-6
eBook Packages: MedicineMedicine (R0)