The Trinity College Dublin 1872 Online Catalogue

  • John G. Byrne
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3163)

Abstract

The development of an online version of the Trinity College Dublin Printed Catalogue, which list books from the 14th C to 1872, is described. The principal benefit of the system is the ability to search on words and word stems in the title field. As the entries are in at least fourteen languages the language of each Roman script entry was determined, with a success rate of over 90%. The image of the entry from the catalogue is displayed. This hides the OCR errors.

Keywords

Recognition Rate Trinity College Optical Character Recognition Function Word Online Catalogue 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Catalogus Librorum Impressorum qui in Bibliotheca Collegii Sacrosanctae et Individuae Trinitatis, Reginae Elizabethae, juxta Dublin. 9 vols (1864-1886)Google Scholar
  2. 2.
    Clarke, R.M.: OCROC Optical Character Recognition Output Corrector. Final year project Trinity College Dublin (May 1993)Google Scholar
  3. 3.
    Culligan, B.T.: Design of an On-Line Database Query System for the 1872 Printed Catalogue. Final year project Trinity College Dublin (May 1993)Google Scholar
  4. 4.
    Anderson, G.: Computerising a Library Catalogue using Optical Character Recognition. M.Sc. thesis, University of Dublin (1992)Google Scholar
  5. 5.
    Clarke, R.M.: User-Oriented Access to a Multilingual Database. M.Sc. thesis, University of Dublin (1995)Google Scholar
  6. 6.
    Kinane, V., Walsh, A. (eds.): Essays on the History of the Trinity College Library Dublin. Four Courts Press, Dublin (2000)Google Scholar
  7. 7.
    Bandinel, B.: Catalogus Librorum Impressorum in Bibliotheca Bodleiana, Oxford (1843)Google Scholar
  8. 8.
    Emmer, M.B., Quillen, E.K., Dewar, R.B.K.: MACRO SPITBOL The High-Performance SNOBOL Language. Catspaw Inc. (1991)Google Scholar
  9. 9.
    Zipf, G.K.: HumanBehaviour and the Principle of Least Effort. Addison-Wesley, Reading (1949)Google Scholar
  10. 10.
    Nic Gerailt, D., Byrne, J.G.: Error Detection in Several Languages for an OCR-Generated Multilingual Database. In: Proc. Third International Workshop on Applications of Natural Language to Information Systems, Simon Fraser University, Canada, June 26-27 (1997)Google Scholar
  11. 11.
    Smith, F.J., Devine, K.: BIRD, QUILL and MicroBIRD - A successful family of text retrieval systems. Literary and Linguistic Computing 4(2), 115–120 (1989)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • John G. Byrne
    • 1
  1. 1.Department of Computer ScienceO’Reilly Institute, Trinity CollegeDublin 2

Personalised recommendations