Language Technology for Cultural Heritage

Selected Papers from the LaTeCH Workshop Series

  • Caroline Sporleder
  • Antal van den Bosch
  • Kalliopi Zervanou
Conference proceedings

DOI: 10.1007/978-3-642-20227-8

Part of the Theory and Applications of Natural Language Processing book series (NLP)

Table of contents (12 papers)

  1. Front Matter
    Pages i-xxxi
  2. Pre-Processing

    1. Front Matter
      Pages 1-1
    2. Strategies for Reducing and Correcting OCR Errors
      Martin Volk, Lenz Furrer, Rico Sennrich
      Pages 3-22
    3. Alignment between Text Images and their Transcripts for Handwritten Documents
      Alejandro H. Toselli, Verónica Romero, Enrique Vidal
      Pages 23-37
  3. Adapting NLP Tools to Older Language Varieties

    1. Front Matter
      Pages 39-39
  4. Linguistic Resources for CH/SSH

    1. Front Matter
      Pages 77-77
    2. The Ancient Greek and Latin Dependency Treebanks
      David Bamman, Gregory Crane
      Pages 79-98
  5. Personalisation

    1. Front Matter
      Pages 113-113
  6. Personalisation

    1. Authoring Semantic and Linguistic Knowledge for the Dynamic Generation of Personalized Descriptions
      Stasinos Konstantopoulos, Vangelis Karkaletsis, Dimitrios Vogiatzis, Dimitris Bilidas
      Pages 115-132
  7. Structural and Narrative Analysis

    1. Front Matter
      Pages 133-133
  8. Structural and Narrative Analysis

    1. Automatic Pragmatic Text Segmentation of Historical Letters
      Iris Hendrickx, Michel Généreux, Rita Marquilhas
      Pages 135-153
    2. Proppian Content Descriptors in an Integrated Annotation Schema for Fairy Tales
      Thierry Declerck, Antonia Scheidel, Piroska Lendvai
      Pages 155-170
    3. Adapting NLP Tools and Frame-Semantic Resources for the Semantic Analysis of Ritual Descriptions
      Nils Reiter, Oliver Hellwig, Anette Frank, Irina Gossmann, Borayin Maitreya Larios, Julio Rodrigues et al.
      Pages 171-193
  9. Data Management, Visualisation and Retrieval

    1. Front Matter
      Pages 195-195
    2. Information Retrieval and Visualization for the Historical Domain
      Yevgeni Berzak, Michal Richter, Carsten Ehrler, Todd Shore
      Pages 197-212

About these proceedings

Introduction

The digital age has had a profound effect on our cultural heritage and the academic research that studies it. Staggering amounts of objects, many of them of a textual nature, are being digitised to make them more readily accessible to both experts and laypersons. Besides a vast potential for more effective and efficient preservation, management, and presentation, digitisation offers opportunities to work with cultural heritage data in ways that were never feasible or even imagined. 

To explore and exploit these possibilities, an interdisciplinary approach is needed, bringing together experts from cultural heritage, the social sciences and humanities on the one hand, and information technology on the other. Due to a prevalence of textual data in these domains, language technology has a crucial role to play in this endeavour. Language technology can break through the "Google barrier" by offering the potential to analyse texts at advanced levels, extracting information and knowledge at the level of the humanities or social sciences researcher, who wants to know about the who, what, where, and when, but also the how and the why. At the same time cultural heritage data poses considerable challenges for existing language technology: technology aimed at "generic" language has to face such disparate problems as historical language variation, OCR digitisation errors, and near-extinct academic expertise.
 
This book is primarily intended for researchers in information technology and language processing who would like to receive a state-of-the-art overview of the whole breadth of the new and vibrant field of language technology for cultural heritage and its associated academic research in the humanities and social sciences. Researchers working in the target domains of cultural heritage, the social sciences and humanities will also find this book useful, as it provides an overview of how language technology can help them with their information needs. The book covers applications ranging from pre-processing and data cleaning, to the adaptation and compilation of linguistic resources, to personalisation, narrative analysis, visualisation and retrieval.  

Keywords

cultural heritage digital humanities digital libraries information retrieval language technology

Editors and affiliations

  • Caroline Sporleder
    • 1
  • Antal van den Bosch
    • 2
  • Kalliopi Zervanou
    • 3
  1. 1., Computational Linguistics / MMCISaarland UniversitySaarbrückenGermany
  2. 2.Fac. HumanitiesTilburg UniversityTilburgNetherlands
  3. 3.Tilburg School for Humanities, Tilburg Center for Cognition and CommuniUniversity of TilburgTilburgNetherlands

Bibliographic information

  • Copyright Information Springer-Verlag Berlin Heidelberg 2011
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Humanities, Social Sciences and Law
  • Print ISBN 978-3-642-20226-1
  • Online ISBN 978-3-642-20227-8