Skip to main content
Log in

Silfide: A System for Open Access and Distributed Delivery of TEI Encoded Documents

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

This paper presents some aspects of the Silfide server, a system dedicated to the delivery of linguistic resources on the web. After presenting the main issues behind the design of such a system, we focus on the editorial choices related to the use of the Text Encoding Initiative to represent our textual documents. In particular, we focus on the accommodations we have had to carry with regards to the TEI header and address the trade-off between extensive enrichment and genericity of the primary data when one wants to precisely mark-up a given document content. As a whole, we show how essential the TEI has proven to be for a project such as ours both from a practical and conceptual point of view.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aarts, J., P. de Haan and N. Oostdijk (Eds). English Language Corpora: Design, Analysis and Exploitation. Rodopi, Amsterdam, 1993.

    Google Scholar 

  • Association for Computers and the Humanities (ACH), Association for Computational Linguistics (ACL), and Association for Literary and Linguistic Computing (ALLC). Guidelines for Electronic Text Encoding and Interchange (TEI P3), 2 vol.. Ed. C. M. Sperberg-McQueen and L. Burnard, Chicago, Oxford: Text Encoding Initiative.

  • Buneman, P. Semistructured Data, Tutorial PODS '97 (see http://www.cis.upenn.edu/~db/tutorials.html).

  • Dunlop, D. “Practical Considerations in the Use of TEI Headers in a Large Corpus”. In Text Encoding Initiative: Background and Context. Dordrecht: Kluwer Academic Publishers, 1995, pp. 85–98.

    Google Scholar 

  • Heid, U. and C. Oliver (1996) An Investigation into the Use of AFS for Distribution and Networking of Linguistic Ressources and Tools. Technical Report Universität Stuttgart, Institut für maschinelle Sprachverarbeitung.

  • Ide, N. and J. Véronis MULTEXT/EAGLES-Corpus Encoding Standard, document Version 0.1. CNRS, Aix-en-Provence, 1995.

    Google Scholar 

  • IJCL-96. International Journal of Corpus Linguistics, V. 1 N.1, John Benjamins, 1996.

  • Krol, E. The Whole Internet: User's Guide and Vatalog. O'Reilly & associates, Sebastopol, collection Nutshell Landbook, 1992.

    Google Scholar 

  • Lapeyre, D. A. and T. Usdin TEI and the American Memory Project at the Library of Congress. Workshop: The Text Encoding Initiative Guidelines and their Application to Building Digital Libraries (20–23 March 96), 1996.

  • Péry-Woodley, M-P. 《Quels corpus pour quels traitements automatiques?》, in TAL-95.

  • Pino M. Encoding Two Large Spanish Corpora with the TEI Scheme: Design and Technical Aspects of Textual Markup. Workshop: The Text Encoding Initiative Guidelines and their Application to Building Digital Librairies (20–23 Mars 96), 1996.

  • Romary, L., N. Mehl and D. Woolls. “The Lingua Parallel Concordancing Project: Managing Multilingual Texts for Educational Purpose”. Text Technology, 5(3) (1995), 206–220.

    Google Scholar 

  • TAL-95, Traitement probabilistes et corpus, Traitement Automatique des Langues journal, 36(1–2), 1995.

  • TEI-P3, Association for Computers and the Humanities (ACH), Association for Computational Linguistics (ACL), and Association for Literary and Linguistic Computing (ALLC). Guidelines for Electronic Text Encoding and Interchange (TEI P3), 2 vol.. Ed. C. M. Sperberg-McQueen and Lou Burnard, Chicago, Oxford: Text Encoding Initiative, 1994.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Romary, L., Bonhomme, P., Bruneseaux, F. et al. Silfide: A System for Open Access and Distributed Delivery of TEI Encoded Documents. Computers and the Humanities 33, 31–38 (1999). https://doi.org/10.1023/A:1001787430094

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1001787430094

Navigation