Abstract
This paper presents some aspects of the Silfide server, a system dedicated to the delivery of linguistic resources on the web. After presenting the main issues behind the design of such a system, we focus on the editorial choices related to the use of the Text Encoding Initiative to represent our textual documents. In particular, we focus on the accommodations we have had to carry with regards to the TEI header and address the trade-off between extensive enrichment and genericity of the primary data when one wants to precisely mark-up a given document content. As a whole, we show how essential the TEI has proven to be for a project such as ours both from a practical and conceptual point of view.
Similar content being viewed by others
References
Aarts, J., P. de Haan and N. Oostdijk (Eds). English Language Corpora: Design, Analysis and Exploitation. Rodopi, Amsterdam, 1993.
Association for Computers and the Humanities (ACH), Association for Computational Linguistics (ACL), and Association for Literary and Linguistic Computing (ALLC). Guidelines for Electronic Text Encoding and Interchange (TEI P3), 2 vol.. Ed. C. M. Sperberg-McQueen and L. Burnard, Chicago, Oxford: Text Encoding Initiative.
Buneman, P. Semistructured Data, Tutorial PODS '97 (see http://www.cis.upenn.edu/~db/tutorials.html).
Dunlop, D. “Practical Considerations in the Use of TEI Headers in a Large Corpus”. In Text Encoding Initiative: Background and Context. Dordrecht: Kluwer Academic Publishers, 1995, pp. 85–98.
Heid, U. and C. Oliver (1996) An Investigation into the Use of AFS for Distribution and Networking of Linguistic Ressources and Tools. Technical Report Universität Stuttgart, Institut für maschinelle Sprachverarbeitung.
Ide, N. and J. Véronis MULTEXT/EAGLES-Corpus Encoding Standard, document Version 0.1. CNRS, Aix-en-Provence, 1995.
IJCL-96. International Journal of Corpus Linguistics, V. 1 N.1, John Benjamins, 1996.
Krol, E. The Whole Internet: User's Guide and Vatalog. O'Reilly & associates, Sebastopol, collection Nutshell Landbook, 1992.
Lapeyre, D. A. and T. Usdin TEI and the American Memory Project at the Library of Congress. Workshop: The Text Encoding Initiative Guidelines and their Application to Building Digital Libraries (20–23 March 96), 1996.
Péry-Woodley, M-P. 《Quels corpus pour quels traitements automatiques?》, in TAL-95.
Pino M. Encoding Two Large Spanish Corpora with the TEI Scheme: Design and Technical Aspects of Textual Markup. Workshop: The Text Encoding Initiative Guidelines and their Application to Building Digital Librairies (20–23 Mars 96), 1996.
Romary, L., N. Mehl and D. Woolls. “The Lingua Parallel Concordancing Project: Managing Multilingual Texts for Educational Purpose”. Text Technology, 5(3) (1995), 206–220.
TAL-95, Traitement probabilistes et corpus, Traitement Automatique des Langues journal, 36(1–2), 1995.
TEI-P3, Association for Computers and the Humanities (ACH), Association for Computational Linguistics (ACL), and Association for Literary and Linguistic Computing (ALLC). Guidelines for Electronic Text Encoding and Interchange (TEI P3), 2 vol.. Ed. C. M. Sperberg-McQueen and Lou Burnard, Chicago, Oxford: Text Encoding Initiative, 1994.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Romary, L., Bonhomme, P., Bruneseaux, F. et al. Silfide: A System for Open Access and Distributed Delivery of TEI Encoded Documents. Computers and the Humanities 33, 31–38 (1999). https://doi.org/10.1023/A:1001787430094
Issue Date:
DOI: https://doi.org/10.1023/A:1001787430094