Abstract
Dictionary markup is one of the concerns of the Text Encoding Initiative (TEI), an international project for text encoding. In this paper, we investigate ways to use and extend the TEI encoding scheme for the markup of Korean dictionary entries. Since TEI suggestions for dictionary markup are mainly for western language dictionaries, we need to cope with problems to be encountered in encoding Korean dictionary entries. We try to extend and modify the TEI encoding scheme in the way suggested by the TEI. Also, we restrict the content model so that the encoded dictionary might be viewed as a database as well as a computerized, originally printed, dictionary.
Similar content being viewed by others
References
Alschuler, L. ABCD..SGML: A User's Guide to Structured Information. London: ITP, 1995.
Atkins, B.T.S. and A. Zampoli. Eds. Computational Approaches to the Lexicon. Oxford: Oxford University Press, 1994.
Boguraev, B. "Machine-Readable Dictionaries and Computational Linguistics Research". In Current Issues in Computational Linguistics: In Honour of Don Walker. Eds. A. Zampoli, N. Calzolari, and M. Palmer. Pisa: Giardini and Dordrecht: Kluwer, 1994, pp. 119–154.
Choe, B.,W. Lee, J. Lee and K. Choi. "The Logical Structure of Lexical Entries for the Construction of a Machine Readable Dictionary". In Korean Journal of Cognitive Science, 7-2 (1996), 75–94 [written in Korean].
Clark, J. NSGMLS-A Validating SGML Parser, software available at ftp://ftp.jclark.com/pub/sp/, 1995.
Fillmore, C.J. and B.T.S. Atkins. "Starting Where the Dictionaries Stop: The Challenge of Corpus Lexicography". In Computational Approaches to the Lexicon. Eds. B.T.S. Atkins and A. Zampoli. Oxford: Oxford University Press, 1994, pp. 349–393.
Ide, N. and M. Sperberg-McQueen. "The Text Encoding Initiative: Its History, Goals, and Future Development". Computers and the Humanities, 29(1) (1995), 5–15.
Ide, N. and J. Véronis. "Encoding Dictionaries". Computers and the Humanities, 29(2) (1995), 167–179.
ISO. ISO 8879: Information Processing-Text and Office Systems-Standard Generalized Markup Language (SGML). ISO, 1986.
Kim, H. and B. Kang. "KOREA-1 Corpus: Design and Composition". Korean Linguistics, 3 (1996), 233–258 [written in Korean].
Sinclair, J. Ed. Looking Up. London: Collins, 1987.
Sperberg-McQueen, C.M. and L. Burnard. Eds. Guidelines for Electronic Text Encoding and Interchange(TEI P3), Chicago and Oxford: TEI, 1994.
Zampoli, A., N. Calzolari and M. Palmer. Eds. Current Issues in Computational Linguistics: In Honour of Don Walker. Pisa: Giardini and Dordrecht: Kluwer, 1994.