Abstract
Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects impacting natural language processing (NLP). A second aspect involves optimizing the process leading to their integration in applications. With this respect, we believe that a consensual specification on monolingual, bilingual and multilingual lexicons can be a useful aid for the various NLP actors. Within ISO, one purpose of Lexical Markup Framework (LMF, ISO-24613) is to define a standard for lexicons that covers multilingual lexical data.
Similar content being viewed by others
Notes
Two types of diagrams must be distinguished: class diagrams and instance diagrams. A model is specified by a UML class diagram within a UML package: in this case, the class name is not underlined in the diagram. The various examples of word description are represented by UML instance diagrams: in this case, the class name is underlined.
Packages for Morphology, Syntax and Semantics are described in (Francopoulo 2006).
To be more precise, Papillon uses the term “axie”. We decided to use the term “axis” in order to respect English orthography.
It should be noted that a W3C schema is not specified as an ISO standard but is specified as a W3C recommendation. For criticisms about W3C schemas and comparison with Relax NG, see (Van der Vlist 2004).
Please consult www.lexicalmarkupframework.org for updated information.
References
Antoni-Lay, M.-H., Francopoulo, G., & Zaysser, L. (1994). A generic model for reusable lexicons: The GENELEX project. Literary and linguistic computing, 9(1), 47–54.
Atkins, S., Bel, N., Bertagna, F., Bouillon, P., Calzolari, N., Fellbaum, C., et al. (2002). From resources to applications. Designing the multilingual ISLE lexical entry. Proceedings of LREC, Las Palmas, Canary Island, Spain (pp. 687–693).
Bertagna, F., Lenci, A., Monachini, M., & Calzolari, N. (2004). Content interoperability of lexical resources, open issues and MILE perspectives. Proceedings of LREC, Lisbon (pp. 131–134).
Calzolari, N., Mc Naught, J., & Zampolli, A. (1996) Eagles, editors introduction, from www.ilc.cnr.it/EAGLES96/edintro.html.
Francopoulo, G., George, M., Calzolari, N., Monachini, M., Bel, N., et al. (2006). Lexical markup framework (LMF). Proceedings of LREC, Genoa (pp. 223–236).
Ide, N., & Véronis, J. (1994). MULTEXT: Multilingual text tools and corpora. In Proceedings of the 15th International Conference on Computational Linguistics, COLING, Kyoto.
ISO-12620. (1999). Computer application in terminology—Data categories. Geneva: ISO.
Khemakhem, A., Gargouri, B., Abdelwahed, A., & Francopoulo, G. (2007). Modélisation des paradigmes de flexion des verbes arabes selon la norme LMF-ISO 24613. Toulouse: TALN.
Lenci, A., Bel, N., Busa, F., Calzolari, N., Gola, E., Monachini, M., et al. (2000). SIMPLE: A general framework for the development of multilingual lexicons. International Journal of Lexicography, 13(4), 249–263.
LMF. (2008). Lexical markup framework ISO-FDIS24613. Geneva: ISO.
Monachini, M., Quochi, V., Del Gratta, R., & Calzolari, N. (2007). Using LMF to shape a lexicon for the biomedical domain. LangTech Proceeding, Rome.
Rumbaugh, J., Jacobson, I., & Booch, G. (2004). The unified modeling language reference manual (2nd ed.). Boston, MA: Addison Wesley.
Sérasset, G., & Mangeot-Lerebours, M. (2001). Papillon Lexical Database project: Monolingual dictionaries & interlingual links. Tokyo: NLPRS.
Van der Vlist E. (2004). Relax NG, O’Reilly. Sebastopol, California: CRCS.
Zampolli, A. (1997). The PAROLE project in the general context of the European actions for language resources. In R. Marcinkeviciene & N. Volz (Eds.), TELRI Proceedings of the Second European Seminar: Language Applications for a Multilingual Europe. Manheim/Kaunas: IDS/VDU.
Acknowledgements
The work presented here was partially funded by the EU eContent-22236 LIRICS project (http://lirics.loria.fr ).
Author information
Authors and Affiliations
Corresponding author
Additional information
This paper is the description of the ongoing work within ISO committees and is not a position paper.
Rights and permissions
About this article
Cite this article
Francopoulo, G., Bel, N., George, M. et al. Multilingual resources for NLP in the lexical markup framework (LMF). Lang Resources & Evaluation 43, 57–70 (2009). https://doi.org/10.1007/s10579-008-9077-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-008-9077-5