Abstract
Semantic knowledge is often used in the framework of Natural Language Processing (NLP) applications. However, for some languages different from English, such knowledge is not always easily available. In fact, for example, French thesaurus are not numerous and are not enough developed. In this context, we present two modifications made on the French version of the EuroWordnet Thesaurus in order to improve it. Firstly, we present the French EuroWordNet thesaurus and its limits. Then we explain two improvements we have made. We add non-existing relationships by using the bilinguism capability of the EuroWordnet thesaurus, and definitions by using an external multilingual resource (Wikipedia [1]).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wikipedia (2006), http://en.wikipedia.org/wiki/
Gonzalo, J., Verdejo, F., Chugur, I., Cigarran, J.: Indexing with wordnet synsets can improve text retrieval. In: Proceeding of the workshop on Usage of Wordnet for NLP, ACL-98, pp. 38–44. Sage, Thousand Oaks (1998)
Miller, G.: Wordnet: an online lexical database. International journal of lexicography 3, 235–312 (1990)
Vossen, P. (ed.): EuroWordnet - A multilingual database with lexical semantic networks. Kluwer Academic Publishers, Dordrecht (1998)
Loupy, C.D.: Managing synonymy and polysemy in a document retrieval system using wordnet. In: Proceedings of Creating and Using Semantic for Information Retrieval and Filtering Workshop, LREC (2002)
O’Hara, T., Mahesh, K., Niremburg, S.: Lexical acquisition with wordnet and the mikrokosmos ontology. In: Harabagui, S. (ed.) COLING-ACL conference: Use of WordNet in Natural Language Processing Systems, August 1998, pp. 94–101 (1998)
Hanks, P., Pustejovsky, J.: A pattern dictionnary for natural language processing. In: Revue Française de Linguistique Appliquée, numéro spécial sur les dictionaires, Dec. 2005, pp. 63–82 (2005)
PostGreSQL (2006), http://www.postgresqlfr.org/
Tufi, D., Cristea, D.: Methodological issues in building the romanian wordnet and consistency checks in balkanet. In: Proceedings of the LREC special Workshop on WordNets, pp. 35–41 (2002)
Morin, E., Jacquemin, C.: Automatic acquisition and expansion of hypernym links. In: Computers and the Humanities (CHUM), pp. 363–396. Kluwer Academic Publishers, Dordrecht (2005)
Moldovan, D., Pascal, M., Harabagiu, S., Surdeanu, M.: Performance issues and error analysis in an open-domain question answering system. In: transactions on information systems, A., ed.: Journal on Very Large Databases. 21, 133–154 (2003)
Saggion, H., Gaizauskas, R., Hepplz, M., Roberts, I., Greenwood, M.A: Exploring the performance of boolean retrieval strategies for open domain question answering. In: Proceeding of the Workshop on Information Retrieval for Question Answering, SIGIR (2004)
Butler, D., Hogan, J., Hopkin, M., Peplow, M., Simonite, T.: Online article from the nature review (2005), http://www.nature.com/news/2005/051212/full/438900a.html
Inex: http://inex.is.informatik.uni-duisburg.de (2005)
Wiqa: http://ilps.science.uva.nl/wiqa (2005)
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic assignment of wikipedia encyclopedic entries to wordnet synsets. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 380–386. Springer, Heidelberg (2005)
Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: The 32nd annual meeting of the association for computational linguistics, pp. 133–138 (1994)
Halkidi, M., Nguyen, B., Varlamis, I., Vazirgiannis, M.: Thesus: Organising web document collections based on link semantics. In: transactions on information systems, A., ed.: Journal on Very Large Databases 12, 320–332 (2003)
Resnik, P.: Semantic similarity in a taxonomy: an information based measure and its application to problems of ambiguity in natural language. Journal of artificial intelligence research 11, 95–130 (1999)
Pearson, J. (ed.): Terms in context, 1998. John Benjamins Publishing Company (1998)
Clef06: Cross language evaluation campaign (2006), http://www.clef-campaign.org/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jacquin, C., Desmontils, E., Monceaux, L. (2007). French EuroWordNet Lexical Database Improvements. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-70939-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70938-1
Online ISBN: 978-3-540-70939-8
eBook Packages: Computer ScienceComputer Science (R0)