Skip to main content

Reusability of Dictionaries in the Compilation of NLP Lexicons

  • Conference paper
  • First Online:
  • 437 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2721))

Abstract

This paper discusses particular linguistic challenges in the task of reusing published dictionaries, conceived as structured sources of lexical information, in the compilation process of a machine-tractable thesaurus-like lexical database for Brazilian Portuguese. After delimiting the scope of the polysemous term thesaurus, the paper focuses on the improvement of the resulting object by a small team, in a form compatible with and inspired by WordNet guidelines, comments on the dictionary entries, addresses selected problems found in the process of extracting the relevant lexical information form the selected dictionaries, and provides some strategies to overcome them.

This research is sponsored by CNPq and FAPESP-São Paulo, Brazil.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barbosa, O. Grande Dicionário de Sinônimos e Antônimos. Ediouro, Rio de Janeiro (1999)

    Google Scholar 

  2. Borba, F.S. (coord.) Dicionário Gramatical de Verbos do Português Contemporâneo do Brasil. Editora da Unesp, São Paulo (1990)

    Google Scholar 

  3. Chafe, W. Meaning and the Structure of Language. The University of Chicago Press, Chicago (1970)

    Google Scholar 

  4. Cruse, D.A. Lexical Semantics. Cambridge University Press, New York (1986)

    Google Scholar 

  5. Dias-da-silva, B.C. Bridging the gap between linguistic theory and natural language processing In: Caron, B. (ed.) 16th International Congress of Linguists. Pergamon-Elsevier Science, Oxford (1998) 10 p.

    Google Scholar 

  6. Dias-da-Silva, B.C., Oliveira, M.F., Hasegawa, R., Moraes, H.R., Amorim, D., Paschoalino, C. Nascimento, A.C. A construção de um thesaurus eletrônico para o português do Brasil. In: Proceedings of the 5th PROPOR — Encontro para o processamento computacional da língua portuguesa escrita e falada, Atibaia, Brazil, (2000) 01–10

    Google Scholar 

  7. Dias-da-Silva, B.C., Oliveira, M. F., Moraes, h. r. Groundwork for the Development of the Brazilian Portuguese Wordnet. In: Ranchhold, E.M.; Mamede, N.J. (eds.) Advances in natural language processing.: Springer-Verlag, Berlin (2002) 189–196

    Chapter  Google Scholar 

  8. Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database. The MIT Press, Cambridge, Mass (1998)

    MATH  Google Scholar 

  9. Fernandes, F. Dicionário de Sinônimos e Antônimos da Língua Portuguesa. Globo, São Paulo (1997)

    Google Scholar 

  10. Ferreira, A.B.H. Dicionário Aurélio Eletrônico Século XXI (versão 3.0). Lexicon, São Paulo (1999)

    Google Scholar 

  11. Kilgarriff, a. Dictionary word sense distinctions: an inquiry into their nature. Computer and the Humanities, 26 (1993) 365–387

    Article  Google Scholar 

  12. Kilgarriff, A. I don’t Believe in Word Senses. Computer and the Humanities 31 (1997) 91–113

    Article  Google Scholar 

  13. Kilgariff, A., Yallop, C. “What’s in a Thesaurus?”. In: Proceedings of the 2nd Conference on Language Resources and Evaluation, Athens, Greece (2000) 8 p.

    Google Scholar 

  14. Summers, D. (ed.) Longman Dictionary of Contemporary English. Longman, Essex (1995)

    Google Scholar 

  15. Weiszflog, W. (ed) Michaelis português — moderno dicionário da língua portuguesa (versão 1.1). DTS Software Brasil Ltda, São Paulo (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dias-da-Silva, B.C., de Oliveira, M.F., de Moraes, H.R. (2003). Reusability of Dictionaries in the Compilation of NLP Lexicons. In: Mamede, N.J., Trancoso, I., Baptista, J., das Graças Volpe Nunes, M. (eds) Computational Processing of the Portuguese Language. PROPOR 2003. Lecture Notes in Computer Science(), vol 2721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45011-4_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-45011-4_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40436-1

  • Online ISBN: 978-3-540-45011-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics