Advertisement

Technical Implementation of the Vocabulário Ortográfico Comum da Língua Portuguesa

  • Maarten JanssenEmail author
  • José Pedro Ferreira
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11122)

Abstract

The recent Portuguese language orthographic agreement (AOLP90) specifies that the new spelling rules are implemented in an official spelling dictionary (VOC). VOC, released in 2017, is the first common spelling dictionary valid in all Portuguese-speaking countries. AOLP90 allows for some national-level spelling variation, defined in a national spelling dictionary (VON) for each country, containing the nationally-representative words and national-level variants. This combination of a single official spelling with national variation cannot be handled in a traditional set-up for lexical data. This article describes how the lexicon is practically implemented in the VOC database. We start by presenting the nature of AOLP90, the requirements for VOC, and the lexical database. We then analyze the technical implications of orthographic variation in a pluricentric context and present the solutions and practical implementation adopted in VOC. We finish by presenting the pluricentric management system designed for this purpose, devised to cater for decentralized, but compatible management of the lexical database.

Keywords

Spelling dictionary Computational lexicography Portuguese as a pluricentric language 

References

  1. 1.
    Marquilhas, R.: The portuguese language spelling accord. Writ. Lang. Lit. 18(2), 275–286 (2015)CrossRefGoogle Scholar
  2. 2.
    Johnson, S.: Spelling Trouble? Language, Ideology and the Reform of German Orthography. Information and Interdisciplinary Subjects Series. Multilingual Matters Ltd, Bristol (2005)Google Scholar
  3. 3.
    Coulmas, F.: Writing reform: conditions and implications. In: Writing Systems: An Introduction to Their Linguistic Analysis, pp. 241–263. Cambridge University Press (2003)Google Scholar
  4. 4.
    Ferreira, J.P., Correia, M., de Almeida, G.B., eds.: Vocabulário Ortográfico Comum da Língua Portuguesa. In: Instituto Internacional da Língua Portuguesa/Comunidade dos Países de Língua Portuguesa, Praia, Cape Verde/Lisbon, Portugal (2017)Google Scholar
  5. 5.
    Buchmann, F.: Spelling dictionaries. In: Durkin, P. (ed.) The Oxford Handbook of Lexicography. Oxford University Press, Oxford (2016)Google Scholar
  6. 6.
    Ferreira, J.P., Janssen, M., de Almeida, G.B., Correia, M., de Oliveira, F.M.: The common orthographic vocabulary of the portuguese language: a set of open lexical resources for a pluricentric language. In: Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 1071–1075 (2012)Google Scholar
  7. 7.
    Janssen, M., Kuhn, T.Z., Ferreira, J.P., Correia, M.: The CPLP corpus, a corpus of portuguese as a pluricentric language. In: XVIII EURALEX International Congress, Ljubljana, Slovenia (2018)Google Scholar
  8. 8.
    Janssen, M.: Open source lexical information network. In: 3rd International Workshop on Generative Approaches to the Lexicon, Geneva, Switzerland (2005)Google Scholar
  9. 9.
    Janssen, M.: Affix selection and deadjectival nouns: a data-driven approach. In: Humphries, G. (ed.) Enlish Language, Literature, and Culture: New directions in research. Bielo-Bialska, Poland (2008)Google Scholar
  10. 10.
    Styles, T.: Place-name dictionaries. In: Durkin, P. (ed.) The Oxford Handbook of Lexicography. Oxford University Press, Oxford (2016)Google Scholar
  11. 11.
    Casteleiro, J.M. (ed.): Dicionário da língua portuguesa contemporânea: 1. A-F. Academia das Ciências de Lisboa, Verbo (2001)Google Scholar
  12. 12.
    Enciclopèdia.cat: Diccionari enciclopèdic (2018)Google Scholar
  13. 13.
    Institut d’Estudis Catalans: Diccionari de la llengua catalana. Institut d’estudis catalans (2007)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.CELGA-ILTECUniversity of CoimbraCoimbraPortugal

Personalised recommendations