Skip to main content

Experiments on Enlarging a Lexical Ontology

  • Conference paper
  • First Online:
Languages, Applications and Technologies (SLATE 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 563))

Included in the following conference series:

  • 367 Accesses

Abstract

This paper presents two simple experiments performed in order to enlarge the coverage of PULO, a Lexical Ontology, based and aligned with the Princeton WordNet. The first experiment explores the triangulation of the Galician, Catalan and Castillian wordnets, with translation dictionaries from the Apertium project. The second, explores Dicionário-Aberto entries, in order to extract synsets from its definitions. Although similar approaches were already applied for different languages, this document aims at documenting their results for the PULO case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This article will use the term variant to refer to one of the synonyms of a synset.

  2. 2.

    Orthography prior to the 1990 agreement, that was officiated in 2008 by the Portuguese Government, and still being, progressively, adopted in Portugal.

  3. 3.

    Text Encoding Initiative XML schema, that includes notation to encode different kind of resources from simple books to corpora or dictionaries.

  4. 4.

    This distinction is, of course, of the responsibility of the original lexicographer.

  5. 5.

    In these and next examples, the authors decided not to translate the variant itself, as a direct translation will lose part of the cultural/usage meaning.

  6. 6.

    Given the obtained accuracy and the lack of human resources for a through validation, the authors decided to include the obtained variants without further analysis.

  7. 7.

    Given the low accuracy and the small number of proposed variants, the authors decided to perform a manual validation prior to their incorporation into PULO.

References

  1. Almeida, J.J., Pinto, U.: Jspell - um módulo para análise léxica genérica de linguagem natural. In: Actas do X Encontro da Associação Portuguesa de Linguística. pp. 1–15. Évora 1994 (1995)

    Google Scholar 

  2. Forcada, M.L.: Apertium: traducció automàtica de codi obert per a les llengües romàniques. Linguamática 1(1), 13–23 (2009)

    MathSciNet  Google Scholar 

  3. Gómez Guinovart, X., Clemente, X.M.G., Pereira, A.G., Lorenzo, V.T.: Galnet: WordNet 3.0 do galego. Linguamática 3(1), 61–67 (2011)

    Google Scholar 

  4. Gonço Oliveira, H., de Paiva, V., Freitas, C., Rademaker, A., Real, L., Simões, A.: As wordnets do Português. In: Simões, A., Barreiro, A., Santos, D., Sousa-Silva, R., Tagnin, S. (eds.) Linguástica, Informática e Tradução: Mundos que se Cruzam, vol. 7, pp. 397–424, March 2015

    Google Scholar 

  5. Gonçalo Oliveira, H., Gomes, P.: ECO and Onto.PT: a flexible approach for creating a Portuguese wordnet automatically. Lang. Resour. Eval. J. 48(2), 373–393 (2014)

    Article  Google Scholar 

  6. Oliveira, H.G., Santos, D., Gomes, P., Seco, N.: PAPEL: a dictionary-based lexical ontology for Portuguese. In: Teixeira, A., de Lima, V.L.S., de Oliveira, L.C., Quaresma, P. (eds.) PROPOR 2008. LNCS (LNAI), vol. 5190, pp. 31–40. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  7. Gonzalez-Agirre, A., Laparra, E., Rigau, G.: Multilingual central repository version 3.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2525–2529. ELRA (2012)

    Google Scholar 

  8. Maziero, E.G., Pardo, T.A.S., Felippo, A.D., Dias-da-Silva, B.C.: A base de Dados Lexical e a interface web do TeP 2.0. In: VI Workshop em Tecnologia da Informação e da Linguagem Humana, pp. 390–392 (2008)

    Google Scholar 

  9. Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38, 39–41 (1995)

    Article  Google Scholar 

  10. Rademaker, A., Paiva, V.D., de Melo, G., Coelho, L.M.R., Gatti, M.: OpenWordNet-PT: a project report. In: Proceedings of the 7th Global WordNet Conference, pp. 383–390 (2014)

    Google Scholar 

  11. Simões, A., Farinha, R.: Dicionário Aberto: um recurso para processamento de linguagem natural. Vice-Versa 16, 159–171 (2011)

    Google Scholar 

  12. Simões, A., Guinovart, X.G.: Bootstrapping a Portuguese WordNet from Galician, Spanish and English wordnets. In: Navarro Mesa, J.L., Ortega, A., Teixeira, A., Hernández Pérez, E., Quintana Morales, P., Ravelo García, A., Guerra Moreno, I., Toledano, D.T. (eds.) IberSPEECH 2014. LNCS, vol. 8854, pp. 239–248. Springer, Heidelberg (2014)

    Google Scholar 

Download references

Acknowledgements

Thanks to Nuno Carvalho for the proofreading. This work has been partially supported by FCT - Fundação para a Ciência e Tecnologia within the Project Scope UID/CEC/00319/2013.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alberto Simões .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Simões, A., Almeida, J.J. (2015). Experiments on Enlarging a Lexical Ontology. In: Sierra-Rodríguez, JL., Leal, JP., Simões, A. (eds) Languages, Applications and Technologies. SLATE 2015. Communications in Computer and Information Science, vol 563. Springer, Cham. https://doi.org/10.1007/978-3-319-27653-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27653-3_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27652-6

  • Online ISBN: 978-3-319-27653-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics