Skip to main content

Integrating the Lexicon-Grammar of Predicate Nouns with Support Verb fazer into Port4NooJ

  • Conference paper
  • First Online:
Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications (NooJ 2017)

Abstract

This paper describes the ongoing process of integrating approximately 3,000 predicate nouns into Port4NooJ, the Portuguese module for NooJ. The integration of these resources enables us to further extend the paraphrastic capabilities of eSPERTo paraphrasing system developed in the scope of a project with the same name. The integrated predicate nouns co-occur with the support verb fazer (do or make) and their syntactic and distributional properties are formalized in lexicon-grammar tables. These lexicon-grammar tables resulted in a standalone dictionary of predicate noun constructions and a few new grammars that can be used in paraphrase analysis and generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Support verbs constructions with the verb fazer are used both in written texts and spoken language, occurring extensively in Portuguese sentences and noun phrases. Sentences with support verb constructions are more frequent than sentences with their equivalent verbs. In fact, Barreiro (2009) points out that from a search on all sentences of the COMPARA parallel corpus (Frankenberg-Garcia and Santos 2003; Santos and Inácio 2006) where the infinitive form of fazer occurs with a noun or with a left modifier and a noun, 47\(\%\) of the times the occurrence is in a support verb construction.

  2. 2.

    eSPERTo, which means ‘smart’ in Portuguese, stands for “System of Paraphrasing for Editing and Revision of Texts” (“Sistema de Parafraseamento para Edição e Revisão de Texto”).

  3. 3.

    Each entry in the lexicon-grammar can map to more than one existing Port4NooJ entry.

  4. 4.

    http://www.linguateca.pt/acesso/corpus.php?corpus=CETEMPUBLICO.

  5. 5.

    http://www.portaldalinguaportuguesa.org/main.html.

  6. 6.

    In order to properly look up for the nominal entries in Port4NooJ, we applied a simple rule that removes from words in Port4NooJ the first consonant in , ct and pt before comparing it to the nominal predicate in the lexicon-grammar. However, if there are derivation rules associated to the entry we still need to make sure that the rules do not introduce pre-agreement spelling.

  7. 7.

    For now, the table that formalizes medical technical terms was left out for a second stage.

References

  • Baptista, J.: Sintaxe dos predicados nominais com ‘ser de’. Fundação Calouste Gulbenkian, Fundação para a Ciência e a Tecnologia, Lisboa (2005)

    Google Scholar 

  • Barreiro, A.: Make it simple with paraphrases: automated paraphrasing for authoring aids and machine translation, Porto, Portugal (2009)

    Google Scholar 

  • Carvalho, P.: Análise e Representação de Construções Adjectivais para Processamento Automático de Texto. Adjectivos Intransitivos Humanos. Ph.D. thesis, Universidade de Lisboa, Lisboa, Portugal (2007)

    Google Scholar 

  • Casteleiro, J.M.: Sintaxe transformacional do adjetivo. INIC, Lisboa (1981)

    Google Scholar 

  • Chacoto, L.: O Verbo Fazer em Construções Nominais Predicativas. Ph.D. thesis, Universidade do Algarve, Faro, Portugal (2005)

    Google Scholar 

  • D’Agostino, E., Elia, A.: Il significato delle frasi: un continuum dalle frasi semplici alle forme polirematiche. In: AA. VV, Ai limiti del linguaggio, pp. 287–310. Laterza, Bari (1998)

    Google Scholar 

  • Frankenberg-Garcia, A., Santos, D.: Introducing COMPARA: the Portuguese - English parallel corpus. In: Zanettin, F., Bernardini, S., Stewart, D. (eds.) Corpora in Translator Education, pp. 71–87. St. Jerome, Manchester (2003)

    Google Scholar 

  • Gross, M.: Méthodes en syntaxe: régime des constructions complétives. Actualités scientifiques et industrielles. Hermann, Paris (1975)

    Google Scholar 

  • Laporte, E., Voyatzi, S.: An electronic dictionary of French multiword adverbs. In: Language Resources and Evaluation Conference, Workshop Towards a Shared Task for Multiword Expressions, pp. 31–34 (2008)

    Google Scholar 

  • Mota, C., Carvalho, P., Raposo, F., Barreiro, A.: Generating paraphrases of human intransitive adjective constructions with Port4NooJ. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds.) NooJ 2015. CCIS, vol. 607, pp. 107–122. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42471-2_10

    Chapter  Google Scholar 

  • Ranchhod, E.: Sintaxe dos predicados nominais com estar. Linguística, INIC, Lisboa (1990)

    Google Scholar 

  • Salkoff, M.: Automatic translation of support verb constructions. In: Proceedings of 13th Conference on Computational Linguistics, COLING 1990, vol. 3, pp. 243–246. ACL, Helsinki (1990)

    Google Scholar 

  • Santos, D., Inácio, S.: Annotating COMPARA, a grammar-aware parallel corpus. In: Calzolari, N., et al. (eds.) Proceedings of 5th LREC, Genoa, Italy, pp. 1216–1221 (2006)

    Google Scholar 

  • Silberztein, M.: Les groupes nominaux productifs et les noms composés lexicalisés. Lingvisticæ Investigationes 17(2), 405–425 (1993)

    Article  Google Scholar 

  • Silberztein, M.: Formalizing Natural Languages: The NooJ Approach, p. 346. Wiley, Hoboken (2016)

    Google Scholar 

Download references

Acknowledgements

This research was supported by Fundação para a Ciência e Tecnologia (FCT), under exploratory project eSPERTo (Ref. EXPL/MHC-LIN/2260/2013). Anabela Barreiro was also funded by FCT through post-doctoral grant SFRH/BPD/91446 /2012. The authors would like to thank Max Silberztein for his prompt support and guidance with all matters related to NooJ.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anabela Barreiro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mota, C., Chacoto, L., Barreiro, A. (2018). Integrating the Lexicon-Grammar of Predicate Nouns with Support Verb fazer into Port4NooJ. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds) Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications. NooJ 2017. Communications in Computer and Information Science, vol 811. Springer, Cham. https://doi.org/10.1007/978-3-319-73420-0_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73420-0_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73419-4

  • Online ISBN: 978-3-319-73420-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics