Skip to main content

GramCat and GramEsp: two grammars for chunking

  • Conference paper
Intelligent Information Processing and Web Mining

Part of the book series: Advances in Soft Computing ((AINSC,volume 31))

  • 859 Accesses

Abstract

In this article we present two grammars (GramCat and GramEsp) for chunking of unrestricted Catalan and Spanish texts. With these grammars we extend the classical notion of chunk as it is defined by Abney, taking advantage of Catalan and Spanish morphosyntactic features: Catalan and Spanish rich inflectional morphology and the high frequency of some prepositional patterns allow us to include both pre- and post-nominal modifiers in the noun phrase.

The work presented here was partially funded by the Xtract2 project (Platform of Linguistic Engineering resources BFF2002-04226-C03-03).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abney, S.: Parsing by Chunks. Principle-Based Parsing (1991)

    Google Scholar 

  2. Abney, S.: Partial Parsing via Finite-State Cascades. Proceedings of the ESSLI’96 Robust Parsing Workshop (1996)

    Google Scholar 

  3. Arévalo, M., Civit, M., Martí, M.A: MICE: A Module for Named Entity Recognition and Classification. International Journal of Corpus Linguistics Volume 9, Number 1, (2004). John Benjamins

    Google Scholar 

  4. Atserias, J., Rodrǵuez, H.: TACAT: TAgged Corpus Text Analyser. Technical Report, Software Department, UPC (1998)

    Google Scholar 

  5. Bosque, I. Demonte, V.: Gramática Descriptiva de la Lengua Española. Espasa-Calpe (1999)

    Google Scholar 

  6. Civit, M., Martí, M.A.: Design Principles for a Spanish Treebank. Proceedings of the First Workshop on Treebanks and Linguistics Theories (TLT2002). Sozopol, Bulgaria (2002), 61–77

    Google Scholar 

  7. Civit, M.: Criterios de etiquetación y desambiguación morfosintáactica de corpus en español. Sociedad Española para el Procesamiento del Lenguaje Natural. Colección monografías. 3. (2003)

    Google Scholar 

  8. Civit, M., Bufí, N., Valverde, M.P.: CAT3LB: a Treebank for Catalan with Word Sense Annotation. 3rd Workshop on Treebanks and Linguistic Theories. Tuebingen, Germany (2004)

    Google Scholar 

  9. Gala, N.: Using the Incremental Finite State Architecture to create a Spanish Shallow Parser. SEPLN, Proceedings of the 15th Conference of the SEPLN Lleida (1999), 75–82

    Google Scholar 

  10. Gelbukh, A., Sidorov, G. Galicia-Haro, S., Bolsharov, I.: Environment for Development of a Natural Language Syntactic Analyzer. Acta Academia (2002)

    Google Scholar 

  11. Kermes, H., Evert, S.: Text analysis meets corpus linguistics. Corpus Linguistics (2003) 402–411

    Google Scholar 

  12. Moreno, A., Grishman, R., López, S., Sánchez, F., Sekine, S.: A Treebank of Spanish and its Application to Parsing. Procedings of the Second Conference on Language Resources and Evaluation (LREC) (2000) 107–111

    Google Scholar 

  13. Sebastián, N., Martí, M.A., Carreiras, M.F., Cuetos, F.: LEXESP: Léxico Informatizado del Español. Edicions de la Universitat de Barcelona, (2000)

    Google Scholar 

  14. Solà, J., Lloret, M.R., Mascaró, J., Pérez, M.: Gramàtica del català contemporani. Empúries, (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Civit, M., Antònia Martí, M. (2005). GramCat and GramEsp: two grammars for chunking. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds) Intelligent Information Processing and Web Mining. Advances in Soft Computing, vol 31. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32392-9_17

Download citation

  • DOI: https://doi.org/10.1007/3-540-32392-9_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25056-2

  • Online ISBN: 978-3-540-32392-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics