Skip to main content

Using Morphological, Syntactical, and Statistical Information for Automatic Term Acquisition

  • Conference paper
  • First Online:
Advances in Natural Language Processing (PorTAL 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2389))

Included in the following conference series:

Abstract

Terminologies are useful in all areas that use specialized languages. The development of terminologies is a hard work, when manually done. It can be assisted with tools to ease and improve the achievement of such a work. In this article, we present ATA, an automatic terms extractor using both linguistic and statistical information.

This paper has been partially supported by the Fundação Para a Ciência e Tecnologia under project number PLUS/1999/LIN/15150

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Salah Aït-Mokhtar. L’analyse Pr’esyntaxique en une seule étape. PhD thesis, Université Blaise Pascal, Feb 1998.

    Google Scholar 

  2. Fernando Batista. Análise sintáctica de superfície e consistência de regras. Master’s thesis, Instituto Superior Técnico, UTL, 2002. (work in progress).

    Google Scholar 

  3. D. Bourigault. Surface grammatical analysis for the extraction of terminological noun phrases. Proceedings of the 15th International Conference on Computational Linguistics, COLING’92, 1992. p. 977–981.

    Google Scholar 

  4. P.R. Clarkson and R. Rosenfeld. Statistical language modeling using the cmu-cambridge toolkit. Proceedings ESCA Eurospeech, 1997.

    Google Scholar 

  5. D. A. Cruse. Lexical semantics, 1986.

    Google Scholar 

  6. J. Ferreira da Silva and G. Pereira Lopes. A local maxima method and a fair dispersion normalization for extracting multi-words units from corpora. International Conference on Mathematics of Language, Orlando, July 1999.

    Google Scholar 

  7. B. Daille. Study and implementation of combined techniques for automatic extraction of terminology. The balancing act combining symbolic and statistical approaches to language, pages 49–66, 1996.

    Google Scholar 

  8. Rosa Estopà. Les unitats terminológiques polilexemàtiques en els lèxics especial-itzats: dret i medicina. PhD thesis, Institut Universitari de Lingüística Aplicada, Barcelona, UPF, 1999.

    Google Scholar 

  9. Abbaci Faiza. Développement du module post-smorph. Master’s thesis, Mémoire de DEA de linguistique et informatique, GRIL, Université Blaise Pascal, Clermont-Ferrand, 1999.

    Google Scholar 

  10. Caroline Hagège. Analyse syntaxique automatique du portugais. Thèse de doctorat, Université Blaise Pascal, GRIL, Clermont-Ferrand, 2000.

    Google Scholar 

  11. C. Jacquemin. Quelques exemples d’application du traitement automatique des langues en accès à l’information. 5emes Journées Internationales d’Analyse de Données Textuelles (JADT), 1, 2000.

    Google Scholar 

  12. C. Jacquemin and D. Bourigault. Term extraction and automatic indexin. R. Mitkov, editor, Handbook of Computational Linguistics, 2000.

    Google Scholar 

  13. J. S. Justeson and S. M. Katz. Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, p. 9–27, 1995.

    Google Scholar 

  14. C. D. Manning and H. Shutze. Foundations of Statistical Natural Language Processing MIT Press, London, 1999.

    MATH  Google Scholar 

  15. MCT and Público. Cetempúblico-corpus de extractos de textos electrónicos, 2000.

    Google Scholar 

  16. A. P. Marquez Neto. Terminologia e corpus linguístico. Revista Internacional de Língua Portuguesa-RILP n. 15, p. 100–108, 1996.

    Google Scholar 

  17. Joana Lúcio Paulo. Pasmo-pós-análise morfológica. Relatório técnico, Instituto Superior Técnico, Lisboa, 2001.

    Google Scholar 

  18. J. Silva, G. Dias, S. Guilloré, and G. Lopes. Using localmaxs algorithm for the extraction of contiguous and non-contiguous multiword lexical units. 9th Portuguese Conference on Artificial Intelligence, 1695:113–132, September 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Paulo, J.L., Correia, M., Mamede, N.J., Hagège, C. (2002). Using Morphological, Syntactical, and Statistical Information for Automatic Term Acquisition. In: Ranchhod, E., Mamede, N.J. (eds) Advances in Natural Language Processing. PorTAL 2002. Lecture Notes in Computer Science(), vol 2389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45433-0_31

Download citation

  • DOI: https://doi.org/10.1007/3-540-45433-0_31

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43829-8

  • Online ISBN: 978-3-540-45433-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics