Skip to main content

LEXIK. An Integrated System for Specialized Terminology

  • Conference paper
  • First Online:
Advances in Computational Intelligence (MICAI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10061))

Included in the following conference series:

  • 1444 Accesses

Abstract

The paper presents LEXIK, an intelligent terminological architecture that is able to efficiently obtain specialized lexical resources for elaborating dictionaries and providing lexical support for different expert tasks. LEXIK is designed as a powerful tool to create a rich knowledge base for lexicography. It will process big amounts of data in a modular system, that combines several applications and techniques for terminology extraction, definition generation, example extraction and term banks, that have been partially developed so far. Such integration is a challenge for the area, which lacks an integrated system for extracting and defining terms from a non-preprocessed corpus.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.eurotermbank.com.

  2. 2.

    http://www.btb.termiumplus.gc.ca/.

  3. 3.

    http://www.uva.fi/en/sites/terminology/.

  4. 4.

    http://www.corpus.unam.mx/SABTEF/.

  5. 5.

    http://www.corpus.unam.mx/geco.

  6. 6.

    www.corpus.unam.mx/termext.

  7. 7.

    http://www.describe.com.mx.

  8. 8.

    http://www.corpus.unam.mx/SABTEF.

References

  1. Alarcón, R., Sierra, G., Bach, C.: ECODE: a definition extraction system. In: Vetulani, Z., Uszkoreit, H. (eds.) LTC 2007. LNCS, vol. 5603, pp. 382–391. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04235-5_33

    Chapter  Google Scholar 

  2. Alarcón, R., Sierra, G., Bach, C.: Description and evaluation of a definition extraction system for Spanish language. In: 1st Workshop on Definition Extraction 2009, Borovets, Bulgaria, pp. 7–13 (2009)

    Google Scholar 

  3. Arroyo-Fernández, I.: Learning kernels for semantic clustering: a deep approach. In: NAACL-HLT 2015 Student Research Workshop (SRW), pp. 79–87 (2015)

    Google Scholar 

  4. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press Addison-Wesley, New York (1999)

    Google Scholar 

  5. Barrón-Cedeño, A., Sierra, G., Drouin, P., Ananiadou, S.: An improved automatic term recognition method for Spanish. In: Gelbukh, A. (ed.) CICLing 2009. LNCS, vol. 5449, pp. 125–136. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00382-0_10

    Chapter  Google Scholar 

  6. Bourigault, D.: LEXTER, un Logiciel d’EXtraction de Terminologie. Application à l’acquisition des connaissances à partir de textes. Ph.D. tesis, Ecole des Hautes Etudes en Sciences Sociales, Paris (1994)

    Google Scholar 

  7. Cabré, T.: La terminología: representación y comunicación. Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Elementos para una teoría de base comunicativa y otros artículos, Barcelona (1999)

    Google Scholar 

  8. Cabré, M.T., Estopá, R., Vivaldi, J.: Automatic Term Detection: A Review of Current Systems. Recent Advances in Computational Terminology, pp. 53–88. John Benjamins Publishing Company, Amsterdam/Philadelphia (2001)

    Google Scholar 

  9. Cabré, T.: El principio de poliedricidad: la articulación de lo discursivo, lo cognitivo y lo lingüístico en terminología (I), pp. 9–36, Ibérica (2008)

    Google Scholar 

  10. da Silva Conrado, M., Pardo, T.A.S., Rezende, S.O.A.: Machine learning approach to automatic term extraction using a rich feature set. In: HLT-NAACL, pp. 16–23 (2013)

    Google Scholar 

  11. David, S., Plante, P.: Le progiciel TERMINO: de la nécessité d’une analyse morphosyntaxique pour le dépouillement terminologique des textes. In: Proceedings of the Montreal Colloquium Les industries de la langue: perspectives des années 1990, vol. 1, pp. 71–88 (1991)

    Google Scholar 

  12. Drouin, P.: Acquisition automatique de termes: simuler le travail du terminologue. Ela. Etudes de linguistique appliquée 4, 417–427 (2015)

    Google Scholar 

  13. Drouin, P., Ladouceur, J.: Lidentification automatique de descripteurs complexes dans des textes de spcialité. In: Proceedings of the Workshop on Compound Nouns: Multilingual Aspects of Nominal Composition, pp. 8–28 (1994)

    Google Scholar 

  14. Enguehard, C., Pantera, L.: Automatic natural acquisition of a terminology. J. Quant. Linguist. 2(1), 27–32 (1994)

    Article  Google Scholar 

  15. Espinosa-Anke, L., Carlini, R., Saggion, H., Ronzano, F.: A semi supervised definition extraction tool (2016). arXiv:1606.02514

  16. Fernández, S., SanJuan, E., Torres-Moreno, J.M.: Textual energy of associative memories: performant applications of enertex algorithm in text summarization and topic segmentation. In: Gelbukh, A., Kuri Morales, Á.F. (eds.) MICAI 2007. LNCS, vol. 4827, pp. 861–871. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76631-5_82

    Chapter  Google Scholar 

  17. Frantzi, K., Ananiadou, S.: Statistical measures for terminological extraction. Working paper of the Department of Computing of Manchester Metropolitan University (1995)

    Google Scholar 

  18. Kilgarriff, A., Husák, M., McAdam, K., Rundell, M., Rychly, P.: GDEX automatically finding good dictionary examples in a corpus. In: Proceedings of the XIII EURALEX International Congress (2008)

    Google Scholar 

  19. Klavans, J.L., Muresan, S.: Evaluation of the DEFINDER system for fully automatic glossary construction. In: Proceedings of the AMIA Symposium, pp. 324–328 (2001)

    Google Scholar 

  20. Kuguel, I.: La activación del significado especializado. In: Lorente, M., Estopá, R., Freixa, J., Martí, J., Tebé, C. (eds.) Estudis de lingüística aplicada en honor de M. Teresa Cabré Castellv. Series del Institut Universitari de Lingüística Aplicada, Barcelona (2007)

    Google Scholar 

  21. Lázaro, J.A.: El ejemplo en terminología, caracterización y extracción automática. Ph.D. thesis, Universitat Pompeu Fabra (2015)

    Google Scholar 

  22. Leu, F-Y., Ko, C.C.: An automated term definition extraction using the web corpus in Chinese language. In: International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2007. IEEE (2007)

    Google Scholar 

  23. Molina, A.: Agrupamiento automático de contextos definitorios. Master thesis, UNAM (2009)

    Google Scholar 

  24. Pantel, P., Lin, D.: A statistical corpus-based term extractor. In: En 14th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, Ottawa, Canada (2001)

    Google Scholar 

  25. Sierra, G., Alarcón, R., Molina, A., Aldana, E.: Web exploitation for definition extraction. En Chávez, E., Morán, E.F.A. (eds.) 2009 Latin American Web Congress, pp. 217–223. IEEE, Los Alamitos (2009). ISBN 978-0-7695-3856-3

    Google Scholar 

  26. Sierra, G., Pozzi, M., Torres Moreno, J.M.: 1st International Workshop on Definition Extraction, Borovets, Bulgaria, 18 de septiembre (2009)

    Google Scholar 

  27. Sparck-Jones, K.: A statistical interpretation of term specificity and its application in retrieval. J. Doc. 28(1), 11–21 (1972)

    Article  Google Scholar 

  28. Sørensen, T.: A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol. Skr. 5, 1–34 (1948). Denmark

    Google Scholar 

  29. Velardi, P., Navigli, R., D’Amadio, P.: Mining the web to create specialized glossaries. IEEE Intell. Syst. 5, 18–25 (2008)

    Article  Google Scholar 

  30. Vivaldi, J.: Extracción de candidatos a término mediante combinación de estrategias heterogéneas. Ph.D. thesis, Universidad Politécnica de Catalua, Barcelona (2001)

    Google Scholar 

  31. Vivaldi, J., Rodríguez, H.: Using Wikipedia for term extraction in the biomedical domain: first experiences. Procesamiento del Lenguaje Natural 45, 251–254 (2010)

    Google Scholar 

  32. Ward Church, K., Hanks, P.: Word association mutual information and lexicography. Comput. Linguist. 16(1), 22–29 (1990)

    Google Scholar 

Download references

Acknowledgments

LEXIK is the integration of different tools created at the Language Engineering Group, UNAM, by several undergraduate and postgraduate students. An acknowledgment to all of them. Also to DGAPA, UNAM by the funds through the projects PAPIIT IN403016 and PAPIIT AN400117, and to CONACYT by the funds through the project Ciencia Básica 178248.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jorge Lázaro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Sierra, G., Lázaro, J., Bel-Enguix, G. (2017). LEXIK. An Integrated System for Specialized Terminology. In: Sidorov, G., Herrera-Alcántara, O. (eds) Advances in Computational Intelligence. MICAI 2016. Lecture Notes in Computer Science(), vol 10061. Springer, Cham. https://doi.org/10.1007/978-3-319-62434-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62434-1_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62433-4

  • Online ISBN: 978-3-319-62434-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics