Abstract
The paper presents approach to knowledge discovery by using query expansion to search for cross-lingual mathematical terminology extraction. It employs information retrieval and statistically-based techniques to extract and process keyword collocations in large comparable cross-lingual web electronic text corpora in the domain of mathematics in Bulgarian and in Serbian language. It, also, offers examples and survey of used techniques for semantic search and clustering by comparing keyword collocations to build a cross-lingual thesauri. The results of semantic keyword search for the two web electronic text corpora using Sketch Engine software are presented and analyzed with respect to the types of keyword collocations processing and to multilingual application of the approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Azzopardi, J., et al.: Back to the sketch-board: integrating keyword search, semantics, and information retrieval. In: Cali, A., Gorgan, D., Ugarte, M. (eds.) Semantic Keyword-Based Search on Structured Data Sources, KEYSTONE 2016. LNCS, vol. 10151, pp. 49–61. Springer (2017)
Baroni, M., Lenci, A.: Distributional memory: a general framework for corpus-based semantics. Comput. Linguist. 36(4), 673–721 (2010)
Bova, V., et al.: The combined method of semantic similarity estimation of problem oriented knowledge on the basis of evolutionary procedures. In: Artificial Intelligence Trends in Intelligent Systems - Proceedings of the 6th Computer Science On-line Conference 2017 (CSOC 2017), vol. 1. Advances in Intelligent Systems and Computing (AISC), vol. 573, pp. 74–83. Springer (2017)
Killgarriff, A., et al.: The sketch engine: ten years on. Lexicography 1, 17–36 (2014)
Levy, O., Goldberg, Y.: Linguistic regularities in sparse and explicit word representations. In: Proceedings of the Eighteenth Conference on Computational Natural Language Learning, pp. 171–180, Baltimore (2014)
Novitskiy, V.: Automatic retrieval of parallel collocations for translation purposes. In: Kuznetsov, S.O., Mandal, D.P., Kundu, M.K., Pal, S.K. (eds.) Pattern Recognition and Machine Intelligence, PReMI 2011. LNCS, vol. 6744. Springer, pp. 261–267 (2011)
Orliac, B.: Extracting specialized collocations using lexical functions. In: Granger, S., Mennier, F. (eds.) Phreseology: An Interdisciplinary Perspective, pp. 377–390 (2008)
Nguyen, D., et al.: WikiTranslate: query translation for cross-lingual information retrieval using only Wikipedia. In: Peters, C., et al. (eds.) Evaluating Systems for Multilingual and Multimodal Information Access, CLEF 2008. LNCS, vol. 5706, pp. 58–65. Springer (2009)
Seretan, V., Wehrli, E.: Accurate collocation extraction using a multilingual parser. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 953–960 (2006)
Stankovic, R., et al.: Developing termbases for expert terminology under the TBX standards. In: Pavlovic-Lazetic, G., Krstev, C., Obradovic, I., Vitas, D. (eds.) Natural Language Processing for Serbian – Resources and Applications, pp. 12–26, University of Belgrade, Faculty of Mathematics (2014)
Stankovic, R., et al.: Keyword-based search on bilingual digital libraries. In: Cali, A., Gorgan, D., Ugarte, M. (eds.) Semantic Keyword-Based Search on Structured Data Sources, KEYSTONE 2016. LNCS, vol. 10151, pp. 112–123. Springer (2016)
Stoykova, V., Mitkova, M.: Conceptual semantic relationships for terms of Precalculus study. WSEAS Trans. Adv. Eng. Educ. 8(1), 13–22 (2011)
Stoykova, V.: Using statistical search to discover semantic relations of political Lexica - evidences from Bulgarian-Slovak EUROPARL 7 corpus. In: Kotsireas, I., Rump, S., Yap, C. (eds.), Mathematical Aspects of Computer and Information Sciences. LNCS, vol. 9582, pp. 335–339. Springer (2016)
Stoykova, V.: Discovering distributional Thesauri semantic relations. In: Proceedings of the 3rd International Workshop on Knowledge Discovery on the Web, CEUR-WS, Cagliari, Italy (2017). http://ceur-ws.org/Vol-1959/paper-11.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Stoykova, V., Stankovic, R. (2019). Using Query Expansion for Cross-Lingual Mathematical Terminology Extraction. In: Silhavy, R. (eds) Artificial Intelligence and Algorithms in Intelligent Systems. CSOC2018 2018. Advances in Intelligent Systems and Computing, vol 764. Springer, Cham. https://doi.org/10.1007/978-3-319-91189-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-91189-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91188-5
Online ISBN: 978-3-319-91189-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)