Skip to main content

Using Query Expansion for Cross-Lingual Mathematical Terminology Extraction

Part of the Advances in Intelligent Systems and Computing book series (AISC,volume 764)

Abstract

The paper presents approach to knowledge discovery by using query expansion to search for cross-lingual mathematical terminology extraction. It employs information retrieval and statistically-based techniques to extract and process keyword collocations in large comparable cross-lingual web electronic text corpora in the domain of mathematics in Bulgarian and in Serbian language. It, also, offers examples and survey of used techniques for semantic search and clustering by comparing keyword collocations to build a cross-lingual thesauri. The results of semantic keyword search for the two web electronic text corpora using Sketch Engine software are presented and analyzed with respect to the types of keyword collocations processing and to multilingual application of the approach.

Keywords

  • Serbian Language
  • Keywords Collocation
  • Semantic Keyword Search
  • Sketch Engine (SE)
  • Basic Statistical Techniques

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-91189-2_16
  • Chapter length: 11 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   219.00
Price excludes VAT (USA)
  • ISBN: 978-3-319-91189-2
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   279.99
Price excludes VAT (USA)
Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.

References

  1. Azzopardi, J., et al.: Back to the sketch-board: integrating keyword search, semantics, and information retrieval. In: Cali, A., Gorgan, D., Ugarte, M. (eds.) Semantic Keyword-Based Search on Structured Data Sources, KEYSTONE 2016. LNCS, vol. 10151, pp. 49–61. Springer (2017)

    CrossRef  Google Scholar 

  2. Baroni, M., Lenci, A.: Distributional memory: a general framework for corpus-based semantics. Comput. Linguist. 36(4), 673–721 (2010)

    CrossRef  Google Scholar 

  3. Bova, V., et al.: The combined method of semantic similarity estimation of problem oriented knowledge on the basis of evolutionary procedures. In: Artificial Intelligence Trends in Intelligent Systems - Proceedings of the 6th Computer Science On-line Conference 2017 (CSOC 2017), vol. 1. Advances in Intelligent Systems and Computing (AISC), vol. 573, pp. 74–83. Springer (2017)

    Google Scholar 

  4. Killgarriff, A., et al.: The sketch engine: ten years on. Lexicography 1, 17–36 (2014)

    CrossRef  Google Scholar 

  5. Levy, O., Goldberg, Y.: Linguistic regularities in sparse and explicit word representations. In: Proceedings of the Eighteenth Conference on Computational Natural Language Learning, pp. 171–180, Baltimore (2014)

    Google Scholar 

  6. Novitskiy, V.: Automatic retrieval of parallel collocations for translation purposes. In: Kuznetsov, S.O., Mandal, D.P., Kundu, M.K., Pal, S.K. (eds.) Pattern Recognition and Machine Intelligence, PReMI 2011. LNCS, vol. 6744. Springer, pp. 261–267 (2011)

    Google Scholar 

  7. Orliac, B.: Extracting specialized collocations using lexical functions. In: Granger, S., Mennier, F. (eds.) Phreseology: An Interdisciplinary Perspective, pp. 377–390 (2008)

    Google Scholar 

  8. Nguyen, D., et al.: WikiTranslate: query translation for cross-lingual information retrieval using only Wikipedia. In: Peters, C., et al. (eds.) Evaluating Systems for Multilingual and Multimodal Information Access, CLEF 2008. LNCS, vol. 5706, pp. 58–65. Springer (2009)

    Google Scholar 

  9. Seretan, V., Wehrli, E.: Accurate collocation extraction using a multilingual parser. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 953–960 (2006)

    Google Scholar 

  10. Stankovic, R., et al.: Developing termbases for expert terminology under the TBX standards. In: Pavlovic-Lazetic, G., Krstev, C., Obradovic, I., Vitas, D. (eds.) Natural Language Processing for Serbian – Resources and Applications, pp. 12–26, University of Belgrade, Faculty of Mathematics (2014)

    Google Scholar 

  11. Stankovic, R., et al.: Keyword-based search on bilingual digital libraries. In: Cali, A., Gorgan, D., Ugarte, M. (eds.) Semantic Keyword-Based Search on Structured Data Sources, KEYSTONE 2016. LNCS, vol. 10151, pp. 112–123. Springer (2016)

    CrossRef  Google Scholar 

  12. Stoykova, V., Mitkova, M.: Conceptual semantic relationships for terms of Precalculus study. WSEAS Trans. Adv. Eng. Educ. 8(1), 13–22 (2011)

    Google Scholar 

  13. Stoykova, V.: Using statistical search to discover semantic relations of political Lexica - evidences from Bulgarian-Slovak EUROPARL 7 corpus. In: Kotsireas, I., Rump, S., Yap, C. (eds.), Mathematical Aspects of Computer and Information Sciences. LNCS, vol. 9582, pp. 335–339. Springer (2016)

    CrossRef  Google Scholar 

  14. Stoykova, V.: Discovering distributional Thesauri semantic relations. In: Proceedings of the 3rd International Workshop on Knowledge Discovery on the Web, CEUR-WS, Cagliari, Italy (2017). http://ceur-ws.org/Vol-1959/paper-11.pdf

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Velislava Stoykova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Stoykova, V., Stankovic, R. (2019). Using Query Expansion for Cross-Lingual Mathematical Terminology Extraction. In: Silhavy, R. (eds) Artificial Intelligence and Algorithms in Intelligent Systems. CSOC2018 2018. Advances in Intelligent Systems and Computing, vol 764. Springer, Cham. https://doi.org/10.1007/978-3-319-91189-2_16

Download citation