A Multilingual Access Module to Legal Texts

  • Kiril SimovEmail author
  • Petya Osenova
  • Iliana Simova
  • Hristo KonstantinovEmail author
  • Tenyo Tyankov
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10791)


The paper introduces a Multilingual Access Module. This module translates the user’s legislation query from its source language into the target language, and retrieves the detected texts that match the query. The service is demonstrated in its potential for two languages – English and Bulgarian, in both directions (English-to-Bulgarian and Bulgarian-to-English). The module consists of two submodules: Ontology-based and Statistical Machine Translation. Since both proposed submodules have some drawbacks, they are used in an integrated architecture, thus profiting from each other.


Multilingual access Query translation Query expansion 


  1. 1.
    Agerri, R., Bermudez, J., Rigau, G.: IXA pipeline: efficient and ready to use multilingual NLP tools. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) (2014)Google Scholar
  2. 2.
    Collins, M.: Discriminative training methods for hidden Markov models. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 1–8 (2002)Google Scholar
  3. 3.
    Koehn, P., Hoang, H.: Factored translation models. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 868–876 (2007)Google Scholar
  4. 4.
    Koehn, P., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp 177–180 (2007)Google Scholar
  5. 5.
    Och, F.J.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 160–167 (2003)Google Scholar
  6. 6.
    Simov, K., Osenova, P., Slavcheva, M.: BTB-TR03: BulTreeBank morphosyntactic tagset BTB-TS version 2.0 (2004)Google Scholar
  7. 7.
    Simov, K., Osenova, P.: Applying ontology-based lexicons to the semantic annotation of learning objects. In: Proceedings from the Workshop on NLP and Knowledge Representation for eLearning Environments, RANLP-2007, pp. 49–55 (2007)Google Scholar
  8. 8.
    Simov, K., Osenova, P.: Language resources and tools for ontology-based semantic annotation. In: Oltramari, A., Prévot, L., Huang, C.-R., Buitelaar, P., Vossen, P. (eds.) OntoLex 2008 Workshop at LREC 2008, pp. 9–13. Published by the European Language Resource Association ELRA (2008)Google Scholar
  9. 9.
    Simov, K., Peev, Z., Kouylekov, M., Simov, A., Dimitrov, M., Kiryakov, A.: CLaRK - an XML-based System for Corpora Development. In: Proceedings of the Corpus Linguistics 2001 Conference, 553–560 (2001)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.Linguistic Modelling DepartmentIICT-BASSofiaBulgaria
  2. 2.APISSofiaBulgaria

Personalised recommendations