Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Mazitov, D.; Alimova, I.; Tutubalina, E.

doi:10.1007/s10958-023-06521-y

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Published: 22 June 2023

Volume 273, pages 595–604, (2023)
Cite this article

Journal of Mathematical Sciences Aims and scope Submit manuscript

D. Mazitov¹,
I. Alimova¹ &
E. Tutubalina¹

155 Accesses
1 Citation
Explore all metrics

Named entity recognition (NER) is aimed at obtaining the important information from the unstructured data presented in the form of natural language texts. In this work, we investigate the efficiency of modern multi-task NER approaches on Russian language corpora by employing several different NER datasets and a dataset of part-of-speech (POS) tags. We apply a state-of-the-art neural architecture based on bidirectional LSTMs and conditional random fields. Convolutional neural networks were utilized to learn character-level features. We carry out extensive experimental evaluation over three standard datasets of news articles written in Russian. The proposed multi-task model achieve state-of-the-art results with an F1 score of 88.04% on Gareev’s dataset and an F1 score of 99.49% on Person-1000 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Article Open access 05 March 2024

Transformer models for text-based emotion detection: a review of BERT-based approaches

Article 08 February 2021

Modeling Relational Data with Graph Convolutional Networks

References

M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, et al., “Tensorflow: a system for large-scale machine learning,” OSDI, 16, 265–283 (2016).
Google Scholar
C. Adak, B. B. Chaudhuri, and M. Blumenstein, “Named entity recognition from unstructured handwritten document images,” in: 12th IAPR Workshop on Document Analysis Systems (DAS) (2016), pp. 375–380.
L. T. Anh, M. Y. Arkhipov, and M. S. Burtsev, “Application of a hybrid bi-LSTM-CRF model to the task of Russian named entity recognition,” in: Communications in Computer and Information Science Book Series – CCIS, Vol. 789 (2017).
L.T. Anh, M. Y. Arkhipov, and M. S. Burtsev, “Application of a hybrid bi-LSTM-CRF model to the task of Russian named entity recognition,” arXiv preprint arXiv:1709.09686 (2017).
A. Y. Antonova and A. N. Soloviev, “Conditional random field models for the processing of Russian,” Communications of the ACM, 56, No. 6 (2013).
M. Y. Arkhipov, M. S. Burtsev, and L. T. Anh, “Application of a hybrid bi-LSTM-CRF model to the task of Russian named entity recognition,” in: Conference on Artificial Intelligence and Natural Language, Springer, Cham (2017).
Google Scholar
M. M. Brykina, A. V. Faynveyts, and S. Yu. Toldova, “Dictionary-based ambiguity resolution in Russian named entities recognition,” in: International Workshop on Computational Linguistics and its Applications (ed. A. Narin’yani), Vol. 1 (2013).
R. Chalapathy, E. Z. Borzeshi, and M. Piccardi, “Bidirectional LSTM-CRF for clinical concept extraction,” arXiv preprint arXiv:1611.08373 (2016).
J. P. C. Chiu and E. Nichols, “Named entity recognition with bidirectional LSTM-cnns,” Transactions of the Association for Computational Linguistics, 4, 357–370 (2016).
L. G. Craidlin, “Program of allocation of Russian individualized nominal groups taglite,” Computational Linguistics and Intellectual Technologies Dialog (2005).
D. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in: 3rd International Conference for Learning Representations, San Diego (2014).
C. Dong, J. Zhang, C. Zong, M. Hattori, and H. Di, “Character-based LSTM-CRF with radical-level features for chinese named entity recognition,” in: Natural Language Understanding and Intelligent Applications, Springer (2016), pp. 239–250.
Chapter Google Scholar
R. Gareev, M. Tkachenko, V. Solovyev, A. Simanovsky, and V. Ivanov, “Introducing baselines for Russian named entity recognition,” in: Computational Linguistics and Intelligent Text Processing (2013).
A. Graves, S. Fernández, and J. Schmidhuber, “Bidirectional LSTM networks for improved phoneme classification and recognition,” in: Artificial Neural Networks: Formal Models and Their Applications – ICANN (2005).
K. Greff, R. K. Srivastava, J. Koutnik, B. R. Steunebrink, and J. Schmidhuber, “LSTM: A search space odyssey,” IEEE Trans Neural Netw Learn Syst. doi: https://doi.org/10.1109/TNNLS.2016.2582924, 2016.
Z. Huang, W. Xu, and K. Yu, “Bidirectional LSTM-CRF models for sequence tagging,” arXiv preprint arXiv:1508.01991 (2015).
Kaggle, Predict Russian Universal Dependencies POS Tags (2017).
G. Konoplich, E. Putin, A. Filchenkov, and R. Rybka, “Named entity recognition in Russian with word representation learned by a bidirectional language model,” AINL (2018).
G. Konoplich, E. Putin, A. Filchenkov, and R. Rybka, “Named entity recognition in Russian with word representation learned by a bidirectional language model,” in: Conference on Artificial Intelligence and Natural Language, Springer (2018), pp. 48–58.
Chapter Google Scholar
J. Lafferty, A. McCallum, and F. Pereira, “Conditional random fields: Probabilistic models for segmenting and labeling sequence data,” in: Proc. 18th International Conference on Machine Learning (2001).
G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer, “Neural architectures for named entity recognition,” in: Proc. 2016 NAACL (2016), pp. 260–270.
X. Ma and E. Hovy, “End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF,” arXiv preprint arXiv:1603.01354 (2016).
V. Malykh and A. Ozerin, “Reproducing Russian ner baseline quality without additional data,” CDUD@ CLA (2016), pp. 54–59.
S. Misawa, M. Taniguchi, Y. Miura, and T. Ohkuma, “Character-based bidirectional LSTM-CRF with words and characters for japanese named entity recognition,” in Proc. 1st Workshop on Subword and Character Level Models in NLP (2017), pp. 97–102.
V. Mozharova and N. Loukachevitch, “Two-stage approach in Russian named entity recognition,” in: Proc. 2016 International FRUCT Conference on Intelligence, Social Media and Web (ISMW FRUCT), IEEE (2016), pp. 1–6.
A. V. Podobryaev, “Searching for person memories in news texts with the use of a model of conditional random fields,” RCDL (2013).
B. Popov, A. Kiryakov, D. Ognyanoff, D. Manov, and A. Kirilov, “Kim – a semantic platform for information extraction and retrieval,” J. Natural Language Engineering, 10 (2004).
R. M. Zavala, P. Martinez, and I. Segura-Bedmar, “A hybrid bi-LSTM-CRF model for knowledge recognition from ehealth documents,” TASS 2018: Workshop on Semantic Analysis at SEPLN (2018), pp. 65–70.
R. Ivanitskiy, A. Shipilo, and L. Kovriguina, “Russian named entities recognition and classification using distributed word and phrase representations,” SIMBig (2016).
A. V. Rubaylo and M. Y. Kosenko, “Software utilities for natural language information retrievial,” in: Almanac of Modern Science and Education, Vol. 12 (2016).
E. Sheng, S. Miller, J.S. Ambite, and P. Natarajan, “A neural named entity recognition approach to biological entity identification,” in: Proc. BioCreative VI Workshop (2017), pp. 24–27.
A. S. Starostin, V. V. Bocharov, S. V. Alexeeva, A. Bodrova, A. S. Chuchunkov, S. S. Dzhumaev, and M. A. Nikolaeva, “Evaluation of named entity recognition and fact extraction systems for Russian,” in: Annual International Conference Dialogue (2016).
A. A. Sysoev and I. A. Andrianov, “Named entity recognition in Russian: the power of wiki-based approach,” in: Proc. International Conference Dialogue (2016), pp. 746–755.
I. V. Trofimov,“Person name recognition in news articles based on the persons-1000/1111-f collections,” in: 16th All-Russian Scientific Conference Digital Libraries: Advanced Methods and Technologies, Digital Collections, RCDL (2014), pp. 217–221.
E. Tutubalina and S. Nikolenko, “Combination of deep recurrent neural networks and conditional random fields for extracting adverse drug reactions from user reviews,” J. Healthcare Engineering, 2017, Article ID 9451342 (2017).
Article Google Scholar
N. A. Vlasova, E. A. Suleymanova, and I. V. Trofimov, “Report on Russian corpus for personal name retrieval,” in: Proceedings of Computational and Cognitive Linguistics TEL (2014).
Q. Wei, T. Chen, R. Xu, Y. He, and L. Gui, “Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks,” Database 2016 (2016).

Download references

Author information

Authors and Affiliations

Kazan Federal University, Kazan, Russia
D. Mazitov, I. Alimova & E. Tutubalina

Authors

D. Mazitov
View author publications
You can also search for this author in PubMed Google Scholar
I. Alimova
View author publications
You can also search for this author in PubMed Google Scholar
E. Tutubalina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Mazitov.

Additional information

Published in Zapiski Nauchnykh Seminarov POMI, Vol. 499, 2021, pp. 222–235.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mazitov, D., Alimova, I. & Tutubalina, E. Named Entity Recognition in Russian Using Multi-Task LSTM-CRF. J Math Sci 273, 595–604 (2023). https://doi.org/10.1007/s10958-023-06521-y

Download citation

Received: 14 January 2019
Published: 22 June 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s10958-023-06521-y

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Access this article

Similar content being viewed by others

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Transformer models for text-based emotion detection: a review of BERT-based approaches

Modeling Relational Data with Graph Convolutional Networks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Access this article

Similar content being viewed by others

"Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach"

Transformer models for text-based emotion detection: a review of BERT-based approaches

Modeling Relational Data with Graph Convolutional Networks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation