Comparison of Different Part-of-Speech Tagging Techniques for Mongolian

Lkhagvasuren, Ganchimeg; Rentsendorj, Javkhlan; Bukhsuren, Enkhtuul; Namsrai, Oyun-Erdene

doi:10.1007/978-981-99-0605-5_9

Ganchimeg Lkhagvasuren⁶,
Javkhlan Rentsendorj⁶,
Enkhtuul Bukhsuren⁶ &
…
Oyun-Erdene Namsrai⁶

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 341))

Included in the following conference series:

International Conference on Intelligent Information Hiding and Multimedia Signal Processing

137 Accesses

Abstract

In this paper, we presented two POS taggers for Mongolian, namely Neural Networks—Multilayer Perceptron and Hidden Markov Model with Viterbi. The accuracy of the former tagger is 95.6%, whereas the latter is 85.6%. Also, we compared the performance of our taggers with the previous works. The Comparison shows that the Neural Network tagger performs better for Mongolian POS tagging than other approaches. Our dataset consists of about 5000 sentences and includes almost 100,000 words for training and testing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Jaimai, P., Chimeddorj, O.: Part of speech tagging for mongolian corpus. 09, 103–106 (2009)
Google Scholar
Zoljargal Munkhjargal, P.J.: Mongolian Trigram Part-of-Speech Tagger, pp. 161–163 (2011)
Google Scholar
A.K.: Part of Speech Tagging Experiments on Mongolian Language. ICEIC 76 (2013)
Google Scholar
Lkhagvasuren, G., Rentsendorj, J. In: Open Information Extraction for Mongolian Language, pp. 299–304 (2020)
Google Scholar
Helmut, S.: In: Improvements in Part-of-Speech Tagging with an Application to German, pp. 13–25. Springer, Netherlands, Dordrecht (1999)
Google Scholar
Khreich, W., Granger, E., Miri, A., Sabourin, R.: A survey of techniques for incremental learning of hmm parameters. Inf. Sci. 197, 105–130 (2012)
Google Scholar
Kupiec, J.: Robust part-of-speech tagging using a hidden markov model. Comput. Speech Lang. 6(3), 225–242 (1992)
Google Scholar
Thede, S.M., Harper, M.: A second-order hidden markov model for part-of-speech tagging. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 175–182 (1999)
Google Scholar
Al Shamsi, F., Guessoum, A.: A hidden markov model-based pos tagger for arabic. In: Proceeding of the 8th International Conference on the Statistical Analysis of Textual Data, pp. 31–42. France (2006)
Google Scholar
Kumawat, D., Jain, V.: Pos tagging approaches: a comparison. Int. J. Comput. Appl. 118, 32–38 (2015)
Google Scholar
Meftah, S., Semmar, N.: A neural network model for part-of-speech tagging of social media texts. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan, European Language Resources Association (ELRA) (2018)
Google Scholar
Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Google Scholar
Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1064–1074. Berlin, Germany, Association for Computational Linguistics (2016)
Google Scholar
Józefowicz, R., Vinyals, O., Schuster, M., Shazeer, N., Wu, Y.: Exploring the limits of language modeling. CoRR (2016). ArXiv:abs/1602.02410
Nyamdavaa, O.: Mongolian syntactic annotation for parser development. Master’s thesis, National University of Mongolia, Mongolia (2016)
Google Scholar
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of english: The penn treebank. Comput. Linguist. 19(2), 313–330 (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

National University of Mongolia, Ulan Bator, Mongolia
Ganchimeg Lkhagvasuren, Javkhlan Rentsendorj, Enkhtuul Bukhsuren & Oyun-Erdene Namsrai

Authors

Ganchimeg Lkhagvasuren
View author publications
You can also search for this author in PubMed Google Scholar
Javkhlan Rentsendorj
View author publications
You can also search for this author in PubMed Google Scholar
Enkhtuul Bukhsuren
View author publications
You can also search for this author in PubMed Google Scholar
Oyun-Erdene Namsrai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oyun-Erdene Namsrai .

Editor information

Editors and Affiliations

Fujian University of Technology, Fuzhou, China
Shaowei Weng
Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan
Chin-Shiuh Shieh
Department of Informatics, University of Piraeus, Pireas, Greece
George A. Tsihrintzis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lkhagvasuren, G., Rentsendorj, J., Bukhsuren, E., Namsrai, OE. (2023). Comparison of Different Part-of-Speech Tagging Techniques for Mongolian. In: Weng, S., Shieh, CS., Tsihrintzis, G.A. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIHMSP 2022. Smart Innovation, Systems and Technologies, vol 341. Springer, Singapore. https://doi.org/10.1007/978-981-99-0605-5_9

Download citation

DOI: https://doi.org/10.1007/978-981-99-0605-5_9
Published: 20 July 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0604-8
Online ISBN: 978-981-99-0605-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics