A Study on the Printed Uyghur Script Recognition Technique Using Word Visual Features

Meimaiti, Halimulati

doi:10.1007/978-3-319-97909-0_76

Halimulati Meimaiti^21,22

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10996))

Included in the following conference series:

Chinese Conference on Biometric Recognition

3051 Accesses

Abstract

This paper proposes a recognition technique which applies a combination of image processing and pattern recognition to visual features of individual words. Uyghur script is naturally cursive, and its characters have uneven width. Therefore, in image format, precisely cutting Uyghur words into characters is difficult. To avoid such problem, we use word models instead of character models. Besides, this technique does not need a large amount of training samples: prepared text samples are converted to image samples which are used to construct individual word models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ding, X., et al.: Character Recognition: Principles Methods and Practice. Tsinhua University Press (2017)
Google Scholar
Wang, H., Ding, X.: Multi-font multi-typeface printing Uyghur character recognition. J. Tsinghua Univ. 44(7), 946–949 (2004)
Google Scholar
Jin, J., Wang, H., Ding, X., Peng, L.: Printed Arabic document recognition system. In: DDR2005, pp. 48–55 (2005)
Google Scholar
Arzigul, H.: Research and development of multi-font printing Uyghur character recognition system. Chin. J. Comput. 11,1480–1484 (2003)
Google Scholar
Kadier, N., Peng, L.: A method of Uyghur and Arabic recognition based on HMM and statistical language model. Comput. Appl. Softw. 32(1), 171–174 (2015)
Google Scholar
Naz, S., et al.: The optical character recognition of Urdu-like cursive scripts. Pattern Recognit. 47(3), 1229–1248 (2014)
Article Google Scholar
Al-Shatnawi, A.M., et al.: Skeleton extraction: comparison of five methods on the Arabic IFN/ENIT database. In: 2014 6th International Conference on Computer Science and Information Technology (CSIT), pp. 50–59 (2014)
Google Scholar
Maqqor, A., et al.: Using HMM toolkit (HTK) for recognition of Arabic manuscripts characters. In: 2014 International Conference on Multimedia Computing and Systems (ICMCS) (2014)
Google Scholar
Ahmad, I., Fink, G.A., Mahmoud, S.A.: Improvements in sub-character HMM model based Arabic text recognition. In: 2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR) (2014)
Google Scholar
Jiang, Z., Ding, X., Peng, L., Liu, C.: Modified bootstrap approach with state number optimization for hidden markov model estimation in small-size printed arabic text line recognition. In: Perner, P. (ed.) MLDM 2014. LNCS (LNAI), vol. 8556, pp. 437–441. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08979-9_33
Chapter Google Scholar
Ait-Mohand, K., Paquet, T., Ragot, N.: Combining structure and parameter adaptation of HMMs for printed text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 36(9), 1716–1732 (2014)
Article Google Scholar
Moysset, B., et al.: The A2iA multi-lingual text recognition system at the second maurdor evaluation. In: 2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR) (2014)
Google Scholar
Mamat, H., Xiaojiao, C.: A method for printed Uyghur character segmentation. In: Liu, C.-L., Zhang, C., Wang, L. (eds.) CCPR 2012. CCIS, vol. 321, pp. 539–547. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33506-8_66
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China
Halimulati Meimaiti
Key Laboratory of Multilanguage Information Technology, Urumqi, Xinjiang, China
Halimulati Meimaiti

Authors

Halimulati Meimaiti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Halimulati Meimaiti .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jie Zhou
Beihang University, Beijing, China
Yunhong Wang
Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Xinjiang University, Urumqi, China
Zhenhong Jia
Tsinghua University, Beijing, China
Jianjiang Feng
Chinese Academy of Sciences, Beijing, China
Shiguang Shan
Xinjiang University, Urumqi, China
Kurban Ubul
Tsinghua University, Shenzhen, China
Zhenhua Guo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meimaiti, H. (2018). A Study on the Printed Uyghur Script Recognition Technique Using Word Visual Features. In: Zhou, J., et al. Biometric Recognition. CCBR 2018. Lecture Notes in Computer Science(), vol 10996. Springer, Cham. https://doi.org/10.1007/978-3-319-97909-0_76

Download citation

DOI: https://doi.org/10.1007/978-3-319-97909-0_76
Published: 09 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97908-3
Online ISBN: 978-3-319-97909-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics