An approach to the script discrimination in the Slavic documents

Brodić, Darko; Milivojević, Zoran N.; Maluckov, Čedomir A.

doi:10.1007/s00500-014-1435-1

An approach to the script discrimination in the Slavic documents

Script discrimination

Methodologies and Application
Published: 28 August 2014

Volume 19, pages 2655–2665, (2015)
Cite this article

Soft Computing Aims and scope Submit manuscript

Darko Brodić¹,
Zoran N. Milivojević² &
Čedomir A. Maluckov¹

246 Accesses
10 Citations
Explore all metrics

Abstract

The paper deals with the problem of the script discrimination in old Slavic printed documents. Therefore, an algorithm for script classification and identification is proposed. It creates coded text from initial document. Then, the coded text is subjected to statistical analysis. As a result, the texture feature extraction is carried out. Obtained texture features are used as criteria for script classification and identification. The proposed method is tested on the samples of old Slavic printed documents written in Glagolitic, Cyrillic and Latin script.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Script Characterization in the Old Slavic Documents

Classification of the Scripts in Medieval Documents from Balkan Region by Run-Length Texture Analysis

Classification of German Scripts by Adjacent Local Binary Pattern Analysis of the Coded Text

References

Bharati MH, Liu JJ, MacGregor JF (2004) Image texture analysis: methods and comparisons. Chemom Intell Lab Systems 72(1):57–71
Article Google Scholar
Brodić D, Milivojević ZN, Maluckov Č (2013) Recognition of the script in Serbian documents using frequency occurrence and co-occurrence analysis. Sci World J 2013(896328):1–14
Article Google Scholar
Brodić D, Milivojević Z, Maluckov Č A (2014) Script characterization in the old Slavic documents. In: Elmoataz A, Lezoray O, Nouboud F, Mammass D (eds) Image and Signal Processing, LNCS 8509, pp 230–238. Springer, Berlin
Busch A, Boles WW, Sridharan S (2006) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732
Article Google Scholar
Chaudhuri BB, Pal U, Mitra M (2002) Automatic recognition of printed Oriya script. Sadhana 27(1):23–34
Article Google Scholar
Clausi DA (2002) An analysis of co-occurrence texture statistics as a function of grey level quantization. Can J Remote Sens 28(1):45–62
Article Google Scholar
Del Bimbo A (2001) Visual information retrieval. Morgan Kaufmann Publishers Inc, San Francisco
Eleyan A, Demirel H (2011) Co-occurrence matrix and its statistical features as a new approach for face recognition. Turkish J Electrical Eng Comput Sci 19(1):98–107
Google Scholar
Ghosh D, Dube T, Shivaprasad AP (2010) Script recognition—a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161
Article Google Scholar
Haralick R, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Systems Man Cybern 3(6):610–621
Article Google Scholar
Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804
Article Google Scholar
Joshi GD, Garg S, Sivaswamy J (2007) A generalised framework for script identification. Int J Document Anal Recogn ( IJDAR) 10(2):55–68
Article Google Scholar
Pal U, Chaudhury BB (2002) Identification of different script lines from multi-script documents. Image Vis Comput 20(13–14):945–954
Silva C, Ribeiro B (2007) On text-based mining with active learning and background knowledge using SVM. Soft Comput 11(6):519–530
Article Google Scholar
Tolambiya A, Venkatraman S, Kalra PK (2010) Content-based image classification with wavelet relevance vector machines. Soft Comput 14(2):129–136
Article Google Scholar
Valkealahti K, Oja E (1998) Reduced multidimensional co-occurrence histograms in texture classification. IEEE Trans Pattern Anal Mach Intell 20(1):90–94
Article Google Scholar
Yang Z, Purves D (2004) The statistical structure of natural light patterns determines perceived light intensity. In: Proceedings of the National Academy of sciences of the United States of America 101(23):8745–8750
Zhang J, Tan T (2002) Brief review of invariant texture analysis methods. Pattern Recogn 35(3):735–747
Article MATH Google Scholar
Zramdini AW, Ingold R (1998) Optical font recognition using typographical features. IEEE Trans Pattern Anal Mach Intell 20(8):877–882
Article Google Scholar

Download references

Acknowledgments

This work was partially supported by the Grant of the Ministry of Education, Science and Technological Development of the Republic Serbia, as a part of the project TR33037 and III43011.

Author information

Authors and Affiliations

Technical Faculty in Bor, V.J. 12, University of Belgrade, 19210 , Bor, Serbia
Darko Brodić & Čedomir A. Maluckov
College of Applied Technical Sciences, Aleksandra Medvedeva 20, 18000 , Niš, Serbia
Zoran N. Milivojević

Authors

Darko Brodić
View author publications
You can also search for this author in PubMed Google Scholar
Zoran N. Milivojević
View author publications
You can also search for this author in PubMed Google Scholar
Čedomir A. Maluckov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Darko Brodić.

Additional information

Communicated by V. Loia.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brodić, D., Milivojević, Z.N. & Maluckov, Č.A. An approach to the script discrimination in the Slavic documents. Soft Comput 19, 2655–2665 (2015). https://doi.org/10.1007/s00500-014-1435-1

Download citation

Published: 28 August 2014
Issue Date: September 2015
DOI: https://doi.org/10.1007/s00500-014-1435-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An approach to the script discrimination in the Slavic documents

Abstract

Access this article

Similar content being viewed by others

Script Characterization in the Old Slavic Documents

Classification of the Scripts in Medieval Documents from Balkan Region by Run-Length Texture Analysis

Classification of German Scripts by Adjacent Local Binary Pattern Analysis of the Coded Text

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An approach to the script discrimination in the Slavic documents

Abstract

Access this article

Similar content being viewed by others

Script Characterization in the Old Slavic Documents

Classification of the Scripts in Medieval Documents from Balkan Region by Run-Length Texture Analysis

Classification of German Scripts by Adjacent Local Binary Pattern Analysis of the Coded Text

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation