An Unsupervised Classification of Printed and Handwritten Telugu Words in Pre-printed Documents Using Text Discrimination Coefficient

Rani, N. Shobha; Vasudev, T

doi:10.1007/978-981-10-2471-9_67

N. Shobha Rani^19,20 &
T Vasudev²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 507))

1053 Accesses
2 Citations

Abstract

Classification of handwritten and printed text in pre-printed documents enhances the performance of optical character recognition technologies. The objective of work presented lies in devising an approach to perform automatic classification of printed and handwritten text at word level, which is inherently found in pre-printed documents. The proposed work consists of three stages to perform the classification of printed and handwritten words in Telugu pre-printed documents. The stage one encompasses the feature computation from the segmented words, stage two determines text discrimination coefficient, and finally, the classification of printed and handwritten text using a decision model is accomplished in stage three. The statistical and geometrical moment features are computed with respect to the text block under consideration, and furthermore, these features are employed for determination of text discrimination coefficient. The results of experimentation are proved to be promising and robust with an accuracy of around 98.2 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Shobha Rani N., Vasudev T.: A Generic Line Elimination Methodology using Circular Masks for Printed and Handwritten Document Images, Emerging research in computing, information, communication and applications, Elsevier Science and technology, vol. 3, (2014).
Google Scholar
Sadagopan Srinivasan., Li Zhao., Lin Sun., Zhen Fang, Peng Li.., Tao Wang., Ravishankar Iyer., Ramesh Illikkal., Dong Liu.,:Performance Characterization and Acceleration of Optical Character Recognition on Handheld Platforms, IEEE International Symposium on Workload Characterization (IISWC), (2010).
Google Scholar
Suman V Patgar., Vasudev T.,: An unsupervised intelligent system to detect fabrication in photocopy document using geometric moments and gray level co-occurrence matrix, International journal of computer applications, Vol. 74(12), 29–34, (2013).
Google Scholar
Mark A Walch., Donald T Gantz.,: Pictographic recognition technology applied to distinctive characteristics of handwritten Arabic text, Proceedings of Symposium on Document Image Understanding Technology, 173–186, (2005).
Google Scholar
Ranjeet Srivastava., Ravi Kumar Tewari., Shashi Kant.,: Separation of machine printed and handwritten text for Hindi documents, International Research Journal of Engineering and Technology (IRJET), Vol. 2(2), pp. 704–708, (2015).
Google Scholar
M.S. Shirdhonkar., Manish B Kokare.,: Discrimination between printed and handwritten text in documents, International journal of Computer Applications, Recent Trends in Image Processing and Pattern Recognition”, 131–134, (2010).
Google Scholar
Rajesh Pathak., Ravi Kumar Tewari.,: Distinction between machine printed text and handwritten Text in a document, International Journal of Scientific Engineering and Research (IJSER), Vol. 3(7), pp. 13–17, (2015).
Google Scholar
Lincoln Faria da Silva., Aura Conci., Angel Sanchez.,: Automatic discrimination between printed and handwritten text in documents, XXII Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI), (2009).
Google Scholar
Mallikarjun Hangarge., K.C. Santosh., Srikanth Doddamani., Rajmohan Pardeshi.,: Statistical Texture Features based Handwritten and Printed Text Classification in South Indian Documents, International Conference on Emerging Trends in Electrical, Communications and Information Technologies, Elsevier, vol. 1(32), 215–221, (2012).
Google Scholar
Samir Malakara., Rahul Kumar Dasa., Ram Sarkarb., Subhadip Basub., Mita Nasipuri.,: Handwritten and printed word identification using gray-scale feature vector and decision tree classifier, International Conference on Computational Intelligence: Modeling Techniques and Applications(CIMTA), Procedia Technology 10, 831–839, (2013).
Google Scholar
Simon Xinmeng Lia.,: Image analysis by moments, Thesis, Department of electrical and computer engineering, University of Manitoba, Winnipeg, Canada, (1993).
Google Scholar
Yan Qiu Chen., Mark X. Nixon., David W. Thomas.,: Statistical geometrical features for texture classification, Pattern recognition, Elsevier Science, vol. 28(4), 537–552, (1995).
Google Scholar
Mohammed Javed., P. Nagabhushan., B.B. Chaudhuri.,: Extraction of projection profile, run-histogram and entropy features straight from run-length compressed text documents, IAPR Asian conference on pattern recognition, IEEE proceedings, 813–817, (2013).
Google Scholar
Li. S., Moon chuan Lee., Chi Man Pun.,: Complex zernike moments features for shape based image retrieval, IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, Vol. 39(1), 227–237, (2008).
Google Scholar
Franz Faul., Edgar Erdfelder., Axel Buchner., Albert-Georg Lang.,: statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses, Behavior Research Methods, Springer, Vol. 41(4), 1149–1160, (2009).
Google Scholar

Download references

Author information

Authors and Affiliations

Maharaja Research Foundation, Maharaja Institute of Technology, University of Mysore, Mysuru, Karnataka, India
N. Shobha Rani
Department of Computer Science, Amrita Vishwa Vidyapeetham, Amrita University, Mysuru Campus, Mysuru, Karnataka, India
N. Shobha Rani & T Vasudev

Authors

N. Shobha Rani
View author publications
You can also search for this author in PubMed Google Scholar
T Vasudev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to N. Shobha Rani .

Editor information

Editors and Affiliations

ANITS, Prof., Comp. Sci. & Engg. Dept. ANITS, Visakhapatnam, Andhra Pradesh, India
Suresh Chandra Satapathy
JNTUH College of Engg. HYD (Autonomous), Prof. & Head, Comp. Sci. & Engg. Dept. JNTUH College of Engg. HYD (Autonomous), Hyderabad, Telangana, India
V. Kamakshi Prasad
JNTUH College of Engg. HYD (Autonomous), Pro., Dept. Computer Science & Engg. JNTUH College of Engg. HYD (Autonomous), Hyderabad, Telangana, India
B. Padmaja Rani
SCIS, University of Hyderabad , Hyderabad, India
Siba K. Udgata
CMR Technical Campus , Hyderabad, India
K. Srujan Raju

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rani, N.S., Vasudev, T. (2017). An Unsupervised Classification of Printed and Handwritten Telugu Words in Pre-printed Documents Using Text Discrimination Coefficient. In: Satapathy, S., Prasad, V., Rani, B., Udgata, S., Raju, K. (eds) Proceedings of the First International Conference on Computational Intelligence and Informatics . Advances in Intelligent Systems and Computing, vol 507. Springer, Singapore. https://doi.org/10.1007/978-981-10-2471-9_67

Download citation

DOI: https://doi.org/10.1007/978-981-10-2471-9_67
Published: 01 December 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2470-2
Online ISBN: 978-981-10-2471-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics