Abstract
Palm leaf manuscripts were one of the earliest forms of written media and were used in Southeast Asia to store early written knowledge about subjects such as medicine, Buddhist doctrine and astrology. Therefore, historical handwritten palm leaf manuscripts are important for people who like to learn about historical documents, because we can learn more experience from them. This paper presents an image segmentation of historical handwriting from palm leaf manuscripts. The process is composed of three steps: 1) background elimination to separate text and background by Otsu’s algorithm 2) line segmentation and 3) character segmentation by histogram of image. The end result is the character’s image. The results from this research may be applied to optical character recognition (OCR) in the future.
Chapter PDF
Similar content being viewed by others
Keywords
References
Shi Z, Setlur S, Govindaraju V. Digital Enhancement of Palm Leaf Manuscript Images using Normalization Techniques. 5th International Conference On Knowledge Based Computer Systems; 2004 December 19–22, 2004 Hyderabad, India; 2004.
Shi Z, Govindaraju V. Historical Document Image Segmentation Using Background Light Intensity Normalization. 12th SPIE Document Recognition and Retrieval; 2005 January 16–20, 2005; California, USA; 2005.
Shi Z, Govindaraju V. Historical Document Image Enhancement Using Background Light Intensity Normalization. 17th International Conference on Pattern Recognition; 2004 23–26 August 2004; Cambridge, United Kingdom; 2004.
S.A. Shahab, Wasfi G. Al-Khatib, Sabri A. Mahmoud. Computer Aided Indexing of Historical Manuscripts The Inte6rnational Conference on Computer Graphics, Imaging and Visualisation (CGIV’06); 2006 April 7, 2006; Sydney, Australia; 2006.
Chamchong R, Surinta O. Text Line Segmentation from Palm Leaf Manuscripts. The 3rd National Conference on Computing and Information Technology (NCCIT2007). Bangkok, Thailand 2007.
Surinta O, Jareanpon C. Comparison of image analysis for Thai handwritten character recognition. 4th International Conference on Intelligent Information Processing (IIP2006); 2006 September 20–23, 2006; Adelaide, Australia: Springer; 2006.
Surinta O, Nitsuwat S. Handwritten Thai Charactr Recognition Using Fourier Descriptors and Robust C-Protytype. Information Technology Journal. 2006 January–June 2006;2(3):96.
Otsu N. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics. 1979 January 1979;9(1).
Otsu’s method. 2008 February 20, 2008 [cited 2008 February 20]; Available from: http://en.wikipedia.org/wiki/Otsus_method
Tacnet LL-SAZB. Text Line Segmentation of Historical Documents: a Survey. International Journal on Document Analysis and Recognition, Analysis of Historical Documents. 2006.
C. Weliwitage, A. L. Harvey, A. B. Jennings. Handwritten Document Offline Text Line Segmentation. In: Cairns Q, Australia 4870, editor. The Digital Imaging Computing: Techniques and Applications (DICTA2005); 2005; Queensland, Australia; 2005.
Ataer E, Duygulu P. Retrieval of Ottoman Documents. 8th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2006 October 26–27, 2006; California, USA: MIR 2006. 8th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2006.
A. Cheung, M. Bennamoun, N. W. Bergmann. An Arabic optical character recognition system using recognition-based segmentation Pattern Recognition Society. 2001 February;3(2).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 IFIP International Federation for Information Processing
About this paper
Cite this paper
Surinta, O., Chamchong, R. (2008). Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts. In: Shi, Z., Mercier-Laurent, E., Leake, D. (eds) Intelligent Information Processing IV. IIP 2008. IFIP – The International Federation for Information Processing, vol 288. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-87685-6_23
Download citation
DOI: https://doi.org/10.1007/978-0-387-87685-6_23
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-87684-9
Online ISBN: 978-0-387-87685-6
eBook Packages: Computer ScienceComputer Science (R0)