Advertisement

Artificial Intelligence Review

, Volume 42, Issue 4, pp 1047–1066 | Cite as

Methods and strategies on off-line cursive touched characters segmentation: a directional review

  • Tanzila Saba
  • Amjad Rehman
  • Mohamed Elarbi-Boudihir
Article

Abstract

Character segmentation is a challenging problem in the field of optical character recognition. Presence of touched characters make this dilemma more crucial. The goal of this paper is to provide major concepts and progress in domain of off-line cursive touched character segmentation. Accordingly, two broad classes of technique are identified. These include methods that perform explicit or implicit character segmentation. The basic methods used by each class of technique are presented and the contributions of individual algorithms within each class are discussed. It is the first survey that focuses on touched character segmentation and provides segmentation rates, descriptions of the test data for the approaches discussed. Finally, the main trends in the field of touched character segmentation are examined, important contributions are presented and future directions are also suggested.

Keywords

Optical character recognition Touched character Documents analysis Explicit segmentation Implicit segmentation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alhajj R, Polat F, Elnagar A (2000) Employing multi-agents to identify touching of adjacent digits in handwritten Hindi numerals. In: Proceedings of the IEEE international conference on systems, man, and cybernetics, vol 4. pp 2725–2730Google Scholar
  2. Bae JH, Jung KC, Kim JW, Kim HJ (1998) Segmentation of touching characters using an MLP. Pattern Recognit Lett 19: 701–709CrossRefMATHGoogle Scholar
  3. Bansal V, Sinha RMK (2002) Segmentation of touching and fused Devanagari characters. Pattern Recognit 35(4): 875–893CrossRefMATHGoogle Scholar
  4. Bayer T, Kressel U (1993) Cut classification for segmentation. Paper presented at the proceedings of international conference on document analysis and recognition. Tsukuba Science City, Japan, pp 565–568Google Scholar
  5. Broumandnia A, Shanbehzadeh J, Nourani M (2007) Segmentation of printed Farsi/Arabic words. In: Proceedings of IEEE/ACS international conference on computer systems and applications. pp 761–766Google Scholar
  6. Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Pattern Anal Mach Intell 17: 690–706CrossRefGoogle Scholar
  7. Casey RG, Nagy G (1982) Recursive segmentation and classification of composite character patterns. Paper presented at the proceedings of the 6th international conference on pattern recognition, Munich, GermanyGoogle Scholar
  8. Chunheng W, Hotta Y, Suwa M, Naoi N (2004) Handwritten Chinese address recognition. In: Proceedings of the ninth international workshop on frontiers in handwriting recognition. pp 539–544Google Scholar
  9. Dijkstra EW (1959) A note on two problems in connection with graphs. Numer Comput Math 1: 269–271CrossRefMATHMathSciNetGoogle Scholar
  10. Elnagar A, Alhajj R (2003) Segmentation of connected handwritten numeral strings. Pattern Recognit 36(3): 625–634CrossRefGoogle Scholar
  11. Fujisawa H (2007) A view on the past and future of character and document recognition. In: Ninth international conference on document analysis and recognition. pp 3–7Google Scholar
  12. Garain U, Chaudhuri BB (2002) Segmentation of touching characters in printed devnagari and bangla scripts using fuzzy multifactorial analysis. IEEE Trans Syst Man Cybern 32(4): 449–459CrossRefGoogle Scholar
  13. Han Z, Liu CP, Yin XC (2005) A two-stage handwritten character segmentation approach in mail address recognition. In: Proceedings of eighth international conference on document analysis and recognition, vol 1. pp 111–115Google Scholar
  14. Hilditch CJ (1969) Linear skelton from square cupboards. Mach Intell 4: 403–420Google Scholar
  15. Hoffman RL, Mccullough JW (1971) Segmentation methods for recognition of machine-printed characters. IBM J Res Dev 15, 153–165 CrossRefGoogle Scholar
  16. Hong-Gang Z, Gang L, Wei-Ran X, Jun G (2002) An algorithm of handwritten digits segmentation based on multi-mould. In: Proceedings of international conference on machine learning and cybernetics, vol 2. pp 1081–1084Google Scholar
  17. Jayarathna UKS, Bandara GEMDC (2006) New segmentation algorithm for off-line handwritten connected character segmentation. In: Proceedings of the first international conference on industrial and information systems. pp 540–546Google Scholar
  18. Jianming H, Donggang Y, Hong Y (1999) Construction of partitioning paths for touching handwritten characters. Non-Linear Anal 20(3): 293–303MATHGoogle Scholar
  19. Jin-Hak B, Kee-Chul J, Jin-Wook K, Hang-Joon K (1998) Segmentation of touching characters using an MLP. Pattern Recognit Lett 19(8): 701–709CrossRefMATHGoogle Scholar
  20. Jiqiang S, Zuo L, Lyu MR, Shijie C (2005) Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Trans Syst Man Cybern B 35(1): 2–11CrossRefGoogle Scholar
  21. Kahan S, Pavlidis T, Baird HS (1987) On the recognition of printed characters of any font and size. IEEE Trans Pattern Anal Mach Intell 9(2): 274–288CrossRefGoogle Scholar
  22. Kim KK, Kim JH, Suen CY (2002) Segmentation-based recognition of handwritten touching pairs of digits using structural features. Pattern Recognit Lett 23(1–3): 13–24MATHGoogle Scholar
  23. Kimura F, Shridhar M, Chen Z (1993) Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words. In: Proceedings of the second international conference on document analysis and recognition. pp 18–22Google Scholar
  24. Kurniawan F, Rehman A, Dzulkifli M, Mariyam S (2009) Self organizing features map with improved segmentation to identify touching of adjacent characters in handwritten words (2011). In: Ninth international conference on hybrid intelligent systems (HIS 2009), vol 1. Shenyang, LiaoNing, China, pp 475–480Google Scholar
  25. Kurniawan F, Rehman A, Dzulkifli M, Mariyam S, Sulong G (2011) Region-based touched character segmentation in handwritten words. Int J Innov Comput Inf Control (IJICIC) 7(6): 3107–3120Google Scholar
  26. Lee H-J, Lee M-C (1993) Understanding mathematical expressions in a printed document. In: Proceedings of second international conference on document analysis and recognition. pp 502–505Google Scholar
  27. Liang S, Shridhar M, Ahmadi M (1994) Segmentation of touching characters in printed document recognition. Pattern Recognit 27(6): 825–840CrossRefGoogle Scholar
  28. Lorenz O, Monagan G (1994) Retrieval of line drawings. In: Proceedings of the third annual symposium on document analysis and information retrieval. Las Vegas, NevadaGoogle Scholar
  29. Lorigo L, Govindaraju V (2005) Segmentation and pre-recognition of arabic handwriting. In: Proceedings of international conference on document analysis and recognition, vol 2. pp 605–609Google Scholar
  30. Lu Y (1995) Machine printed character segmentation: an overview. Pattern Recognit 28(1): 67–80CrossRefGoogle Scholar
  31. Masayuki O, Syougo S, Tadashi S (1999) Segmentation of touching characters in formulas. In: Proceedings of third IAPR workshop on document analysis systems theory and practiceGoogle Scholar
  32. Min-Chul J, Yong-Chul S, Srihari SN (1999) Machine printed character segmentation method using side profiles. In: Proceedings of IEEE SMC ’99 conference on systems, man, and cyberneticsGoogle Scholar
  33. Misako S, Satoshi N (2004) Segmentation of handwritten numerals by graph representation. In: Proceedings of the ninth international workshop on frontiers in handwriting recognitionGoogle Scholar
  34. Monagan G (1994) A procedure for segmenting touching numbers in cadastral maps. In: Proceedings of IAPR workshop on machine vision applications (MVA 1994). Kawasaki, JapanGoogle Scholar
  35. Okamoto M, Sakaguchi S, Suzuki T (1999) Segmentation of touching characters in formulas. Lect Note Comput Sci 1655:151–156CrossRefGoogle Scholar
  36. Oliveira LES, Lethelier E, Bortolozzi F, Sabourin R (2000) A new segmentation approach for handwritten digits. In: Proceedings of the 15th international conference on pattern recognition, vol 2. pp 323–326Google Scholar
  37. Ouchtati S, Bedda M, Lachouri A (2007) Segmentation and recognition of handwritten numeric chains. J Comput Sci 3(4): 242–248CrossRefGoogle Scholar
  38. Pal U, Belaid A, Choisy C (2001) Water reservoir based approach for touching numeral segmentation. In: Proceedings sixth international conference on document analysis and recognition. pp 892–896Google Scholar
  39. Pal U, Belaid A, Choisy C (2003) Touching numeral segmentation using water reservoir concept. Pattern Recognit Lett 24(1–3): 261–272CrossRefGoogle Scholar
  40. Rao AVS, Subbarao M, Rao NV, Sastry ASCS, Reddy LP (2009) Segmentation of touching handwritten numerals and alphabets. In: Second international conference on computer and electrical engineering. pp 304–307Google Scholar
  41. Rehman A, Saba T (2011) Performance analysis of segmentation approach for cursive handwritten word recognition on benchmark database. Digit Signal Process (Elsevier) 21: 486–490CrossRefGoogle Scholar
  42. Roy PP, Pal U, Llados J (2008) Recognition of multi-oriented touching characters in graphical documents. In: Proceedings of sixth Indian conference on computer vision, graphics & image processing. pp 297–304Google Scholar
  43. Rui MA, Yingnan Z, Yongquan X, Yunyang Y (2008) A touching pattern-oriented strategy for handwritten digits segmentation. In: Proceeding of international conference on computational intelligence and security. pp 174–179Google Scholar
  44. Saba T, Rehman A (2011) Off-line cursive script recognition: current advances, comparisons and remaining problems. Artif Intell Rev. doi: 10.1007/s10462-011-9229-7
  45. Saba T, Rehman A, Sulong G (2010a) Improved off-line connected script recognition based on hybrid strategy. Int J Eng Sci Technol 2(6): 1603–1611Google Scholar
  46. Saba T, Rehman A, Sulong G (2010b) Non-linear segmentation of touched roman characters based on genetic algorithm. Int J Comput Sci Eng 2(6): 2167–2172Google Scholar
  47. Saba T, Rehman A, Sulong G (2010c) Touched cursive script segmentation with neural confidence. Int J Innov Comput Inf Control 7(7): 1–10Google Scholar
  48. Saba T, Sulong G, Rahim S, Rehman A (2010d) On the segmentation of multiple touched characters: a heuristics approach. Inf Commun Technol Lect Note Comput Sci (LNCS) Springer Verlag 101(3): 540–542. doi: 10.1007/978-3-642-15766-0_91 Google Scholar
  49. Saba T, Sulong G, Rehman A (2011) Document image analysis: issues, comparison of methods and remaining problems. Artif Intell Rev 35(2): 101–118. doi: 10.1007/s10462-010-9186-6 CrossRefGoogle Scholar
  50. Seo W, Cho BJ (2004) Efficient segmentation path generation for unconstrained handwritten Hangul characters. Lect Note Artif Intell 3192: 438–446Google Scholar
  51. Shi Z, Govindaraju V (1997) Segmentation and recognition of connected handwritten numeral strings. Pattern Recognit 30(9): 1501–1504CrossRefGoogle Scholar
  52. Strathy NW, Suen CY, Krzyzak A (1993) Segmentation of handwritten digits using contour features. In: Proceedings of the second international conference on document analysis and recognition. pp 577–580Google Scholar
  53. Suwa M (2005) Segmentation of connected handwritten numerals by graph representation. In: Proceedings of eighth international conference on document analysis and recognition, vol 2. pp 750–754Google Scholar
  54. Teruyuki Y, Shinji T, Tomohiro Y, Tsuyoshi S, Eiji M, Hisao O (2002) A segmentation system for touching handwritten Japanese characters. In: Proceedings of the eighth international workshop on frontiers in handwriting recognition (IWFHR’02). pp 407–412Google Scholar
  55. Tian X, Zhang Y (2007) Segmentation of touching characters in mathematical expressions using contour feature technique. In: Eighth ACIS international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing. pp 206–209Google Scholar
  56. Tripathy N, Pal U (2004) Handwriting segmentation of un-constrained Oriya text. In: Proceedings of the international workshop on frontiers in handwriting recognition. pp 306–311Google Scholar
  57. Tsujimoto S, Assada H (1991) Resolving ambiguity in segmenting touching characters. In: Proceedings of the first international conference on document analysis and recognitionGoogle Scholar
  58. Ventzislav A (2004) Using critical points in contours for segmentation of touching characters. In: Proceedings of the 5th international conference on computer systems and technologiesGoogle Scholar
  59. Wei X, Ma S (2005) Segmentation of touching Chinese character based on convex hull ratio feature. J Chin Inf Process 47: 91–96Google Scholar
  60. Wei X, Ma S, Jin Y (2005) Segmentation of connected Chinese characters based on genetic algorithm. In: Proceedings of the eighth international conference on document analysis and recognition (ICDAR’05), vol 2. pp 645–649Google Scholar
  61. Wshah S, Shi Z, Govindaraju V (2009a) Segmentation of Arabic handwriting based on both contour and skeleton segmentation. In: Proceedings of the tenth international conference on document analysis and recognition. pp 793–797Google Scholar
  62. Wshah S, Shi Z, Govindaraju V (2009b) Segmentation of Arabic handwriting based on both contour and skeleton segmentation In: Proceedings of the 10th international conference on document analysis and recognition. pp 793–797Google Scholar
  63. Xian W, Govindaraju V, Srihari S (1999) Multi-experts for touching digit string recognition. In: Proceedings of the fifth international conference on document analysis and recognition. pp 800–803Google Scholar
  64. Xianghui W, Shaoping M, Yijiang J (2005) Segmentation of connected Chinese characters based on genetic algorithm. Paper presented at the proceedings of the eighth international conference on document analysis and recognitionGoogle Scholar
  65. Yamaguchi T, Yoshikawa T, Shinogi T, Tsuruoka S, Teramoto M (2001) A segmentation method for touching Japanese handwritten characters based on connecting condition of lines. In: Proceedings of sixth international conference on document analysis and recognition. pp 837–841Google Scholar
  66. Yi-Kai C, Jhing-Fa W (2000) Segmentation of single or multiple-touching handwritten numeral string using background and foreground analysis. IEEE Trans Pattern Anal Mach Intell 22(11): 1304–1317CrossRefGoogle Scholar
  67. Yong G, Yan Z, Zhao H (2009a) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524Google Scholar
  68. Yong G, Yan Z, Zhao H (2009b) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524Google Scholar
  69. Yong G, Yan Z, Zhao H (2009c) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524Google Scholar
  70. Yoon JJ, Kim G (2000) An approach for active segmentation of unconstrained handwritten Korean strings using run-length code. In: Proceedings of the seventh international workshop on frontiers in handwriting recognition. AmsterdamGoogle Scholar
  71. Yu D, Yan H (2001) Separation of touching handwritten multi-numeral strings based on morphological structural features. Pattern Recognit 34(3): 587–599CrossRefMATHGoogle Scholar
  72. Yun L, Liu CS, Ding XQ, Qiang F (2004) A recognition based system for segmentation of touching handwritten numeral strings. In: Proceedings of the ninth international workshop on frontiers in handwriting recognition. pp 294–299Google Scholar
  73. Zhang JY, Ding XQ (2000) Multi-scale feature extraction and nested-subset classifier design for high accuracy handwritten character recognition. Paper presented at the, Barcelona, SpainGoogle Scholar
  74. Zhang S, Karim MA (2002) A new impulse detector for switching median filters. IEEE Signal Process Lett 9: 360–363CrossRefGoogle Scholar
  75. Zhang LQ, Suen CY (2002) Recognition of courtesy amounts on bank checks based on a segmentation approach. In: Proceedings of eighth international workshop on handwriting recognition. pp 298–302Google Scholar
  76. Zhang T, Wang X, Chen C, Liu J (2009) Connected numeral strings segmentation based on the combination of characteristic position and contour detecting. In: Proceedings of the first international conference on digital image processing. pp 81–84Google Scholar
  77. Zhongkang L, Zheru C, Wan-Chi S, Pengfei S (1999) A background-thinnig-based approach for separating and recognizing connected handwritten digit strings. Pattern Recognit 32: 921–933CrossRefGoogle Scholar
  78. Zhu X, Yin X (2002) A new textual/non-textual classifier for document skew correction. In: Proceedings of the 16th international conference on pattern recognition (ICPR). pp 480–482Google Scholar

Copyright information

© Springer Science+Business Media B.V. 2011

Authors and Affiliations

  • Tanzila Saba
    • 1
  • Amjad Rehman
    • 2
  • Mohamed Elarbi-Boudihir
    • 2
  1. 1.Graphics and Multimedia Department Faculty of Computer Science & Information SystemsUniversiti Teknologi MalaysiaJohor BahruMalaysia
  2. 2.A.I. and Robotics Research Group, College of Computer and Information SciencesAl-Imam M. Saud Islamic UniversityRiyadhKingdom of Saudi Arabia

Personalised recommendations