Advertisement

Online Handwriting Recognition for Indic Scripts

  • A. BharathEmail author
  • Sriganesh Madhvanath
Chapter
Part of the Advances in Pattern Recognition book series (ACVPR)

Abstract

Online handwriting recognition refers to the problem of machine recognition of handwriting captured in the form of pen trajectories. The recognition technology holds significant promise for Indic scripts, given that the Indic languages are used by a sixth of the world’s population, and the greater ease of use of handwriting-based text input for these scripts compared to keyboard-based methods. Even though the recognition of handwritten Devanagari, Bangla, and Tamil has received significant attention in recent times, one may say that research efforts directed at Indic script recognition in general are in their early stages. The structure of the scripts and the variety of shapes and writing styles pose challenges that are different from other scripts and hence require customized techniques for feature representation and recognition. In this chapter, we describe the challenges in recognizing online handwriting in Indic scripts and provide an overview of the state of the art for isolated character and word recognition. We then present in brief some of the promising applications, starting with handwriting-based text input systems (IMEs) that have been built for entering Indic scripts. In the last section, we provide a few pointers to resources such as tools and data sets that are currently available for online Indic script recognition research. endabstract

Keywords

Online handwriting recognition Indic scripts Character recognition Word recognition 

Notes

Acknowledgments

We gratefully acknowledge the contributions and influence over the years, of various present and former colleagues at HP Labs India – notably Deepu Vijayasenan, M. Dinesh, Ajay Bhaskarabhatla, Mudit Agrawal, Kalika Bali, and K.S.R. Anjaneyulu, collaborators Prof. A.G. Ramakrishnan at the Indian Institute of Science (IISc), Prof. Srinivasa Chakravarthy at Indian Institute of Technology-Madras (IIT-M), and Prof. C.V. Jawahar at International Institute of Information Technology-Hyderabad (IIIT-H), their students, and various interns who have worked with us at sometime or the other.

References

  1. 1.
  2. 2.
    Tappert, C., Suen, C., Wakahara, T.: State of the Art in On-Line Handwriting Recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 12(8) (1990) 787–808CrossRefGoogle Scholar
  3. 3.
    Plamondon, R., Srihari, S.N.: Online and Off-Line Handwriting Recognition: A Comprehensive Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 22(1) (2000) 63–84CrossRefGoogle Scholar
  4. 4.
    Liu, C.L., Jaeger, S., Nakagawa, M.: Online Recognition of Chinese Characters: The State-of-the-Art. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 26(2) (2004) 1489–1500Google Scholar
  5. 5.
    Jaeger, S., Liu, Nakagawa, M.:The State of the Art in Japanese On-Line Handwriting Recognition Compared to Techniques in Western Handwriting Recognition. International Journal on Document Analysis and Recognition (IJDAR) 6(2) (2003) 75–88CrossRefGoogle Scholar
  6. 6.
    Coulmas, F.: The Blackwell Encyclopedia of Writing Systems. Blackwell, Oxford (1996)Google Scholar
  7. 7.
    Mudur, S.P., Nayak, N., Shanbhag, S., Joshi, R.K.: An Architecture for the Shaping of Indic Texts. Computers & Graphics 23(1) (1999) 7–24CrossRefGoogle Scholar
  8. 8.
  9. 9.
    Bharath, A., Deepu, V., Madhvanath, S.: An Approach to Identify Unique Styles in Online Handwriting Recognition. In: 8th International Conference on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea (August–September 2005)Google Scholar
  10. 10.
    Swethalakshmi, H., Jayaraman, A., Chakravarthy, V.S., Sekhar, C.C.: Online Handwritten Character Recognition of Devanagari and Telugu Characters using Support Vector Machines. In: 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France (October 2006)Google Scholar
  11. 11.
    Jayaraman, A., Sekhar, C.C., Chakravarthy, V.S.: Modular Approach to Recognition of Strokes in Telugu Script. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  12. 12.
    Aparna, K.H., Subramanian, V., Kasirajan, M., Prakash, G.V., Chakravarthy, V.S., Madhvanath, S.: Online Handwriting Recognition for Tamil. In: 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2004), Tokyo, Japan (October 2004)Google Scholar
  13. 13.
    Swethalakshmi, H.: Online Handwritten Character Recognition for Devanagari and Tamil Scripts Using Support Vector Machines. Master’s thesis, Indian Institute of Technology, Madras, India (October 2007)Google Scholar
  14. 14.
    Babu, V.J., Prasanth, L., Sharma, R.R., Rao, G.V.P., Bharath, A.: HMM-Based Online Handwriting Recognition System for Telugu Symbols. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  15. 15.
    Deepu, V., Madhvanath, S., Ramakrishnan, A.G.: Principal Component Analysis for Online Handwritten Character Recognition. In: 17th International Conference on Pattern Recognition (ICPR 2004), Cambridge, UK (August 2004)Google Scholar
  16. 16.
    Joshi, N., Sita, G., Ramakrishnan, A.G., Madhvanath, S.: Tamil Handwriting Recognition Using Subspace and DTW Based Classifiers. In: 11th International Conference on Neural Information Processing (ICONIP 2004), Calcutta, India (November 2004)Google Scholar
  17. 17.
    Joshi, N., Sita, G., Ramakrishnan, A.G., Madhvanath, S.: Comparison of Elastic Matching Algorithms for Online Tamil Handwritten Character Recognition. In: 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2004), Tokyo, Japan (October 2004)Google Scholar
  18. 18.
    Madhvanath, S., Lucas, S.M.: IWFHR 2006 Online Tamil Handwritten Character Recognition Competition. In: 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France (October 2006)Google Scholar
  19. 19.
    Joshi, N., Sita, G., Ramakrishnan, A.G., Deepu, V., Madhvanath, S.: Machine Recognition of Online Handwritten Devanagari Characters. In: 8th International Conference on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea (August–September 2005)Google Scholar
  20. 20.
    Deepu, V., Madhvanath, S.: Genetically Evolved Transformations for Rescaling Online Handwritten Characters. In: IEEE India Annual Conference (INDICON 2004), Kharagpur, India (December 2004)Google Scholar
  21. 21.
    Toselli, A.H., Pastor, M., Vidal, E.: On-Line Handwriting Recognition System for Tamil Handwritten Characters. In:Pattern Recognition and Image Analysis. Springer Berlin/Heidelberg (July 2007) 370–377CrossRefGoogle Scholar
  22. 22.
    Prasanth, L., Babu, V.J., Sharma, R.R., Rao, G.V.P., Dinesh, M.: Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  23. 23.
    Sundaram, S., Ramakrishnan, A.G.: A Novel Hierarchical Classification Scheme for Online Tamil Character Recognition. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  24. 24.
    Sundaresan, C.S., Keerthi, S.S.: A Study of Representations for Pen Based Handwriting Recognition of Tamil Characters. In: 5th International Conference on Document Analysis and Recognition (ICDAR 1999), Bangalore, India (September 1999)Google Scholar
  25. 25.
    Rao, P.V.S., Ajitha, T.M.: Telugu Script Recognition – A Feature Based Approach. In: 3rd International Conference on Document Analysis and Recognition (ICDAR 1995), Montreal, Canada (August 1995)Google Scholar
  26. 26.
    Connell, S.D., Sinha, R.M.K., Jain, A.K.: Recognition of Unconstrained On-Line Devanagari Characters. In: 15th International Conference on Pattern Recognition (ICPR 2000), Barcelona, Spain (September 2000)Google Scholar
  27. 27.
    Ranade, A., Ranade, M.: Devanagari Pen-Written Character Recognition. In: 9th International Conference on Advanced Computing and Communications (ADCOM 2001), Bhubaneshwar, India (December 2001)Google Scholar
  28. 28.
    Niels, R., Vuurpijl, L.: Dynamic Time Warping Applied to Tamil Character Recognition. In: 8th International Conference on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea (August–September 2005)Google Scholar
  29. 29.
    Kunte, R.S.R., Samuel, R.D.S.: On-Line Character Recognition System for HandWritten Characters/Script with Bilingual Facility Employing Neural Classifiers and Wavelet Features. In: International Conference on Knowledge Based Computer Systems (KBCS 2000), Mumbai, India (December 2000)Google Scholar
  30. 30.
    Bhattacharya, U., Gupta, B.K., Parui, S.K.: Direction Code Based Features for Recognition of Online Handwritten Characters of Bangla. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  31. 31.
    Balaji, R., Deepu, V., Madhvanath, S., Prabhakaran, J.: Handwritten Gesture Recognition for Gesture Keyboard. In: 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France (October 2006)Google Scholar
  32. 32.
    Vinciarelli, A.: A Survey on Off-Line Cursive Word Recognition. Pattern Recognition 35(7) (2002) 1433–1446zbMATHCrossRefGoogle Scholar
  33. 33.
    Lecolinet, E., Baret, O.: Cursive Word Recognition: Methods and Strategies. In: Fundamentals in Handwriting Recognition. Springer-Verlag, New York (1994) 235–263Google Scholar
  34. 34.
    Steinherz, T., Rivlin, E., Intrator, N.: Offline Cursive Script Word Recognition – A Survey. International Journal on Document Analysis and Recognition (IJDAR) 2(2–3) (1999) 90–110Google Scholar
  35. 35.
    Madhvanath, S., Govindaraju, V.: The Role of Holistic Paradigms in Handwritten Word Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 23(2) (2001) 149–164CrossRefGoogle Scholar
  36. 36.
    Madduri, K., Aparna, K.H., Chakravarthy, V.S.: PATRAM – A Handwritten Word Processor for Indian Languages. In: 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2004), Tokyo, Japan (October 2004)Google Scholar
  37. 37.
    Krishna, A., Prabhu, G., Bali, K., Madhvanath, S.: Indic Scripts Based Online Form Filling – A Usability Exploration. In: 11th International Conference on Human–Computer Interaction (HCII 2005), Las Vegas, USA (July 2005)Google Scholar
  38. 38.
    Bharath, A., Madhvanath, S.: Hidden Markov Models for Online Handwritten Tamil Word Recognition. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil (September 2007)Google Scholar
  39. 39.
    Balasubramanian, A.: Document Annotation and Retrieval Systems. Master’s Thesis, International Institute of Information Technology, Hyderabad, India (2006)Google Scholar
  40. 40.
    Indian Languages Corpora, http://www.ciilcorpora.net/
  41. 41.
    Baker, P., Hardie, A., McEnery, T., Cunningham, H., Gaizauskas, R.: EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation. In: 3rd International Conference on Language Resources and Evaluation (LREC 2002), Canary Islands, Spain (May 2002)Google Scholar
  42. 42.
    The Unicode Consortium: The Unicode Standard Version 5.0. Addison Professional, MA (2006)Google Scholar
  43. 43.
    Indian Script Code for Information Interchange (ISCII), Ministry of Information Technology, Government of India, http://tdil.mit.gov.in/standards.htm (April 2003)
  44. 44.
    Manohar, P.: The Multimodal Interaction for the Computer: An Application-Independent Approach. Master’s Thesis, Indian Institute of Technology, Madras, India (2006)Google Scholar
  45. 45.
    Barthelmess, P., Kaiser, E., McGee, D.: Toward Content-Aware Multimodal Tagging of Personal Photo Collections. In: 9th International Conference on Multimodal Interfaces (ICMI 2007), Nagoya, Japan (November 2007)Google Scholar
  46. 46.
    Prasad, A., Prashant, A., Borgaonkar, S.: Guided Handwriting: Predictive Writing Input Method Environment. Internal Technical Report, HP Labs India (December 2005)Google Scholar
  47. 47.
    Manish Kumar: Compact Stylus-Based Input Method for Indic Scripts. Diploma Thesis, National Institute of Design, Ahmedabad, India (2007)Google Scholar
  48. 48.
    Srinivas, N.K., Varghese, N., Raman, R.K.V.S.: IndicDasher: A Stroke and Gesture Based Input Mechanism for Indic Scripts. In: Workshop on Intelligent User Interfaces for Developing Regions (IUI4DR 2008), Canary Islands, Spain (January 2008)Google Scholar
  49. 49.
    Bharath, A., Madhvanath, S.: Recognition of Eyes-Free Handwriting Input for Pen and Touch Interfaces. In: 11th International Conference on Frontiers in Handwriting Recognition (ICFHR 2008), Montreal, Canada (August 2008)Google Scholar
  50. 50.
    Bhaskarabhatla, A.S., Madhvanath, S.: Experiences in Collection of Handwriting Data for Online Handwriting Recognition in Indic Scripts. In: 4th International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal (May 2004)Google Scholar
  51. 51.
    Kumar, A., Balasubramanian, A., Namboodiri, A., Jawahar, C.V.: Model-Based Annotation of Online Handwritten Datasets. In: 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France (October 2006)Google Scholar
  52. 52.
    Bhaskarabhatla, A.S., Madhvanath, S., Kumar, M.N.S.S.K.P., Balasubramanian, A., Jawahar, C.V.: Representation and Annotation of Online Handwritten Data. In: 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2004), Tokyo, Japan (October 2004)Google Scholar
  53. 53.
    Agrawal, M., Bhaskarabhatla, A.S., Madhvanath, S.: Data Collection for HandWriting Corpus Creation in Indic Scripts. In: International Conference on Speech and Language Technology and Oriental COCOSDA (ICSLT-COCOSDA 2004), New Delhi, India (November 2004)Google Scholar
  54. 54.
    Guyon, I., Schomaker, L., Plamondon, R., Liberman, M., Janet, S.: UNIPEN Project of Online Data Exchange and Recognizer Benchmarks. In: International Conference on Pattern Recognition (ICPR 1994), Jerusalem, Israel (October 1994)Google Scholar
  55. 55.
    Agrawal, M., Bali, K., Madhvanath, S., Vuurpijl, L.: UPX – A New XML Representation for Annotated Datasets of Online Handwriting Data. In: 8th International Conference on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea (October 2005)Google Scholar
  56. 56.
    W3C Working Group: Ink Markup Language, http://www.w3.org/2002/mmi/ink (2003)
  57. 57.
    Richard Ishida: Unicode Technical Note #10: An Introduction to Indic Scripts, http://unicode.org/notes/tn10/(August 2003)
  58. 58.
    International Unipen Foundation: The Unipen Project, http://www.unipen.org(1994)
  59. 59.
    Madhvanath, S., Deepu, V., Kadiresan, T.M.: Lipitk: A Generic Toolkit for Online Handwriting Recognition. In:10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France (October 2006)Google Scholar
  60. 60.
  61. 61.
  62. 62.
    HP Labs Isolated Handwritten Tamil Character Dataset hpl-tamil-iso-char, http://www.hpl.hp.com/india/research/penhw/resources/tamil-iso-char.html
  63. 63.
    Handwritten Character Databases of Indic Scripts, http://www.isical.ac.in/ujjwal/download/database.html
  64. 64.
  65. 65.
    Online Handwriting Recognition System for Indian Languages (OHWR), http://ragashri.ee.iisc.ernet.in/ohwr/

Copyright information

© Springer-Verlag London Limited 2009

Authors and Affiliations

  1. 1.Hewlett-Packard LaboratoriesBangaloreIndia

Personalised recommendations