Guide to OCR for Arabic Scripts pp 351-371 | Cite as
Toward Distributed Cursive Writing OCR Systems Based on a Combination of Complementary Approaches
Abstract
Large amounts of cursive writing documents are still waiting to be computerized for several and different purposes. These documents are in general of medium to low quality; hence they require a sophisticated recognition algorithm capable of properly extracting the correct text from low quality cursive documents. The Dynamic Time Warp (DTW) algorithm is among the most effective algorithms for cursive writing optical character recognition (OCR). However, the DTW is a rather complex task requiring extensive computational capabilities, which hinders its commercial deployment on nonspecialized stand alone machines. Volunteer grids, such as XtremWeb and BOINC, provide viable infrastructures to speed up the DTW execution time. Recent experiments conducted on the Scientific Research Tunisian Grid (SRTG), an XtremWeb volunteer grid, confirmed this claim and showed a very tangible speedup along very high recognition rates. Such infrastructures present several practical advantages, such as the possibility of noncondemnation of the involved computers and the possibility of their simultaneous use by different users and/or applications. Unfortunately, volunteer grid infrastructures are inherently unable to guarantee the continuous availability of the stored data and, more importantly, the engaged processing capacities. Any involved computer may renegade and depart from the system at will, which consequently affects the application performance. Agent technology can be exploited here to solve the problem. In this chapter, we propose a service-oriented grid architecture (SOGA) based on the integration of both grid and agent technologies. An analytical study is conducted to ascertain and evaluate the key performance parameters of our proposed SOGA. The results confirm that our proposal provides a solid and viable solution for the large scale recognition of printed cursive writing based on the DTW algorithm.
References
- 1.Abdennadher, N.: Vers un outil Peer-To-Peer orienté calcul intensif. In: Flash Informatique, EPFL, August 2005 Google Scholar
- 2.Abdennadher, N.: Using the volunteer computing platform XtremWeb-CH: lessons and perspectives. In: Workshop on Grid Computing: e-Infrastructure, Applications and Research, ESSTT, UTIC, Tunisia (2007) Google Scholar
- 3.Abdennadher, N.: XtremWeb-CH: une plateforme global computing pour les applications de haute performance. Internal report, August 2004, HES-SO/EIG Google Scholar
- 4.Al-Badr, A., Haralick, R.: A segmentation-free approach to text recognition with application to Arabic text. Int. J. Doc. Anal. Recognit. 1(3), 147–166 (1998) CrossRefGoogle Scholar
- 5.Alves, C.E.R., et al.: Parallel dynamic programming for solving the string editing problem on CGM/BSP. In: Proc. SPAA’02, Winnipeg, Manitoba, Canada, August 10–13, 2002 Google Scholar
- 6.Amin, A.: Off-line Arabic character recognition: the state of the art. Pattern Recognit. 31(5), 517–530 (1998) MathSciNetCrossRefGoogle Scholar
- 7.Available at http://www.nla.gov.au/
- 8.Available at http://www.ocrgrid.org/
- 9.Available at http://www.kirtas.com/
- 10.Available at http://code.google.com/p/ocropus/
- 11.Available at http://www.xtremwebch.net
- 12.Available at http://www.xtremweb.net
- 13.Available at http://boinc.berkeley.edu
- 14.Bahi, J.M., et al.: Synchronous and asynchronous solution of a 3D transport model in a grid computing environment. Appl. Math. Model. 30, 616–628 (2006). Available on www.sciencedirect.com MATHCrossRefGoogle Scholar
- 15.Bertis, V., Bolze, R., Desprez, F., Reed, K.: Large scale execution of a bioinformatic application on a volunteer grid. In: IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2008, 14–18 April 2008, pp. 1–8 (2008) CrossRefGoogle Scholar
- 16.Bradford, P.G.: Efficient parallel dynamic programming. In: Proc. 30th Annual Allerton Conference on Communication, Control and Computing, University of Illinois, Urbana, IL, USA, pp. 185–194 (1992) Google Scholar
- 17.Bridle, J.S., et al.: An algorithm for connected word recognition. In: Proc. IEEE, ICASSP, pp. 899–902, May 1982 Google Scholar
- 18.Buyya, R., et al.: A gentle introduction to grid computing and technologies. In: Proc. CSI, India, May 7–19, 2005 Google Scholar
- 19.Capello, F.: The evolution of GRID5000. In: Workshop on Grid Computing: e-infrastructure, Applications and Research, ESSTT, UTIC Tunisia (2007) Google Scholar
- 20.Cheng, H.D., et al.: VLSI architecture for pattern matching using space-time domain expansion approach. In: Proc. IEEE Int. Conf. Computer Design VLSI Comput, NY, Oct 7–10, 1985 Google Scholar
- 21.Cheng, H.D., et al.: VLSI architecture for dynamic time-warp recognition of handwritten symbols. In: IEEE ASSP, vol. 34, Jan 1986 Google Scholar
- 22.Cheng, H.-D., et al.: A VLSI architecture for dynamic time-warp recognition of handwritten symbols. IEEE Trans. Acoust. Speech Signal Process. 34(3), 603–613 (1986) CrossRefGoogle Scholar
- 23.Cheung, A., et al.: An Arabic optical character recognition system using recognition based segmentation. Pattern Recognit. 34 (2001) Google Scholar
- 24.Coté, M., et al.: Une architecture Multi-agents réutilisables pour les environnements riches en informations. In: Série Scientifique, Montréal, July 2002 Google Scholar
- 25.Fedak, G., He, H., Cappello, F.: BitDew: a programmable environment for large-scale data, management and distribution. Technical report N 6427, INRIA, January 2008 Google Scholar
- 26.Ferber, J.: Les Systèmes Multi-agents. Vers une Intelligence Collective. InterEditios, Paris (1995) MATHGoogle Scholar
- 27.Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the grid. Int. J. Supercomput. Appl. (2002) Google Scholar
- 28.Foster, I., Jennings, N.R., Kesselman, C.: Brain meets brawn: why grid and agents need each other. In: Proc. AAMAS’04, Leeds, UK, March 2004 Google Scholar
- 29.Gonzalez, R.C., Wintz, P.: BitDew: Digital Image Processing. Addison-Wesley, Reading (1988) Google Scholar
- 30.Hasegawa, W.: Research on Mobile Agent Security (510080). School of Information Science, Japan Advanced Institute of Science and Technology, February 2007 Google Scholar
- 31.Hayes, C.C.: Agents in a nutshell—a very brief introduction. IEEE Trans. Knowl. Data Eng. 11(1), 127–132 (1999) MathSciNetCrossRefGoogle Scholar
- 32.
- 33.IBM: Introduction to grid computing with Globus. IBM RedBook. SG24-6895-01. ISBN 0738427969, September 2003 Google Scholar
- 34.Jarras, I., Chaib-draa, B.: Aperçu sur les systèmes Multi-agents. In: Série Scientifique, Montréal, July 2002 Google Scholar
- 35.Jonquet, C.: Dynamic service generation: agent interactions for service exchange on the grid. Ph.D. dissertation, University of Montpellier II, November 2006 Google Scholar
- 36.Kanoun, S., et al.: Reconnaissance d’images de textes Arabes par approche Affixale. In: Proc. MCSEAI’04, Sousse, Tunisia, May 9–12, 2004 Google Scholar
- 37.Khemakhem, M.: Reconnaisance globale de caractères imprimés Arabes et Latins par comparaison dynamique. In: Proc. Regional Conference on Computer Science and Arabization, Tunis, Tunisia, March 1988 Google Scholar
- 38.Khemakhem, M., Belghith, A.: A multipurpose multi-agent system based on a loosely coupled architecture to speedup the DTW algorithm for Arabic printed cursive OCR. In: Proc. AICCSA-2005, Cairo, Egypt, January 2005 Google Scholar
- 39.Khemakhem, M., Belghith, A.: A P2P grid architecture for distributed Arabic OCR based on the DTW algorithm. Int. J. Comput. Appl. 31(1) (2009) Google Scholar
- 40.Khemakhem, M., et al.: Reconnaissance de caractères imprimés par comparaison dynamique. In: Proc. AFCET, Antibes, Sept. 1987 Google Scholar
- 41.Khemakhem, M., et al.: Arabic type written character recognition using dynamic comparison. In: Proc. 1st Computer Conference, Kuwait, March 1989 Google Scholar
- 42.Khemakhem, M., Belghith, A., Ben Ahmed, M.: Etude et evaluation de deux méthodes de distribution de l’algorithme de comparaison dynamique pour la reconnaissance de caractères Arabes. In: Proc. First Maghrebin Symposium on Programming and Systems, Algeria, October 1991 Google Scholar
- 43.Khemakhem, M., Belghith, A., Ben Ahmed, M.: Modélisation architecturale de la comparaison dynamique distribuée. In: Proc. Second International Congress on Arabic and Advanced Computer Technology, Casablanca, Morocco, December 1993 Google Scholar
- 44.Lesser, V.R.: Cooperative multi-agent systems: a personal view of the state of the art. IEEE Trans. Knowl. Data Eng. 11(1), 133–142 (1999) CrossRefGoogle Scholar
- 45.Mella, P.: Complex systems vs. Simplex systems: the behaviour of collectivities following the combinatory system view. In: Proc. 6th International Conference on Complex Systems (CS02), Complexity with Agent Based Modeling, Chuo University, Tokyo, Japan, September 9–11, 2002 Google Scholar
- 46.Quénot, G.R., et al.: A dynamic programming processor for speech recognition. IEEE J. Solid-State Circuits 24(F9), 20 (1989) Google Scholar
- 47.Russel, S., et al.: Artificial Intelligence: A Modern Approach. Prentice Hall, Englewood Cliffs (1995) Google Scholar
- 48.Shi, Z., et al.: Agent based grid computing. Appl. Math. Model. 30, 629–640 (2006). Available on www.sciencedirect.com MATHCrossRefGoogle Scholar
- 49.Vinciarelli, A.: A survey on off-line cursive word recognition. Pattern Recognit. 35, 1433–1446 (2002) MATHCrossRefGoogle Scholar
- 50.Weis, G.: Multi-agent Systems, a Modern Approach to Distributed Artificial Intelligence. MIT Press, Cambridge (1999) Google Scholar
- 51.Wolski, R.: Dynamically forecasting network performance using the network weather service. Ucsd Technical Report tr-cs96-494, University of California, San Diego, La Jolla, CA, January 7, 1998 Google Scholar
- 52.Wooldridge, M., et al.: Intelligent agents: theory and practice. Knowl. Eng. Rev. 10(2), 115–152 (1995) MathSciNetCrossRefGoogle Scholar
- 53.Wooldridge, M., et al.: A methodologie for agent oriented analysis and design. In: Proc. Third International Conference on Autonomous Agents (Agents’99), pp. 69–76. ACM, Seattle (1999) CrossRefGoogle Scholar
- 54.Worldlanguage products. Available at http://www.worldlanguage.com/ProductsArabic/OCR/Page1.htm