Fuzzy Techniques for Text Localisation in Images

Górecki, Przemysław; Caponetti, Laura; Castiello, Ciro

doi:10.1007/978-3-540-76827-2_10

Fuzzy Techniques for Text Localisation in Images

Przemysław Górecki⁵,
Laura Caponetti⁶ &
Ciro Castiello⁶

Chapter

498 Accesses
2 Citations

Part of the book series: Studies in Computational Intelligence ((SCI,volume 96))

Text information extraction represents a fundamental issue in the context of digital image processing. Inside this wide area of research, a number of specific tasks can be identified ranging from text detection to text recognition. In this chapter, we deal with the particular problem of text localisation, which aims at determining the exact location where the text is situated inside a document image. The strict connection between text localisation and image segmentation is highlighted in the chapter and a review of methods for image segmentation is proposed. Particularly, the benefits coming from the employment of fuzzy and neuro-fuzzy techniques in this field is assessed, thus indicating a way to combine Computational Intelligence methods and document image analysis. Three peculiar methods based on image segmentation are presented to show different applications of fuzzy and neuro-fuzzy techniques in the context of text localisation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 329.00; Price excludes VAT (USA)

Hardcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Colombo C, Del Bimbo A, Pala P (1999) IEEE Multimedia 6(3):38–53
Article Google Scholar
Long F, Zhang H, Feng D (2003) Fundamentals of content-based image retr- ieval, in: Feng D ZHE Siu WC (ed.) Multimedia information retrieval and management - technological fundamentals and applications. Springer, Berlin Heidelberg New York
Google Scholar
Yang M, Kriegman D, Ahuja N (2002) IEEE Trans Pattern Anal Mach Intell 24(1):34–58
Article Google Scholar
Dingli A, Ciravegna F, Wilks Y (2003) Automatic semantic annotation using unsupervised information extraction and integration, in: Proceedings of semAnnot workshop
Google Scholar
Djioua B, Flores JG, Blais A, Desclés JP, Guibert G, Jackiewicz A, Priol FL, Nait-Baha L, Sauzay B (2006) EXCOM: An automatic annotation Engine for semantic information, in: Proceedings of FLAIRS conference, pp. 285–290
Google Scholar
Orasan C (2005) Automatic annotation of corpora for text summarisation: A comparative study, in: Computational linguistics and intelligent text processing, volume 3406/2005, Springer, Berlin Heidelberg New York
Google Scholar
Karatzas D, Antonacopoulos A (2003) Two Approaches for Text Segmentation in Web Images, in: Proceedings of the 7th International Conference on Document Analsis and Recognition (ICDAR2003), IEEE Computer Society Press, Cambridge, UK pp. 131–136
Chapter Google Scholar
Jung K, Kim K, Jain A (2004) Pattern Recognit 37:977–997
Article Google Scholar
Chen D, Odobez J, Bourlard H (2002) Text segmentation and recognition in complex background based on Markov random field, in: Proceedings of International Conference on Pattern Recognition, pp. 227–230
Google Scholar
Li H, Doerman D, Kia O (2000) IEEE Trans Image Process 9(1):147–156
Article Google Scholar
Li H, Doermann D (2000) Superresolution-based enhancement of text in digital video, in: Proceedings of International Conference of Pattern Recognition, pp. 847–850
Google Scholar
Li H, Kia O, Doermann D (1999) Text enhancement in digital video, in: Proceedings of SPIE, Document Recognition IV, pp. 1–8
Google Scholar
Sato T, Kanade T, Hughes E, Smith M (1998) Video OCR for digital news archive, in: Proceedings of IEEE Workshop on Content based Access of Image and Video Databases, pp. 52–60
Google Scholar
Zhou J, Lopresti D, Lei Z (1997) OCR for world wide web images, in: Proceedings of SPIE on Document Recognition IV, pp. 58–66
Google Scholar
Zhou J, Lopresti D, Tasdizen T (1998) Finding text in color images, in: Proceedings of SPIE on Document Recognition V, pp. 130–140
Google Scholar
Ching-Yu Y, Tsai WH (2000) Signal Process.: Image Commun. 15(9):781–797
Article Google Scholar
Deng S, Lati S, Regentova E (2001) Document segmentation using polynomial spline wavelets, Pattern Recognition 34:2533–2545
Article MATH Google Scholar
Lu Y, Shridhar M (1996) Character segmentation in handwritten words, J. of, Pattern Recognit 29(1):77–96
Google Scholar
Mital D, Leng GW (1995) J Microcomput Appl 18(4):375–392
Article Google Scholar
Rossant F (2002) Pattern Recognit Lett 23(10):1129–1141
Article MATH Google Scholar
Xiao Y, Yan H (2003) Text extraction in document images based on Delaunay triangulation, Pattern Recognition 36(3):799–809
Article MATH MathSciNet Google Scholar
Pratt W (2001) Digital image processing (3rd edition). Wiley, New York, NY
Google Scholar
Haralick R (1979) Proc IEEE 67:786–804
Article Google Scholar
Haralick R, Shanmugam K, Dinstein I (1973) Textural features for image classification, IEEE Trans Syst Man Cybern 3:610–621
Article Google Scholar
Baird H, Jones S, Fortune S (1990) Image segmentation by shape-directed covers, in: Proceedings of International Conference on Pattern Recognition, pp. 820–825
Google Scholar
Nagy G, Seth S, Viswanathan M (1992) Method of searching and extracting text information from drawings, Computer 25:10–22
Article Google Scholar
O’Gorman L (1993) IEEE Trans Pattern Anal Mach Intell 15:1162–1173
Article Google Scholar
Kose K, Sato A, Iwata M (1998) Comput Vis Image Underst 70:370–382
Article Google Scholar
Wahl F, Wong K, Casey R (1982) Graph Models Image Process 20:375–390
Google Scholar
Jain A, Yu B (1998) IEEE Trans Pattern Anal Mach Intell 20:294–308
Article Google Scholar
Pavlidis T, Zhou J (1992) Graph Models Image Process 54:484–496
Article Google Scholar
Hadjar K, Hitz O, Ingold R (2001) Newspaper Page Decomposition Using a Split and Merge Approach, in: Proceedings of Sixth International Conference on Document Analysis and Recognition
Google Scholar
Jiming L, Tang Y, Suen C (1997) Pattern Recognit 30(8):1265–1278
Article Google Scholar
Rosenfeld A, la Torre PD (1983) IEEE Trans Syst Man Cybern SMC-13:231–235
Google Scholar
Sahasrabudhe S, Gupta K (1992) Comput Vis Image Underst 56:55–65
Google Scholar
Sezan M (1985) Graph Models Image Process 29:47–59
Article Google Scholar
Yanni M, Horne E (1994) A new approach to dynamic thresholding, in: Proceedings of EUSIPCO’94: 9th European Conference on Signal Processing 1, pp. 34–44
Google Scholar
Sezgin M, Sankur B (2004) J Electron Imaging 13(1):146–165
Article Google Scholar
Kamel M, Zhao A (1993) Graph Models Image Process 55(3):203–217
Article Google Scholar
Solihin Y, Leedham C (1999) Integral ratio: A new class of global thresholding techniques for handwriting images, in: IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-21, pp. 761–768
Google Scholar
Trier O, Jain A (1995) Goal-directed evaluation of binarization methods, in: IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-17, pp. 1191–1201
Google Scholar
Bow ST (2002) Pattern Recognition and Image Preprocessing 2nd edition. Dekker, New York, NY
Google Scholar
Jung K, Han J (2004) Pattern Recognit Lett 25(6):679–699
Article Google Scholar
Ohya J, Shio A, Akamatsu S (1994) IEEE Trans Pattern Anal Mach Intell 16(2):214–224
Article Google Scholar
Wu S, Amin A (2003) Proceedings of Seventh international conference on Document Analysis and Recognition, volume 1, pp. 493–497
Article Google Scholar
Canny J (1986) IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Article Google Scholar
Chen D, Shearer K, Bourlard H (2001) Text enhancement with asymmetric filter for video OCR, in: Proceedings of International Conference on Image Analysis and Processing, pp. 192–197
Google Scholar
Hasan Y, Karam L (2000) IEEE Trans Image Process 9(11):1978–1983
Article Google Scholar
Lee SW, Lee DJ, Park HS (1996) IEEE Trans Pattern Recogn Mach Intell 18(10):1045–1050
Article Google Scholar
Grigorescu SE, Petkov N, Kruizinga P (2002) IEEE Trans Image Process 11(10):1160–1167
Article MathSciNet Google Scholar
Livens S, Scheunders P, van de Wouwer G, Van Dyck D (1997) Wavelets for texture analysis, an overview, in: Proceedings of the Sixth International Conference on Image Processing and Its Applications, pp. 581–585
Google Scholar
Tuceryan M, Jain AK (1998) Texture analysis, in: Chen CH, Pau LF, Wang PSP (eds.) The Handbook of Pattern Recognition and Computer Vision 2nd edition, World Scientific Publishing, River Edge, NJ pp. 207–248
Google Scholar
Jain A, Bhattacharjee S (1992) Mach Vision Appl 5:169–184
Article Google Scholar
Acharyya M, Kundu M (2002) IEEE Trans Circ Syst video Technol 12(12): 1117–1127
Article Google Scholar
Etemad K, Doermann D, Chellappa R (1997) IEEE Trans Pattern Anal Mach Intell 19(1):92–96
Article Google Scholar
Mao W, Chung F, Lanm K, Siu W (2002) Hybrid Chinese/English text detection in images and video frames, in: Proceedings of International Conference on Pattern recognition, volume 3, pp. 1015–1018
Google Scholar
Coifman R, Wickerhauser V (1992) IEEE Trans Inf Theory 38(2):713–718
Article MATH Google Scholar
Coifman RR (1990) Wavelet Analysis and Signal Processing, in: Auslander L, Kailath T, Mitter SK (eds.) Signal Processing, Part I: Signal Processing Theory, Springer, Berlin Heidelberg New York, pp. 59–68, URL {citeseer.is-t}.psu.edu/coifman92wavelet.html
Daubechies I (1992) Ten Lectures on Wavelets (CBMS - NSF Regional Conference Series in Applied Mathematics), Soc for Industrial & Applied Math
Google Scholar
Bruce A, Gao HY (1996) Applied Wavelet Analysis with S-Plus, Springer, Berlin Heidelberg New York
MATH Google Scholar
Mallat SG (1989) IEEE Trans Pattern Anal Mach Intell 11(7):674–693
Article MATH Google Scholar
Engelbrecht A (2003) Computational Intelligence: An Introduction, WileyNew York, NY
Google Scholar
Sincak P, Vascak J (eds.) (2000) Quo vadis computational intelligence?, Physica-Verlag
Google Scholar
Zadeh L (1965) Inform Control 8:338–353
Article MATH MathSciNet Google Scholar
Klir G, Yuan B (eds.) (1996) Fuzzy sets, fuzzy logic, and fuzzy systems: selected papers by Lotfi A. Zadeh, World Scientific Publishing, River Edge, NJ
Google Scholar
Pham T, Chen G (eds.) (2000) Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems, CRC , Boca Raton, FL
Google Scholar
Jawahar C, Ray A (1996) IEEE Signal Process Lett 3(8):225–227
Article Google Scholar
Jin Y (2003) Advanced Fuzzy Systems Design and Applications, Physica/ Springer, Heidelberg
MATH Google Scholar
Mamdani E, Assilian S (1975) Int J Man-Mach Studies 7(1):1–13
Article MATH Google Scholar
Sugeno M, Kang G (1988) Structure identification of fuzzy model, Fuzzy Sets Syst 28:15–33
Article MATH MathSciNet Google Scholar
Dubois D, Prade H (1996) Fuzzy Sets Syst 84:169–185
Article MATH MathSciNet Google Scholar
Leekwijck W, Kerre E (1999) Fuzzy Sets Syst 108(2):159–178
Article MATH Google Scholar
Dunn J (1974) J Cybern 3:32–57
Article MathSciNet Google Scholar
Bezdek J (1981) Pattern Recognition with Fuzzy Objective Function Algorithms (Advanced Applications in Pattern Recognition), Springer, Berlin Heidelberg New York URL http://www.amazon.co.uk/exec/obidos/ASIN/0306406713/citeulike-21
Macqueen J (1967) Some methods of classification and analysis of multivariate observations, in: Proceedings of the Fifth Berkeley Symposium on Mathemtical Statistics and Probability, pp. 281–297
Google Scholar
Pham D (2001) Comput Vis Image Underst 84:285–297
Article MATH MathSciNet Google Scholar
Bezdek J, Hall L, Clarke L (1993) Med Phys 20:1033–1048
Article Google Scholar
Rignot E, Chellappa R, Dubois P (1992) IEEE Trans Geosci Remote Sensing 30(4):697–705
Article Google Scholar
Jang JS, Sun C (1995) Proc of the IEEE 83:378–406
Article Google Scholar
Kosko B (1991) Neural networks and fuzzy systems: a dynamical systems approach to machinhe intelligence, Prentice Hall, Englewood Cliffs, NJ
Google Scholar
Lin C, Lee C (1996) Neural fuzzy systems: a neural fuzzy synergism to intelligent systems, Prentice-Hall, Englewood Cliffs, NJ
Google Scholar
Mitra S, Hayashi Y (2000) IEEE Trans Neural Netw 11(3):748–768
Article Google Scholar
Nauck D (1997) Neuro-Fuzzy Systems: Review and Prospects, in: Proc. Fifth European Congress on Intelligent Techniques and Soft Computing (EUFIT’97), pp. 1044–1053
Google Scholar
Fuller R (2000) Introduction to Neuro-Fuzzy Systems, Springer, Berlin Heidelberg New York
MATH Google Scholar
Castellano G, Castiello C, Fanelli A, Mencar C (2005) Fuzzy Sets Syst 149(1):187–207
Article MATH MathSciNet Google Scholar
Castiello C, Gorecki P, Caponetti L (2005) Neuro-Fuzzy Analysis of Document Images by the KERNEL System, Lecture Notes in Artificial Intelligence 3849:369–374
Google Scholar
Caponetti L, Castiello C, Gorecki P (2007) Document Page Segmentation using Neuro-Fuzzy Approach, to appear in Applied Soft Computing Journal
Google Scholar
Gorecki P, Caponetti L, Castiello C (2006) Multiscale Page Segmentation using Wavelet Packet Analysis, in: Abstracts of VII Congress Italian Society for Applied and Industrial Mathematics (SIMAI 2006), p. 210
Google Scholar
of Oulu Finland U, Document Image Database, http://www.ee.oulu.fi/research/imag/document/
Hinds S, Fisher J, D’Amato D (1990) A document skew detection method using run-length encoding and Hough transform, in: Proc. of the 10th Int. Conference on Pattern Recognition (ICPR), pp. 464–468
Google Scholar
Hough P (1959) Machine Analysis of Bubble Chamber Pictures, in: International Conference on High Energy Accelerators and Instrumentation, CERN
Google Scholar
Srihari S, Govindaraju V (1989) Mach Vision Appl 2:141–153
Article Google Scholar
Gonzalez R, Woods R (2007) Digital Image Processing 3rd edition, Prentice Hall
Google Scholar
Lindeberg T (1994) Scale-space theory in computer vision, Kluwer, Boston
Google Scholar
Watt A, Policarpo F (1998) The Computer Image, ACM, Addison-Wesley
Google Scholar
Sammon J (1970) IEEE Trans Comput C-19:826–829
Google Scholar
Holland J (1992) Adaptation in Natural and Artificial Systems reprint edition, MIT, Cambridge, MA,
Google Scholar
Mitchell M (1996) An Introduction to Genetic Algorithms, MIT, iSBN:0-262-13316-4
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, University of Warmia and Mazury, ul. Oczapowskiego 2, 10-719, Olsztyn, Poland
Przemysław Górecki
Department of Computer Science, University of Bari, via E. Orabona, 4-70125, Bari, Italy
Laura Caponetti & Ciro Castiello

Authors

Przemysław Górecki
View author publications
You can also search for this author in PubMed Google Scholar
Laura Caponetti
View author publications
You can also search for this author in PubMed Google Scholar
Ciro Castiello
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Quantitative and Information System Department, Kuwait University College of Business Administration, 5486, 13055, Kuwait
Aboul-Ella Hassanien
Centre for Quantifiable Quality of Service in Communication Systems (Q2S) Centre of Excellence, Norwegian University of Science and Technology, O.S. Bragstads plass 2E, 7491, Trondheim, Norway
Ajith Abraham
Systems Research Institute, Polish Academy of Sciences, Newelska 6, 01-447, Warsaw
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Górecki, P., Caponetti, L., Castiello, C. (2008). Fuzzy Techniques for Text Localisation in Images. In: Hassanien, AE., Abraham, A., Kacprzyk, J. (eds) Computational Intelligence in Multimedia Processing: Recent Advances. Studies in Computational Intelligence, vol 96. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76827-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-76827-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76826-5
Online ISBN: 978-3-540-76827-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Buying options