Skip to main content
Log in

RETRACTED ARTICLE: Document image analysis: issues, comparison of methods and remaining problems

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

This article was retracted on 26 June 2013

Abstract

Image analysis is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. However, document image analysis is the special case in image analysis as their spatial properties are different from natural images. Therefore, the main focus of this paper is to describe image denoising issues in general and document image issues in particular. Since the field of document processing is relatively new, it is also dynamic, so current methods have room for improvement and innovations are still being made. Several algorithms proposed in the literature are described. Critical discussions are reported about the current status of the field and open problems are highlighted. It is also demonstrated that, there are rarely definitive techniques for all cases of a certain problem. We surveyed the state of art, analyzed recent trends and tried to identify challenges for future research in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abdel-Dayem AR, Hamou AK, El-Sakka MR (2004) Novel adaptive filtering for salt-and-pepper noise removal from binary document images. LNCS 3212: 191–199

    Google Scholar 

  • Abreu E, Lightstone M, Mitra S, Arakawa K (1996) A new efficient approach for the removal of impulse noise from highly corrupted images. IEEE Trans Image Process 5(6): 1012–1025

    Article  Google Scholar 

  • Akiyama T, Hagita N (1990) Automated entry system for printed documents. Pattern Recogn 23(11): 1141–1154

    Article  Google Scholar 

  • Ali MB (1996) Background noise detection and cleaning in document images. In: Proceedings of the 13th international conference on pattern recognition, vol 3, pp 758–762

  • Al-Khaffaf H, Talib AZ, Salam RA (2008) Removing salt-and-pepper noise from binary images of engineering drawings. In: 19th international conference on pattern recognition (ICPR 2008), pp 1–4

  • Amin A, Fischer S (2000) A document skew detection method using the hough transform. Pattern Anal Appl 3: 243–253

    Article  MATH  Google Scholar 

  • Aradhya VNM, Kumar GH, Shivakumara P (2007) An accurate and efficient skew estimation Technique for south indian documents: a new boundary growing and nearest neighbor clustering based approach. Int J Robot Autom 22(4): 272–280

    Google Scholar 

  • Arvind KR, Kumar J, Ramakrishnan AG (2007). Entropy based skew correction of document images. LNCS 2007, vol 4815, pp 495–502

  • Avila BT, Lins RD (2005) A fast orientation and skew detection algorithm for monochromatic document images. In: Proceedings of the 2005 ACM symposium on document engineering

  • Bagdanov A, Kanai J (1997) Projection profile based skew estimation algorithm for JBIG compressed images. In: Proceedings of the 4th international conference on document analysis and recognition, pp 401–405

  • Baird HS (1987) The skew angle of printed documents. In: Proceedings of SPSE 40th symposium hybrid imaging systems, Rochester, NY, pp 739–743

  • Baird HS (1992) Anatomy of a versatile page reader. Proc IEEE 80(7): 1059–1065

    Article  Google Scholar 

  • Bala E, Ertuzun A (2002) Applications of multiwavelet techniques to image denoising. Proc Int Conf Image Process 3: 581–584

    Article  Google Scholar 

  • Barni M, Buti F, Bartolini F, Cappellini V (2000) A quasi-euclidean norm to speed up vector median filtering. IEEE Trans Image Process 9(10): 1704–1709

    Article  MathSciNet  MATH  Google Scholar 

  • Bharath AA, Ng J (2005) A steerable complex wavelet construction and its application to image denoising. IEEE Trans Image Process 14(7): 948–959

    Article  MathSciNet  Google Scholar 

  • Billawala N, Hart PE, Peairs M (1993) Image continuation. In: Proceedings of the second document analysis and recognition, pp 53–57

  • Bloomberg DS, Kopec GE, Dasari L (1995) Measuring document image skew and orientation. Document Recognition II (SPIE vol 2422), San Jose, CA, 6–7, pp 302–316

  • Bovik A (1987) Streaking in median filtered images. IEEE Trans Acoust Speech Signal Process 35(4): 493–503

    Article  MATH  Google Scholar 

  • Buades A, Coll B, Morel JM (2005) A review of image denoising algorithms with a new one multiscale modeling and simulation. SIAM Interdiscip J 4(2): 490–530

    MathSciNet  MATH  Google Scholar 

  • Buccigrossi RW, Simoncelli EP (1999) Image compression via joint statistical characterization in the wavelet domain. IEEE Trans Image Process 8(12): 1688–1701

    Article  Google Scholar 

  • Chan RH, Ho C-W, Nikolova M (2005) Salt-and-pepper noise removal by median-type noise detectors and detail preserving regularization. IEEE Trans Image Progress 14: 1479–1485

    Article  Google Scholar 

  • Chang SG, Yu B, Vetterli M (2000) Spatially adaptive wavelet thresholding with context modeling for image denoising. IEEE Trans Image Process 9(9): 1522–1531

    Article  MathSciNet  MATH  Google Scholar 

  • Chaudhuri A, Chaudhuri S (1997) Robust detection of skew in document images. IEEE Trans Image Process 6(2): 344–349

    Article  Google Scholar 

  • Chinnasarn K, Rangsanseri Y, Thitimajshima P (1998) Removing salt-and-pepper noise in text/graphics images. In: IEEE Asia-Pacific conference on circuits and systems, pp 459–462

  • Choi H, Baraniuk RG (2004) Multiple wavelet basis image denoising using Besov ball projections. IEEE Signal Process Lett 11(9): 717–720

    Article  Google Scholar 

  • Chou CH, Chu SY, Chang F (2007) Estimation of document skew angles using piecewise linear approximation of line objects. Pattern Recogn 40: 443–455

    Article  MATH  Google Scholar 

  • Ciardiello G, Scafuro G, Degrandi MT, Spada MR, Roccotelli MP (1988) An experimental system for office document handling and text recognition. In: Proceedings of ninth international conference on pattern recognition, pp 739–743

  • Dalong L (2009) Support vector regression based image denoising. Image Vis Comput 27: 623–627

    Article  Google Scholar 

  • Dalong L, Simske S, Mersereau RM (2007) Image denoising through support vector regression. IEEE Int Conf Image Process 4: 425–428

    Google Scholar 

  • Dargherty E, Lotufo R (2003) Hands–on Morphological Image Processing. The Society of Photo-Optical Instrumentation Engineers, Bellingham

    Book  Google Scholar 

  • Dhandra BV, Malemath VS, Mallikarjun H, Hegadi R (2006) Skew detection in binary image documents based on image dilation and region labeling approach. In: Proceedings 18th international conference on pattern recognition, vol 2, pp 954–957

  • Dong Y-Q, Fang X-S (2006) An efficient salt-and-pepper noise removal, Acta Scientiarum Natrulium Universitatics Pekinensis, vol 5

  • Donoho DL, Johnstone IM (1995) Adapting to unknown smoothness via wavelet shrinkage. J Am Stat Assoc Natl Lab 90(432): 1200–1224

    Article  MathSciNet  MATH  Google Scholar 

  • Gatos B, Papamarkos N, Chamzas C (1997) Skew detection and text line position determination in digitized documents. Pattern Recogn 30(9): 1505–1519

    Article  Google Scholar 

  • Gonzalez R et al (2000) Digital image processing. Addision-Wesley, Reading

    Google Scholar 

  • Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11): 1162–1173

    Article  Google Scholar 

  • Hamza AB, Luque P, Martinez J, Roman R (1999) Removing noise and preserving details with relaxed median filters. J Math Image Vis 11(2): 161–177

    Article  Google Scholar 

  • Hashizume A, Yeh PS, Rosenfeld A (1986) Method of detecting the orientation of aligned components. Pattern Recogn 4(3): 125–132

    Google Scholar 

  • He K, Zhou J, Liu C, Liu R (2008) An efficient salt-and-pepper noise removal on local edge-preserving function. In: International conference on embedded software and systems symposia, (ICESS Symposia ‘08), pp 392–397

  • Hinds J, Fisher L, D’Amato DP (1990) A document skew detection method using run-length encoding and the Hough transform. In: Proceedings of the 10th international conference pattern recognition. IEEE CS Press, Los Alamitos, CA, pp 464–468

  • Hull JJ (1998) Document image skew detection: survey and annotated bibliography. World Scientific, Singapore, pp, pp 40–64

    Google Scholar 

  • Ishitani Y (1993) Document skew detection based on local region complexity. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba, Japan, pp 49–52

  • Jain AK (1989) Fundamentals of digital image processing. Prentice-Hall, Englewood Cliffs

    MATH  Google Scholar 

  • Jiang H, Han C, Fan K (1997) A fast approach to the detection and correction of skew documents. Pattern Recogn Lett 18: 675–686

    Article  Google Scholar 

  • Jiang X, Bunke H, Widmer-Kljajo D (1999) Skew detection of document images by focused nearest-neighbor clustering. In: Proceedings 5th international conference on document analysis and recognition, Bangalore, pp 629–632

  • Jung A (2001) An introduction to a new data analysis tool: independent component analysis. In: Proceedings of workshop GK “Nonlinearity”—Regensburg

  • Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recogn 37(5): 977–997

    Article  Google Scholar 

  • Ko S, Lee YH (1991) Center weighted median filters and their applications to image enhancement. IEEE Trans Circuits Syst 38(9): 984–993

    Article  Google Scholar 

  • Kumar J, Kasar T, Ramakrishna AG (2007) IEEE TENCON, 1–4

  • Kuo S, Johnston JD (2002) Spatial noise shaping based on human visual sensitivity and its application to image coding. IEEE Trans Image Process 11(5): 509–517

    Article  Google Scholar 

  • Le DS, Thoma GR, Wechsler H (1994) Automatic page orientation and skew angle detection for binary document images. Pattern Recogn 27(10): 1325–1344

    Article  Google Scholar 

  • Lins RD, Avila BT (2004) A new algorithm for skew detection in images of documents. In: Lecture Notes in Computer Science, vol 3212, Springer, Berlin, pp 234–240

  • Liolios N, Fakotakis N, Kokkinakis G (2001) Improved document skew detection based on text line connected component clustering. In: Proceedings of the international conference on image processing, Thessaloniki, Greece, vol 1, pp 1098–1101

  • Liying F, Lixin F, Chew LT (2001) Binarizing document image using coplanar prefilter. In: Proceedings of sixth international conference on document analysis and recognition, pp 34–38

  • Lu Y, Tan CL (2003) A nearest-neighbor chain based approach to skew estimation in document images. Pattern Recogn Lett 24: 2315–2323

    Article  Google Scholar 

  • Mahmoudi M, Sapiro G (2005) Fast image and video denoising via non-local means of similar neighborhoods. IEEE Signal Process Lett 12(12): 839–842

    Article  Google Scholar 

  • Makridis M, Nikolaou N, Papamarkos N (2007) A new technique for global and local skew correction in binary documents. In: 9th international conference on advanced concepts for intelligent vision systems, pp 877–887

  • Manjunath VN, Kumar GH, Shivakumara P (2006) Skew detection technique for binary document images based on Hough transform. Int J Inf Technol 3(3): 194–200

    Google Scholar 

  • Motwani MC, Gadiya MC, Motwani RC (2004) Survey of image denoising techniques. In: Proceedings of global signal processing expo and conference (GSPx), Santa Clara, CA

  • Nikolova M (2004) A variational approach to remove outliers and impulse noise. J Math Imaging Vis 20: 99–120

    Article  MathSciNet  Google Scholar 

  • Ozawa H, Nakagawa T (1993) A character image enhancement method from characters with various background images. In: Proceedings of second international conference on document analysis and recognition, pp 58–61

  • Pal U, Chaudhuri BB (1996) An improved document skew angle estimation technique. Pattern Recogn Lett 17(8): 899–904

    Article  Google Scholar 

  • Pavlidis T, Zhou J (1991) Page segmentation by white streams. In: Proceedings of first international conference on document analysis and recognition (ICDAR), pp 945–953

  • Peake GS, Tan TN (1997) A general algorithm for document skew angle estimation. IEEE Int Conf Image Process 2: 230–233

    Article  Google Scholar 

  • Ping Z, Lihui C, Alex KC (2000) Text document filters using morphological and geometrical features of characters. In: Proceedings of fifth international conference on signal processing, vol 1, pp 472–475

  • Portilla Zhou J, Krzyzak A, Suen CY (2002) Verification-a method of enhancing the recognizers of isolated and touching handwritten numerals. Pattern Recogn 35: 1179–1189

    Article  Google Scholar 

  • Postl W (1986) Detection of linear oblique structures and skew scan in digitized documents. In: Proceedings of the 8th international conference on pattern recognition, Paris, France, pp 687–689

  • Radu MV, Bilcu C (2007) Fast Non-local means for image de-noising. In: Proceedings of IS&T/SPIE symposium on electronic imaging, digital photography III conference, vol 6502, San Jose, California, USA

  • Romberg JK, Choi H, Baraniuk RG (2001) Bayesian tree-structured image modeling using wavelet-domain hidden Markov models. IEEE Image Process 10(7): 1056–1068

    Article  Google Scholar 

  • Sadri J, Cheriet M (2009) A new approach for skew correction of documents based on particle swarm optimization. In: Proceedings of 10th international conference on document analysis and recognition, ICDAR ‘09, pp 1066–1070

  • Saragiotis P, Papamarkos N (2008) Local skew correction in documents. Int J Pattern Recogn Artif Intell 22(4): 691–710

    Article  Google Scholar 

  • Sarfraz M, Rasheed Z (2008) Skew estimation and correction of text using bounding box. In: Proceedings of fifth international conference on computer graphics, imaging and visualization, (CGIV ‘08), pp 259–264

  • Sauvola J, Pietik Äainen M (1995) Skew angle detection using texture direction analysis. In: Proceedings of the 9th scandinavian conference on image analysis, pp 1099–1106

  • Seo H-J, Chatterjee P, Takeda H, Milanfar P (2007) A comparison of some state of the art image denoising methods. In: conference record of the forty-first asilomar conference on signals, systems and computers, pp 518–522

  • Sharif AE, Movahhedinia N (2008) On skew estimation of Persian/Arabic printed documents. J Appl Sci 8(12): 2265–2271

    Article  Google Scholar 

  • Shivakumara P Hemantha G, Kumar D (2002) Text-skew detection through contour following in document image. In: Proceedings of national workshop on computer vision, graphics and image processing, (WVGIP 2002), pp 39–44

  • Shivakumara P, Hemantha Kumar G, Guru DS, Nagabhushan P (2003) Skew estimation of binary document images using static and dynamic thresholds useful for document image mosaicing. In: Proceedings of national workshop on IT services and applications (WITSA2003) Feb 27–28, pp 1–5

  • Singh C, Bhatia N, Kaur A (2008) Hough transform based fast skew detection and accurate skew correction methods. Pattern Recogn 41: 3528–3546

    Article  MATH  Google Scholar 

  • Spitz AL (1997) Determination of the script and language content of document images. IEEE Trans Pattern Anal Mach Intell 19(3): 235–245

    Article  Google Scholar 

  • Srihari SN, Govindaraju V (1989) Analysis of textual images using the Hough transform. Mach Vis Appl 2: 141–153

    Article  Google Scholar 

  • Sun T, Neuvo Y (1994) Detail-preserving median based filters in image processing. Pattern Recogn Lett 15: 341–347

    Article  Google Scholar 

  • Sun C, Si D (1997) Skew and slant correction for document images using gradient direction. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, pp 142–146

  • Teboul S, Blanc-F’eraud L, Aubert G, Barlaud M (1998) Variational approach for edge-preserving regularization using coupled PDE’s. IEEE Trans Image Process 7: 387–397

    Article  Google Scholar 

  • Wang Z, Zhang D (1999) Progressive switching median filter for the removal of impulse noise from highly corrupted images. IEEE Trans Circ Syst II 46: 78–80

    Article  Google Scholar 

  • Wang J, Leung MKH, Hui SC (1997) Cursive word reference line detection. Pattern Recogn 30(3): 503–511

    Article  Google Scholar 

  • Xie J, Zhang D, Xu W (2004) Spatially adaptive wavelet denoising using the minimum description length principle. IEEE Trans Image Process 13(2): 179–187

    Article  Google Scholar 

  • Xing-mei L, Guo-ping Y, Xing-mei L, Liang C (2007) The Image denoise based on soft-threshold and edge enhancement. In: Second workshop on digital media and its application in museum & Heritage, pp 53–56

  • Yin PY (2001) Skew detection and block classification of printed documents. J Image Vis Comput 19(8): 567–579

    Article  Google Scholar 

  • Yoon M, Lee S, Kim J (1995) Faxed image restoration using kalman filtering. In: Third international conference on document analysis and recognition, vol 2, pp 677–680

  • Yu B, Jain AK (1996) A robust and fast skew detection algorithm for generic documents. Pattern Recogn 29(10): 1599–1629

    Article  Google Scholar 

  • Zhang S, Karim MA (2002) A new impulse detector for switching median filters. IEEE Signal Process Lett 9: 360–363

    Article  Google Scholar 

  • Zheng Q, Kanungo T (2001) Morphological degradation models and their use in document image restoration. Int Conf Image Process 1: 193–196

    Google Scholar 

  • Zhong J, Ning R (2005) Image denoising based on wavelets and multifractals for singularity detection. IEEE Trans Image Process 14(10): 1435–1447

    Article  Google Scholar 

  • Zhou X, Zhou C, Stewart BG (2006) Comparisons of discrete wavelet transform, wavelet packet transform and stationary wavelet transform in denoise PD measurement, data. In: IEEE, international symposium on electrical insulation, pp 237–240

  • Zhu X, Yin X (2002) A new textual/non-textual classifier for document skew correction. In: Proceedings of the 16th international conference on pattern recognition (ICPR), pp 480–482

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Rehman.

Additional information

This article has been retracted at the request of the publishing editor due to plagiarism.

The retraction note to this article can be found online at http://dx.doi.org/10.1007/s10462-013-9407-x.

About this article

Cite this article

Saba, T., Sulong, G. & Rehman, A. RETRACTED ARTICLE: Document image analysis: issues, comparison of methods and remaining problems. Artif Intell Rev 35, 101–118 (2011). https://doi.org/10.1007/s10462-010-9186-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-010-9186-6

Keywords

Navigation