Abstract
Imaging techniques are widely used in document image analysis in order to process, enhance, analyze, and recognize document images. In this chapter, we present an overview of basic image processing algorithms used in document image analysis and focus on the techniques used for document image binarization, enhancement, and normalization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akiyama T, Hagita N (1990) Automated entry system for printed documents. Pattern Recognit 23(11):1141–1154
Amin A, Fischer S (2000) A document skew detection method using the Hough transform. Pattern Anal Appl 3(3):243–253
Aradhye HB (2005) A generic method for determining up/down orientation of text in roman and non-roman scripts. Pattern Recognit 38:2114–2131
Avila BT, Lins RD (2004) A new algorithm for removing noisy borders from monochromatic documents. In: ACM-SAC’2004, Cyprus, Mar 2004. ACM, pp 1219–1225
Avila BT, Lins RD (2004) Efficient removal of noisy borders from monochromatic documents. In: ICIAR 2004, Porto, Portugal. LNCS 3212, pp 249–256
Badekas E, Papamarkos N (2007) Optimal combination of document binarization techniques using a self-organizing map neural network. Eng Appl Artif Intell (Elsevier) 20:11–24
Badekas E, Nikolaou N, Papamarkos N (2006) Text binarization in color documents. Int J Imaging Syst Technol 16(6):262–274
Baird HS (1987) The skew angle of printed documents. In: SPSE 40th conference and symposium on hybrid imaging systems, Rochester, pp 21–24
Bozinovich A, Srihari A (1989) Off-line cursive script word recognition. Trans Pattern Anal Mach Intell II(1):69–82
Brown MS, Tsoi YC (2006) Geometric and shading correction for images of printed materials using boundary. IEEE Trans Image Process 15(6):1544–1554
Cao Y, Li H (2003) Skew detection and correction in document images based on straight-line fitting. Pattern Recognit Lett 24(12):1871—1879
Cao H, Ding X, Liu C (2003) Rectifying the bound document image captured by the camera: a model based approach. In: 7th international conference on document analysis and recognition, Edinburgh, pp 71–75
Caprari RS (2000) Algorithm for text page up/down orientation determination. Pattern Recognit Lett 21:311–317
Chen Y, Leedham G (2005) Decompose algorithm for thresholding degraded historical document images. IEE Vis Image Signal Process 152(6):702–714
Chen YK, Wang JF (2001) Skew detection and reconstruction of color-printed document images. IEICE Trans Inf Syst E84-D(8):1018–1024
Cheriet M, Said JN, Suen CY (1998) A recursive thresholding technique for image segmentation. IEEE Trans Image Process 7(6):918–921
Chou CH, Chu SY, Chang F (2007) Estimation of skew angles for scanned documents based on piecewise covering by parallelograms. Pattern Recognit 40:443–455
Chou C-H, Lin W-H, Chang F (2010) A binarization method with learning-built rules for document images produced by cameras. Pattern Recognit 43(4):1518–1530
Ding Y, Kimura F, Miyake Y (2000) Slant estimation for handwritten words by directionally refined chain code. In: 7th international workshop on Frontiers in handwriting recognition (IWFHR 2000), Amsterdam, pp 53–62
Ding Y, Ohyama W, Kimura F, Shridhar M (2004) Local Slant estimation for handwritten English words. In: 9th international workshop on Frontiers in handwriting recognition (IWFHR 2004), Tokyo, Japan, pp 328–333,
Drira F, LeBourgeois F, Emptoz H (2011) A new PDE-based approach for singularity-preserving regularization: application to degraded characters restoration. Int J Doc Anal Recognit. doi:10.1007/s10032-011-0165-5
Dubois E, Pathak A (2001) Reduction of bleed-through in scanned manuscript documents. In: Proceedings of the image processing, image quality, image capture systems conference, Apr 2001, pp 177–180
Fadoua D, Le Bourgeois F, Emptoz H (2006) Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique. In: 7th international workshop on document analysis systems, (DAS 2006), Nelson, New Zealand, pp 38–49
Fan KC, Wang YK, Lay TR (2002) Marginal noise removal of document images. Pattern Recognit 35(11):2593–2611
Fan H, Zhu L, Tang Y (2010) Skew detection in document images based on rectangular active contour. Int J Doc Anal Recognit 13(4):261–269
Gatos B, Papamarkos N, Chamzas C (1997) Skew detection and text line position determination in digitized documents. Pattern Recognit 30(9):1505–1519
Gatos B, Pratikakis I, Perantonis SJ (2006) Adaptive degraded document image binarization. Pattern Recognit 39:317–327
Gatos B, Pratikakis I, Perantonis SJ (2008) Efficient binarization of historical and degraded document images. In: 8th international workshop on document analysis systems (DAS’08), Nara, Sept 2008, pp 447–454
Haji MM, Bui TD, Suen CY (2009) Simultaneous document margin removal and skew correction based on corner detection in projection profiles. In: ICIAP 2009, Solerno, Italy. LNCS 5716, pp 1025–1034
Huang S, Ahmadi M, Sid-Ahmed MA (2008) A hidden Markov model-based character extraction method. Pattern Recognit 41(9):2890–2900
Hull JJ (1998) Document image skew detection: survey and annotated bibliography. In: Hull JJ, Taylor SL (eds) Document analysis systems II. World Scientific, Singapore/River Edge, pp 40–64
Kamel M, Zhao A (1993) Extraction of binary character/graphics images from grayscale document images. Comput Vis Graph Image Process 55:203–217
Kavallieratou E, Fakotakis N, Kokkinakis G (2001) Slant estimation algorithm for OCR systems. Pattern Recognit 34:2515–2522
Kim G, Govindaraju V (1997) A Lexicon driven approach to handwritten word recognition for real-time applications. IEEE Trans Pattern Anal Mach Intell 19(4):366–379
Kim I-K, Jung D-W, Park R-H (2002) Document image binarization based on topographic analysis using a water flow model. Pattern Recognit 35:265–277
Kimura F, Shridhar M, Chen Z (1993) Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words. In: 2nd international conference on document analysis and recognition (ICDAR 1993), Tsukuba City, Japan. Oct 1993, pp 18–22
Lavialle O, Molines X, Angella F, Baylou P (2001) Active contours network to straighten distorted text lines. In: International conference on image processing, Thessaloniki, pp 748–751
Le DX, Thoma GR (1996) Automated borders detection and adaptive segmentation for binary document images. In: International conference on pattern recognition, (ICPR 1996), Vienna, Austria, p III: 737–741
Le DS, Thoma GR, Wechsler H (1994) Automated page orientation and skew angle detection for binary document images. Pattern Recognit 27(10):1325–1344
Leung CC, Chan KS, Chan HM, Tsui WK (2005) A new approach for image enhancement applied to low-contrast–low-illumination IC and document images. Pattern Recognit Lett 26:769–778
Li S, Shen Q, Sun J (2007) Skew detection using wavelet decomposition and projection profile analysis. Pattern Recognit Lett 28:555–562
Liang J, DeMenthon D, Doermann D (2008) Geometric rectification of camera-captured document images. IEEE Trans PAMI 30(4):591–605
Likforman-Sulem L, Darbon J, Barney Smith EH (2011) Enhancement of historical printed document images by combining total variation regularization and Non-local Means filtering. Image Vis Comput J 29(5):351–363
Liu H, Wua Q, Zha H, Liu X (2008) Skew detection for complex document images using robust borderlines in both text and non-text regions. Pattern Recognit Lett 29:1893–1900
Lu Y, Tan CL (2003) A nearest-neighbor chain based approach to skew estimation in document images. Pattern Recognit Lett 24:2315–2323
Lu SJ, Chen BM, Ko CC (2006) A partition approach for the restoration of camera images of planar and curled document. Image Vis Comput 24(8):837–848
Makridis M, Nikolaou N, Papamarkos N (2010) An adaptive technique for global and local skew correction in color documents. Expert Syst Appl 37(10):6832–6843
Masalovitch A, Mestetskiy L (2007) Usage of continuous skeletal image representation for document images de-warping. In: International workshop on camera-based document analysis and recognition, Curitiba, pp 45–53
Moghaddam RF, Cheriet M (2010) A variational approach to degraded document enhancement. IEEE Trans Pattern Anal Mach Intell 32(8):1347–1361
Niblack W (1986) An introduction to digital image processing. Prentice-Hall, Englewood Cliffs, pp 115–116
Nomura S, Yamanaka K, Shiose T, Kawakami H, Katai O (2009) Morphological preprocessing method to thresholding degraded word images. Pattern Recognit Lett 30(8):729–744
Obafemi-Ajayi T, Agam G, Frieder O (2010) Historical document enhancement using LUT classification. Int J Doc Anal Recognit 13:1–17
O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11):1162–1173
Otsu N (1979) A threshold selection method from Gray-level histograms. IEEE Trans Syst Man Cybern 9(1):377–393
Pratt WK (2012) Digital image processing. Wiley, New York
Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recognit 33:225–236
Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. J Electron Imaging 13(1):146–165
Shafait F, Breuel TM (2009) A simple and effective approach for border noise removal from document images. In: 13th IEEE international multi-topic conference, Islamabad, Dec 2009, pp 1–5
Shafait F, van Beusekom J, Keysers D, Breuel TM (2008) Document cleanup using page frame detection. Int J Doc Anal Recognit 11:81–96
Singh C, Bhatia N, Kaur A (2008) Hough transform based fast skew detection and accurate skew correction methods. Pattern Recognit 41(12):3528–3546
Solihin Y, Leedham C (1999) Integral ratio: a new class of global thresholding techniques for handwriting images. IEEE Trans Pattern Anal Mach Intell 21:761–768
Stamatopoulos N, Gatos B, Kesidis A (2007) Automatic borders detection of camera document images. In: 2nd international workshop on camera-based document analysis and recognition (CBDAR’07), Curitiba, Sept 2007, pp 71–78
Stamatopoulos N, Gatos B, Pratikakis I, Perantonis SJ (2011) Goal-oriented rectification of camera-based document images. IEEE Trans Image Process 20(4):910–920
Su B, Lu S, Tan CL (2011) Combination of document image binarization techniques. In: 11th international conference on document analysis and recognition, Beijing, 18–21 Sept 2011
Tonazzini A (2010) Color space transformations for analysis and enhancement of ancient degraded manuscripts. Pattern Recognit Image Anal 20(3):404–417
Trier ØD, Taxt T (1995) Evaluation of binarization methods for document images for document images. IEEE Trans Pattern Anal Mach Intell 17(3):312–315
Tsai CM, Lee HJ (2002) Binarization of color document images via luminance and saturation color features. IEEE Trans Image Process 11(4):434–451
Tseng YH, Lee HJ (2008) Document image binarization by two-stage block extraction and background intensity determination. Pattern Anal Appl 11:33–44
Ulges A, Lampert CH, Breuel TM (2005) Document image dewarping using robust estimation of curled text lines. In: 8th international conference on document analysis and recognition, Seoul, pp 1001–1005
van Beusekom J, Shafait F, Breuel TM (2010) Combined orientation and skew detection using geometric text-line modeling. Int J Doc Anal Recognit 13(2):79–92
Vinciarelli A, Luettin J (2001) A new normalization technique for cursive handwritten words. Pattern Recognit Lett 22(9):1043–1050
Vonikakis V, Andreadis I, Papamarkos N (2011) Robust document binarization with OFF center-surround cells. Pattern Anal Appl 14:219–234
Wahl FM, Wong KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375–390
Wang B, Li XF, Liu F, Hu FQ (2005) Color text image binarization based on binary texture analysis. Pattern Recognit Lett 26:1650–1657
Wu C, Agam G (2002) Document image De-warping for text/graphics recognition. In: Joint IAPR international workshop on structural, syntactic and statistical pattern recognition, Windsor, pp 348–357
Yan H (1993) Skew correction of document images using interline cross-correlation. Graph Models Image Process 55(6):538–543
Yang Y, Yan H (2000) An adaptive logical method for binarization of degraded document images. Pattern Recognit 33:787–807
Yin PY (2001) Skew detection and block classification of printed documents. Image Vis Comput 19(8):567–579
Zhang L, Tan CL (2005) Warped image restoration with applications to digital libraries. In: 8th international conference on document analysis and recognition, Seoul, pp 192–196
Zhang L, Yip AM, Brown MS, Tan CL (2009) A unified framework for document restoration using in painting and shape-from-shading. Pattern Recognit J 42(11):2961–2978
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag London
About this entry
Cite this entry
Gatos, B.G. (2014). Imaging Techniques in Document Analysis Processes. In: Doermann, D., Tombre, K. (eds) Handbook of Document Image Processing and Recognition. Springer, London. https://doi.org/10.1007/978-0-85729-859-1_4
Download citation
DOI: https://doi.org/10.1007/978-0-85729-859-1_4
Published:
Publisher Name: Springer, London
Print ISBN: 978-0-85729-858-4
Online ISBN: 978-0-85729-859-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering