Abstract
It is presented herein a new thresholding algorithm for images of historical documents. The algorithm provides high quality binary images using entropy information of the images to define a primary threshold value which is adjusted with the use of ROC curves.
Chapter PDF
Similar content being viewed by others
Keywords
- Receiver Operating Characteristic Curve
- Document Image
- Historical Document
- Sample Document
- Produce Different Classifier With Different
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Chang, M.S., Kang, S.M., Rho, W.S., Kim, H.G., Kim, D.J.: Improved binarization algorithm for document image by histogram and edge detection. In: Proc. 3rd Intern. Conf.on Document Analysis and Recognition, Canada, pp. 636–639 (1995)
Glasbye, C.A.: An Analysis of Histogram-Based Thresholding Algorithm. In: CVGIP: Graphical Models and Image Processing (November 1993)
Huang, L.K.,, L.: Image Thresholding by Minimizing the Measures of Fuzziness. Pattern Recognition (1995)
Jawahar, C.V., Biswas, P.K., Ray, K.: Investigations On Fuzzy Thresholding Based On Fuzzy Clustering. Pattern Recognition (1997)
Johannsen, G., Bille, J.: A Threshold Selection Method using Information Measures. In: Proceedings, 6th Int. Conf. Pattern Recognition, Munich, Germany, pp. 140–143 (1982)
Kapur, J.N., Sahoo, P.K., Wong, A.K.C.: A New Method for Gray-Level Picture Thresholding using the Entropy of the Histogram. Computer Vision, Graphics and Image Processing 29(3) (1985)
Kapur, J.N.: Measures of Information and their Applications. John Wiley and Sons, Chichester (1994)
Katz, S.W., Brink, A.D.: Segmentation of Chromosome Images, pp. 85–90. IEEE, Los Alamitos (1993)
Kittler, J., Illingworth, J.: ‘Minimum Error Thresholding. Pattern Recognition 19(1), 41–47 (1986)
Kullback, S.: Information Theory and Statistics. Dover Publications, Inc., New York (1997)
Li, C.H., Lee, C.K.: Minimum Cross Entropy Thresholding. Pattern Recognition 26(4), 616–626 (1993)
Mello, C.A.B.: A New Entropy and Logarithmic Based Binarization Algorithm for Grayscale Images. In: IASTED VIIP 2004, Hawaii, USA (2004)
McMilan, N.A., Creelman, C.D.: Detection Theory. LEA Pub. (2005)
Otsu, N.: A threshold selection method from gray-level histogram. IEEE Trans.on Systems, Man, and Cybernetics 8, 62–66 (1978)
Parker, J.R.: Algorithms for Image Processing and Computer Vision. John Wiley and Sons, Chichester (1997)
Pun, T.: Entropic Thresholding, A New Approach. C.Graphics and Image Proc. (1981)
Ridler, T.W., Calvard, S.: Picture Thresholding Using an Iterative Selection Method. IEEE Trans. on Systems, Man and Cybernetics SMC-8(8), 630–632 (1978)
Sahoo, P., Wilkins, C., Yeager, J.: Threshold Selection using Renyi’s Entropy. Pattern recognition 30(1), 71–84 (1997)
Shannon, C.: A Mathematical Theory of Communication. Bell System Technology Journal 27, 370–423, 623-656 (1948)
Wu, L., Ma, S., Lu, H.: An Effective Entropic thresholding for Ultrasonic Images, pp. 1552–1554. IEEE, Los Alamitos (1998)
Yager, R.R.: On the Measures of Fuzziness and Negation.Part.1: Membership in the Unit Interval. Int. Journal of Gen. Sys. (1979)
Yan, H.: Unified Formulation of a Class of Image Thresholding Techniques. Pattern Recognition 29(12), 2025–2032 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mello, C.A.B., Costa, A.H.M. (2005). Image Thresholding of Historical Documents Using Entropy and ROC Curves. In: Sanfeliu, A., Cortés, M.L. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2005. Lecture Notes in Computer Science, vol 3773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11578079_93
Download citation
DOI: https://doi.org/10.1007/11578079_93
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29850-2
Online ISBN: 978-3-540-32242-9
eBook Packages: Computer ScienceComputer Science (R0)