Image Document Categorization using Hidden Tree Markov Models and Structured Representations

Diligenti, Michelangelo; Frasconi, Paolo; Gori, Marco

doi:10.1007/3-540-44732-6_15

Michelangelo Diligenti⁷,
Paolo Frasconi⁸ &
Marco Gori⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2013))

Included in the following conference series:

International Conference on Advances in Pattern Recognition

672 Accesses
3 Citations

Abstract

Categorization is an important problem in image document processing and is often a preliminary step for solving subsequent tasks such as recognition, understanding, and information extraction. In this paper the problem is formulated in the framework of concept learning and each category corresponds to the set of image documents with similar physical structure. We propose a solution based on two algorithmic ideas. First, we transform the image document into a structured representation based on X-Y trees. Compared to “flat” or vector-based feature extraction techniques, structured representations allow us to preserve important relationships between image sub-constituents. Second, we introduce a novel probabilistic architecture that extends hidden Markov models for learning probability distributions defined on spaces of labeled trees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Y. Bengio and P. Frasconi, “An input output HMM architecture,” in Advances in Neural Information Processing Systems 7, (G. Tesauro, D. Touretzky, and T. Leen, eds.), pp. 427–434, The MIT Press, 1995.
Google Scholar
R. Brugger, A. Zramdini, and R. Ingold, “Modeling documents for structure recognition using generalized n-grams,” in Proceedings of ICDAR, 1997.
Google Scholar
A. Dengel, “Initial learning of document structure,” in Proceedings of ICDAR, pp. 86–90, 1993.
Google Scholar
U. M. Fayyad and K. B. Irani, “Multi-interval discretization of continuous-valued attributes for classification learning,” in Proc. 13th Int. Joint Conf. on Artificial Intelligence, pp. 1022–1027, Morgan Kaufmann, 1993.
Google Scholar
P. Frasconi, M. Gori, and A. Sperduti, “A general framework for adaptive processing of data structures,” IEEE Trans. on Neural Networks, vol. 9, no. 5, pp. 768–786, 1998.
Article Google Scholar
R. C. Gonzalez and M. G. Thomason, Syntactic Pattern Recognition. Reading, Massachusettes: Addison Wesley, 1978.
MATH Google Scholar
D. Heckerman, “Bayesian networks dor data mining,” Data Mining and Knowledge Discovery, vol. 1, no. 1, pp. 79–120, 1997.
Article Google Scholar
F. V. Jensen, S. L. Lauritzen, and K. G. Olosen, “Bayesian updating in recursive graphical models by local computations,” Computational Statistical Quarterly vol. 4, pp. 269–282, 1990.
Google Scholar
G. Nagy and S. Seth, “Hierarchical representation of optically scanned documents,” in Proc. Int. Conf. on Pattern Recognition, pp. 347–349, 1984.
Google Scholar
J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, 1988.
Google Scholar
L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. of the IEEE, vol. 77, no. 2, pp. 257–286, 1989.
Google Scholar
P. Smyth, D. Heckerman, and M. I. Jordan, “Probabilistic independence networks for hidden markov probability models,” Neural Computation, vol. 9, no. 2, pp. 227–269, 1997.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Information Engineering, Università di Siena, Italy
Michelangelo Diligenti & Marco Gori
Dept. of Systems and Computer Science, Università di Firenze, Italy
Paolo Frasconi

Authors

Michelangelo Diligenti
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Frasconi
View author publications
You can also search for this author in PubMed Google Scholar
Marco Gori
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Exeter, EX4 4PT, Exeter, UK
Sameer Singh
Computational Intelligence Group, Tuiuti University of Parana, Curitiba, Brazil
Nabeel Murshed
Institute of Computer Aided Automation PRIP-Group 1832, Vienna University of Technology, Favoritenstr. 9/2/4, 1040, Wien, Austria
Walter Kropatsch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diligenti, M., Frasconi, P., Gori, M. (2001). Image Document Categorization using Hidden Tree Markov Models and Structured Representations. In: Singh, S., Murshed, N., Kropatsch, W. (eds) Advances in Pattern Recognition — ICAPR 2001. ICAPR 2001. Lecture Notes in Computer Science, vol 2013. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44732-6_15

Download citation

DOI: https://doi.org/10.1007/3-540-44732-6_15
Published: 09 May 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41767-5
Online ISBN: 978-3-540-44732-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics