Abstract
Camera-based document image retrieval is a task of searching document images from the database based on query images captured using digital cameras. For this task, it is required to solve the problem of “perspective distortion” of images,as well as to establish a way of matching document images efficiently. To solve these problems we have proposed a method called Locally Likely Arrangement Hashing (LLAH) which is characterized by both the use of a perspective invariant to cope with the distortion and the efficiency: LLAH only requires O(N) time where N is the number of feature points that describe the query image. In this paper, we introduce into LLAH an affine invariant instead of the perspective invariant so as to improve its adjustability. Experimental results show that the use of the affine invariant enables us to improve either the accuracy from 96.2% to 97.8%, or the retrieval time from 112 msec./query to 75 msec./query by selecting parameters of processing.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Doermann, D.: The Indexing and Retrieval of Document Images: A Survey. Computer Vision and Image Understanding 70(3), 287–298 (1998)
Hull, J.J.: Document image matching and retrieval with multiple distortion-invariant descriptors. In: Document Analysis Systems, pp. 379–396 (1995)
Doermann, D., Li, H., Kia, O.: The detection of duplicates in document image databases. In: Proc. ICDAR 1997, pp. 314–318 (1997)
Doermann, D., Liang, J., Li, H.: Progress in camera-based document image analysis. In: Proc. ICDAR 2003, pp. 606–616 (2003)
Clark, P., Mirmehdi, M.: Recognising text in real scenes. IJDAR 4, 243–257 (2002)
Pollard, S., Pilu, M.: Building cameras for capturing documents. IJDAR 7, 123–137 (2005)
Wolfson, H.J., Rigoutsos, I.: Geometric hashing: an overview. IEEE Computational Science & Engineering 4(4), 10–21 (1997)
Nakai, T., Kise, K., Iwamura, M.: Hashing with Local Combinations of Feature Points and Its Application to Camera-Based Document Image Retrieval. In: Proc. CBDAR 2005, pp. 87–94 (2005)
Suk, T., Flusser, J.: Point-based projective invariants. Pattern Recognition 33, 251–261 (2000)
Huet, B., Hancock, E.R.: Cartographic indexing into a database of remotely sensed images. In: WACV 1996, pp. 8–14 (1996)
Rothwell, C.A., Zisserman, A., Fosyth, D.A., Mundy, J.L.: Using projective invariants for constant time library indexing in model based vision. In: Proc. BMVC, pp. 62–70 (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakai, T., Kise, K., Iwamura, M. (2006). Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval. In: Bunke, H., Spitz, A.L. (eds) Document Analysis Systems VII. DAS 2006. Lecture Notes in Computer Science, vol 3872. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11669487_48
Download citation
DOI: https://doi.org/10.1007/11669487_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32140-8
Online ISBN: 978-3-540-32157-6
eBook Packages: Computer ScienceComputer Science (R0)