Abstract
Digitization of documents recently has become an important technology. However, it is difficult for existing scanners to read books at high speed and at high resolution simultaneously. In order to realize a promising new book scanning system, we aimed to scan a book containing many pages by using multiple high-speed cameras to acquire images while continuously flipping through the pages, then integrating the images viewed by different cameras to digitize all of the pages. However, high-accuracy integration with the non-uniform rectification required for such input images is a challenging task because the sheets of the document are deformed and the image resolution is so high that misalignment can easily occur. This paper proposes a new multi-camera-array book scanning system and a method of achieving high-accuracy three-dimensional deformation estimation and high-resolution rectification of the distorted document images with a system configuration in which multiple high-speed cameras are arranged with small overlapping captured areas. Experiments using the developed system showed that high-accuracy document images were reconstructed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Nakashima, T., Watanabe, Y., Komuro, T., Ishikawa, M.: Book flipping scanning. In: Adjunct Proceedings of UIST, pp. 79–80 (2009)
Watanabe, Y., Nakashima, T., Komuro, T., Ishikawa, M.: Estimation of non-rigid surface deformation using developable surface model. In: Proceedings of ICPR, pp. 197–200 (2010)
Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: A model based approach. In: Proceedings of ICDAR, pp. 71–75 (2003)
Koo, H.I., Cho, N.I.: State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 421–434. Springer, Heidelberg (2010)
Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 591–605 (2008)
Tian, Y., Narasimhan, S.G.: Rectification and 3D Reconstruction of Curved Document Images. In: Proceedings of ICCV, pp. 377–384 (2011)
Yamashita, A., Kawarago, A., Kaneko, T., Miura, K.T.: Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system. In: Proceedings of ICPR, pp. 482–485 (2004)
Courteille, F., Crouzil, A., Durou, J.D., Gurdjos, P.: Shape from shading for the digitization of curved documents. Machine Vision and Applications 18, 301–316 (2007)
Brown, M.S., Pisula, C.J.: Conformal deskewing of non-planar documents. In: Proceedings of CVPR, pp. 998–1004 (2005)
Brown, M.S., Seales, W.B.: Image restoration of arbitrarily warped documents. IEEE Transactions on Pattern Analysis and Machine Intelligence 26, 129–1306 (2004)
Tanimoto, M.: Overview of free viewpoint television. Transactions on Signal Processing: Image Communication 6, 454–461 (2006)
Ng, R., Levoy, M., Bredif, M., Duval, G., Horowitz, M., Hanrahan, P., Design, D.: Light field photography with a hand-held plenoptic camera. Stanford Tech. Report CTSR 2005-02 (2005)
Wilburn, B., Joshi, N., Vaish, V., Talvala, E.V., Antunez, E., Barth, A., Adams, A., Horowitz, M., Levoy, M.: High performance imaging using large camera arrays. ACM Transactions on Graphics 24, 765–776 (2005)
Brown, M., Lowe, D.G.: Automatic panoramic image stitching using invariant features. International Journal of Computer Vision 74, 59–73 (2007)
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, pp. 1150–1157 (2007)
Carmo, M.P.D.: Differential Geometry of Curves and Surfaces. Prentice Hall (1976)
Liang, J., DeMenthon, D., Doermann, D.: Unwarping Images of Curved Documents Using Global Shape Optimization. In: Proceedings of CBDAR, pp. 25–29 (2005)
Gumerov, N.A., Zandifar, A., Duraiswami, R., Davis, L.S.: 3d structure recovery and unwarping of surfaces applicable to planes. International Journal of Computer Vision 66, 261–281 (2006)
Burt, P.J., Adelson, E.H.: A multiresolution spline with application to image mosaics. ACM Transactions on Graphics 2, 217–236 (1983)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Watanabe, Y., Itoyama, K., Yamada, M., Ishikawa, M. (2013). Digitization of Deformed Documents Using a High-Speed Multi-camera Array. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37444-9_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-37444-9_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37443-2
Online ISBN: 978-3-642-37444-9
eBook Packages: Computer ScienceComputer Science (R0)