Abstract
Mobile phone camera-based document video scanning is an interesting research problem which has entered into a new era with the emergence of widely used, processing capable and motion sensors equipped smartphones. We present our ongoing research on mobile phone camera-based document image mosaic reconstruction method for video scanning of paper documents. In this work, we have optimized the classic keypoint feature descriptor-based image registration method, by employing the accelerometer and gyroscope sensor data. Experimental results are evaluated using optical character recognition (OCR) on the reconstructed mosaic from mobile phone camera-based video scanning of paper documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: International Conference on Computer Vision and Pattern Recognition, pp. 510–517 (2012)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Hannuksela, J., Sangi, P., Heikkila, J., Liu, X., Doermann, D.: Document image mosaicing with mobile phones. In: International Conference on Image Analysis and Processing, pp. 575–582 (2007)
Jagannathan, L., Jawahar, C.: Perspective correction methods for camera based document analysis. In: International Workshop on Camera-Based Document Analysis and Recognition, pp. 148–154 (2005)
Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Sov. Phys. Dokl. 10(8), 707–710 (1966)
Liang, J., DeMenthon, D., Doermann, D.: Mosaicing of camera-captured document images. Comput. Vis. Image Underst. 113(4), 572–579 (2009)
Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recogn. 7(2–3), 84–104 (2005)
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Muja, M., Lowe, D.: Fast approximate nearest neighbors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Applications, pp. 331–340 (2009)
Nakai, T., Kise, K., Iwamura, M.: Camera-based document image mosaicing using LLAH. In: Document Recognition and Retrieval XVI, pp. 1–10 (2009)
Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)
Sawhney, H.S., Hsu, S., Kumar, R.: Robust video mosaicing through topology inference and local to global alignment. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, p. 103. Springer, Heidelberg (1998)
Szeliski, R.: Image alignment and stitching. In: Handbook of Mathematical Models in Computer Vision, pp. 273–292. Springer (2006)
Woodman, O.J.: An introduction to inertial navigation. Technical report 696, University of Cambridge, Computer Laboratory, Cambridge (2007)
Yang, Q., Wang, C., Gao, Y., Qu, H., Chang, E.: Inertial sensors aided image alignment and stitching for panorama on mobile phones. In: International Workshop on Mobile Location-Based Service, pp. 21–30 (2011)
Acknowledgment
The piXL project is supported by the “Fonds national pour la Société Numérique” of the French State by means of the “Programme d’Investissements d’Avenir”, and referenced under PIA-FSN2-PIXL. For more details and resources, visit http://valconum.fr/index.php/les-projets/pixl.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Luqman, M.M., Gomez-Krämer, P., Ogier, JM. (2014). Mobile Phone Camera-Based Video Scanning of Paper Documents. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2013. Lecture Notes in Computer Science(), vol 8357. Springer, Cham. https://doi.org/10.1007/978-3-319-05167-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-05167-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05166-6
Online ISBN: 978-3-319-05167-3
eBook Packages: Computer ScienceComputer Science (R0)