Skip to main content

Mobile Phone Camera-Based Video Scanning of Paper Documents

  • Conference paper
  • First Online:
Book cover Camera-Based Document Analysis and Recognition (CBDAR 2013)

Abstract

Mobile phone camera-based document video scanning is an interesting research problem which has entered into a new era with the emergence of widely used, processing capable and motion sensors equipped smartphones. We present our ongoing research on mobile phone camera-based document image mosaic reconstruction method for video scanning of paper documents. In this work, we have optimized the classic keypoint feature descriptor-based image registration method, by employing the accelerometer and gyroscope sensor data. Experimental results are evaluated using optical character recognition (OCR) on the reconstructed mosaic from mobile phone camera-based video scanning of paper documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: International Conference on Computer Vision and Pattern Recognition, pp. 510–517 (2012)

    Google Scholar 

  2. Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)

    Google Scholar 

  3. Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)

    Article  MathSciNet  Google Scholar 

  4. Hannuksela, J., Sangi, P., Heikkila, J., Liu, X., Doermann, D.: Document image mosaicing with mobile phones. In: International Conference on Image Analysis and Processing, pp. 575–582 (2007)

    Google Scholar 

  5. Jagannathan, L., Jawahar, C.: Perspective correction methods for camera based document analysis. In: International Workshop on Camera-Based Document Analysis and Recognition, pp. 148–154 (2005)

    Google Scholar 

  6. Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Sov. Phys. Dokl. 10(8), 707–710 (1966)

    MathSciNet  Google Scholar 

  7. Liang, J., DeMenthon, D., Doermann, D.: Mosaicing of camera-captured document images. Comput. Vis. Image Underst. 113(4), 572–579 (2009)

    Article  Google Scholar 

  8. Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recogn. 7(2–3), 84–104 (2005)

    Article  Google Scholar 

  9. Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  10. Muja, M., Lowe, D.: Fast approximate nearest neighbors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Applications, pp. 331–340 (2009)

    Google Scholar 

  11. Nakai, T., Kise, K., Iwamura, M.: Camera-based document image mosaicing using LLAH. In: Document Recognition and Retrieval XVI, pp. 1–10 (2009)

    Google Scholar 

  12. Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)

    Google Scholar 

  13. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)

    Google Scholar 

  14. Sawhney, H.S., Hsu, S., Kumar, R.: Robust video mosaicing through topology inference and local to global alignment. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, p. 103. Springer, Heidelberg (1998)

    Google Scholar 

  15. Szeliski, R.: Image alignment and stitching. In: Handbook of Mathematical Models in Computer Vision, pp. 273–292. Springer (2006)

    Google Scholar 

  16. Woodman, O.J.: An introduction to inertial navigation. Technical report 696, University of Cambridge, Computer Laboratory, Cambridge (2007)

    Google Scholar 

  17. Yang, Q., Wang, C., Gao, Y., Qu, H., Chang, E.: Inertial sensors aided image alignment and stitching for panorama on mobile phones. In: International Workshop on Mobile Location-Based Service, pp. 21–30 (2011)

    Google Scholar 

Download references

Acknowledgment

The piXL project is supported by the “Fonds national pour la Société Numérique” of the French State by means of the “Programme d’Investissements d’Avenir”, and referenced under PIA-FSN2-PIXL. For more details and resources, visit http://valconum.fr/index.php/les-projets/pixl.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muhammad Muzzamil Luqman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Luqman, M.M., Gomez-Krämer, P., Ogier, JM. (2014). Mobile Phone Camera-Based Video Scanning of Paper Documents. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2013. Lecture Notes in Computer Science(), vol 8357. Springer, Cham. https://doi.org/10.1007/978-3-319-05167-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05167-3_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05166-6

  • Online ISBN: 978-3-319-05167-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics