Abstract
In this paper, we aim to implement augmented reality (AR) on distant text documents or books. For this purpose, we propose a new paper-based AR system that can detect text documents in real scenes, markerize and identify them, estimate their relative 3D poses to the camera, and augment them with virtual contents. Unlike the previous paper-based AR systems (applicable to only close documents), the proposed system not only requires no detection of words or characters, but allows partial occlusions like the previous systems. In our experiments, the proposed system worked at 24 fps and could consistently achieve high identification rates for both occluded and unoccluded pages.
Similar content being viewed by others
Notes
In preliminary experiments, we tested several approaches and the black pixel counting was extremely fast and accurate, robust to the occlusion, and thus the best for our purpose.
Since our marker identification method was not rotation-invariant, each document was in-plane rotated by 0, 90, 180, and 270∘.
References
http://www.marxentlabs.com/5-augmented-reality-books-that-delight-and-engage/. [Online; accessed 2-Sep-2014]
http://virtual.vtt.fi/virtual/proj2/multimedia/alvar/. ALVAR [Online; accessed 2-Sep-2014]
http://opencv.org. OpenCV [Online; accessed 2-Sep-2014]
http://www.opengl.org/. OpenGL [Online; accessed 2-Sep-2014]
https://code.google.com/p/glmetaseq/. GLMetaseq [Online; accessed 2-Sep-2014]
Alahi A, Ortiz R, Vandergheynst P (2012) FREAK: Fast retina keypoint. In: Proceedings of CVPR, pp. 510–517. IEEE
Back M, Cohen J, Gold R, Harrison SR, Minneman SL (2001) Listen Reader: an electronically augmented paper-based book. In: Proceedings of CHI, pp. 23–29. ACM
Bay H, Ess A, Tuytelaars T, Gool LV (2008) SURF: speeded up robust features. Comp Vision Image Underst 110:346–359
Billinghurst M, Dunser A (2012) Augmented reality in the classroom. Computer 45(7):56–63
Billinghurst M, Kato H, Poupyrev I (2001) The MagicBook-moving seamlessly between reality and virtuality. IEEE Comput Graph Appl 21 (3):6–8
Clark AJ, Dnser A (2012) An interactive augmented reality coloring book. In: Proceedings of 3DUI, pp. 7–10. IEEE
Clark P, Mirmehdi M (2002) Recognising text in real scenes. IJDAR 4(4):243–257
Coates A, Carpenter B, Case C, Satheesh S, Suresh B, Wang T, Wu DJ, Ng AY (2011) Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of ICDAR, pp. 440–445. IEEE
Duda RO, Hart PE (1972) Use of the hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15
Fiala M (2005) ARTag, a fiducial marker system using digital techniques. In: Proceedings of CVPR, pp. 590–596. IEEE Computer Society
Fragoso V, Gauglitz S, Zamora S, Kleban J, Turk M (2011) TranslatAR: a mobile augmented reality translator. In: Proceedings of WACV, pp. 497–502
Gauglitz S, Höllerer T, Turk M (2011) Evaluation of interest point detectors and feature descriptors for visual tracking. Int J Comput Vis 94(3):335–360
Gomez L, Karatzas D (2013) Multi-script text extraction from natural scenes. In: Proceedings of ICDAR, pp. 467–471. IEEE
Grasset R, Dnser A, Seichter H, Billinghurst M (2007) The mixed reality book: a new multimedia reading experience. In: CHI Extended Abstracts, pp. 1953–1958. ACM
Harris C, Stephens M (1988) A combined corner and edge detector. In: Proceedings of the 4th Alvey Vision Conference, pp. 147–151
Hartley R, Zisserman A (2003) Multiple view geometry in computer vision, 2 edn. Cambridge University Press, New York
Hull JJ, Erol B, Graham J, Ke Q, Kishi H, Moraleda J, Olst DGV (2007) Paper-based augmented reality. In: Proceedings of ICAT, pp. 205–209
Kato H, Billinghurst M (1999) Marker tracking and HMD calibration for a video-based augmented reality conferencing system. In: Proceedings of IWAR, pp. 85–94. IEEE Computer Society
Kato H, Billinghurst M, Poupyrev I, Imamoto K, Tachibana K (2000) Virtual object manipulation on a table-top ar environment. In: Proceedings of ISAR, pp. 111–119
Kato H, Tachibana K, Billinghurst M, Grafe M (2003) A registration method based on texture tracking using ARToolKit. Proceedings of ARToolkit Workshop, pp. 77–85. IEEE
Kim K, Lepetit V, Woo W (2010) Scalable real-time planar targets tracking for digilog books. Vis Comput 26(6–8):1145–1154
Leutenegger S, Chli M, Siegwart R (2011) BRISK: binary robust invariant scalable keypoints. In: ICCV, pp. 2548–2555. IEEE
Li N (1993) An implementation of ocr system based on skeleton matching. University of Kent at Canterbury, Computing Laboratory
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Nakai T, Kise K, Iwamura M (2007) Camera based document image retrieval with more time and memory efficient LLAH. In: Proc CBDAR
Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: Proceedings of CVPR, pp. 3538–3545
Norrie MC, Palinginis A, Signer B (2005) Content publishing framework for interactive paper documents. In: Proceedings of ACM Symposium on Document Engineering, pp. 187–196. ACM
Park H, Park JI (2004) Invisible marker tracking for AR. In: Proceedings of ISMAR, pp. 272–273. IEEE Computer Society
Park J, Woo W (2012) Hybrid document matching method for page identification of digilog books. T. Edutainment 8:24–34
Peng H, Long F, Chi Z (2003) Document image recognition based on template matching of component block projections. IEEE Trans Pattern Anal Mach Intell 25(9):1188–1192
Petter M, Fragoso V, Turk M, Baur C (2011) Automatic text detection for mobile augmented reality translation. In: Proceedings of ICCV Workshops, pp. 48–55. IEEE
Rekimoto J, Ayatsuka Y (2000) CyberCode: designing augmented reality environments with visual tags. In: Proceedings of DARE, pp. 1–10
Rublee E, Rabaud V, Konolige K, Bradski GR (2011) ORB: an efficient alternative to SIFT or SURF. In: Proceedings of ICCV, pp. 2564–2571
Saso TI, Iguchi K, Inakage M (2003) Little red: storytelling in mixed reality. In: Proceedings of SIGGRAPH. ACM
Scherrer C, Pilet J, Fua P, Lepetit V (2008) The haunted book. In: Proceedings of ISMAR, pp. 163–164. IEEE
Uchiyama H, Saito H (2009) Augmenting text document by on-line learning of local arrangement of keypoints. In: Proc. of ISMAR, pp. 95–98
Wagner D, Langlotz T, Schmalstieg D (2008) Robust and unobtrusive marker tracking on mobile phones. In: Proceedings of ISMAR, pp. 121–124. IEEE
Wagner D, Reitmayr G, Mulloni A, Drummond T, Schmalstieg D (2008) Pose tracking from natural features on mobile phones. In: Proceedings of ISMAR, pp. 125–134. IEEE
Willis KDD, Shiratori T, Mahler M (2013) HideOut: Mobile projector interaction with tangible objects and surfaces. In: Proceedings of TEI, pp. 331–338. ACM
Zhang S, Tian Q, Huang Q, Gao W, Rui Y (2014) USB: ultrashort binary descriptor for fast visual matching and retrieval. In: Proceedings of CVPR. IEEE
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ryu, HS., Park, H. A system for supporting paper-based augmented reality. Multimed Tools Appl 75, 3375–3390 (2016). https://doi.org/10.1007/s11042-014-2439-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2439-7