An Interactive Image Rectification Method Using Quadrangle Hypothesis

  • Satoshi Yonemoto
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8157)


In this paper, we propose an interactive image rectification method for general planar objects. Our method has two interactive techniques that allow a user to choose the target region of interest. First, with a user-stroke based cropping. Second, with a box based cropping. Our method can be applied to non-rectangular objects. The idea is based on use of horizontal and vertical lines with the target object. We assume that such lines can be richly detected. Practically, at least two horizontal lines and two vertical lines must be observed. Our method has the following procedures: First, detect primitive line segments, and then select horizontal and vertical line segments using baselines. Next, make a quadrangle hypothesis as a combination of 4 line segments. And then, evaluate whether re-projected line segments will be horizontal (vertical) or not. The quadrangle hypothesis with max goodness is the final solution. In our experiments, we showed promising cropping results for several images. And we demonstrated real-time marker-less tracking using the rectified reference image.


image rectification marker-less tracking text detection 


  1. 1.
    Canny, J.A.: computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 8, 679–714 (1986)CrossRefGoogle Scholar
  2. 2.
    Clark, P., Mirmehdi, M.: Estimating the orientation and recovery of text planes in a single image. In: Proceedings of the 12th British Machine Vision Conference (2001)Google Scholar
  3. 3.
    Fue, P., Vincent, L.: Vision based 3D tracking and pose estimation for mixed reality. In: Emerging Technologies of Augmented Reality: Interfaces and Design, pp. 1–22 (2005)Google Scholar
  4. 4.
    Fragoso, V., et al.: TranslatAR: A Mobile Augmented Reality Translator. In: Proceedings of IEEE Workshop on Applications of Computer Vision (WACV), pp. 497–502 (2011)Google Scholar
  5. 5.
    Hartley, R., Zisserman, A.: Multiple view geometry in computer vision. Cambridge University Press, New York (2001)Google Scholar
  6. 6.
    Jain, A., Yu, B.: Automatic text location in images and video frames. In: Proc. 14th Intl. Conf. on Pattern Recognition, vol. 2, pp. 1497–1499 (1998)Google Scholar
  7. 7.
    Lu, S., et al.: Perspective rectification of document images using fuzzy set and morphological operations. Image and Vision Computing 23(5), 541–553 (2005)CrossRefGoogle Scholar
  8. 8.
    Matas, J., Galambos, C., Kittler, J.V.: Robust Detection of Lines Using the Progressive Probabilistic Hough Transform. CVIU 78(1), 119–137 (2000)Google Scholar
  9. 9.
    Yin, X.-C., et al.: Robust Vanishing Point Detection for MobileCam-Based Documents. In: International Conference on Document Analysis and Recognition. IEEE (2011)Google Scholar
  10. 10.
    Zhang, Z., He, L.-W.: Note-taking with a camera: whiteboard scanning and image enhancement. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 3 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Satoshi Yonemoto
    • 1
  1. 1.Graduate School of Information ScienceKyushu Sangyo UniversityJapan

Personalised recommendations