Rectification of planar targets using line segments

An, Jaehyun; Koo, Hyung Il; Cho, Nam Ik

doi:10.1007/s00138-016-0807-1

Rectification of planar targets using line segments

Original Paper
Published: 10 September 2016

Volume 28, pages 91–100, (2017)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Jaehyun An¹,
Hyung Il Koo² &
Nam Ik Cho¹

647 Accesses
5 Citations
Explore all metrics

Abstract

This paper presents a method that performs the rectification of planar objects. Based on the 2D Manhattan world assumption (i.e., the majority of line segments are aligned with principal axes), we develop a cost function whose minimization yields a rectification transform. We parameterize the homography with camera parameters and design a cost function which encodes the measure of line segment alignment. Since there are outliers in the line segment detection, we also develop an iterative optimization scheme for the robust estimation. Experimental results on a range of images with planar objects show that our method performs rectification robustly and accurately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems

Article 06 March 2024

Fiducial Markers for Pose Estimation

Article 26 March 2021

3D point cloud-based place recognition: a survey

Article Open access 07 March 2024

References

Buenaposada, J.M., Baumela, L.: Real-time tracking and estimation of plane pose. In: Proceedings of International Conference on Pattern Recognition, pp. 697–700 (2002)
Clark, P., Mirmehdi, M.: Estimating the orientation and recovery of text planes in a single image. In: Proceedings of the 12th British Machine Vision Conference, pp. 421–430 (2001)
Cobzas, D., Jagersand, M., Sturm, P.: 3d ssd tracking with estimated 3d planes. Image Vis. Comput. 27, 69–79 (2009)
Article Google Scholar
Corral-Soto, E.R., Elder, J.H.: Automatic single-view calibration and rectification from parallel planar curves. In: European Conference on Computer Vision, pp. 813–827 (2014)
Grompone, V.G.R., Jakubowicz, J., Morel, J.M., Randall, G.: LSD: a fast line segment detector with a false detection control. IEEE Trans. Pattern Anal. Mach. Intell. 32(4), 722–732 (2010)
Article Google Scholar
Hanbury, A., Wildenauer, H.: Robust camera self-calibration from monocular images of manhattan worlds. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2831–2838 (2012)
Hartl, A., Reitmayr, G.: Rectangular target extraction for mobile augmented reality applications. In: Proceedings of International Conference on Pattern Recognition, pp. 81–84 (2012)
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2000)
MATH Google Scholar
Hua, G., Liu, Z., Zhang, Z., Wu, Y.: Automatic business card scanning with a camera. In: IEEE International Conference on Image Processing, pp. 373–376 (2006)
Hua, G., Liu, Z., Zhang, Z., Wu, Y.: Iterative local-global energy minimization for automatic extraction of objects of interest. IEEE Trans. Pattern Anal. Mach. Intell. 28(10), 1701–1706 (2006)
Article Google Scholar
Korč, F., Förstner, W.: eTRIMS Image Database for interpreting images of man-made scenes. Tech. Rep. TR-IGG-P-2009-01, Department of Photogrammetry, University of Bonn (2009)
Lee, H., Shechtman, E., Wang, J., Lee, S.: Automatic upright adjustment of photographs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 877–884 (2012)
Lee, H., Shechtman, E., Wang, J., Lee, S.: Automatic upright adjustment of photographs with robust camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 833–844 (2014)
Article Google Scholar
Lee, W., Pack, Y., Lepetit, V.: Video-based In Situ tagging on mobile phones. IEEE Trans. Circuits Syst. Video Techn. 21, 1487–1496 (2011)
Article Google Scholar
Liebowitz, D., Zisserman, A.: Metric rectification for perspective images of planes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 482–488 (1998)
Mirzaei, F., Roumeliotis, S.: Optimal estimation of vanishing points in a manhattan world. In: IEEE International Conference on Computer Vision, pp. 2454–2461 (2011)
Monasse, P., Morel, J.M., Tang, Z.: Three-step image rectification. In: The British Machine Vision Conference, pp. 89.1–10 (2010)
Mor, J.: The Levenberg-Marquardt algorithm: Implementation and theory. In: Watson, G. (ed.) Numerical Analysis. Lecture Notes in Mathematics, vol. 630, pp. 105–116. Springer, Berlin Heidelberg (1978)
Chapter Google Scholar
Pilu, M.: Extraction of illusory linear clues in perspectively skewed documents. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. I363–I368 (2001)
Pritts, J., Chum, O., Matas, J.: Detection, rectification and segmentation of coplanar repeated patterns. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2973–2980 (2014)
Tardif, J.P.: Non-iterative approach for fast and accurate vanishing point detection. In: IEEE International Conference on Computer Vision, pp. 1250–1257 (2009)
Tretyak, E., Barinova, O., Kohli, P., Lempitsky, V.: Geometric image parsing in man-made environments. Int. J. Comput. Vis. 97(3), 305–321 (2012)
Article Google Scholar
Xu, C., Kuipers, B., Murarka, A.: 3d pose estimation for planes. In: Proceedings of International Conference on Computer Vision Workshops, pp. 673–680 (2009)
Xu, Y., Oh, S., Hoogs, A.: A minimum error vanishing point detection approach for uncalibrated monocular images of man-made environments. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1376–1383 (2013)
Zaheer, A., Rashid, M., Khan, S.: Shape from angle regularity. In: Proceedings of the 12th European Conference on Computer Vision, vol part VI, pp. 1–14 (2012)
Zhang, Z., Ganesh, A., Liang, X., Ma, Y.: Tilt: Transform invariant low-rank textures. Int. J. Comput. Vis. 99(1), 1–24 (2012)
Article MathSciNet MATH Google Scholar
Zhang, Z., Matsushita, Y., Ma, Y.: Camera calibration with lens distortion from low-rank textures. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2321–2328 (2011)

Download references

Acknowledgments

This research was supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2015-H8501-15-1016) supervised by the IITP (Institute for Information & communications Technology Promotion).

Author information

Authors and Affiliations

INMC, Seoul National University, Seoul, Korea
Jaehyun An & Nam Ik Cho
Ajou University, Suwon, Korea
Hyung Il Koo

Authors

Jaehyun An
View author publications
You can also search for this author in PubMed Google Scholar
Hyung Il Koo
View author publications
You can also search for this author in PubMed Google Scholar
Nam Ik Cho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyung Il Koo.

Appendix: Jacobian matrix of the proposed cost function

The cost function (12) can be minimized via the Levenberg–Marquardt algorithm. For the efficient implementation of the algorithm, we need the derivatives of

$$\begin{aligned} d_\mu ( \mathrm {H}^{-1}\mathbf {u}, \mathrm {H}^{-1}\mathbf {v}) \end{aligned}$$

(24)

and

$$\begin{aligned} \frac{\text {max}\left( a,f \right) }{\text {min}\left( a,f \right) } \end{aligned}$$

(25)

with respect to four parameters(i.e., $t \in \{\theta _1, \theta _2, \theta _3, f\}$). However, the $\min (\cdot ,\cdot )$ function in (24) is not differentiable at some points; therefore, we use a simple approximation:

$$\begin{aligned} \frac{\partial }{\partial {t}} \left( \min ( f(\cdot ), g(\cdot ) ) \right) = \left\{ \begin{array}{ll} \frac{\partial f(\cdot )}{\partial {t}} &{} ~~~~\text{ if } ~~~f(\cdot ) \le g(\cdot )\\ \frac{\partial g(\cdot )}{\partial {t}} &{} \quad \text{ otherwise. } \end{array}\right. \end{aligned}$$

(26)

Although this approximation introduces ambiguities when $f(\cdot ) = g(\cdot )$, this case seldom happens and the approximation works well in practice. We handle $\min (\cdot ,\cdot )$ and $|\cdot |$ functions in a similar manner.

1.1 Derivatives of (24)

Let us denote $\hat{\varvec{p}}=\mathrm {H}^{-1}\mathbf {u} = \begin{bmatrix} \hat{p}_1&\hat{p}_2&\hat{p}_3 \end{bmatrix}^\top $ and $\hat{\varvec{q}}=\mathrm {H}^{-1}\mathbf {v} = \begin{bmatrix} \hat{q}_1&\hat{q}_2&\hat{q}_3 \end{bmatrix}^\top $ and denote their inhomogeneous representation as $\tilde{\varvec{p}} = \begin{bmatrix} x_1(\cdot )&y_1(\cdot ) \end{bmatrix}^\top $ and $\tilde{\varvec{q}} = \begin{bmatrix} x_2(\cdot )&y_2(\cdot ) \end{bmatrix}^\top $, respectively. Then, the derivative of (24) with respect to t

$$\begin{aligned} \frac{\partial }{\partial t} \left[ d_\mu ( \hat{\varvec{p}}, \varvec{\hat{q}} ) \right] = \frac{\partial }{\partial t} \left[ \min \left( \left| x_1 ( \cdot ) - x_2 ( \cdot ) \right| , \left| y_1 ( \cdot ) - y_2 (\cdot ) \right| \right) \right] \end{aligned}$$

(27)

can be decomposed into four cases:

$$ \begin{aligned} \left\{ \begin{array}{ll} \frac{\partial }{\partial t} \left( x_1(\cdot ) - x_2(\cdot ) \right) &{}\quad \text{ if }~~ x_1 (\cdot )> x_2 (\cdot )~ \& ~ |x_1(\cdot ) - x_2(\cdot ) | \le | y_1(\cdot ) - y_2(\cdot ) | \\ \frac{\partial }{\partial t} \left( x_2(\cdot ) - x_1(\cdot ) \right) &{}\quad \text{ if }~~ x_1 (\cdot ) \le x_2 (\cdot )~ \& ~ |x_1(\cdot ) - x_2(\cdot ) | \le | y_1(\cdot ) - y_2(\cdot ) | \\ \frac{\partial }{\partial t} \left( y_2(\cdot ) - y_1(\cdot ) \right) &{}\quad \text{ if }~~ y_1 (\cdot ) \le y_2 (\cdot )~ \& ~ |x_1(\cdot ) - x_2(\cdot ) | > | y_1(\cdot ) - y_2(\cdot ) | \\ \frac{\partial }{\partial t} \left( y_1(\cdot ) - y_2(\cdot ) \right) &{}\quad \text{ otherwise. } \end{array}\right. \end{aligned}$$

(28)

Since we can derive $\frac{\partial \tilde{\varvec{p}}}{\partial t} = \begin{bmatrix} \frac{\partial {x_1(\cdot )}}{\partial t}&\frac{\partial {y_1(\cdot )}}{\partial t} \end{bmatrix}^\top $ by using a chain rule:

$$\begin{aligned} \frac{\partial \tilde{\varvec{p}}}{\partial t} = \frac{\partial \tilde{\varvec{p}}}{\partial \hat{\varvec{p}}} \frac{\partial \hat{\varvec{p}}}{\partial t} \end{aligned}$$

(29)

where

$$\begin{aligned} \frac{\partial \tilde{\varvec{p}}}{\partial \hat{\varvec{p}}} =\frac{\partial \begin{bmatrix} \hat{p}_1/\hat{p}_3&\hat{p}_2/\hat{p}_3 \end{bmatrix}}{\partial \begin{bmatrix} \hat{p}_1&\hat{p}_2&\hat{p}_3 \end{bmatrix}}=\begin{bmatrix} 1/\hat{p}_3&\quad 0&\quad -\hat{p}_1/\hat{p}_3^{2}\\ 0&\quad 1/\hat{p}_3&\quad -\hat{p}_2/\hat{p}_3^{2} \end{bmatrix}, \end{aligned}$$

(30)

it is sufficient to compute $\frac{\partial \hat{\varvec{p}}}{\partial t}$ and $\frac{\partial \hat{\varvec{q}}}{\partial t}$ to get (27). By using (8), we can get $\frac{\partial \hat{\varvec{p}}}{\partial t}$ for each parameter:

$$\begin{aligned} \frac{\partial \hat{\varvec{p}}}{\partial \theta _{1}}= & {} \left\{ \mathbf {I}-\frac{\left( \mathbf {t}+\mathbf {e}_{3} \right) \mathbf {e}_{3}^\top }{\mathbf {e}_{3}^\top \mathbf {t}} \right\} \frac{\partial \mathbf {R}^\top }{\partial \theta _{1}}\mathbf {K}^{-1}\mathbf {u},\end{aligned}$$

(31)

$$\begin{aligned} \frac{\partial \hat{\varvec{p}}}{\partial \theta _{2}}= & {} \left\{ \mathbf {I}-\frac{\left( \mathbf {t}+\mathbf {e}_{3} \right) \mathbf {e}_{3}^\top }{\mathbf {e}_{3}^\top \mathbf {t}} \right\} \frac{\partial \mathbf {R}^\top }{\partial \theta _{2}}\mathbf {K}^{-1}\mathbf {u},\end{aligned}$$

(32)

$$\begin{aligned} \frac{\partial \hat{\varvec{p}}}{\partial \theta _{3}}= & {} \left\{ \mathbf {I}-\frac{\left( \mathbf {t}+\mathbf {e}_{3} \right) \mathbf {e}_{3}^\top }{\mathbf {e}_{3}^\top \mathbf {t}} \right\} \frac{\partial \mathbf {R}^\top }{\partial \theta _{3}}\mathbf {K}^{-1}\mathbf {u},\end{aligned}$$

(33)

$$\begin{aligned} \frac{\partial \hat{\varvec{p}}}{\partial f}= & {} \left\{ \mathbf {I}-\frac{\left( \mathbf {t}+\mathbf {e}_{3} \right) \mathbf {e}_{3}^\top }{\mathbf {e}_{3}^\top \mathbf {t}} \right\} \mathbf {R}^\top \frac{\partial \mathbf {K}^{-1}}{\partial f}\mathbf {u}. \end{aligned}$$

(34)

The derivatives of $\frac{\partial \hat{\varvec{q}}}{\partial t}$ can be derived in a similar way.

1.2 Derivatives of (25)

We can get the derivative of (25) with respect to f similarly:

$$\begin{aligned} \frac{\partial }{\partial f} \left[ \frac{\text {max}\left( a,f \right) }{\text {min}\left( a,f \right) } \right] =\left\{ \begin{array}{ll} \frac{1}{a}, &{}\quad \text{ if } a < f \\ -\frac{a}{f^{2}}, &{}\quad \text{ if } a > f \\ 0, &{}\quad \text{ otherwise }. \end{array}\right. \end{aligned}$$

(35)

Rights and permissions

Reprints and permissions

About this article

Cite this article

An, J., Koo, H.I. & Cho, N.I. Rectification of planar targets using line segments. Machine Vision and Applications 28, 91–100 (2017). https://doi.org/10.1007/s00138-016-0807-1

Download citation

Received: 07 May 2015
Revised: 13 March 2016
Accepted: 29 August 2016
Published: 10 September 2016
Issue Date: February 2017
DOI: https://doi.org/10.1007/s00138-016-0807-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Rectification of planar targets using line segments

Abstract

Access this article

Similar content being viewed by others

A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems

Fiducial Markers for Pose Estimation

3D point cloud-based place recognition: a survey

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Jacobian matrix of the proposed cost function

1.1 Derivatives of (24)

1.2 Derivatives of (25)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Rectification of planar targets using line segments

Abstract

Access this article

Similar content being viewed by others

A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems

Fiducial Markers for Pose Estimation

3D point cloud-based place recognition: a survey

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Jacobian matrix of the proposed cost function

Appendix: Jacobian matrix of the proposed cost function

1.1 Derivatives of (24)

1.2 Derivatives of (25)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation