Abstract
Models defined using higher-order potentials are becoming increasingly popular in computer vision. However, the exact representation of a general higher-order potential defined over many variables is computationally unfeasible. This has led prior works to adopt parametric potentials that can be compactly represented. This paper proposes a non-parametric higher-order model for image labeling problems that uses a patch-based representation of its potentials. We use the transformation scheme of [11, 25] to convert the higher-order potentials to a pair-wise form that can be handled using traditional inference algorithms. This representation is able to capture structure, geometrical and topological information of labels from training data and to provide more precise segmentations. Other tasks such as image denoising and reconstruction are also possible. We evaluate our method on denoising and segmentation problems with synthetic and real images.
Chapter PDF
Similar content being viewed by others
Keywords
References
Becker, C.J., Ali, K., Knott, G., Fua, P.: Learning Context Cues for Synapse Segmentation. IEEE Transactions on Medical Imaging 32(10), 1864–1877 (2013)
Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 26 (2004)
Buades, A., Coll, B., Morel, J.M.: A non-local algorithm for image denoising. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 60–65 (2005)
Delong, A., Osokin, A., Isack, H.N., Boykov, Y.: Fast approximate energy minimization with label costs. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2173–2180 (2010)
Efros, A.A., Freeman, W.T.: Image quilting for texture synthesis and transfer. In: SIGGRAPH, pp. 341–346 (2001)
Fitzgibbon, A., Wexler, Y., Zisserman, A.: Image-based rendering using image-based priors. In: Proc. International Conference on Computer Vision (ICCV), pp. 1176–1183 (2003)
Glocker, B., Heibel, T.H., Navab, N., Kohli, P., Rother, C.: TriangleFlow: Optical flow with triangulation-based higher-order likelihoods. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part III. LNCS, vol. 6313, pp. 272–285. Springer, Heidelberg (2010)
Hinton, G.E.: Training products of experts by minimizing contrastive divergence. Neural Computation 14(8), 1771–1800 (2002)
Hoiem, D., Rother, C., Winn, J.M.: 3D layoutCRF for multi-view object class recognition and segmentation. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR) (2007)
Kohli, P., Kumar, M., Torr, P.: P 3 and beyond: Solving energies with higher order cliques. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR) (2007)
Kohli, P., Kumar, M.P.: Energy minimization for linear envelope MRFs. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1863–1870 (2010)
Kohli, P., Ladicky, L., Torr, P.H.S.: Robust higher order potentials for enforcing label consistency. International Journal of Computer Vision (IJCV) 82(3), 302–324 (2009)
Kolmogorov, V.: Convergent tree-reweighted message passing for energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 28(10), 1568–1583 (2006)
Komodakis, N., Paragios, N.: Beyond pairwise energies: Efficient optimization for higher-order MRFs. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2985–2992 (2009)
Kontschieder, P., Bulo, S., Bischof, H., Pelillo, M.: Structured class-labels in random forests for semantic image labelling. In: Proc. International Conference on Computer Vision (ICCV), pp. 2190–2197 (November 2011)
Krahenbuhl, P., Koltun, V.: Efficient inference in fully connected crfs with gaussian edge potentials. In: Shawe-Taylor, J., Zemel, R., Bartlett, P., Pereira, F., Weinberger, K. (eds.) Advances in Neural Information Processing Systems, vol. 24, pp. 109–117 (2011), http://papers.nips.cc/paper/4296-efficient-inference-in-fully-connected-crfs-with-gaussian-edge-potentials.pdf
Ladicky, L., Russell, C., Kohli, P., Torr, P.H.S.: Graph cut based inference with co-occurrence statistics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 239–253. Springer, Heidelberg (2010)
Lan, X., Roth, S., Huttenlocher, D., Black, M.: Efficient belief propagation with learned higher-order markov random fields. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 269–282. Springer, Heidelberg (2006)
Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing via label transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 33(12), 2368–2382 (2011)
Lucchi, A., Li, Y., Fua, P.: Learning for Structured Prediction Using Approximate Subgradient Descent with Working Sets. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
Nowozin, S., Rother, C., Bagon, S., Sharp, T., Yao, B., Kohli, P.: Decision tree fields. In: Proc. International Conference on Computer Vision (ICCV), pp. 1668–1675 (2011)
Olsson, C., Ulen, J., Boykov, Y., Kolmogorov, V.: Partial enumeration and curvature regularization. In: Proc. International Conference on Computer Vision (ICCV), pp. 2936–2943 (December 2013)
Potetz, B.: Efficient belief propagation for vision using linear constraint nodes. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR) (2007)
Roth, S., Black, M.: Fields of experts: A framework for learning image priors. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 860–867 (2005)
Rother, C., Kohli, P., Feng, W., Jia, J.: Minimizing sparse higher order energy functions of discrete variables. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1382–1389 (2009)
Sharp, T., Rother, C., Nowozin, S., Jancsary, J.: Regression tree fields – an efficient, non-parametric approach to image labeling problems. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2376–2383 (2011)
Shotton, J., Winn, J., Rother, C., Criminisi, A.: textonBoost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 1–15. Springer, Heidelberg (2006)
Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., Rother, C.: A comparative study of energy minimization methods for Markov random fields. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 16–29. Springer, Heidelberg (2006)
Woodford, O., Torr, P., Reid, I., Fitzgibbon, A.: Global stereo reconstruction under second order smoothness priors. In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Márquez-Neila, P., Kohli, P., Rother, C., Baumela, L. (2014). Non-parametric Higher-Order Random Fields for Image Segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8694. Springer, Cham. https://doi.org/10.1007/978-3-319-10599-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-10599-4_18
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10598-7
Online ISBN: 978-3-319-10599-4
eBook Packages: Computer ScienceComputer Science (R0)