Ambiguity Modeling in Latent Spaces

Ek, Carl Henrik; Rihan, Jon; Torr, Philip H. S.; Rogez, Grégory; Lawrence, Neil D.

doi:10.1007/978-3-540-85853-9_6

Carl Henrik Ek¹,
Jon Rihan¹,
Philip H. S. Torr¹,
Grégory Rogez² &
…
Neil D. Lawrence³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5237))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

933 Accesses
23 Citations

Abstract

We are interested in the situation where we have two or more representations of an underlying phenomenon. In particular we are interested in the scenario where the representation are complementary. This implies that a single individual representation is not sufficient to fully discriminate a specific instance of the underlying phenomenon, it also means that each representation is an ambiguous representation of the other complementary spaces. In this paper we present a latent variable model capable of consolidating multiple complementary representations. Our method extends canonical correlation analysis by introducing additional latent spaces that are specific to the different representations, thereby explaining the full variance of the observations. These additional spaces, explaining representation specific variance, separately model the variance in a representation ambiguous to the other. We develop a spectral algorithm for fast computation of the embeddings and a probabilistic model (based on Gaussian processes) for validation and inference. The proposed model has several potential application areas, we demonstrate its use for multi-modal regression on a benchmark human pose estimation data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agarwal, A., Triggs, B.: Recovering 3 d human pose from monocular images. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(1), 44–58 (2006)
Article Google Scholar
Ek, C.H., Torr, P.H.S., Lawrence, N.D.: Gaussian process latent variable models for human pose estimation. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds.) MLMI 2007. LNCS, vol. 4892, pp. 132–143. Springer, Heidelberg (2008)
Chapter Google Scholar
Harmeling, S.: Exploring model selection techniques for nonlinear dimensionality reduction. Technical Report EDI-INF-RR-0960, University of Edinburgh (2007)
Google Scholar
Kuss, M., Graepel, T.: The geometry of kernel canonical correlation analysis. Technical Report TR-108, Max Planck Institute for Biological Cybernetics, Tübingen, Germany (2003)
Google Scholar
Lawrence, N.D.: Probabilistic non-linear principal component analysis with Gaussian Process latent variable models. J. Mach. Learn. Res. 6, 1783–1816 (2005)
MathSciNet Google Scholar
Lawrence, N.D., Quionero-Candela, J.: Local distance preservation in the GP-LVM through back constraints. In: Greiner, R., Schuurmans, D. (eds.) ICML 2006, vol. 21, pp. 513–520. ACM, New York (2006)
Chapter Google Scholar
Navaratnam, R., Fitzgibbon, A., Cipolla, R.: The joint manifold model. In: IEEE International Conference on Computer Vision (ICCV) (2007)
Google Scholar
Rasmussen, C.E., Williams, C.K.I.: Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning). MIT Press, Cambridge (2005)
Google Scholar
Sanguinetti, G., Lawrence, N.D.: Missing data in kernel PCA. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212. Springer, Heidelberg (2006)
Chapter Google Scholar
Scholkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Google Scholar
Shon, A.P., Grochow, K., Hertzmann, A., Rao, R.P.N.: Learning shared latent structure for image synthesis and robotic imitation. In: Proc. NIPS, pp. 1233–1240 (2006)
Google Scholar
Sigal, L., Black, M.J.: Humaneva: Synchronized video and motion capture dataset for evaluation of articulated human motion. Brown Univertsity TR (2006)
Google Scholar
Sminchisescu, C., Kanaujia, A., Li, Z., Metaxas, D.: Discriminative density propagation for 3d human motion estimation. In: Proc. Conf. Computer Vision and Pattern Recognition, pp. 217–323 (2005)
Google Scholar
Weinberger, K.Q., Sha, F., Saul, L.K.: Learning a kernel matrix for nonlinear dimensionality reduction. In: ACM International Conference Proceeding Series (2004)
Google Scholar
Zhu, Q., Avidan, S., Yeh, M.C., Cheng, K.T.: Fast Human Detection Using a Cascade of Histograms of Oriented Gradients. CVPR 1(2), 4 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Oxford Brookes University, UK
Carl Henrik Ek, Jon Rihan & Philip H. S. Torr
University of Zaragoza, Spain
Grégory Rogez
University of Manchester, UK
Neil D. Lawrence

Authors

Carl Henrik Ek
View author publications
You can also search for this author in PubMed Google Scholar
Jon Rihan
View author publications
You can also search for this author in PubMed Google Scholar
Philip H. S. Torr
View author publications
You can also search for this author in PubMed Google Scholar
Grégory Rogez
View author publications
You can also search for this author in PubMed Google Scholar
Neil D. Lawrence
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Rainer Stiefelhagen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ek, C.H., Rihan, J., Torr, P.H.S., Rogez, G., Lawrence, N.D. (2008). Ambiguity Modeling in Latent Spaces. In: Popescu-Belis, A., Stiefelhagen, R. (eds) Machine Learning for Multimodal Interaction. MLMI 2008. Lecture Notes in Computer Science, vol 5237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85853-9_6

Download citation

DOI: https://doi.org/10.1007/978-3-540-85853-9_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85852-2
Online ISBN: 978-3-540-85853-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics