Abstract
This paper presents a novel approach to represent spatio-temporal visual information. We introduce a surface-based shape model whose structure is invariant to surface variations over time to describe 3D dynamic surfaces (e.g., 3D video obtained from multiview video capture). The descriptor is defined as a graph lying on object surfaces and anchored to invariant local features (e.g., surface point extrema). Geodesic consistency-based priors are used as cues within a probabilistic framework to maintain the graph invariant, even though the surfaces undergo non-rigid deformations. Our contribution brings to 3D geometric data a temporally invariant structure that relies only on intrinsic surface properties, and is independent of surface parameterization (i.e., surface mesh connectivity). The proposed descriptor can therefore be used for efficient dynamic surface encoding, through transformation into 2D (geometry) images, as its structure can provide an invariant representation for dynamic 3D mesh models. Various experiments on challenging publicly available datasets are performed to assess invariant property and performance of the descriptor.
Similar content being viewed by others
Notes
A path on a surface is a set of points linked two-by-two by a line.
References
de Aguiar, E., Stoll, C., Theobalt, C., Ahmed, N., Seidel, H.P., Thrun, S.: Performance capture from sparse multi-view video. ACM Trans Graphics 27(3), 98:1–98:10 (2008)
Alexa, M., Müllen, W.: Representing animations by principal components. Computer Gr. Forum 19(3), 411–418 (2000)
Allard, J., Ménier, C., Raffin, B., Boyer, E., Faure, F.: Grimage: Markerless 3d Interactions. ACM SIGGRAPH—Emerging Technologies (2007)
Alliez, P., Gotsman, C.: Recent advances in compression of 3d meshes. Adv. Multiresolution Geometric Model. Springer-Verlag (2005)
Baran, I., Popovic, J.: Automatic rigging and animation of 3d characters. ACM Trans Gr. 26(3), 27 (2007)
Blum, H.: A Transformation for Extracting New Descriptors of Shape. Models for the Perception of Speech and Visual Form. MIT Press, pp. 362–380 (1967)
Briceno, H., Sandler, P., McMillian, L., Gortler, S., Hoppe, H.: Geometry videos: a new representation for 3d animations. In: Eurographics/SIGGRAPH Symp Computer, Animation, pp. 136–146 (2003)
Bronstein, A.M., Bronstein, M.M., Kimmel, R.: Calculus of non-rigid surfaces for geometry and texture manipulation. In: IEEE Trans Vis. Computer Gr., pp. 902–913 (2007)
Cagniart, C., Boyer, E., Ilic, S.: Probabilistic deformable surface tracking from multiple videos. In: Proceedings of European Conf Computer Vision (2010)
Carr, N., Hoberock, J., Crane, K., Hart, J.: Rectangular multi-chart geometry images. In: Proceedings of Eurographics Symp Geometry Processing, pp. 181–190 (2006)
Carranza, J., Theobalt, C., Magnor, M., Seidel, H.P.: Free-viewpoint video of human actors. ACM Trans Gr. 22(3), 569–577 (2003)
Cornea, N., Silver, D., Yuan, X., Balasubramanian, R.: Computing hierarchical curve skeletons of 3d objects. Vis. Computer J. 21(11), 945–955 (2005)
Edelsbrunner, H., Harer, J., Mascarenhas, A., Pascucci, V.: Time-varying reeb graphs for continuous space-time data. In: Proceedings of Symp Computational Geometry (2004)
Erickson, J., Har-Peled, S.: Optimally cutting a surface into a disk. Discrete Comput. Geometry 31(1), 37–59 (2004)
Floater, M.: Parametrization and smooth approximation of surface triangulations. Computer Aided Geometric Design 14(3), 231–250 (1997)
Forsyth, D.A., Mundy, J.L., Zisserman, A., Coelho, C., Heller, A., Rothwell, C.: Invariant descriptors for 3d object recognition and pose. IEEE Trans Pattern Anal. Mach. Intell. 13(10), 971–991 (1991)
Franco, J., Menier, C., Boyer, E., Raffin, B.: A distributed approach for real-time 3d modeling. In: Proceedings of IEEE Conf Computer Vision Pattern Recognition Workshop on real-time 3D sensors and their applications (2004)
Gu, X., Gortler, S., Hoppe, H.: Geometry images. ACM Trans. Gr. (SIGGRAPH) 21(3), 355–361 (2002)
Guo, Y.W., Wang, J., Cui, X.F., Peng, Q.S.: A New Constrained Texture Mapping Method. Entertainment Computing. ICEC Springer, Springer LNCS, Berlin Heidelberg (2005)
Habe, H., Katsura, Y., Matsuyama, T.: Skin-off: representation and compression scheme for 3d video. In: Proceedings of picture coding symposium (2004)
Hilaga, M., Shinagawa, Y., Kohmura, T., Kunii, T.L.: Topology matching for fully automatic similarity estimation of 3d shapes. ACM SIGGRAPH, pp. 203–212 (2001)
Huang, P., Tung, T., Nobuhara, S., Hilton, A., Matsuyama, T.: Comparison of skeleton and non-skeleton shape descriptors for 3d video. In: Proceedings of 3DPVT (2010)
Jiang, H., Liu, H., Tan, P., Zhang, G., Bao, H.: 3d reconstruction of dynamic scenes with multiple handheld cameras. In: Proceedings of European Conf computer vision (2012)
Kanade, T., Yoshida, A., Oda, K., Kano, H., Tanaka, M.: A stereo machine for video-rate dense depth mapping and its new applications. In: Proceedings of IEEE Conf computer vision pattern recognition (1996)
Karni, Z., Gotsman, C.: Compression of soft-body animation sequence. Computers Gr. 28, 25–34 (2004)
Kavan, L., Collins, S., Žára, J., O’Sullivan, C.: Skinning with dual quaternions. In: Proceedings of Symposium on interactive 3D graphics and games, pp. 39–46 (2007)
Klein, T., Ertl, T.: Scale-space tracking of critical points in 3d vector fields. In: Proceedings of topology-based methods in visualization (2005)
Mamou, K., Zaharia, T., Preteux, F., Stefanoski, N., Ostermann, J.: Frame-based compression of animated meshes in mpeg-4. In: Proceedings of IEEE Int’l Conf Multimedia and Expo (2008)
Matsuyama, T., Wu, X., Takai, T., Nobuhara, S.: Real-time 3d shape reconstruction, dynamic 3d mesh deformation, and high fidelity visualization for 3d video. Computer Vis. Image Understand. 96(3), 393–434 (2004)
Matsuyama, T., Nobuhara, S., Takai, T., Tung, T.: 3d Video and its Applications. Springer, London (2012)
Mémoli, F., Sapiro, G.: A theoretical and computational framework for isometry invariant recognition of point cloud data. Found Comput. Math. 5(3), 313–347 (2005)
Morse, M.: The calculus of variations in the large. American Mathematical Society, Colloquium Publication, New York, p 18 (1934)
Mortara, M., Patanè, G.: Affine-invariant skeleton of 3d shapes. In: Proceedings of Shape Modeling International (2002)
Mundy, J,, Zisserman, A.: Geometric invariance in computer vision. MIT Press, Cambridge (1992)
CR Cignoni, P., Scopigno, R.: Metro: measuring error on simplified surfaces. Computer Gr. Forum 17(2), 167–174 (1998)
Palagyi, K., Kuba, A.: A parallel 3d 12-subiteration thinning algorithm. Graph Models Image Proc 61(4), 199–221 (1999)
Pascucci, V., Scorzelli, G., Bremer, P.T., Mascarenhas, A.: Robust on-line computation of reeb graphs: simplicity and speed. ACM Trans Gr. 26 (3), 58:1–58:9 (2007)
Reeb, G.: On the singular points of a completely integrable pfaff form or of a numerical function. Comptes Rendus Acad. Sci. Paris 222, 847–849 (1946)
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int’l J. Computer Vis. 66(3), 231–259 (2006)
Saboret, L., Alliez, P., Lévy, B.: Planar parameterization of triangulated surface meshes, 40 edn. In: CGAL reference manual CGAL Editorial Board (2012)
Sander, P., Wood, Z., Gortler, S., Snyder, J., Hoppe, H.: Multi-chart geometry images. In: Proceedings of Eurographics Symp geometry processing, pp. 146–155 (2003)
Seitz, S., Curless, B., Diebel, J., Scharstein, D., Szeliski, R.: A comparison and evaluation of multi-view stereo reconstruction algorithms. In: Proceedings of IEEE Conf computer vision pattern recognition (2006)
Starck, J., Hilton, A.: Spherical matching for temporal correspondence of non-rigid surfaces. In: Proceedings of IEEE Int’l Conf computer vision (2005)
Starck, J., Hilton, A.: Surface capture for performance-based animation. IEEE Computer Gr. Appl. 27(3), 21–31 (2007)
Sumner, R.W., Popovic, J.: Deformation transfer for triangle meshes. ACM Trans Gr. 23(3), 399–405 (2004)
Taubin, G., Rossignac, J.: Geometric compression through topological surgery. ACM Trans Gr. 17(2), 84–115 (1998)
Tung, T., Matsuyama, T.: Dynamic surface matching by geodesic mapping for 3d animation transfer. In: Proceedings of IEEE Conf computer vision pattern recognition (2010)
Tung, T., Matsuyama, T.: Invariant surface-based shape descriptor for dynamic surface encoding. In: Proceedings of Asian Conf computer vision (2012a)
Tung, T., Matsuyama, T.: Topology dictionary for 3d video understanding. IEEE Trans Pattern Anal. Mach. Intell. 34(8), 1645–1657 (2012b)
Tung, T., Schmitt, F.: The augmented multiresolution reeb graph approach for content-based retrieval of 3d shapes (code on webpage). Int’l J. Shape Model. 11(1), 91–120 (2005)
Tung, T., Schmitt, F., Matsuyama, T.: Topology matching for 3d video compression. In: Proceedings of IEEE Conf computer vision pattern recognition (2007)
Vlasic, D., Baran, I., Matusik, W., Popovic, J.: Articulated mesh animation from multi-view silhouettes. ACM Trans Gr. 27(3), 97:1–97:3 (2008)
Acknowledgments
This work was supported in part by the JST-CREST project “Creation of Human-Harmonized Information Technology for Convivial Society”. The authors thank Dr. Lyndon Hill for his preliminary work on this project.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tung, T., Matsuyama, T. Invariant shape descriptor for 3D video encoding. Vis Comput 31, 311–324 (2015). https://doi.org/10.1007/s00371-014-0925-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-014-0925-6