Abstract
This chapter provides an analysis of traditional and newly introduced methods for the segmentation of point clouds and for the three-dimensional detection and pose estimation of rigid, articulated, and flexible objects in the scene.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akima, H., 1970. A new method of interpolation and smooth curve fitting based on local procedures. J. Assoc. Comput. Mach. 17(4), pp. 589–602.
Ando, S., Kusachi, Y., Suzuki, A., Arakawa, K., 2005. Appearance based pose estimation of 3D object using support vector regression. Proc. IEEE Int. Conf. on Image Processing, I, pp. 341–344.
Bachler, G., Berger, M., Röhrer, R., Scherer, S., Pinz, A., 1999. A vision driven automatic assembly unit. Proc. International Conference on Computer Analysis of Images and Patterns, Ljubljana, Slovenia, pp. 375–382.
Barrois, B., 2010. Analyse der Position, Orientierung und Bewegung von rigiden und artikulierten Objekten aus Stereobildsequenzen. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany.
Barrois, B., Wöhler, C., 2007. 3D pose estimation based on multiple monocular cues. ISPRS Workshop Towards Benchmarking Automated Calibration, Orientation and Surface Reconstruction from Images (BenCOS), held in conjunction with CVPR 2007, Minneapolis, USA.
Barrois, B., Wöhler, C., 2008. Spatio-temporal 3D pose estimation of objects in stereo images. In: Gasteratos, A., Vincze, M., Tsotsos, J. (eds.), Proc. Int. Conf. on Computer Vision Systems, Santorini, Greece. Lecture Notes in Computer Science 5008, pp. 507–516, Springer, Berlin.
Barrow, H., 1977. Parametric correspondence and chamfer matching: two new techniques for image matching. Proc. Int. Joint Conf. on Artificial Intelligence, pp. 659–663.
Besl, P. J., McKay, N. D., 1992. A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), pp. 239–256.
Blake, A., Isard, M., 1998. Active Contours. Springer, London.
Bock, H. H., 1974. Automatische Klassifikation. Vandenhoeck & Ruprecht, Göttingen.
Bouguet, J.-Y., 2007. Camera Calibration Toolbox for Matlab. http://www.vision.caltech.edu/bouguetj/calib_doc/ (accessed September 04, 2007).
Bradski, G. R., 1998. Real time face and object tracking as a component of a perceptual user interface. Proc. Workshop on Appl. of Computer Vision, pp. 214–219.
Brox, T., Rosenhahn, B., Cremers, D., 2008. Contours, optic flow, and prior knowledge: cues for capturing 3D human motion from videos. In: Rosenhahn, B., Klette, R., Metaxas, D. (eds.), Human Motion: Understanding, Modelling, Capture and Animation. Springer, Dordrecht.
Cañero, C., Radeva, P., Toledo, R., Villanueva, J. J., Mauri, J., 2000. 3D curve reconstruction by biplane snakes. Proc. Int. Conf. on Pattern Recognition, Barcelona, Spain, vol. 4, pp. 4563–4567.
Canny, J., 1986. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8, pp. 679–698.
Caselles, V., Kimmel, R., Sapiro, G., 1995. Geodesic active contours. Proc. Int. Conf. on Computer Vision, Boston, USA, pp. 694–699.
Caselles, V., Kimmel, R., Sapiro, G., 1997. Geodesic active contours. Int. J. Comput. Vis. 22, pp. 61–79.
Chang, J. Y., Raskar, R., Agrawal, A., 2009. 3D pose estimation and segmentation using specular cues. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1706–1713.
Cheng, Y., 1995. Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17, pp. 790–799.
Cohen, L. D., 1991. On active contour models and balloons. CVGIP, Image Underst. 53(2), pp. 211–218.
Cohen, L. D., Cohen, I., 1993. Finite element methods for active contour models and balloons for 2d and 3d images. IEEE Trans. Pattern Anal. Mach. Intell. 15(11), pp. 1131–1147.
Collet, A., Martinez, M., Srinivasa, S., 2011. The MOPED framework: object recognition and pose estimation for manipulation. Int. J. Robot. Res. 30(10), pp. 1284–1306.
Comaniciu, D., Meer, P., 2002. Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), pp. 603–619.
Comaniciu, D., Ramesh, V., Meer, P., 2000. Real-time tracking of non-rigid objects using mean shift. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 2, pp. 142–149.
d’Angelo, P., Wöhler, C., Krüger, L., 2004. Model based multi-view active contours for quality inspection. Proc. Int. Conf. on Computer Vision and Graphics, Warszaw, Poland.
Demant, C., 1999. Industrial Image Processing. Springer, Berlin.
Deutscher, J., Blake, A., Reid, I., 2001. Articulated body motion capture by annealed particle filtering. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Kauai, Hawaii, USA, vol. 2, pp. 2126–2133.
Duric, Z., Li, F., Sun, Y., Wechsler, H., 2002. Using normal flow for detection and tracking of limbs in color images. Proc. Int. Conf. on Pattern Recognition, Quebec City, Canada, vol. 4, pp. 268–271.
Ellenrieder, M. M., 2004. Shape reconstruction of flexible objects from monocular images for industrial applications. Proc. SPIE Electronic Imaging, Computational Imaging II, San Jose, USA, SPIE 5299, pp. 295–303.
Ellenrieder, M. M., 2005. Optimal Viewpoint Selection for Industrial Machine Vision and Inspection of Flexible Objects. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany. VDI Fortschritt-Berichte, Reihe 10, no. 763, VDI-Verlag, Düsseldorf.
Erol, A., Bebis, G., Nicolescu, M., Boyle, R. D., Twombly, X., 2007. Vision-based hand pose estimation: a review. Comput. Vis. Image Underst. 108(1–2), pp. 52–73.
Fermüller, C., Aloimonos, Y., 1994. On the geometry of visual correspondence. Int. J. Comput. Vis. 21(3), pp. 223–247.
Fowlkes, C., Belongie, S., Chung, F., Malik, J., 2004. Spectral grouping using the Nyström method. IEEE Trans. Pattern Anal. Mach. Intell. 26(2), pp. 214–225.
Franke, U., Joos, A., 2000. Real-time stereo vision for urban traffic scene understanding. Proc. IEEE Conf. on Intelligent Vehicles, Detroit, pp. 273–278.
Franke, U., Rabe, C., Badino, H., Gehrig, S. K., 2005. 6D-vision: fusion of stereo and motion for robust environment perception. In: Kropatsch, W., Sablatnig, R., Hanbury, A. (eds.). Pattern Recognition, Proc. 27th DAGM Symposium, Vienna, Austria. Lecture Notes in Computer Science 3663, pp. 216–223, Springer, Berlin.
Fua, P., Brechbühler, C., 1996. Imposing hard constraints on soft snakes. Proc. Europ. Conf. on Computer Vision, Cambridge, UK, vol. 2, pp. 495–506.
Fua, P., Leclerc, Y. G., 1990. Model driven edge detection. Mach. Vis. Appl. 3(1), pp. 45–56.
Fukunaga, K., Hostetler, L. D., 1975. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Trans. Inf. Theory 21, pp. 32–40.
Gall, J., Rosenhahn, B., Brox, T., Seidel, H.-P., 2009. Optimization and filtering for human motion capture—a multi-layer framework. Int. J. Comput. Vis. 87(1–2), pp. 75–92.
Gambini, J., Mejail, M., Jacobo-Berlles, J., Delrieux, C., 2004. SAR image segmentation through B-spline deformable contours and fractal dimension. Proc. 20th ISPRS Congress, XXXV(3), pp. 1159–1163.
Gavrila, D. M., Davis, L. S., 1996. 3D model-based tracking of humans in action: a multi-view approach. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 73–80.
Gavrila, D. M., Philomin, V., 1999. Real-time object detection for “smart” vehicles. Proc. Int. Conf. on Computer Vision, Kerkyra, Greece, pp. 87–93.
Gonçalves, N., Araújo, H., 2002. Estimation of 3D motion from stereo images—differential and discrete formulations. Proc. Int. Conf. on Pattern Recognition, Quebec City, Canada, vol. 1, pp. 335–338.
Grebner, K., 1994. Wissensbasierte Entwicklungsumgebung für Bildanalysesysteme aus dem industriellen Bereich. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany.
Grest, D., Koch, R., 2008. Motion capture for interaction environments. In: Rosenhahn, B., Klette, R., Metaxas, D. (eds.), Human Motion: Understanding, Modelling, Capture and Animation, Springer, Dordrecht.
Hahn, M., 2011. Raum-zeitliche Objekt- und Aktionserkennung: Ein statistischer Ansatz für reale Umgebungen. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany.
Hahn, M., Barrois, B., Krüger, L., Wöhler, C., Sagerer, G., Kummert, F., 2010a. 3D pose estimation and motion analysis of the articulated human hand-forearm limb in an industrial production environment. 3D Research 03, 03.
Hahn, M., Krüger, L., Wöhler, C., Groß, H.-M., 2007. Tracking of human body parts using the multiocular contracting curve density algorithm. Proc. Int. Conf. on 3-D Digital Imaging and Modeling, Montréal, Canada.
Hahn, M., Krüger, L., Wöhler, C., 2008b. Spatio-temporal 3D pose estimation of human body parts using the shape flow algorithm. Proc. Int. Conf. on Pattern Recognition, Tampa, USA.
Hahn, M., Quronfuleh, F., Wöhler, C., Kummert, F., 2010b. 3D mean-shift tracking and recognition of working actions. In: Salah, A. A., Gevers, T., Sebe, N., Vinciarelli, A. (eds.), Proc. Int. Workshop on Human Behaviour Understanding, held in conjunction with ICPR 2010, Istanbul, Turkey. Lecture Notes on Computer Science 6219, pp. 101–112, Springer, Berlin.
Hanek, R., 2001. The contracting curve density algorithm and its application to model-based image segmentation. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 797–804.
Hanek, R., 2004. Fitting Parametric Curve Models to Images Using Local Self-adapting Separation Criteria. Doctoral Dissertation, Technical University of Munich.
Hanek, R., Beetz, M., 2004. Fitting Parametric Curve Models to Images Using Local Self-adapting Separation Criteria. Int. J. Comput. Vis. 59, pp. 233–258.
Haralick, R., Joo, H., Lee, C., Zhuang, X., Vaidya, V., Kim, M., 1989. Pose estimation from corresponding point data. IEEE Trans. Syst. Man Cybern. 19(6), pp. 1426–1446.
Hinz, M., Toennies, K. D., Grohmann, M., Pohle, R., 2001. Active double-contour for segmentation of vessels in digital subtraction angiography. Proc. SPIE, 4322, pp. 1554–1562.
Hofmann, M., Gavrila, D. M., 2009. Multi-view 3D human pose estimation combining single-frame recovery, temporal integration and model adaptation. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 2214–2221.
Horn, B. K. P., 1986. Robot Vision. MIT Press, Cambridge.
Isard, M., Blake, A., 1998. CONDENSATION—conditional density propagation for visual tracking. Int. J. Comput. Vis. 29, pp. 5–28.
Jiang, X., Bunke, H., 1997. Dreidimensionales Computersehen. Springer, Berlin.
Kass, M., Witkin, A., Terzopoulos, D., 1988. Snakes: active contour models. Int. J. Comput. Vis. 1(4), pp. 321–331.
Knoop, S., Vacek, S., Dillmann, R., 2005. Modeling joint constraints for an articulated 3D human body model with artificial correspondences. ICP. Proc. Int. Conf. on Humanoid Robots, Tsukuba, Japan.
Kölzow, T., Ellenrieder, M., 2003. A general approach for multi-feature, multi-sensor classification and localization of 3D objects in 2D image sequences. Proc. SPIE-IS&T Electronic Imaging, SPIE 5014, pp. 99–110.
Krüger, L., 2007. Model Based Object Classification and Localisation in Multiocular Images. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany.
Krüger, L., Wöhler, C., Würz-Wessel, A., Stein, F., 2004. In-factory calibration of multiocular camera systems. Proc. SPIE Photonics Europe (Optical Metrology in Production Engineering), Strasbourg, pp. 126–137.
Lagger, P., Salzmann M., Lepetit, V., Fua, P., 2008. 3D pose refinement from reflections. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Anchorage, USA.
Lamdan, Y., Wolfson, H., 1988. Geometric hashing: a general and efficient model-based recognition scheme. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 238–249.
Lange, C., Hermann, T., Ritter, H., 2004. Holistic body tracking for gestural interfaces. Gesture-Based Communication in Human-Computer Interaction, selected and revised papers of the 5th Int. Gesture Workshop, Genova, Italy. Lecture Notes in Computer Science 2915, pp. 132–139, Springer, Berlin.
Lowe, D. G., 1987. Three-dimensional object recognition from single two-dimensional images. Artif. Intell. 31(3), pp. 355–395.
Lowe, D. G., 1991. Fitting parameterized three-dimensional models to images. IEEE Trans. Pattern Anal. Mach. Intell. 13(5), pp. 441–450.
MacQueen, J. B., 1967. Some methods for classification and analysis of multivariate observations. Proc. 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297, University of California Press, Berkeley.
Marsland, S., 2009. Machine Learning: An Algorithmic Perspective. Chapman & Hall/CRC Machine Learning & Pattern Recognition Series.
Moeslund, T. B., Hilton, A., Krüger, V., 2006. A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Underst. 104(2), pp. 90–126.
Mongkolnam, P., Dechsakulthorn, T., Nukoolkit, C., 2006. Image shape representation using curve fitting. Proc. WSEAS Int. Conf. on Signal, Speech and Image Processing, Lisbon, Portugal.
Mündermann, L., Corazza, S., Andriacchi, T. P., 2008. Markerless motion capture for biomechanical applications. In: Rosenhahn, B., Klette, R., Metaxas, D. (eds.), Human Motion: Understanding, Modelling, Capture and Animation, Springer, Dordrecht.
Nayar, S. K., Bolle, R. M., 1996. Reflectance based object recognition. Int. J. Comput. Vis. 17(3), pp. 219–240.
Nelder, J. A., Mead, R., 1965. A simplex method for function minimization. Comput. J. 7, pp. 308–313.
Neuenschwander, W., Fua, P., Iverson, L., Szekely, G., Kubler, O., 1997. Ziplock snakes. Int. J. Comput. Vis. 25(3), pp. 191–201.
Niemann, H., Hornegger, J., 2001. A novel probabilistic model for object recognition and pose estimation. Int. J. Pattern Recognit. Artif. Intell. 15(2), pp. 241–253.
Nomura, Y., Zhang, D., Sakaida, Y., Fujii, S., 1996. 3-d object pose estimation based on iterative image matching: shading and edge data fusion. Proc. IEEE Int. Conf. on Computer Vision and Pattern Recognition, pp. 866–871.
Panin, G., Ladikos, A., Knoll, A., 2006. An efficient and robust real-time contour tracking system. Proc. IEEE Int. Conf. on Computer Vision Systems, New York, USA, pp. 44–51.
Phong, T. Q., Horaud, R., Yassine, A., Tao, P. D., 1996. Object pose from 2-D to 3-D point and line correspondences. Int. J. Comput. Vis. 15(3), pp. 225–243.
Plänkers, R., Fua, P., 2003. Articulated soft objects for multiview shape and motion capture. IEEE Trans. Pattern Anal. Mach. Intell. 25(9), pp. 1182–1187.
Pons-Moll, G., Baak, A., Gall, J., Leal-Taixé, L., Müller, M., Seidel, H.-P., Rosenhahn, B., 2011. Outdoor human motion capture using inverse kinematics and von Mises-Fisher sampling. Proc. IEEE Int. Conf. on Computer Vision, Barcelona, Spain.
Press, W. H., Teukolsky, S. A., Vetterling, W. T., Flannery, B. P., 2007. Numerical Recipes. The Art of Scientific Computing. 3rd Edition, Cambridge University Press, Cambridge.
Rey, W. J. J., 1983. Introduction to Robust and Quasi-robust Statistical Methods. Springer, Berlin.
Rosenhahn, B., Perwass, C., Sommer, G., 2003. Pose estimation of free-form surface models. In: Michaelis, B., Krell, G. (eds.), Pattern Recognition, Proc. 25th DAGM Symposium, Magdeburg, Germany. Lecture Notes in Computer Science 2781, pp. 574–581, Springer, Berlin.
Rosenhahn, B., Kersting, U., Smith, A., Gurney, J., Brox, T., Klette, R., 2005. A system for marker-less human motion estimation. In: Kropatsch, W., Sablatnig, R., Hanbury, A. (eds.), Pattern Recognition, Proc. 27th DAGM Symposium, Vienna, Austria. Lecture Notes in Computer Science 3663, pp. 230–237, Springer, Berlin.
Rosenhahn, B., Brox, T., Cremers, D., Seidel, H.-P., 2006. A comparison of shape matching methods for contour based pose estimation. Combinatorial Image Analysis, Lecture Notes in Computer Science 4040, pp. 263–276, Springer, Berlin.
Rosenhahn, B., Kersting, U. G., Powell, K., Brox, T., Seidel, H.-P., 2008a. Tracking Clothed People. In: Rosenhahn, B., Klette, R., Metaxas, D. (eds.), Human Motion: Understanding, Modelling, Capture and Animation, Springer, Dordrecht.
Rottensteiner, F., Trinder, J., Clode, S., Kubik, K., 2005. Automated delineation of roof planes in LIDAR data. Int. Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XXXVI-3/W19, pp. 221–226.
Rottensteiner, F., 2006. Consistent estimation of building parameters considering geometric regularities by soft constraints. Int. Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XXXVI-3, pp. 13–18.
Sagerer, G., 1985. Darstellung und Nutzung von Expertenwissen für ein Bildanalysesystem. Springer, Berlin.
Salzmann, M., Urtasun, R., 2010. Combining discriminative and generative methods for 3D deformable surface and articulated pose reconstruction. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, San Francisco, USA, pp. 647–654.
Schmidt, J., Fritsch, J., Kwolek, B., 2006. Kernel particle filter for real-time 3d body tracking in monocular color images. Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition, pp. 567–572.
Schmidt, J., Wöhler, C., Krüger, L., Gövert, T., Hermes, C., 2007. 3D scene segmentation and object tracking in multiocular image sequences. Proc. Int. Conf. on Computer Vision Systems, Bielefeld, Germany.
Schunck, B. G., 1989. Image flow segmentation and estimation by constraint line clustering. IEEE Trans. Pattern Anal. Mach. Intell. 11(10), pp. 1010–1027.
Sethian, J., 1999. Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge University Press, Cambridge.
Shi, J., Malik, J., 1997. Normalized cuts and image segmentation. IEEE Conf. on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, pp. 731–737.
Shi, J., Malik, J., 1998. Motion segmentation and tracking using normalized cuts. Proc. Int. Conf. on Computer Vision, Bombay, India, pp. 1154–1160.
Shi, J., Malik, J., 2000. Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), pp. 888–905.
Sminchisescu, C., 2008. 3D human motion analysis in monocular video: techniques and challenges. In: Rosenhahn, B., Klette, R., Metaxas, D. (eds.), Human Motion: Understanding, Modelling, Capture and Animation, Springer, Dordrecht.
Stenger, B., Mendonça, P. R. S., Cipolla, R., 2001. Model-based hand tracking using an unscented kalman filter. Proc. British Machine Vision Conference, Manchester, UK, vol. I, pp. 63–72.
Stößel, D., 2007. Automated Visual Inspection of Assemblies from Monocular Images. Doctoral Dissertation, Technical Faculty, Bielefeld University, Germany.
von Bank, C., Gavrila, D. M., Wöhler, C., 2003. A visual quality inspection system based on a hierarchical 3D pose estimation algorithm. In: Michaelis, B., Krell, G. (eds.), Pattern Recognition, Proc. 25th DAGM Symposium, Magdeburg, Germany. Lecture Notes in Computer Science 2781, pp. 179–186, Springer, Berlin.
Wedel, A., Rabe, C., Vaudrey, T., Brox, T., Franke, U., Cremers, D., 2008a. Efficient dense scene flow from sparse or dense stereo data. Proc. Europ. Conf. on Computer Vision, pp. 739–751.
Wedel, A., Vaudrey, T., Meissner, A., Rabe, C., Brox, T., Franke, U., Cremers, D., 2008b. An evaluation approach for scene flow with decoupled motion and position. In: Revised Papers Int. Dagstuhl Seminar on Statistical and Geometrical Approaches to Visual Motion Analysis, pp. 46–69, Springer, Berlin.
Williams, D. J., Shah, M., 1992. A fast algorithm for active contours and curvature estimation. CVGIP, Image Underst. 55, pp. 14–26.
Xu, C., Prince, J. L., 1998. Snakes, shapes, and gradient vector flow. IEEE Trans. Image Process. 7(3), pp. 359–369.
Ye, Q. Z., Ong, S. H., Han, X., 2001. A stereo vision system for the inspection of IC bonding wires. Int. J. Imaging Syst. Technol. 11(4), pp. 254–262.
Yoon, Y., DeSouza, G. N., Kak, A. C., 2003. Real-time tracking and pose estimation for industrial objects using geometric features. Proc. Int. Conf. on Robotics and Automation, pp. 3473–3478.
Zhang, Z., 1992. Iterative Point Matching for Registration of Free-form Curves. Technical Report No. 1658, Institut National de Recherche en Informatique et en Automatique (INRIA), Sophia Antipolis, France.
Zhang, Z., 1999b. Iterative point matching for registration of free-form curves and surfaces. Int. J. Comput. Vis. 13(2), pp. 119–152.
Ziegler, J., Nickel, K., Stiefelhagen, R., 2006. Tracking of the articulated upper body on multi-view stereo image sequences. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1, pp. 774–781.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this chapter
Cite this chapter
Wöhler, C. (2013). Three-Dimensional Pose Estimation and Segmentation Methods. In: 3D Computer Vision. X.media.publishing. Springer, London. https://doi.org/10.1007/978-1-4471-4150-1_2
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4150-1_2
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4149-5
Online ISBN: 978-1-4471-4150-1
eBook Packages: Computer ScienceComputer Science (R0)