Abstract
In this chapter, we present two methods for geometric reconstruction of players in standard sports broadcasts specifically designed to enable the broadcast director to generate novel views from locations where there is no physical camera (novel-view synthesis). This will significantly broaden the creative freedom of the director greatly enhancing the viewing experience. First, we propose a data-driven method based on multiview body pose estimation. This method can operate in uncontrolled environments with loosely calibrated and low resolution cameras and without restricting assumptions on the family of possible poses or motions. Second, we propose a scalable top-down patch-based method that reconstructs the geometry of the players adaptively based on the amount of detail available in the video streams. These methods are complementary to each other and together provides a more complete set of tools for novel-view synthesis for sport broadcasts.
Keywords
- Optical Flow
- Feature Match
- Calibration Error
- Temporal Coherence
- Visual Hull
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, access via your institution.
Buying options




















References
Hilton A, Guillemaut J, Kilner J, Grau O, Thomas G (2011) 3D-TV production from conventional cameras for sports broadcast. IEEE Trans Broadcast 57(2):462–476
LiberoVision. www.liberovision.com
Kuster K, Bazin J-C, Martin T, Oztireli C, Popa T, Gross M (2014) Spatio-temporal geometry fusion for multiple hybrid cameras using moving least squares surfaces. In: Eurographics
Guillemaut J-Y, Kilner J, Hilton A (2009) Robust graph-cut scene segmentation and reconstruction for free-viewpoint video of complex dynamic scenes. In: ICCV
Germann M, Popa T, Ziegler R, Keiser R, Gross M (2011) Space-time body pose estimation in uncontrolled environments. In: 3DIMPVT
Thomas GA (2006) Real-time camera pose estimation for augmenting sports scenes. In: CVMP
Hayashi K, Saito H (2006) Synthesizing free-viewpoint images from multiple view videos in Soccer stadium. In: CGIV
Germann M, Hornung A, Keiser R, Ziegler R, Würmlin S, Gross M (2010) Articulated billboards for video-based rendering. In: Eurographics
Germann M, Popa T, Keiser R, Ziegler R, Gross M (2012) Novel-view synthesis of outdoor sport events using an adaptive view-dependent geometry. In: Computer graphics forum (Proceedings of the Eurographics)
Moeslund TB, Granum E (2001) A survey of computer vision-based human motion capture. In: CVIU
Moeslund TB, Hilton A, Krüger V (2006) A survey of advances in vision-based human motion capture and analysis. In: CVIU
Vicon (2010). http://www.vicon.com
Ballan L, Cortelazzo GM (2008) Marker-less motion capture of skinned models in a four camera set-up using optical flow and silhouettes. In: 3DPVT
Choi C, Baek S-M, Lee S (2008) Real-time 3d object pose estimation and tracking for natural landmark based visual servo. In: IROS
Theobalt C, de Aguiar E, Magnor MA, Theisel H, Seidel H-P (2004) Marker-free kinematic skeleton estimation from sequences of volume data. In: VRST
de Aguiar E, Stoll C, Theobalt C, Ahmed N, Seidel H-P, Thrun S (2008) Performance capture from sparse multi-view video. In: SIGGRAPH
Vlasic D, Baran I, Matusik W, Popović J (2008) Articulated mesh animation from multi-view silhouettes. In: SIGGRAPH
Mori G (2005) Guiding model search using segmentation. In: ICCV
Ferrari V, Marin-Jimenez M, Zisserman A (2008) Progressive search space reduction for human pose estimation. In: CVPR
Efros AA, Berg AC, Mori G, Malik J (2003) Recognizing action at a distance. In: ICCV
Germann M, Hornung A, Keiser R, Ziegler R, Würmlin S, Gross M (2010) Articulated billboards for video-based rendering. In: Eurographics
CGLM Database (2010). http://mocap.cs.cmu.edu
Bouguet J-Y (1999) Pyramidal implementation of the Lucas Kanade feature tracker: description of the algorithm. Technical report, Intel Corporation, Microprocessor Research Labs
Schreiner J, Asirvatham A, Praun E, Hoppe H (2004) Inter-surface mapping. In: SIGGRAPH
Mor J (1978) The Levenberg-Marquardt algorithm: implementation and theory. Lecture notes in mathematics, vol 630
Laurentini A (1994) The visual hull concept for Silhouette-based image understanding. PAMI 16(2):150–162
Matusik V, Buehler C, Raskar R, Gortler S, McMillan L (2000) Image-based visual hulls. In: SIGGRAPH
Li M, Magnor M, Seidel H-P (2004) A hybrid hardware-accelerated algorithm for high quality rendering of visual hulls. In: Graphics interface
Grau O, Thomas GA, Hilton A, Kilner J, Starck J (2007) A Robust free-viewpoint video system for sport scenes. In: 3DTV
Petit B, Lesage JD, Menier C, Allard J, Franco JS, Raffin B, Boyer E, Faure F (2010) Multicamera real-time 3D modeling for telepresence and remote collaboration. Int J Digit Multi Broadcast
Franco J-S, Boyer E (2009) Efficient polyhedral modelling from silhouettes. PAMI 31(3):414–427
Guillemaut J-Y, Hilton A (2011) Joint multi-layer segmentation and reconstruction for free-viewpoint video applications. IJCV 93(1):73–100
Inamoto N, Saito S (2002) Intermediate view generation of soccer scene from multiple videos. In: ICPR
Matusik W, Pfister H (2004) 3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes. In: SIGGRAPH
Bradley D, Popa T, Sheffer A, Heidrich W, Boubekeur T (2008) Markerless garment capture. In: SIGGRAPH
Ballan L, Brostow GJ, Puwein J, Pollefeys M (2010) Unstructured video-based rendering: interactive exploration of casually captured videos. In: SIGGRAPH
Fraundorfer F, Schindler K, Bischof H (2006) Piecewise planar scene reconstruction from sparse correspondences. Image Vis Comput 24(4):395–406
Stich T, Linz C, Albuquerque G, Magnor M (2008) View and time interpolation in image space. In: Pacific graphics
Sudipta SN, Steedly D, Szeliski R (2009) Piecewise planar stereo for image-based rendering. In: ICCV
Gallup D, Frahm J-M, Pollefeys M (2010) Piecewise planar and non-planar stereo for urban scene reconstruction. In: CVPR
Tola E, Lepetit V, Fua P (2010) Daisy: an efficient dense descriptor applied to wide baseline stereo. PAMI 32(5):815–830
Fleuret F, Berclaz J, Lengagne R, Fua P (2007) Multi-camera people tracking with a probabilistic occupancy map. PAMI 30(2):267–282
Zach C, Pock T, Bischof H (2007) A globally optimal algorithm for Robust TV-L1 range image integration. In: ICCV
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24:381–395
Barnes C, Shechtman E, Goldman DB, Finkelstein A (2010) The generalized PatchMatch correspondence algorithm. In: ECCV
Buehler C, Bosse M, McMillan L, Gortler S, Cohen M (2001) Unstructured Lumigraph rendering. In: SIGGRAPH
Acknowledgments
The image data is courtesy of Teleclub and LiberoVision.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Popa, T., Germann, M., Ziegler, R., Keiser, R., Gross, M. (2014). Geometry Reconstruction of Players for Novel-View Synthesis of Sports Broadcasts. In: Moeslund, T., Thomas, G., Hilton, A. (eds) Computer Vision in Sports. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-09396-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-09396-3_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09395-6
Online ISBN: 978-3-319-09396-3
eBook Packages: Computer ScienceComputer Science (R0)