Abstract
This article presents a method that removes outliers, reduces noise and fills holes in a point cloud using a learned shape prior. The shape prior is learned from a set of training objects using the Gaussian process latent variable model. All training objects are represented by a signed distance function. To improve the time performance, the signed distance functions are compressed using the discrete cosine transform. As input data the method uses the estimated object pose from an object detector and a segmented point cloud. It is shown that the estimated shape prior is capable of modelling fine details to a certain degree. The limitations and difficulties of this method are also investigated. It is shown that by applying this method the accuracy and completeness of the reconstructed models can be significantly increased.
Zusammenfassung
Bereinigung von 3D-Punktwolken mit Hilfe von a-priori-Formen, geschätzt mit dem Gaussian Process Latent Variable Model. Dieser Artikel präsentiert ein Verfahren, mit dem Ausreißer und Rauschen aus einer Punktwolke entfernt und Löcher gefüllt werden können. Das Verfahren verwendet dabei eine angelernte a-priori-Form. Diese Form wird aus mehreren Trainingsformen mit dem Gaussian Process Latent Variable Model angelernt. Alle Trainingsobjekte werden durch eine vorzeichenbehaftete Abstandsfunktion repräsentiert. Zur Erhöhung der Performanz werden die Abstandsfunktionen mit der Diskreten Kosinustransformation komprimiert. Als Eingabedaten verwendet das Verfahren die vom Objekt-Detektor geschätzte Pose und eine segmentierte Punktwolke. Es wird gezeigt, dass die geschätzte a-priori-Form in der Lage ist, Details bis zu einem bestimmten Grad zu modellieren. Die Grenzen dieses Verfahrens werden ebenfalls untersucht. Es wird gezeigt, dass durch Anwendung des Verfahrens die Genauigkeit und Vollständigkeit der rekonstruierten Objekte signifikant verbessert werden kann.
Similar content being viewed by others
References
Ahmed N, Natarajan T, Rao KR (1974) Discrete cosine transform. IEEE Trans Comput 23(1):90–93
Allen B, Curless B, Popović Z (2003) The space of human body shapes: reconstruction and parameterization from range scans. ACM Trans Graph (SIGGRAPH Conf Proc) 22(3):587–594
Anguelov D, Srinivasan P, Koller D, Thrun S, Rodgers J, Davis J (2005) Scape: shape completion and animation of people. ACM Trans Graph (SIGGRAPH Conf Proc) 24(3):408–416
Bao SYZ, Chandraker M, Lin Y, Savarese S (2013) Dense object reconstruction with semantic priors. In: IEEE Conference on computer vision and pattern recognition, pp 1264–1271
Berger M, Tagliasacchi A, Seversky LM, Alliez P, Levine JA, Sharf A, Silva C (2014) State of the art in surface reconstruction from point clouds. In: Eurographics 2014—State of the Art Reports, pp 161–185
BlendSwap (2016) http://www.blendswap.com/. 17 Nov 2016
Broyden CG (1970) The convergence of a class of double-rank minimization algorithms 1. General considerations. J Inst Math Appl 6(1):76–90
Dame A, Prisacariu VA, Ren CY, Reid ID (2013) Dense reconstruction using 3d object shape priors. In: IEEE conference on computer vision and pattern recognition, pp 1288–1295
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Furukawa Y, Ponce J (2010) Accurate, dense, and robust multi-view stereopsis. IEEE Trans Pattern Anal Mach Intell 32(8):1362–1376
Furukawa Y, Curless B, Seitz SM, Szeliski R (2010) Towards internet-scale multi-view stereo. In: IEEE conference on computer vision and pattern recognition, pp 1434–1441
Gal R, Shamir A, Hassner T, Pauly M, Cohen-Or D (2007) Surface reconstruction using local shape priors. In: Fifth Eurographics symposium on geometry processing, pp 253–262
GrabCAD (2016) https://grabcad.com/. 17 Nov 2016
Heimann T, Meinzer HP (2009) Statistical shape models for 3d medical image segmentation: a review. Med Image Anal 13(4):543–563
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417–441
Huang J, You S (2014) Segmentation and matching: towards a robust object detection system. In: IEEE winter conference on applications of computer vision, pp 325–332
Jensen R, Dahl A, Vogiatzis G, Tola E, Aanæs H (2014) Large scale multi-view stereopsis evaluation. In: IEEE conference on computer vision and pattern recognition, pp 406–413
Jones MW, Baerentzen JA, Sramek M (2006) 3d distance fields: a survey of techniques and applications. IEEE Trans Vis Comput Graph 12(4):581–599
Kazhdan M, Hoppe H (2013) Screened poisson surface reconstruction. ACM Trans Graph (SIGGRAPH Conf Proc) 32(3):29:1–29:13
Kraevoy V, Sheffer A (2005) Template-based mesh completion. In: Proceedings of the third Eurographics symposium on geometry processing, Eurographics Association, SGP ’05, pp 13–22
Krenzin J, Hellwich O (2016) Reduction of point cloud artifacts using shape priors estimated with the gaussian process latent variable model. In: Pattern recognition. GCPR. Lecture notes in computer science. Springer International Publishing, Cham, pp 273–284 (2016)
Lawrence ND (2004) Gaussian process latent variable models for visualisation of high dimensional data. Adv Neural Inf Process Syst 16(3):329–336
Lawrence ND (2005) Probabilistic non-linear principal component analysis with Gaussian process latent variable models. J Mach Learn Res 6(Nov):1783–1816
Lévy B (2003) Dual domain extrapolation. ACM Trans Graph (SIGGRAPH Conf Proc) 22(3):364–369
Lorensen WE, Cline HE (1987) Marching cubes: a high resolution 3d surface construction algorithm. ACM Trans Graph (SIGGRAPH Conf Proc) 21(4):163–169
Matlab (2015) MATLAB and optimization toolbox release R2015a. The MathWorks Inc., Natick. https://www.mathworks.com/
Maurer CR Jr, Qi R, Raghavan V (2003) A linear time algorithm for computing exact Euclidean distance transforms of binary images in arbitrary dimensions. IEEE Trans Pattern Anal Mach Intell 25(2):265–270
Moons T, Van Gool L, Vergauwen M (2009) 3d reconstruction from multiple images: part 1—principles. Found Trends Comput Graph Vis 4(4):287–404
Nan L, Xie K, Sharf A (2012) A search-classify approach for cluttered indoor scene understanding. ACM Trans Graph (SIGGRAPH Conf Proc) 31(6):137:1–137:10
Pauly M, Mitra NJ, Giesen J, Gross MH, Guibas LJ (2005) Example-based 3d scan completion. In: Symposium on geometry processing, Eurographics Association, ACM transactions on graphics (SIGGRAPH conference proceedings), vol 255, pp 23–32
Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2(6):559–572
Prisacariu VA, Reid ID (2011a) Nonlinear shape manifolds as shape priors in level set segmentation and tracking. In: IEEE conference on computer vision and pattern recognition, pp 2185–2192
Prisacariu VA, Reid ID (2011b) Shared shape spaces. In: International conference on computer vision. IEEE Computer Society, pp 2587–2594
Prisacariu VA, Segal AV, Reid ID (2013) Simultaneous monocular 2d segmentation, 3d pose recovery and 3d reconstruction. In: Asian conference on computer vision, pp 593–606
Quirion S, Duchesne C, Laurendeau D, Marchand M (2008) Comparing GPLVM approaches for dimensionality reduction in character animation. J Int Conf Central Eur Comput Graph Vis Comput Vis 16(1–3):41–48
Ren CY, Prisacariu V, Reid I (2013) Regressing local to global shape properties for online segmentation and tracking. Int J Comput Vis 106(3):269–281
Rusu RB, Cousins S (2011) 3D is here: Point Cloud Library (PCL). In: IEEE international conference on robotics and automation, pp 1–4
Schölkopf B, Smola A, Müller KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10(5):1299–1319
Seitz S, Curless B, Diebel J, Scharstein D, Szeliski R (2006) A comparison and evaluation of multi-view stereo reconstruction algorithms. In: IEEE conference on computer vision and pattern recognition, vol 1, pp 519–526
Shao T, Xu W, Zhou K, Wang J, Li D, Guo B (2012) An interactive approach to semantic modeling of indoor scenes with an RGBD camera. ACM Trans Graph (SIGGRAPH Conf Proc) 31(6):136:1–136:11
Sheffer A, Praun E, Rose K (2006) Mesh parameterization methods and their applications. Found Trends Comput Graph Vis 2(2):105–171
Shen CH, Fu H, Chen K, Hu SM (2012) Structure recovery by part assembly. ACM Trans Graph (SIGGRAPH Conf Proc) 31(6):180:1–180:11
Teutsch C, Trostmann E, Berndt D (2011) A parallel point cloud clustering algorithm for subset segmentation and outlier detection. Proc SPIE 8085(808):509
Wang Q, Wang F, Li D, Wang X (2014) Clustering-based latent variable models for monocular non-rigid 3d shape recovery. In: 10th International conference on intelligent computation 2014, pp 162–172
Wu C (2016) VisualSFM: a visual structure from motion system. http://ccwu.me/vsfm/. 17 Nov 2016
Zhang Z (1994) Iterative point matching for registration of free-form curves and surfaces. Int J Comput Vis 13(2):119–152
Zheng S, Prisacariu VA, Averkiou M, Cheng MM, Mitra NJ, Shotton J, Torr PHS, Rother C (2015) Object proposals estimation in depth image using compact 3d shape manifolds. In: Pattern recognition. GCPR, Lecture notes in computer science. Springer International Publishing, Cham, pp 196–208
Author information
Authors and Affiliations
Corresponding author
Additional information
This paper is an extended version of the paper Reduction of Point Cloud Artifacts Using Shape Priors Estimated with the Gaussian Process Latent Variable Model published at GCPR 2016.
Rights and permissions
About this article
Cite this article
Krenzin, J., Hellwich, O. 3D Reconstruction Supported by Gaussian Process Latent Variable Model Shape Priors. PFG 85, 97–112 (2017). https://doi.org/10.1007/s41064-017-0009-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41064-017-0009-0