Skip to main content
Log in

Object recognition in noisy RGB-D data using GNG

  • Theoretical Advances
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Object recognition in 3D scenes is a research field in which there is intense activity guided by the problems related to the use of 3D point clouds. Some of these problems are influenced by the presence of noise in the cloud that reduces the effectiveness of a recognition process. This work proposes a method for dealing with the noise present in point clouds by applying the growing neural gas (GNG) network filtering algorithm. This method is able to represent the input data with the desired number of neurons while preserving the topology of the input space. The GNG obtained results which were compared with a Voxel grid filter to determine the efficacy of our approach. Moreover, since a stage of the recognition process includes the detection of keypoints in a cloud, we evaluated different keypoint detectors to determine which one produces the best results in the selected pipeline. Experiments show how the GNG method yields better recognition results than other filtering algorithms when noise is present.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. http://www.vision.deis.unibo.it/research/80-shot.

References

  1. Aldoma A, Tombari F, Di Stefano L, Vincze M (2012) A global hypotheses verification method for 3d object recognition. In: Fitzgibbon A, Lazebnik S, Perona P, Sato Y, Schmid C (eds) Computer Vision?, vol 7574., ECCV 2012, Lecture Notes in Computer ScienceSpringer, Berlin Heidelberg, pp 511–524

    Google Scholar 

  2. Asari M, Sheikh U, Supriyanto E (2014) 3d shape descriptor for object recognition based on kinect-like depth image. Image Vis Comput 32(4):260–269

    Article  Google Scholar 

  3. Besl P, McKay N (1992) A method for registration of 3-d shapes. IEEE Trans Pattern Anal Mach Intell 14(2):239–256

  4. Chen H, Bhanu B (2007) 3d free-form object recognition in range images using local surface patches. Pattern Recognit Lett 28(10):1252–1262

    Article  Google Scholar 

  5. Chen Y, Medioni G (1991) Object modeling by registration of multiple range images. In: Medioni G (ed) 1991 Proceedings., IEEE International Conference on Robotics and Automation, vol 3. pp 2724–2729

  6. Computer Vision LAB: SHOT: Unique signatures of histograms for local surface description—computer vision LAB. http://www.vision.deis.unibo.it/research/80-shot

  7. Fritzke B (1995) A growing neural gas network learns topologies. In: Advances in neural information processing systems, vol 7. MIT Press, pp 625–632

  8. Guo Y, Bennamoun M, Sohel F, Lu M, Wan J (2014) 3d object recognition in cluttered scenes with local surface features: A survey. IEEE Trans Pattern Anal Mach Intell, IEEE Transactions on 36(11):2270–2287

    Article  Google Scholar 

  9. Hinterstoisser S, Holzer S, Cagniart C, Ilic S, Konolige K, Navab N, Lepetit V (2011) Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: Computer vision (ICCV), 2011 IEEE International Conference on, pp 858–865

  10. Johnson A, Hebert M (1998) Surface matching for object recognition in complex three-dimensional scenes. Image Vis Comput 16(9–10):635–651

    Article  Google Scholar 

  11. Johnson A, Hebert M (1999) Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans Pattern Anal Mach Intell 21(5):433–449. doi:10.1109/34.765655

    Article  Google Scholar 

  12. Kohonen T (1995) Self-organising maps. Springer-Verlag

  13. Muja M FLANN—fast library for approximate nearest neighbors: FLANN—FLANN browse. http://www.cs.ubc.ca/research/flann/

  14. Martinetz T (1993) Competitive Hebbian learning rule forms perfectly topology preserving maps. In: Gielen S, Kappen B (eds) Proc. ICANN’93, Int. Conf. on Artificial Neural Networks. Springer, London, pp 427–434

  15. Martinetz T, Schulten K (1994) Topology representing networks. Neural Netw 7(3):507–522

    Article  Google Scholar 

  16. Muja M, Lowe DG (2014) Scalable nearest neighbor algorithms for high dimensional data. IEEE Trans Pattern Anal Mach Intell 36(11):2227–2240

    Article  Google Scholar 

  17. Pang G, Neumann U (2013) Training-based object recognition in cluttered 3d point clouds. In: 2013 International Conference on 3D Vision—3DV 2013. doi:10.1109/3DV.2013.20, pp 87–94

  18. PCL: Documentation—point cloud library (PCL). http://pointclouds.org/documentation/tutorials/normal_estimation.php

  19. Radu Bogdan Rusu: Point cloud library (PCL): pcl::UniformSampling \(<\) PointInT \(>\) class template reference. http://docs.pointclouds.org/1.7.0/classpcl_1_1_uniform_sampling.html#details

  20. Rusu R, Blodow N, Beetz M (2009) Fast point feature histograms (fpfh) for 3d registration. In: IEEE International Conference on Robotics and automation. ICRA ’09, pp 3212–3217

  21. Sipiran I, Bustos B (2011) Harris 3d: a robust extension of the Harris operator for interest point detection on 3d meshes. Vis Comput 27(11):963–976. doi:10.1007/s00371-011-0610-y

    Article  Google Scholar 

  22. Tombari F, Di Stefano L (2010) Object recognition in 3d scenes with occlusions and clutter by hough voting. In: 2010 Fourth Pacific-Rim Symposium on Image and Video Technology (PSIVT), pp 349–355

  23. Tombari F, Gori F, Di Stefano L (2011) Evaluation of stereo algorithms for 3d object recognition. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp 990–997

  24. Tombari F, Salti S (2011) A combined texture-shape descriptor for enhanced 3d feature matching. In: 2011 18th IEEE International Conference on Image Processing (ICIP), pp 809 –812

  25. Tombari F, Salti S, Di Stefano L (2010) Unique signatures of histograms for local surface description. In: Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III., ECCV’10. Springer-Verlag, Berlin, Heidelberg, pp 356–369

  26. Tombari F, Salti S, Di Stefano L (2013) Performance evaluation of 3d keypoint detectors. Int J Comput Vis 102(1–3):198–220. doi:10.1007/s11263-012-0545-4

    Article  Google Scholar 

  27. Viejo D, Garcia J, Cazorla M, Gil D, Johnsson M (2012) Using GNG to improve 3d feature extraction–application to 6DoF egomotion. Neural Netw 32:138–146

    Article  Google Scholar 

  28. Xu G, Mourrain B, Duvigneau R, Galligo A (2013) Analysis-suitable volume parameterization of multi-block computational domain in isogeometric applications. Comput Aid Des 45(2): 395–404 (Solid and Physical Modeling 2012)

  29. Zhong Y (2009) Intrinsic shape signatures: a shape descriptor for 3d object recognition. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), pp 689–696

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miguel Cazorla.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rangel, J.C., Morell, V., Cazorla, M. et al. Object recognition in noisy RGB-D data using GNG. Pattern Anal Applic 20, 1061–1076 (2017). https://doi.org/10.1007/s10044-016-0546-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-016-0546-y

Keywords

Navigation