Skip to main content
Log in

Recognition improvement through optimized spatial support methodology

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Spatial support is an effective method of improving object recognition that is widely used in the field of computer vision. Compared to various other spatial support methods, such as sliding windows, spatial support based on image segmentation is a classic technique with high-quality segmentation that can positively contribute to image recognition. However, over-segmentation and under-segmentation often occur in the segmentation process. It is difficult for object classifiers to recognize objects in low-quality, segmented images, thus low-quality segmentation will reduce recognition accuracy. In order to resolve this drawback, watershed segmentation, multi-scale decompositions based on WLS (Weighted-Least Squares) filters and multi-layer smoothing, have been used to process the images. This methodology was utilized to maintain the sharp regions’ boundaries, while smoothing regions which could contribute to accurately segmenting the images. After obtaining high-quality segmentation, the images could then be utilized in image recognition. The superiority of this paper’s methodology compared to that of previous methods has been demonstrated herein. Experiments using a large database have demonstrated that this methodology is capable of improving image recognition through optimizing segmentation results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Arbelaez P et al (2011) “Contour detection and hierarchical image segmentation.” Pattern Anal Mach Intell IEEE Trans 33.5: 898–916

  2. Belongie S et al (1998) “Color-and texture-based image segmentation using EM and its application to content-based image retrieval.” Computer Vision, 1998. Sixth International Conference on. IEEE

  3. Beucher S, Lantuejoul C (1979) “Use of watersheds in contour detection”

  4. Black MJ et al (1998) Robust anisotropic diffusion. Image Processing IEEE Trans 7(3):421–432

    Article  Google Scholar 

  5. Burt PJ, Adelson EH (1983) The Laplacian pyramid as a compact image code. Commun IEEE Trans 31(4):532–540

    Article  Google Scholar 

  6. Chung FRK (1997) “Spectral Graph Theory American Mathematical Society.” Providence, RI

  7. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. Pattern Anal Mach Intell IEEE Trans 24(5):603–619

    Article  Google Scholar 

  8. Cour T, Florence B, Shi J (2005) “Spectral segmentation with multiscale graph decomposition.” Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 2. IEEE

  9. Cui C et al (2012) “Semantically coherent image annotation with a learning-based keyword propagation strategy.” Proceedings of the 21st ACM international conference on Information and knowledge management. ACM

  10. Dalal N, Triggs B (2005) “Histograms of oriented gradients for human detection.” Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 1. IEEE

  11. Everingham M, van Gool L, Williams C, Winn J, Zisserman A (2008) “PASCAL 2008 Results,” http://www.pascal-network.org/challenges/VOC/voc2008/workshop/index.html

  12. Farbman Z, et al (2008) “Edge-preserving decompositions for multi-scale tone and detail manipulation.” ACM Transactions on Graphics (TOG). Vol. 27. No. 3. ACM

  13. Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181

    Article  Google Scholar 

  14. Frucci M (2006) Oversegmentation reduction by flooding regions and digging watershed lines. Int J Pattern Recognit Artif Intell 20(01):15–38

    Article  Google Scholar 

  15. Frucci M, Perner P, Sanniti di Baja G (2008) “Case-based reasoning for image segmentation by watershed transformation.” Case-Based Reasoning on Images and Signals. Springer Berlin Heidelberg, 319–353

  16. Fu Y, Hospedales T, Xiang T, Gong S (2013) “Learning multi-modal latent attributes.” Proc IEEE Conf Comput Vis Pattern Recognit

  17. Ge F, Song W, Liu T (2006) “Image-segmentation evaluation from the perspective of salient object extraction.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 1. IEEE

  18. Hoiem D, Efros A A, Hebert M (2005) “Geometric context from a single image.” Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on. Vol. 1. IEEE

  19. Kumar N et al (2009) “Attribute and simile classifiers for face verification.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE

  20. Kunze K et al (2013) “The Wordometer--Estimating the Number of Words Read Using Document Image Retrieval and Mobile Eye Tracking.” Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE

  21. Lagendijk RL, Biemond J, Boekee DE (1988) Regularized iterative image restoration with ringing reduction. Acoust Speech Signal Processing IEEE Trans 36(12):1874–1888

    Article  MATH  Google Scholar 

  22. Levner I, Zhang H (2007) Classification-driven watershed segmentation. Image Processing IEEE Trans 16(5):1437–1445

    Article  MathSciNet  Google Scholar 

  23. Lischinski D, et al (2006) “Interactive local adjustment of tonal values.” ACM Transactions on Graphics (TOG). Vol. 25. No. 3. ACM

  24. Lobo JM, Jiménez‐Valverde A, Real R (2008) AUC: a misleading measure of the performance of predictive distribution models. Glob Ecol Biogeogr 17(2):145–151

    Article  Google Scholar 

  25. Maji S, Berg A C (2009) “Max-margin additive classifiers for detection.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE

  26. Malik J et al (2001) Contour and texture analysis for image segmentation. Int J Comput Vis 43(1):7–27

    Article  MATH  Google Scholar 

  27. Malisiewicz T, Efros AA (2007) “Improving spatial support for objects via multiple segmentations”

  28. Martin D, et al (2001) “A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics.” Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on. Vol. 2. IEEE

  29. Maulik U (2009) Medical image segmentation using genetic algorithms. Inform Technol Biomed IEEE Trans 13(2):166–173

    Article  Google Scholar 

  30. Mumford D, Shah J (1989) Optimal approximations by piecewise smooth functions and associated variational problems. Commun Pure Appl Math 42(5):577–685

    Article  MathSciNet  MATH  Google Scholar 

  31. Osher S, Sethian JA (1988) Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations. J Comput Phys 79(1):12–49

    Article  MathSciNet  MATH  Google Scholar 

  32. Palatucci M et al (2009) “Zero-shot Learning with Semantic Output Codes.” NIPS. 3(3)

  33. Perona P, Malik J (1990) Scale-space and edge detection using anisotropic diffusion. Patt Anal Mach Intell IEEE Trans 12(7):629–639

    Article  Google Scholar 

  34. Pock T et al (2009) “An algorithm for minimizing the Mumford-Shah functional.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE

  35. Ren Z, Shakhnarovich G (2013) “Image segmentation by cascaded region agglomeration.” Computer Vision and Pattern Recognition (CVPR), 2013 I.E. Conference on. IEEE

  36. Roerdink JBTM, Meijster A (2000) The watershed transform: definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41(1):187–228

    MathSciNet  MATH  Google Scholar 

  37. Russell BC et al (2006) “Using multiple segmentations to discover objects and their extent in image collections.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 2. IEEE

  38. Russell BC et al (2008) LabelMe: a database and web-based tool for image annotation. Int J Comput Vis 77(1–3):157–173

    Article  Google Scholar 

  39. Schneiderman H, Kanade T (2000) “A statistical method for 3D object detection applied to faces and cars.” Computer Vision and Pattern Recognition, 2000. Proceedings. IEEE Conference on. Vol. 1. IEEE

  40. Sethian JA (1999) Level Set Methods and Fast Marching Methods. Cambridge University Press

  41. Shi J, Malik J (2000) Normalized cuts and image segmentation. Pattern Anal Mach Intell IEEE Trans 22(8):888–905

    Article  Google Scholar 

  42. Soille P, Vogt P (2009) Morphological segmentation of binary patterns. Pattern Recogn Lett 30(4):456–459

    Article  Google Scholar 

  43. Tolliver D A, Miller G L (2006) “Graph partitioning by spectral rounding: Applications in image segmentation and clustering.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 1. IEEE

  44. Tomasi C, Manduchi R (1998) “Bilateral filtering for gray and color images.” Computer Vision, 1998. Sixth International Conference on. IEEE

  45. Unnikrishnan R, Caroline P, Hebert M (2005) “A measure for objective evaluation of image segmentation algorithms.” Computer Vision and Pattern Recognition-Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on. IEEE

  46. Vedaldi A, Zisserman A (2011) “Image Classification Practical”, http://www.robots.ox.ac.uk/~vgg/share/practical-image-classification.htm

  47. Vincent L, Soille P (1991) Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Trans Pattern Anal Mach Intell 13(6):583–598

    Article  Google Scholar 

  48. Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154

    Article  Google Scholar 

  49. Wang G, Ye Z, Fei-Fei L (2006) “Using dependent regions for object categorization in a generative framework.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 2. IEEE

  50. Wu H et al (2014) “Optimized recognition with few instances based on semantic distance.” Vis Comput: 1–9

  51. Yang AY et al (2008) Unsupervised segmentation of natural images via lossy data compression. Comput Vis Image Underst 110(2):212–225

    Article  Google Scholar 

  52. Zha Z-J et al (2008) “Joint multi-label multi-instance learning for image classification.” Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE

Download references

Acknowledgments

This work is supported by the NSFC 61273274, 61370127, 973 Program 2011CB302203, National Key Technology R&D Program of China 2012BAH01F03, NSFB4123104, FRFCU 2014JBZ004, Z131110001913143 and Tsinghua-Tencent Joint Lab for IIT.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hao Wu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, H., Miao, Z., Wang, Y. et al. Recognition improvement through optimized spatial support methodology. Multimed Tools Appl 75, 5603–5618 (2016). https://doi.org/10.1007/s11042-015-2527-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2527-3

Keywords

Navigation