Abstract
Spatial support is an effective method of improving object recognition that is widely used in the field of computer vision. Compared to various other spatial support methods, such as sliding windows, spatial support based on image segmentation is a classic technique with high-quality segmentation that can positively contribute to image recognition. However, over-segmentation and under-segmentation often occur in the segmentation process. It is difficult for object classifiers to recognize objects in low-quality, segmented images, thus low-quality segmentation will reduce recognition accuracy. In order to resolve this drawback, watershed segmentation, multi-scale decompositions based on WLS (Weighted-Least Squares) filters and multi-layer smoothing, have been used to process the images. This methodology was utilized to maintain the sharp regions’ boundaries, while smoothing regions which could contribute to accurately segmenting the images. After obtaining high-quality segmentation, the images could then be utilized in image recognition. The superiority of this paper’s methodology compared to that of previous methods has been demonstrated herein. Experiments using a large database have demonstrated that this methodology is capable of improving image recognition through optimizing segmentation results.
Similar content being viewed by others
References
Arbelaez P et al (2011) “Contour detection and hierarchical image segmentation.” Pattern Anal Mach Intell IEEE Trans 33.5: 898–916
Belongie S et al (1998) “Color-and texture-based image segmentation using EM and its application to content-based image retrieval.” Computer Vision, 1998. Sixth International Conference on. IEEE
Beucher S, Lantuejoul C (1979) “Use of watersheds in contour detection”
Black MJ et al (1998) Robust anisotropic diffusion. Image Processing IEEE Trans 7(3):421–432
Burt PJ, Adelson EH (1983) The Laplacian pyramid as a compact image code. Commun IEEE Trans 31(4):532–540
Chung FRK (1997) “Spectral Graph Theory American Mathematical Society.” Providence, RI
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. Pattern Anal Mach Intell IEEE Trans 24(5):603–619
Cour T, Florence B, Shi J (2005) “Spectral segmentation with multiscale graph decomposition.” Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 2. IEEE
Cui C et al (2012) “Semantically coherent image annotation with a learning-based keyword propagation strategy.” Proceedings of the 21st ACM international conference on Information and knowledge management. ACM
Dalal N, Triggs B (2005) “Histograms of oriented gradients for human detection.” Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 1. IEEE
Everingham M, van Gool L, Williams C, Winn J, Zisserman A (2008) “PASCAL 2008 Results,” http://www.pascal-network.org/challenges/VOC/voc2008/workshop/index.html
Farbman Z, et al (2008) “Edge-preserving decompositions for multi-scale tone and detail manipulation.” ACM Transactions on Graphics (TOG). Vol. 27. No. 3. ACM
Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181
Frucci M (2006) Oversegmentation reduction by flooding regions and digging watershed lines. Int J Pattern Recognit Artif Intell 20(01):15–38
Frucci M, Perner P, Sanniti di Baja G (2008) “Case-based reasoning for image segmentation by watershed transformation.” Case-Based Reasoning on Images and Signals. Springer Berlin Heidelberg, 319–353
Fu Y, Hospedales T, Xiang T, Gong S (2013) “Learning multi-modal latent attributes.” Proc IEEE Conf Comput Vis Pattern Recognit
Ge F, Song W, Liu T (2006) “Image-segmentation evaluation from the perspective of salient object extraction.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 1. IEEE
Hoiem D, Efros A A, Hebert M (2005) “Geometric context from a single image.” Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on. Vol. 1. IEEE
Kumar N et al (2009) “Attribute and simile classifiers for face verification.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE
Kunze K et al (2013) “The Wordometer--Estimating the Number of Words Read Using Document Image Retrieval and Mobile Eye Tracking.” Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE
Lagendijk RL, Biemond J, Boekee DE (1988) Regularized iterative image restoration with ringing reduction. Acoust Speech Signal Processing IEEE Trans 36(12):1874–1888
Levner I, Zhang H (2007) Classification-driven watershed segmentation. Image Processing IEEE Trans 16(5):1437–1445
Lischinski D, et al (2006) “Interactive local adjustment of tonal values.” ACM Transactions on Graphics (TOG). Vol. 25. No. 3. ACM
Lobo JM, Jiménez‐Valverde A, Real R (2008) AUC: a misleading measure of the performance of predictive distribution models. Glob Ecol Biogeogr 17(2):145–151
Maji S, Berg A C (2009) “Max-margin additive classifiers for detection.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE
Malik J et al (2001) Contour and texture analysis for image segmentation. Int J Comput Vis 43(1):7–27
Malisiewicz T, Efros AA (2007) “Improving spatial support for objects via multiple segmentations”
Martin D, et al (2001) “A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics.” Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on. Vol. 2. IEEE
Maulik U (2009) Medical image segmentation using genetic algorithms. Inform Technol Biomed IEEE Trans 13(2):166–173
Mumford D, Shah J (1989) Optimal approximations by piecewise smooth functions and associated variational problems. Commun Pure Appl Math 42(5):577–685
Osher S, Sethian JA (1988) Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations. J Comput Phys 79(1):12–49
Palatucci M et al (2009) “Zero-shot Learning with Semantic Output Codes.” NIPS. 3(3)
Perona P, Malik J (1990) Scale-space and edge detection using anisotropic diffusion. Patt Anal Mach Intell IEEE Trans 12(7):629–639
Pock T et al (2009) “An algorithm for minimizing the Mumford-Shah functional.” Computer Vision, 2009 I.E. 12th International Conference on. IEEE
Ren Z, Shakhnarovich G (2013) “Image segmentation by cascaded region agglomeration.” Computer Vision and Pattern Recognition (CVPR), 2013 I.E. Conference on. IEEE
Roerdink JBTM, Meijster A (2000) The watershed transform: definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41(1):187–228
Russell BC et al (2006) “Using multiple segmentations to discover objects and their extent in image collections.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 2. IEEE
Russell BC et al (2008) LabelMe: a database and web-based tool for image annotation. Int J Comput Vis 77(1–3):157–173
Schneiderman H, Kanade T (2000) “A statistical method for 3D object detection applied to faces and cars.” Computer Vision and Pattern Recognition, 2000. Proceedings. IEEE Conference on. Vol. 1. IEEE
Sethian JA (1999) Level Set Methods and Fast Marching Methods. Cambridge University Press
Shi J, Malik J (2000) Normalized cuts and image segmentation. Pattern Anal Mach Intell IEEE Trans 22(8):888–905
Soille P, Vogt P (2009) Morphological segmentation of binary patterns. Pattern Recogn Lett 30(4):456–459
Tolliver D A, Miller G L (2006) “Graph partitioning by spectral rounding: Applications in image segmentation and clustering.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 1. IEEE
Tomasi C, Manduchi R (1998) “Bilateral filtering for gray and color images.” Computer Vision, 1998. Sixth International Conference on. IEEE
Unnikrishnan R, Caroline P, Hebert M (2005) “A measure for objective evaluation of image segmentation algorithms.” Computer Vision and Pattern Recognition-Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on. IEEE
Vedaldi A, Zisserman A (2011) “Image Classification Practical”, http://www.robots.ox.ac.uk/~vgg/share/practical-image-classification.htm
Vincent L, Soille P (1991) Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Trans Pattern Anal Mach Intell 13(6):583–598
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang G, Ye Z, Fei-Fei L (2006) “Using dependent regions for object categorization in a generative framework.” Computer Vision and Pattern Recognition, 2006 I.E. Computer Society Conference on. Vol. 2. IEEE
Wu H et al (2014) “Optimized recognition with few instances based on semantic distance.” Vis Comput: 1–9
Yang AY et al (2008) Unsupervised segmentation of natural images via lossy data compression. Comput Vis Image Underst 110(2):212–225
Zha Z-J et al (2008) “Joint multi-label multi-instance learning for image classification.” Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE
Acknowledgments
This work is supported by the NSFC 61273274, 61370127, 973 Program 2011CB302203, National Key Technology R&D Program of China 2012BAH01F03, NSFB4123104, FRFCU 2014JBZ004, Z131110001913143 and Tsinghua-Tencent Joint Lab for IIT.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wu, H., Miao, Z., Wang, Y. et al. Recognition improvement through optimized spatial support methodology. Multimed Tools Appl 75, 5603–5618 (2016). https://doi.org/10.1007/s11042-015-2527-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2527-3