Abstract
This paper presents a novel distributed object segmentation framework that allows one to extract potentially large coherent objects from digital images. The proposed approach requires minimum user supervision and permits to segment the objects accurately. It works in three steps starting with the user input in form of few mouse clicks on the target object. First, based on user input, the statistical characteristics of the target distributed object are modeled with Gaussian mixture model. This model serves as the primary segmentation of the object. In the second step, the segmentation result is refined by performing connected component analysis to reduce false positives. In the final step the resulting segmentation map is dilated to select the neighboring pixels that are potentially incorrectly classified; this allows us to recast the segmentation as a graph partitioning problem that can be solved using the well-known graph cut technique. Extensive experiments have been carried out on heterogeneous images to test the accuracy of the proposed method for the segmentation of various types of distributed objects. Examples of application of proposed technique in remote sensing to segment roads and rivers from aerial images are also presented. The visual and objective evaluation and comparison with the existing techniques show that the proposed tool can deliver optimal performance when applied to tough object segmentation tasks.
Similar content being viewed by others
References
Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S (2012) Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell 34(11):2274–2282
Belongie S, Carson C, Greenspan H, Malik J (1998) Color- and texture-based image segmentation using em and its application to content-based image retrieval. In: Proceedings of IEEE International Conference Computer Vision (ICCV), pp 675–682
Berman A, Dadourian A, Vlahos P (2000) Method for removing from an image the background surrounding a selected object, October 17 US Patent 6,134,346
Berman A, Vlahos P, Dadourian A (2001) Method for removing from an image the background surrounding a selected subject by generating candidate mattes, September 11 US Patent 6,288,703
Beucher S, Meyer F (1993) The morphological approach to segmentation: the watershed transformation. Mathematical morphology in image processing. Opt Eng 34:433–481
Bo P, Zhang L, Zhang D (2013) A survey of graph theoretical approaches to image segmentation. Pattern Recognit 46(3):1020–1038
Bosamiya JH, Agrawal P, Roy PP, Balasubramanian R (2015) Script independent scene text segmentation using fast stroke width transform and grabcut. In: 3Rd IAPR Asian Conference on Pattern Recognition (ACPR), pp 151–155
Bouman CA (1997) Cluster: an unsupervised algorithm for modeling Gaussian mixtures. Available from http://engineering.purdue.edu/bouman
Boykov Y, Jolly M-P (2001) Interactive graph cuts for optimal boundary amp; region segmentation of objects in n-d images. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), vol 1, pp 105–112
Boykov Y, Funka-Lea G (2006) Graph cuts and efficient N-D image segmentation. Int J Comput Vis 70(2):109–131
Boykov Y, Veksler O (2006) Graph cuts in vision and graphics: Theories and applications. In: Paragios N, Chen Y, Faugeras O (eds) Handbook of mathematical models in computer vision. Springer, US, pp 79–96
Boykov Y, Veksler O, Zabih R (2001) Fast approximate energy minimization via graph cuts. IEEE Trans Pattern Anal Mach Intell 23(11):1222–1239
Cagnazzo M, Parrilli S, Poggi G, Verdoliva L (2007) Costs and advantages of object-based image coding with shape-adaptive wavelet transform. EURASIP J Image Video Process 2007(1):19–19
Chen D, Li G, Sun Y, Kong J, Jiang G, Tang H, Ju Z, Yu H, Liu H (2017) An interactive image segmentation method in hand gesture recognition. Sensors 17(2):1–17
Cheng MM, Prisacariu VA, Zheng S, Torr PHS, Rother C (2015) Densecut: densely connected crfs for realtime grabcut. Comput Graph Forum 34(7):193–201
Chuang Y-Y, Curless B, Salesin DH, Szeliski R (2001) A bayesian approach to digital matting. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition (CVPR), vol 2, pp 264–271
Delong A, Boykov Y (2008) A scalable graph-cut algorithm for N-D grids. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1–8
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B-Stat Methodol 39(1):1–38
Farid MS, Mahmood A, Grangetto M (2016) Image de-fencing framework with hybrid inpainting algorithm. Signal Image Video Process 10(7):1193–1201
Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27(8):861–874
Grünwald P (2005) A Tutorial Introduction to the Minimum Description Length Principle
Guan Q, Hua M, Hu HG (2017) A modified grabcut approach for image segmentation based on local prior distribution. In: 2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR), pp 122–126
Hansen MH, Yu B (2001) Model selection and the principle of minimum description length. J Am Stat Assoc 96(454):746–774
Heimowitz A, Keller Y (2016) Image segmentation via probabilistic graph matching. IEEE Trans Image Process 25(10):4743–4752
Hernandez-Lopez FJ, Rivera M (2014) Change detection by probabilistic segmentation from monocular view. Mach Vis Appl 25(5):1175–1195
Im J, Jensen JR, Tullis JA (2008) Object-based change detection using correlation image analysis and image segmentation. Int J Remote Sens 29(2):399–423
Jian M, Jung C (2016) Interactive image segmentation using adaptive constraint propagation. IEEE Trans Image Process 25(3):1301–1311
Johnson J, Krishna R, Stark M, Li L-J, Shamma D, Bernstein M, Li Fei-F (2015) Image retrieval using scene graphs. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition (CVPR)
Juan O, Boykov Y (2006) Active graph cuts. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol 1, pp 1023–1029
Khan MH, Shirahama K, Farid MS, Grzegorzek M (2016) Multiple human detection in depth images. In: Proceedings of International Workshop on Multimedia Signal Processing (MMSP), pp 1–6
Kolmogorov V, Zabin R (2004) What energy functions can be minimized via graph cuts? IEEE Trans Pattern Anal Mach Intell 26(2):147–159
Kontkanen P, Myllymäki P, Buntine W, Rissanen J, Tirrii H (2003) An MDL framework for data clustering
Kuntimad G, Ranganath HS (1999) Perfect image segmentation using pulse coupled neural networks. IEEE Trans Neural Netw 10(3):591–598
Li Y, Sun J, Tang C-K, Shum H-Y (2004) Lazy snapping. Trans Graph 23(3):303–308
Lin D, Dai J, Jia J, He K, Sun J (2016) ScribbleSup: scribble-supervised convolutional networks for semantic segmentation. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 3159–3167
Little RJA, Rubin DB (1987) Statistical analysis with missing data. Wiley Series in Probability and Statistics, 1st edition. Wiley, New York
Liu J, Sun J (2010) Parallel graph-cuts by adaptive bottom-up merging. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 2181–2188
Mahalanobis PC (1936) On the generalised distance in statistics. Proc Natl Inst Sci, India 2(1):49–55
Mnih V (2013) Massachusetts roads dataset. Available from http://www.cs.toronto.edu/vmnih/data/
Mortensen E, Barrett W (1995) Intelligent scissors for image composition. In: Proceedings of the 22nd annual conference on computer graphics interaction technology SIGGRAPH ’95. ACM, pp 191–198
Mortensen E, Barrett W (1998) Interactive segmentation with intelligent scissors. Graph Model Im Proc 60(5):349–384
Mortensen E, Barrett W (1999) Toboggan-based intelligent scissors with a four-parameter edge model. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol 2, pp 452–458
Mortensen E, Morse B, Barrett W, Udupa J (1992) Adaptive boundary detection using ‘live-wire’ two-dimensional dynamic programming. In: Proceedings of Computer Cardiology, pp 635–638
Osher S, Sethian JA (1988) Fronts propagating with curvature-dependent speed: algorithms based on hamilton-jacobi formulations. J Comput Phys 79(1):12–49
Pham DL, Chenyang X, Prince JL (2000) Current methods in medical image segmentation. Annu Rev Biomed Eng 2(1):315–337
Redner RA, Walker HF (1984) Mixture densities, maximum likelihood and the em algorithm. SIAM Rev 26(2):195–239
Ren D, Jia Z, Yang J, Kasabov NK (2017) A practical grabcut color image segmentation based on bayes classification and simple linear iterative clustering. IEEE Access 5:18480–18487
Rissanen J (1983) A universal prior for integers and estimation by minimum description length. Ann Stat 11(2):416–431, 06
Rissanen J (2001) Strong optimality of the normalized ml models as universal codes and information in data. IEEE Trans Inf Theory 47(5):1712–1717
Rother C, Kolmogorov V, Blake A (2004) GrabCut: interactive foreground extraction using iterated graph cuts. ACM Trans Graph 23(3):309–314
Ruzon MA, Tomasi C (2000) Alpha estimation in natural images. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol 1, pp 18–25
Shen H, Pan WD, Wu D (2017) Predictive lossless compression of regions of interest in hyperspectral images with no-data regions. IEEE Trans Geosci Remote Sens 55(1):173–182
Stockman G, Shapiro LG (2001) Computer Vision, 1st edition. Prentice Hall PTR, Upper Saddle River
Strandmark P, Kahl F (2010) Parallel and distributed graph cuts by dual decomposition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 2085–2092
Tappen MF, Freeman WT (2003) Comparison of graph cuts with belief propagation for stereo, using identical MRF parameters. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), pp 900–906
Tazeem H, Farid MS, Mahmood A (2017) Improving security surveillance by hidden cameras. Multimed Tool Appl 76(2):2713–2732
Udupa J, LaBlanc VR, Schmidt H, Imielinska C, Saha PK, Grevera GJ, Zhuge Y, Currie LM, Molholt P, Jin Y (2002) Methodology for evaluating image-segmentation algorithms. In: Medical imaging, pp 266–277
Udupa JK, LeBlanc VR, Zhuge Y, Imielinska C, Schmidt H, Currie LM, Hirsch BE, Woodburn J (2006) A framework for evaluating image segmentation algorithms. Comput Med Imaging Graph 30(2):75–87
Vezhnevets V, Growcut VK (2005) Interactive multi-label nd image segmentation by cellular automata. In: Proceedings of graphicon, pp 150–156
Vineet V, Narayanan PJ (2008) CUDA cuts: fast Graph cuts on the GPU. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), pp 1–8
von Neumann J (1951) The general and logical theory of automata. Cerebral Mechanisms in Behavior – The Hixon Symposium, 1–31
Wallace RS, Kanade T (1990) Finding natural clusters having minimum description length. In: Proceedings of International Conference on Pattern Recognition (ICPR), vol 1, pp 438–442
Wang J, Cohen MF (2005) An iterative optimization approach for unified image segmentation and matting. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), vol 2, pp 936–943
Wang J, Cohen MF (2008) Image and video matting: a survey. Now Publishers Inc.
Wang G, Lu J, Pan Z, Miao Q (2016) Color texture segmentation based on active contour model with multichannel nonlocal and tikhonov regularization. Multimed Tool Appl:1–12
Wang R, Lv J, Ma S (2017) A mri image segmentation method based on medical semaphore calculating in medical multimedia big data environment. Multimed Tool Appl:1–21
Wu Y, Peng X, Ruan K, Hu Z (2016) Improved image segmentation method based on morphological reconstruction. Multimed Tool Appl 76(19):1–13
Xu X, Geng W, Ju R, Yang Y, Ren T, Wu G (2014) OBSIR: object-based stereo image retrieval. In: Proceeding of IEEE International Conference on Multimedia and Expo (ICME), pp 1–6
Yan C, Zhang Y, Xu J, Dai F, Li L, Dai Q, Wu F (2014) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Signal Process Lett 21(5):573–576
Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089
Yan C, Xie H, Yang D, Yin J, Zhang Y, Dai Q (2017) Supervised hash coding with deep neural network for environment perception of intelligent vehicles. IEEE Trans Intell Transp Syst PP(99):1–12
Yan C, Xie H, Liu S, Yin J, Zhang Y, Dai Q (2017) Effective uyghur language text detection in complex background images for traffic prompt identification. IEEE Trans Intell Transp Syst PP(99):1–10
Yang Q, Tang X, Wang C, Ye Z, Mo C (2007) Progressive cut an image cutout algorithm that models user intentions. IEEE Trans Multimed 14(3):56–66
Yi P, Li C, Ou-Yang F-X, Chen W, Yong J-H (2015) JF-Cut: a parallel graph cut approach for large-scale image and video. IEEE Trans Image Process 24(2):655–666
Zemene E, Pelillo M (2016) Interactive image segmentation using constrained dominant sets. In: Proceedings 14th European Conference Computer Vision (ECCV). Springer International Publishing, Cham, pp 278–294
Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv 35(4):399–458
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Farid, M.S., Lucenteforte, M. & Grangetto, M. DOST: a distributed object segmentation tool. Multimed Tools Appl 77, 20839–20862 (2018). https://doi.org/10.1007/s11042-017-5546-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-5546-4