Exploiting Low-Level Image Segmentation for Object Recognition
A method for exploiting the information in low-level image segmentations for the purpose of object recognition is presented. The key idea is to use a whole ensemble of segmentations per image, computed on different random samples of image sites. Along the boundaries of those segmentations that are stable under the sampling process we extract strings of vectors that contain local image descriptors like shape, texture and intensities. Pairs of such strings are aligned, and based on the alignment scores a mixture model is trained which divides the segments in an image into fore- and background. Given such candidate foreground segments, we show that it is possible to build a state-of-the-art object recognition system that exhibits excellent performance on a standard benchmark database. This result shows that despite the inherent problems of low-level image segmentation in poor data conditions, segmentation can indeed be a valuable tool for object recognition in real-world images.
KeywordsObject Recognition Gaussian Mixture Model Training Image Category Label Segment Boundary
Unable to display preview. Download preview PDF.
- 1.Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. IEEE Trans. Pattern Anal. Machine Intell. 26(11) (2004)Google Scholar
- 5.Berg, A.C., Berg, T.L., Malik, J.: Shape matching and object recognition using low distortion correspondence. In: CVPR 2005, pp. 26–33 (2005)Google Scholar
- 6.Yu, S.X., Gross, R., Shi, J.: Concurrent object recognition and segmentation by graph partitioning. In: NIPS, pp. 1383–1390. MIT Press, Cambridge (2002)Google Scholar
- 7.Geman, S., Potter, D.F., Chi, Z.: Composition Systems. Technical report, Division of Applied Mathematics, Brown University, Providence, RI (1998)Google Scholar
- 8.Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. In: CVPR Workshop GMBV (2004)Google Scholar
- 11.Belongie, S., Malik, J., Puzicha, J.: Matching shapes. In: ICCV 2001, pp. 454–463 (2001)Google Scholar
- 13.Roth, V., Steinhage, V.: Nonlinear discriminant analysis using kernel functions. In: Solla, S., Leen, T., Müller, K.R. (eds.) NIPS 12, pp. 568–574. MIT Press, Cambridge (1999)Google Scholar
- 15.Roth, V., Tsuda, K.: Pairwise coupling for machine recognition of hand-printed japanese characters. In: CVPR, pp. 1120–1125 (2001)Google Scholar