Abstract
A new class of image-level detectors that can be adapted by machine learning techniques to detect parts of objects from a given category is proposed. A classifier (e.g. neural network or adaboost trained classifier) within the detector selects a relevant subset of extremal regions, i.e. regions that are connected components of a thresholded image. Properties of extremal regions render the detector very robust to illumination change. Robustness to viewpoint change is achieved by using invariant descriptors and/or by modeling shape variations by the classifier.
The approach is brought to bear on three problems: text detection, face segmentation and leopard skin detection. High detection rates were obtained for unconstrained (i.e. brightness, affine and font invariant) text detection (92%) with a reasonable false positive rate.
The time-complexity of the detection is approximately linear in the number of pixels and a non-optimized implementation runs at about 1 frame per second for a 640× 480 image on a high-end PC.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Fei-Fei, L., Fergus, R., Perona, P.: A bayesian approach to unsupervised one-shot learning of object categories. In: ICCV 2003, pp. 1134–1141 (2003)
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR 2003, vol. II, pp. 264–271 (2003)
Ferrari, V., Tuytelaars, T., Van Gool, L.: Real-time affine region tracking and coplanar grouping. In: CVPR 2001, II, 226–233 (2001)
Ferrari, V., Tuytelaars, T., Van Gool, L.: Wide-baseline multiple-view correspondences. In: CVPR (2003)
Kadir, T., Brady, M.: Saliency, scale and image description. In: IJCV 2001, vol. 45(2), pp. 83–105 (2001)
Lazebnik, S., Schmid, C., Ponce, J.: Affine-invariant local descriptors and neighborhood statistics for texture recognition. In: ICCV 2003, pp. 649–655 (2003)
Leibe, B., Schiele, B.: Analyzing appearance and contour based methods for object categorization. In: CVPR 2003, vol. II, pp. 409–415 (2003)
Lowe, D.G.: Object recognition from local scale-invariant features. In: ICCV 1999, pp. 1150–1157 (1999)
Lucas, S.: Icdar03 text detection competition datasets (2003), http://algoval.essex.ac.uk/icdar/Datasets.html
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: BMVC 2002, London, UK, vol. 1, pp. 384–393 (2002)
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: ICCV 2001, pp. 525–531 (2001)
Mori, G., Belongie, S., Malik, J.: Shape contexts enable efficient retrieval of similar shapes. In: CVPR 2001, vol. I, pp. 723–730 (2001)
Obdrzalek, S., Matas, J.: Object recognition using local affine frames on distinguished regions. In: BMVC, London, UK, vol. 1, pp. 113–122 (2002)
Pritchett, P., Zisserman, A.: Matching and reconstruction from widely separated views. In: Koch, R., Van Gool, L. (eds.) SMILE 1998. LNCS, vol. 1506, p. 78. Springer, Heidelberg (1998)
Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. PAMI 19(5), 530–535 (1997)
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: ICCV 2003, pp. 1470–1477 (2003)
Tuytelaars, T., van Gool, L.: Content-based image retrieval based on local affinely invariant regions. In: VIIS, pp. 493–500 (1999)
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 18–32. Springer, Heidelberg (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matas, J., Zimmermann, K. (2005). A New Class of Learnable Detectors for Categorisation. In: Kalviainen, H., Parkkinen, J., Kaarna, A. (eds) Image Analysis. SCIA 2005. Lecture Notes in Computer Science, vol 3540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499145_55
Download citation
DOI: https://doi.org/10.1007/11499145_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26320-3
Online ISBN: 978-3-540-31566-7
eBook Packages: Computer ScienceComputer Science (R0)