A New Class of Learnable Detectors for Categorisation

Matas, Jiri; Zimmermann, Karel

doi:10.1007/11499145_55

Jiri Matas¹⁹ &
Karel Zimmermann¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3540))

Included in the following conference series:

Scandinavian Conference on Image Analysis

2242 Accesses
16 Citations

Abstract

A new class of image-level detectors that can be adapted by machine learning techniques to detect parts of objects from a given category is proposed. A classifier (e.g. neural network or adaboost trained classifier) within the detector selects a relevant subset of extremal regions, i.e. regions that are connected components of a thresholded image. Properties of extremal regions render the detector very robust to illumination change. Robustness to viewpoint change is achieved by using invariant descriptors and/or by modeling shape variations by the classifier.

The approach is brought to bear on three problems: text detection, face segmentation and leopard skin detection. High detection rates were obtained for unconstrained (i.e. brightness, affine and font invariant) text detection (92%) with a reasonable false positive rate.

The time-complexity of the detection is approximately linear in the number of pixels and a non-optimized implementation runs at about 1 frame per second for a 640× 480 image on a high-end PC.

Download to read the full chapter text

Chapter PDF

Computer Vision Algorithms for Image Segmentation, Motion Detection, and Classification

Portable and fast text detection

Article 30 May 2016

Training a Classifier by Descriptors in the Space of the Radon Transform

Article 01 May 2020

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Fei-Fei, L., Fergus, R., Perona, P.: A bayesian approach to unsupervised one-shot learning of object categories. In: ICCV 2003, pp. 1134–1141 (2003)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR 2003, vol. II, pp. 264–271 (2003)
Google Scholar
Ferrari, V., Tuytelaars, T., Van Gool, L.: Real-time affine region tracking and coplanar grouping. In: CVPR 2001, II, 226–233 (2001)
Google Scholar
Ferrari, V., Tuytelaars, T., Van Gool, L.: Wide-baseline multiple-view correspondences. In: CVPR (2003)
Google Scholar
Kadir, T., Brady, M.: Saliency, scale and image description. In: IJCV 2001, vol. 45(2), pp. 83–105 (2001)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Affine-invariant local descriptors and neighborhood statistics for texture recognition. In: ICCV 2003, pp. 649–655 (2003)
Google Scholar
Leibe, B., Schiele, B.: Analyzing appearance and contour based methods for object categorization. In: CVPR 2003, vol. II, pp. 409–415 (2003)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: ICCV 1999, pp. 1150–1157 (1999)
Google Scholar
Lucas, S.: Icdar03 text detection competition datasets (2003), http://algoval.essex.ac.uk/icdar/Datasets.html
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: BMVC 2002, London, UK, vol. 1, pp. 384–393 (2002)
Google Scholar
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: ICCV 2001, pp. 525–531 (2001)
Google Scholar
Mori, G., Belongie, S., Malik, J.: Shape contexts enable efficient retrieval of similar shapes. In: CVPR 2001, vol. I, pp. 723–730 (2001)
Google Scholar
Obdrzalek, S., Matas, J.: Object recognition using local affine frames on distinguished regions. In: BMVC, London, UK, vol. 1, pp. 113–122 (2002)
Google Scholar
Pritchett, P., Zisserman, A.: Matching and reconstruction from widely separated views. In: Koch, R., Van Gool, L. (eds.) SMILE 1998. LNCS, vol. 1506, p. 78. Springer, Heidelberg (1998)
Chapter Google Scholar
Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. PAMI 19(5), 530–535 (1997)
Google Scholar
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: ICCV 2003, pp. 1470–1477 (2003)
Google Scholar
Tuytelaars, T., van Gool, L.: Content-based image retrieval based on local affinely invariant regions. In: VIIS, pp. 493–500 (1999)
Google Scholar
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 18–32. Springer, Heidelberg (2000)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Center for Machine Perception, Faculty of Electrotechnical Engineering, Czech Technical University in Prague,
Jiri Matas & Karel Zimmermann

Authors

Jiri Matas
View author publications
You can also search for this author in PubMed Google Scholar
Karel Zimmermann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Technology, Lappeenranta University of Technology, P.O.Box 20, FIN-53851, Lappeenranta, Finland
Heikki Kalviainen
Dept. of Computer Science, University of Joensuu, Finland
Jussi Parkkinen
Department of Information and Computer Sciences, Toyohashi University of Technology, 1-1 Hibarigaoka, Tenpaku-cho, 441-8580, Toyohashi, Japan
Arto Kaarna

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matas, J., Zimmermann, K. (2005). A New Class of Learnable Detectors for Categorisation. In: Kalviainen, H., Parkkinen, J., Kaarna, A. (eds) Image Analysis. SCIA 2005. Lecture Notes in Computer Science, vol 3540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499145_55

Download citation

DOI: https://doi.org/10.1007/11499145_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26320-3
Online ISBN: 978-3-540-31566-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

A New Class of Learnable Detectors for Categorisation

Abstract

Chapter PDF

Similar content being viewed by others

Computer Vision Algorithms for Image Segmentation, Motion Detection, and Classification

Portable and fast text detection

Training a Classifier by Descriptors in the Space of the Radon Transform

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

A New Class of Learnable Detectors for Categorisation

Abstract

Chapter PDF

Similar content being viewed by others

Computer Vision Algorithms for Image Segmentation, Motion Detection, and Classification

Portable and fast text detection

Training a Classifier by Descriptors in the Space of the Radon Transform

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation