Abstract
Several types of interest point detectors have been proposed for spatial images. This paper investigates how this notion can be generalised to the detection of interesting events in space-time data. Moreover, we develop a mechanism for spatio-temporal scale selection and detect events at scales corresponding to their extent in both space and time. To detect spatio-temporal events, we build on the idea of the Harris and Förstner interest point operators and detect regions in space-time where the image structures have significant local variations in both space and time. In this way, events that correspond to curved space-time structures are emphasised, while structures with locally constant motion are disregarded.
To construct this operator, we start from a multi-scale windowed second moment matrix in space-time, and combine the determinant and the trace in a similar way as for the spatial Harris operator. All spacetime maxima of this operator are then adapted to characteristic scales by maximising a scale-normalised space-time Laplacian operator over both spatial scales and temporal scales. The motivation for performing temporal scale selection as a complement to previous approaches of spatial scale selection is to be able to robustly capture spatio-temporal events of different temporal extent. It is shown that the resulting approach is truly scale invariant with respect to both spatial scales and temporal scales. The proposed concept is tested on synthetic and real image sequences. It is shown that the operator responds to distinct and stable points in space-time that often correspond to interesting events. The potential applications of the method are discussed.
The support from the Swedish Research Council and from the Royal Swedish Academy of Sciences as well as the Knut and Alice Wallenberg Foundation is gratefully acknowledged.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Almansa, A. and Lindeberg, T. (2000). Fingerprint enhancement by shape adaptation of scale-space operators with automatic scale-selection, IEEE Transactions on Image Processing 9(12): 2027–2042.
Barron, J., Fleet, D. and Beauchemin, S. (1994). Performance of optical flow techniques, International Journal of Computer Vision 12(1): 43–77.
Bigün, J., Granlund, G. and Wiklund, J. (1991). Multidimensional orientation estimation with applications to texture analysis and optical flow, IEEE Transactions on Pattern Analysis and Machine Intelligence 13(8): 775–790.
Black, M. and Jepson, A. (1998). Eigen tracking: Robust matching and tracking of articulated objects using view-based representation, International Journal of Computer Vision 26(1): 63–84.
Blake, A. and Isard, M. (1998). Condensation — conditional density propagation for visual tracking, IJCV 29(1): 5–28.
Chomat, O., de Verdiere, V., Hall, D. and Crowley, J. (2000a). Local scale selection for Gaussian based description techniques, Proc. Sixth European Conference on Computer Vision, Vol. 1842 of Lecture Notes in Computer Science, Springer Verlag, Berlin, Dublin, Ireland, pp. 117–133.
Chomat, O., Martin, J. and Crowley, J. (2000b). A probabilistic sensor for the perception and recognition of activities, Proc. Sixth European Conference on Computer Vision, Dublin, Ireland, pp. I:487–503.
Fleet, D., Black, M. and Jepson, A. (1998). Motion feature detection using steerable flow fields, Proc. Computer Vision and Pattern Recognition, Santa Barbara, CA, pp. 274–281.
Florack, L. M. J. (1997). Image Structure, Kluwer Academic Publishers, Dordrecht, Netherlands.
Förstner, W. A. and Gülch, E. (1987). A fast operator for detection and precise location of distinct points, corners and centers of circular features, Proc. Intercommission Workshop of the Int. Soc. for Photogrammetry and Remote Sensing, Interlaken, Switzerland.
Garding, J. and Lindeberg, T. (1996). Direct computation of shape cues using scale-adapted spatial derivative operators, International Journal of Computer Vision 17(2): 163–191.
Hall, D., de Verdiere, V. and Crowley, J. (2000). Object recognition using coloured receptive fields, Proc. Sixth European Conference on Computer Vision, Vol. 1842 of Lecture Notes in Computer Science, Springer Verlag, Berlin, Dublin, Ireland, pp. 164–177.
Harris, C. and Stephens, M. (1988). A combined corner and edge detector, Alvey Vision Conference, pp. 147–152.
Koenderink, J. J. (1988). Scale-time, Biological Cybernetics 58: 159–162.
Koenderink, J. J. and van Doorn, A. J. (1992). Generic neighborhood operators, IEEE Transactions on Pattern Analysis and Machine Intelligence 14(6): 597–605.
Laptev, I. and Lindeberg, T. (2002). Velocity-adaptation of spatio-temporal receptive fields for direct recognition of activities: An experimental study, in D. Suter (ed.), Proc. ECCV’02 workshop on Statistical Methods in Video Processing, Copenhagen, Denmark, pp. 61–66.
Lindeberg, T. (1994). Scale-Space Theory in Computer Vision, Kluwer Academic Publishers, Boston.
Lindeberg, T. (1997). On automatic selection of temporal scales in time-causal scale-space, AFPAC’97: Algebraic Frames for the Perception-Action Cycle, Vol. 1315 of Lecture Notes in Computer Science, Springer Verlag, Berlin, pp. 94–113.
Lindeberg, T. (1998). Feature detection with automatic scale selection, International Journal of Computer Vision 30(2): 77–116.
Lindeberg, T. and Bretzner, L. (2003). Real-time scale selection in hybrid multi-scale representations, Proc. Scale-Space’03, LNCS, Springer Verlag, these proceedings.
Lindeberg, T. and Fagerström, D. (1996). Scale-space with causal time direction, Proc. Fourth European Conference on Computer Vision, Vol. 1064 of Lecture Notes in Computer Science, Springer Verlag, Berlin, Cambridge, UK, pp. I:229–240.
Lowe, D. (1999). Object recognition from local scale-invariant features, Proc. Seventh International Conference on Computer Vision, Corfu, Greece, pp. 1150–1157.
Mikolajczyk, K. and Schmid, C. (2001). Indexing based on scale invariant interest points, Proc. Eighth International Conference on Computer Vision, Vancouver, Canada, pp. I:525–531.
Mikolajczyk, K. and Schmid, C. (2002). An affine invariant interest point detector, Proc. Seventh European Conference on Computer Vision, Vol. 2350 of Lecture Notes in Computer Science, Springer Verlag, Berlin, Copenhagen, Denmark, pp. I:128–142.
Niyogi, S. A. (1995). Detecting kinetic occlusion, Proc. Fifth International Conference on Computer Vision, Cambridge, MA, pp. 1044–1049.
Schmid, C. and Mohr, R. (1997). Local grayvalue invariants for image retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5): 530–535.
Schmid, C., Mohr, R. and Bauckhage, C. (2000). Evaluation of interest point detectors, International Journal of Computer Vision 37(2): 151–172.
Smith, S. and Brady, J. (1995). ASSET-2: Real-time motion segmentation and shape tracking, IEEE Transactions on Pattern Analysis and Machine Intelligence 17(8): 814–820.
Tell, D. and Carlsson, S. (2002). Combining topology and appearance for wide baseline matching, Proc. Seventh European Conference on Computer Vision, Vol. 2350 of Lecture Notes in Computer Science, Springer Verlag, Berlin, Copenhagen, Denmark, pp. I:68–83.
Tuytelaars, T. and Van Gool, L. (2000). Wide baseline stereo matching based on local, affinely invariant regions, British Machine Vision Conference, pp. 412–425.
Witkin, A. P. (1983). Scale-space filtering, Proc. 8th Int. Joint Conf. Art. Intell., Karlsruhe, Germany, pp. 1019–1022.
Zelnik-Manor, L. and Irani, M. (2001). Event-based analysis of video, Proc. Computer Vision and Pattern Recognition, Kauai Marriott, Hawaii, pp. II:123–130.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Laptev, I., Lindeberg, T. (2003). Interest Point Detection and Scale Selection in Space-Time. In: Griffin, L.D., Lillholm, M. (eds) Scale Space Methods in Computer Vision. Scale-Space 2003. Lecture Notes in Computer Science, vol 2695. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44935-3_26
Download citation
DOI: https://doi.org/10.1007/3-540-44935-3_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40368-5
Online ISBN: 978-3-540-44935-5
eBook Packages: Springer Book Archive