Video Spatio-temporal Signatures Using Polynomial Transforms
In this paper we integrate spatial and temporal information, which are extracted separately from a video sequence, for indexing and retrieval purposes. We focus on two filter families that are suitable models of the human visual system for spatial and temporal information encoding. They are special cases of polynomial transforms that perform local decompositions of a signal. Spatial primitives are extracted using Hermite filters, which agree with the Gaussian derivative model of receptive field profiles. Temporal events are characterized by Laguerre filters, which preserve the causality constraint in the temporal domain. Integration of both models gives a spatio-temporal feature extractor based on early vision.. Results encourage our model for video indexing and retrieval.
Unable to display preview. Download preview PDF.
- 5.Cheung, S.C., Zakhor, A.: Estimation of Web Video Multiplicity. In: Proc. SPIE – Internet Imaging, vol. 3964, pp. 34–36 (2000)Google Scholar
- 7.Del Bimbo, A.: Visual Information Retrieval. Morgan Kaufmann Publishers Inc., San Francisco (1999)Google Scholar
- 8.Flickner, M., et al.: Query by Image and Video Content: The QBIC System. IEEE Computer 28(9), 23–32 (1995)Google Scholar
- 10.Koekoek, R., Swarttouw, R.F.: The Askey-scheme of Hypergeometric Orthogonal Polynomials and its q-Analogue. Delft University of Technology, Faculty of Information Technology and Systems, Department of Technical Mathematics and Informatics. Report 98-17 (1998)Google Scholar
- 13.Palmer, S.E.: Vision Science. Photons to Phenomenology. The MIT Press, Cambridge (1999)Google Scholar
- 15.Puzicha, J., Hofmann, T., Buhmann, J.M.: Non-Parametric Similarity Measures for Unsupervised Texture Segmentation and Image Retrieval. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 267–272 (1997)Google Scholar