Video Segmentation and Shot Boundary Detection Using Self-Organizing Maps
We present a video shot boundary detection (SBD) algorithm that spots discontinuities in visual stream by monitoring video frame trajectories on Self-Organizing Maps (SOMs). The SOM mapping compensates for the probability density differences in the feature space, and consequently distances between SOM coordinates are more informative than distances between plain feature vectors.
The proposed method compares two sliding best-matching unit windows instead of just measuring distances between two trajectory points, which increases the robustness of the detector. This can be seen as a variant of the adaptive threshold SBD methods. Furthermore, the robustness is increased by using a committee machine of multiple SOM-based detectors. Experimental evaluation made by NIST in the TRECVID evaluation confirms that the SOM-based SBD method works comparatively well in news video segmentation, especially in gradual transition detection.
Keywordsself-organizing map video shot boundary detection
- 1.Rui, Y., Huang, T.S., Mehrotra, S.: Exploring video structure beyond the shots. In: International Conference on Multimedia Computing and Systems, pp. 237–240 (1998), citeseer.ist.psu.edu/rui98exploring.html
- 6.Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Dept. of Computer Science, University of Glasgow (1979), citeseer.ist.psu.edu/vanrijsbergen79information.html
- 7.ISO/IEC: Information technology - Multimedia content description interface - Part 3: Visual, 15938-3:2002(E) (2002)Google Scholar
- 8.Sjöberg, M., Muurinen, H., Laaksonen, J., Koskela, M.: PicSOM experiments in TRECVID 2006. In: Proceedings of the TRECVID 2006 Workshop, Gaithersburg, MD, USA (November 2006)Google Scholar