Combining Short and Long Term Audio Features for TV Sports Highlight Detection
As bearer of high-level semantics, audio signal is being more and more used in content-based multimedia retrieval. In this paper, we investigate TV tennis game highlight detection based on the use of both short and long term audio features and propose two approaches, decision fusion and hierarchical classifier, in order to combine these two kinds of audio features. As more information is included in decision making, the overall performance of the system is enhanced.
KeywordsAudio Signal Audio Feature Decision Fusion Term Feature Spectrum Envelope
Unable to display preview. Download preview PDF.
- 1.Rui, Y., Gupta, A., Acero, A.: Automatically extracting highlights for TV baseball programs. In: Proc. of the 8th ACM MULTIM, pp. 105–115 (2000)Google Scholar
- 2.Harb, H., Chen, L.: Highlights detection in sports videos based on audio analysis. In: Proc. of CBMI 2003 (2003)Google Scholar
- 6.Klein, L.A.: Sensor and Data Fusion. SPIE Press, San Jose (2004)Google Scholar