Abstract
In this article the effective method of a single commercial extracting from a advertising block and its recognition using only the audio signal is presented. Proposed algorithm uses a multidimensional orthogonal audio signal representation for a track parametrization. Simulation results for poor commercial audio signal recording conditions and comparison with the known methods are presented. The proposed solution gives a recognition at the level of 98%. This is the result better than the popular methods based on spectral analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kay, S.M.: Modern Spectral Estimation. Prentice Hall, Englewood Cliffs (1988)
Rabiner, L.: Fundamentals of Speech Recognition. Prentice Hall PTR, Englewood Cliffs (1993)
Lienhart, R., Kuhmunch, C., Euelsberg, W.: On the detection and recognition of television commercials. In: Proc. IEEE Conf. on Multimedia Computing and Systems, Ottawa, Canada, pp. 509–516 (June 1997)
Zabih, R., Miller, J., Mai, K.: A feature-based algorithm for detecting and classifying scene breaks. In: ACM Conference on Multimedia, San Francisco, California (November 1995)
Biernacki, P., Zarzycki, J.: Multidimensional Nonlinear Noise-Cancelling Filters of the Volterra-Wiener Class. In: Proc. 2-Nd Int. Workshop on Multidimensional (nD) Systems (NDS-2000), pp. 255–261. Inst. of Control and Comp. Eng. TU of Zielona Gora Press, Czocha Castle (2000)
Biernacki, P., Zarzycki, J.: Orthogonal Schur-Type Solution of the Nonlinear Noise-Cancelling Problem. In: Proc. Int. Conf. On Signals and Electronic Systems (ICSES 2000), Ustron, pp. 337–342 (2000)
Lee, D.T.L., Morf, M., Friedlander, B.: Recursive Least-Squares Ladder Estimation Algorithms. IEEE Trans. on CAS 28, 467–481 (1981)
Schetzen, S.: The Voltera & Wiener Theories of nonlinear systems. John Wiley & Sons, New York (1980)
Haitsma, J., Kalker, T., Oostveen, J.: Robust audio hashing for content identification. In: Proc. of the Content-Based Multimedia Indexing, Firenze, Italy (September 2001)
Morgan, N., Bourlard, H., Hermansky, H.: Automatic Speech Recognition: An Auditory Perspective. In: Greenberg, S., Ainsworth, W.A. (eds.) Speech Processing in the Auditory System, p. 315. Springer, Heidelberg (2004) ISBN 9780387005904
Paliwal, K.K.: Spectral subband centroid features for speech recognition. In: Proc. IEEE ICASSP, pp. 617–620 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Biernacki, P. (2010). Intelligent System for Commercial Block Recognition Using Audio Signal Only. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6276. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15387-7_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-15387-7_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15386-0
Online ISBN: 978-3-642-15387-7
eBook Packages: Computer ScienceComputer Science (R0)