Abstract
In this paper, we consider the issue of structuring large TV streams. More precisely, we focus on the labeling problem: once segments have been extracted from the stream, the problem is to automatically label them according to their type (eg. programs vs. commercial breaks). In the literature, several machine learning techniques have been proposed to solve this problem: Inductive Logic Programming, numeric classifiers like SVM or decision trees... In this paper, we assimilate the problem of labeling segments to the problem of labeling a sequence of data. We propose to use a very effective approach based on another classifier: the Conditional Random Fields (CRF), a tool which has proved useful to handle sequential data in other domains. We report different experiments, conducted on some manually and automatically segmented data, with different label granularities and different features to describe segments. We demonstrate that this approach is more robust than other classification methods, in particular when it uses the neighbouring context of a segment to find its type. Moreover, we highlight that the segmentation and the choice of features to describe segments are two crucial points in the labeling process.
Chapter PDF
References
Ibrahim, Z.A.A., Gros, P.: Tv stream structuring. ISRN Signal Processing (2011)
Jousse, F.: Transformation d’arbres XML avec des modèles probabilistes pour l’annotation. PhD thesis, University of Lille III, France (2007)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. of the Int. Conf. on Machine Learning (ICML), pp. 282–289 (July 2001)
Manson, G., Berrani, S.-A.: An inductive logic programming-based approach for TV stream segment classification. In: Proc. of the IEEE Int. Symp. on Multimedia (December 2008)
Balvet, A., Laurence, G., Rozenknop, A., Moreau, E., Tellier, I., Poibeau, T.: Annotation fonctionnelle de corpus arborés avec des champs aléatoires conditionnels. In: TALN
Naturel, X.: Structuration automatique de flux vidéos de télévision. PhD thesis, University of Rennes 1, France (2007)
Naturel, X., Gravier, G., Gros, P.: Fast Structuring of Large Television Streams Using Program Guides. In: Marchand-Maillet, S., Bruno, E., Nürnberger, A., Detyniecki, M. (eds.) AMR 2006. LNCS, vol. 4398, pp. 222–231. Springer, Heidelberg (2007)
Pinto, D., McCallum, A., Wei, X., Croft, W.: Table extraction using Conditional Random Fields. In: Proc. of the ACM SIGIR, pp. 235–242 (July 2003)
Poli, J.-P.: An automatic television stream structuring system for television archives holders. Multimedia systems 14(5), 255–275 (2008)
Quattoni, A., Collins, M., Darrell, T.: Conditional random fields for object recognition. In: Neural Information Processing Systems (NIPS) (December 2004)
Sha, F., Pereira, F.: Shallow parsing with conditional random fields. In: Proc. of Human Language Technology - North American Chapter of the Association for Computational Linguistics (May 2003)
Shi, T., Horvath, S.: Unsupervised learning with random forest predictors. Journal of Computational and Graphical Statistics (2006)
Tabei, Y., Asai, K.: A local multiple alignment method for detection of non coding RNA sequences. Bioinformatics 25, 1498–1505 (2009)
Weinman, J., Hanson, A., McCallum, A.: Sign detection in natural images with Conditional Random Fields. In: Proc. of the IEEE International Workshop on Machine Learning for Signal Processing (September 2004)
Yuan, J., Li, J., Zhang, B.: Gradual transition detection with conditional random fields. In: Proc. of the 15th International Conference on Multimedia, pp. 277–280. ACM (September 2007)
Zeng, Z., Zhang, S., Zheng, H., Yang, W.: Program segmentation in a television stream using acoustic cues. In: Proc. of the International Conference on Audio, Language and Image Processing, pp. 748–752 (July 2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martienne, E., Claveau, V., Gros, P. (2012). Labeling TV Stream Segments with Conditional Random Fields. In: Salerno, E., Çetin, A.E., Salvetti, O. (eds) Computational Intelligence for Multimedia Understanding. MUSCLE 2011. Lecture Notes in Computer Science, vol 7252. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32436-9_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-32436-9_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32435-2
Online ISBN: 978-3-642-32436-9
eBook Packages: Computer ScienceComputer Science (R0)