Abstract
We present several novel techniques to summarize the high-level behavior in surveillance video. Our proposed methods can employ either optical flow or trajectories as input, and incorporate spatial and temporal information together, which improve upon existing approaches for summarization. To begin, we extract common pathway regions by performing graph-based clustering on similarity matrices describing the relationships between location/orientation states. We then employ the activities along the pathway regions to extract the aggregate behavioral patterns throughout scenes. We show how our summarization methods can be applied to detect anomalies, retrieve video clips of interest, and generate adaptive-speed summary videos. We examine our approaches on multiple complex urban scenes and present experimental results.
Similar content being viewed by others
References
Cheung, V., Frey, B.J., Jojic, N.: Video epitomes. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2005)
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: Proceedings of IEEE International Conferenc on Computer Vision (2005)
Hanjalic, A., Zhang, H.: An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis. IEEE TCSVT 9(8), 1280–1289 (1999)
Hferlin, B., Hferlin, M., Weiskopf, D., Heidemann, G.: (2010) Information-based adaptive fast-forward for visual surveillance. Multimedia Tools Appl. 55(1), 1–24
Hospedales, T., Gong, S., Xiang, T.: A markov clustering topic model for mining behaviour in video. In: Proceedings of the IEEE International Conference on Computer Vision (2009)
Jojic, N., Frey, B.J., Kannan, A.: Epitomic analysis of appearance and shape. In: Proceedings of IEEE International Conference on Computer Vision (2003)
Kang, H.W., Chen, X.Q., Matsushita, Y., Tang, X.: Space-time video montage. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2006)
Kasamwattanarote, S., Cooharojananone, N., Satoh, S., Lipikorn, R.: Real time tunnel based video summarization using direct shift collision detection. In: Advances in Multimedia Information Processing—PCM 2010, vol 6297, pp 136–147 (2010)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2006)
Leung, Y., Zhang, J.S., Xu, Z.B.: Clustering by scale-space filtering. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1396–1410 (2000)
Li, J., Gong, S., Xiang, T.: Scene segmentation for behaviour correlation. In: Proceedings of the European Conference on Computer Vision (2008)
Li, Z., Ishwar, P., Konrad, J.: Video condensation by ribbon carving. IEEE Trans. Image Proc. 18, 2572–2583 (2009)
Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17, 395–416 (2007)
Makris, D., Ellis, T.: Path detection in video surveillance. Image Vis. Comput. 20, 895–903 (2002)
Petrovic, N., Jojic, N.: Adaptive video fast forward. Multimedia Tools Appl. 26(2), 327–344 (2005)
Pop, I., Scuturici, M., Miguet, S.: Common motion map based on codebooks. In: 5th International Symposium, ISVC 2009, pp. 1181–1190. Las Vegas, NV, USA (2009)
Pritch, Y., Rav-Acha, A.: Nonchronological video synopsis and indexing. IEEE Trans. Pattern Anal. Mach. Intell. 30, 1971–1984 (2008)
Pritch, Y., Ratovich, S., Hendel, A., Peleg, S.: Clustered synopsis of surveillance video. Advanced Video and Signal Based Surveillance (2009)
Rav-Acha, A., Pritch, Y., Peleg, S.: Making a long video short: Dynamic video synopsis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2006)
Ren, X., Malik, J.: Learning a classification model for segmentation. In: Proceedings of the IEEE International Conference on Computer Vision (2003)
Rodriguez, M.: CRAM: compact representation of actions in movies. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2010)
Saleemi, I., Shafique, K., Shah, M.: Probabilistic modeling of scene dynamics for applications in visual surveillance. IEEE Trans. Pattern Anal. Mach. Intell. 31(8), 1472–1484 (2009)
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern. Anal. Mach. Intell. 22(8), 888–905 (2000)
Shi, J., Tomasi, C.: Good features to track. In: Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (1994)
Simakov, D., Caspi, Y., Shechtman, E., Irani, M.: Summarizing visual data using bidirectional similarity. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)
Stauffer, C., Grimson, W.E.L.: Learning patterns of activity using real-time tracking. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 747–767 (2000)
Streib, K., Davis, J.W.: Improving graph-based clustering via Ripley’s K-function and local connection merging. In: Review in process, technical report pending (2012)
Streib, K., Davis, J.W.: Extracting pathlets from weak tracking data. Advanced Video and Signal Based Surveillance (2010)
Wang, X., Tieu, K., Grimson, W.E.L.: Learning semantic scene models by trajectory analysis. In: Proceedings of the European Conference on Computer Vision (2006)
Wang, X., Ma, X., Grimson, E.: Unsupervised activity perception in crowded and complicated scenes using hierarchical Bayesian models. IEEE Trans. Pattern. Anal. Mach. Intell. 31(3), 539–555 (2009)
Wilson, R., Spann, M.: A new approach to clustering. Pattern Recognit. 23(12), 1413–1425 (1990)
Wang, X., Ma, K.T., Ng, W.G., Grimson, W.E.L.: Trajectory analysis and semantic region modeling using a nonparametric Bayesian model. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)
Yang, Y., Liu, J., Shah, M.: Video scene understanding using multi-scale analysis. In: Proceedings of the IEEE International Conference on Computer Vision (2009)
Zhu, X., Wu, X., Fan, J.: Exploring video content structure for hierarchical summarization. Multimedia Syst. 10(3), 98–115 (2004)
Acknowledgments
This research was supported in part by the Air Force Research Laboratories under contract No. FA8650-07-D-1220.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Streib, K., Davis, J.W. Summarizing high-level scene behavior. Machine Vision and Applications 25, 229–244 (2014). https://doi.org/10.1007/s00138-013-0573-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-013-0573-2