Journal of Computer Science and Technology

, Volume 28, Issue 5, pp 788–796 | Cite as

A Novel Web Video Event Mining Framework with the Integration of Correlation and Co-Occurrence Information

  • Cheng-De Zhang
  • Xiao Wu
  • Mei-Ling Shyu
  • Qiang Peng
Regular Paper


The massive web videos prompt an imperative demand on efficiently grasping the major events. However, the distinct characteristics of web videos, such as the limited number of features, the noisy text information, and the unavoidable error in near-duplicate keyframes (NDKs) detection, make web video event mining a challenging task. In this paper, we propose a novel four-stage framework to improve the performance of web video event mining. Data preprocessing is the first stage. Multiple Correspondence Analysis (MCA) is then applied to explore the correlation between terms and classes, targeting for bridging the gap between NDKs and high-level semantic concepts. Next, co-occurrence information is used to detect the similarity between NDKs and classes using the NDK-within-video information. Finally, both of them are integrated for web video event mining through negative NDK pruning and positive NDK enhancement. Moreover, both NDKs and terms with relatively low frequencies are treated as useful information in our experiments. Experimental results on large-scale web videos from YouTube demonstrate that the proposed framework outperforms several existing mining methods and obtains good results for web video event mining.


web video event mining multiple correspondence analysis co-occurrence near-duplicate keyframe 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Supplementary material

11390_2013_1377_MOESM1_ESM.docx (15 kb)
ESM 1 (DOCX 14 kb)


  1. 1.
    Zhang J, Fan X, Wang J et al. Keyword-propagation-based information enriching and noise removal for web news videos. In Proc. the 18th ACM International Conference on Knowledge Discovery and Data Mining, Aug. 2012, pp.561-569.Google Scholar
  2. 2.
    Chen K Y, Luesukprasert L, Chou S et al. Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(8): 1016–1025.CrossRefGoogle Scholar
  3. 3.
    Fung G P C , Yu J X, Liu H et al. Time-dependent event hierarchy construction. In Proc. the 13th Int. Conf. Knowledge Discovery and Data Mining, Aug. 2007, pp.300-309.Google Scholar
  4. 4.
    Fung G P C, Yu J X, Yu P S et al. Parameter free bursty events detection in text streams. In Proc. the 31st Int. Conf. Very Large Data Bases, Aug. 2005, pp.181-192.Google Scholar
  5. 5.
    He Q, Chang K, Lim E P. Analyzing feature trajectories for event detection. In Proc. the 30th ACM Int. Conf. Research and Develop. in Inform. Retrieval, Aug. 2007, pp.207-214.Google Scholar
  6. 6.
    Wang X, Zhai C, Hu X et al. Mining correlated bursty topic patterns from coordinated text streams. In Proc. the 13th ACM International Conference on Knowledge Discovery and Data Mining, Aug. 2007, pp.784-793.Google Scholar
  7. 7.
    Yao J, Cui B, Huang Y et al. Bursty event detection from collaborative tags. World Wide Web, 2012, 15(2): 171–195.CrossRefGoogle Scholar
  8. 8.
    Tan S, Tan H K, Ngo C W. Topical summarization of web videos by visual-text time-dependent alignment. In Proc. the ACM Int. Conf. Multimedia, Oct. 2010, pp.1095-1098.Google Scholar
  9. 9.
    Wu X, Zhao W L, Ngo C W. Near-duplicate keyframe retrieval with visual keywords and semantic context. In Proc. the 6th ACM International Conference on Image and Video Retrieval, July 2007, pp.162-169.Google Scholar
  10. 10.
    Ke Y, Sukthankar R, Huston L. Efficient near-duplicate detection and sub-image retrieval. In Proc. the ACM Int. Conf. Multimedia, 2004, Vol.4, pp.869-876.Google Scholar
  11. 11.
    Ngo C W, Zhao W L, Jiang Y G. Fast tracking of nearduplicate keyframes in broadcast domain with transitivity propagation. In Proc. the 14th ACM International Conference on Multimedia, Oct. 2006, pp.845-854.Google Scholar
  12. 12.
    Zhang D Q, Chang S F. Detecting image near-duplicate by stochastic attributed relational graph matching with learning. In Proc. the 12th ACM International Conference on Multimedia, Oct. 2004, pp.877-884.Google Scholar
  13. 13.
    Wu X, Ngo C W, Hauptmann A G. Multimodal news story clustering with pairwise visual near-duplicate constraint. IEEE Transactions on Multimedia, 2008, 10(2): 188–199.CrossRefGoogle Scholar
  14. 14.
    Wu X, Ngo C W, Li Q. Threading and autodocumenting news videos: A promising solution to rapidly browse news topics. IEEE Signal Processing Magazine, 2006, 23(2): 59–68.CrossRefGoogle Scholar
  15. 15.
    Martinez-Gil J, Aldana-Montes J. KnoE: A web mining tool to validate previously discovered semantic correspondences. Journal of Computer Science and Technology, 2012, 27(6): 1222–1232.CrossRefGoogle Scholar
  16. 16.
    Lu B, Wang G R, Yuan Y. A novel approach towards large scale cross-media retrieval. Journal of Computer Science and Technology, 2012, 27(6): 1140–1149.CrossRefGoogle Scholar
  17. 17.
    Feng B L, Cao J, Bao X G et al. Graph-based multi-space semantic correlation propagation for video retrieval. The Visual Computer, 2011, 27(1): 21–34.CrossRefGoogle Scholar
  18. 18.
    Hsu W H, Chang S F. Topic tracking across broadcast news videos with visual duplicates and semantic concepts. In Proc. the 2006 IEEE International Conference on Image Processing, Oct. 2006, pp.141-144.Google Scholar
  19. 19.
    Liu D T, Shyu M L, Chen C et al. Within and between shot information utilisation in video key frame extraction. Journal of Information & Knowledge Management, 2011, 10(3): 247–259.CrossRefGoogle Scholar
  20. 20.
    Meng T, Shyu M L. Leveraging concept association network for multimedia rare concept mining and retrieval. In Proc. the 2012 IEEE International Conference on Multimedia & Expo, July 2012, pp.860-865.Google Scholar
  21. 21.
    Cao J, Ngo C W, Zhang Y D et al. Tracking web video topics: Discovery, visualization, and monitoring. IEEE Trans. Circuits and Systems for Video Technology, 2011, 21(12): 1835–1846.CrossRefGoogle Scholar
  22. 22.
    Duygulu P, Pan J Y, Forsyth D A. Towards autodocumentary: Tracking the evolution of news stories. In Proc. the 12th ACM Int. Conf. Multimedia, Oct. 2004, pp.820-827.Google Scholar
  23. 23.
    Zhai Y, Shah M. Tracking news stories across different sources. In Proc. the 13th ACM International Conference on Multimedia, Nov. 2005, pp.2-10.Google Scholar
  24. 24.
    Liu L, Sun L, Rui Y et al. Web video topic discovery and tracking via bipartite graph reinforcement model. In Proc. of the 17th ACM International Conference on World Wide Web, Apr. 2008, pp.1009-1018.Google Scholar
  25. 25.
    Wu X, Lu Y J, Peng Q et al. Mining event structures from web videos. IEEE Multimedia, 2011, 18(1): 38–51.CrossRefGoogle Scholar
  26. 26.
    Hu S M, Chen T, Xu K et al. Internet visual media processing: A survey with graphics and vision applications. The Visual Computer, 2013, 29(5): 393–405.CrossRefGoogle Scholar
  27. 27.
    Parry M L, Legg P A, Chung D H et al. Hierarchical event selection for video storyboards with a case study on snooker video visualization. IEEE Transactions on Visualization and Computer Graphics, 2011, 17(12): 1747–1756.CrossRefGoogle Scholar
  28. 28.
    Lin L, Ravitz G, Shyu M L et al. Correlation-based video semantic concept detection using multiple correspondence analysis. In Proc. the 10th IEEE International Symposium on Multimedia, Dec. 2008, pp.316-321.Google Scholar
  29. 29.
    Salkind N J. Encyclopedia of Measurement and Statistics. SAGA Publications, Inc., 2006.Google Scholar
  30. 30.
    Kennedy L S, Naaman M. Generating diverse and representative image search results for landmarks. In Proc. the 17th ACM International Conference on World Wide Web, Apr. 2008, pp.297-306.Google Scholar
  31. 31.
    Zhu Q S, Lin L, Shyu M L et al. Utilizing context information to enhance content-based image classification. International Journal of Multimedia Data Engineering and Management, 2011, 2(3): 34–51.CrossRefGoogle Scholar
  32. 32.
    Lin L, Chen C, Shyu M L et al. Weighted subspace filtering and ranking algorithms for video concept retrieval. IEEE Multimedia, 2011, 18(3): 32–43.CrossRefGoogle Scholar
  33. 33.
    Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91–110.CrossRefGoogle Scholar
  34. 34.
    Zhao W L, Wu X, Ngo C W. On the annotation of web videos by efficient near-duplicate search. IEEE Transactions on Multimedia, 2010, 12(5): 448–461.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York & Science Press, China 2013

Authors and Affiliations

  • Cheng-De Zhang
    • 1
  • Xiao Wu
    • 1
  • Mei-Ling Shyu
    • 2
  • Qiang Peng
    • 1
  1. 1.School of Information Science and TechnologySouthwest Jiaotong UniversityChengduChina
  2. 2.Department of Electrical and Computer EngineeringUniversity of MiamiCoral GablesUSA

Personalised recommendations