NII-UIT Browser: A Multimodal Video Search System

  • Thanh Duc Ngo
  • Vinh-Tiep Nguyen
  • Vu Hoang Nguyen
  • Duy-Dinh Le
  • Duc Anh Duong
  • Shin’ichi Satoh
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8936)

Abstract

We introduce an interactive system for searching a known scene in a video database. The key idea is to enable multimodal search. As the retrieved database is getting larger, using individual modals may not be powerful enough to discriminate a scene with other near duplicates. In our system, a known scene can be described and searched by its visual cues or audio genres. Templates are given for users to rapidly and exactly describe the scene. Moreover, search results are updated instantly as users change the description. As a result, users can generate a large number of possible queries to find the matched scene in a short time.

Keywords

Multimodal Approach Know Item Search Interactive Tool 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Lokoč, J., Blažek, A., Skopal, T.: Signatured-based Video Browser. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014, Part II. LNCS, vol. 8326, pp. 415–418. Springer, Heidelberg (2014)CrossRefGoogle Scholar
  2. 2.
    Ngo, T.D., Nguyen, V.H., Lam, V., Phan, S., Le, D.-D., Duong, D.A., Satoh, S.: NII-UIT: A Tool for Known Item Search by Sequential Pattern Filtering. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014, Part II. LNCS, vol. 8326, pp. 419–422. Springer, Heidelberg (2014)CrossRefGoogle Scholar
  3. 3.
    Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object Detection with Discriminatively Trained Part Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 1627–1645 (2010)Google Scholar
  4. 4.
    Zhu, X., Ramanan, D.: Face detection, pose estimation and landmark localization in the wild. In: Computer Vision and Pattern Recognition (CVPR), pp. 2879–2886 (2012)Google Scholar
  5. 5.
    Lee, C.H., Soong, F., Juang, B.H.: A segment model based approach to speech recognition. In: International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 501–541 (1988)Google Scholar
  6. 6.
    Lowe, D.G.: Distinctive image features from scale invariant keypoints. International Journal Computer Vision (IJCV), 91–110 (2004)Google Scholar
  7. 7.
    Thomas, D., Daniel, K., Herman, N.: Features for Image Retrieval: An Experimental Comparison. Journal Information Retrieval, 77–107 (2008)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Thanh Duc Ngo
    • 1
  • Vinh-Tiep Nguyen
    • 2
  • Vu Hoang Nguyen
    • 1
  • Duy-Dinh Le
    • 3
  • Duc Anh Duong
    • 1
  • Shin’ichi Satoh
    • 3
  1. 1.University of Information Technology - VNUHCMVietnam
  2. 2.University of Science - VNUHCMVietnam
  3. 3.National Institute of InformaticsJapan

Personalised recommendations