Advertisement

Associating Cooking Video Segments with Preparation Steps

  • Koichi Miura
  • Reiko Hamada
  • Ichiro Ide
  • Shuichi Sakai
  • Hidehiko Tanaka
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2728)

Abstract

We are trying to integrate television cooking videos with corresponding cookbooks. The cookbook has the advantage of the capability to easily browse through a cooking procedure, but understanding of actual cooking operations through written explanation is difficult. On the other hand, a video contains visual information that text cannot express sufficiently, but it lacks the ease to randomly browse through the procedures. We expect that their integration in the form of linking preparation steps (text) in a cookbook and video segments should result in complementing the drawbacks in each media. In this work, we propose a method to associate video segments with preparation steps in a supplementary cookbook by combining video structure analysis and text-based keyword matching. The result of an experiment showed high accuracy in association per video segments, i.e. annotating the video.

Keywords

Face Region Preparation Step Video Segment Cooking Procedure Conceptual Dictionary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    Wactlar, H.D., Hauptmann, A. G., Christel, M. G., Houghton, R.A., Olligschlaeger, A.M.: Complementary video and audio analysis for broadcast news archives. Comm. ACM 45 (2000) 42–47CrossRefGoogle Scholar
  2. [2]
    Watanabe, Y., Okada, Y., Tsunoda, T., Nagao, M.: Aligning articles in TV newscasts and newspapers (in Japanese). Journal of JSAI 12 (1997) 921–927Google Scholar
  3. [3]
    Yaginuma, Y., Sakauchi, M.: Content-based retrieval and decomposition of TV drama based on intermedia synchronization. In: First Intl. Conf. on Visual Information Systems. (1996) 165–170Google Scholar
  4. [4]
    Ariki, Y., Saito, Y.: Extraction of TV news articles based on scene cut detection using DCT clustering. In: Proc. Intl. Conf. on Image Processing. (1996) 847–850Google Scholar
  5. [5]
    Matsuhashi, S., Nakamura, O., Minami, T.: Human-face extraction using modified HSV color system and personal identification through facial image based on isodensity maps. In: IEEE Canadian Conf. on Electrical and Computer Engineering’ 95. (1995) 909–912Google Scholar
  6. [6]
    Ide, I., Yamamoto, K., Tanaka, H.: Automatic video indexing based on shot classification. In: First Intl. Conf. on Advanced Multimedia Content Processing (AMCP’ 98). (1998) 99–114Google Scholar
  7. [7]
    Babaguchi, N., Etoh, M., Satoh, S., Adachi, J., Akutsu, A., Ariki, Y., Echigo, T., Shibata, M., Zen, H., Nakamura, Y., Minoh, M., Matsuyama, T.: Video database for evaluating video processing (in Japanese). Tech. Report of IEICE, PRMU2002-30 102 (2002) 69–74Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Koichi Miura
    • 1
  • Reiko Hamada
    • 1
  • Ichiro Ide
    • 2
  • Shuichi Sakai
    • 1
  • Hidehiko Tanaka
    • 1
  1. 1.The University of TokyoTokyoJapan
  2. 2.National Institute of InformaticsTokyoJapan

Personalised recommendations