Associating Cooking Video Segments with Preparation Steps
We are trying to integrate television cooking videos with corresponding cookbooks. The cookbook has the advantage of the capability to easily browse through a cooking procedure, but understanding of actual cooking operations through written explanation is difficult. On the other hand, a video contains visual information that text cannot express sufficiently, but it lacks the ease to randomly browse through the procedures. We expect that their integration in the form of linking preparation steps (text) in a cookbook and video segments should result in complementing the drawbacks in each media. In this work, we propose a method to associate video segments with preparation steps in a supplementary cookbook by combining video structure analysis and text-based keyword matching. The result of an experiment showed high accuracy in association per video segments, i.e. annotating the video.
KeywordsFace Region Preparation Step Video Segment Cooking Procedure Conceptual Dictionary
Unable to display preview. Download preview PDF.
- Watanabe, Y., Okada, Y., Tsunoda, T., Nagao, M.: Aligning articles in TV newscasts and newspapers (in Japanese). Journal of JSAI 12 (1997) 921–927Google Scholar
- Yaginuma, Y., Sakauchi, M.: Content-based retrieval and decomposition of TV drama based on intermedia synchronization. In: First Intl. Conf. on Visual Information Systems. (1996) 165–170Google Scholar
- Ariki, Y., Saito, Y.: Extraction of TV news articles based on scene cut detection using DCT clustering. In: Proc. Intl. Conf. on Image Processing. (1996) 847–850Google Scholar
- Matsuhashi, S., Nakamura, O., Minami, T.: Human-face extraction using modified HSV color system and personal identification through facial image based on isodensity maps. In: IEEE Canadian Conf. on Electrical and Computer Engineering’ 95. (1995) 909–912Google Scholar
- Ide, I., Yamamoto, K., Tanaka, H.: Automatic video indexing based on shot classification. In: First Intl. Conf. on Advanced Multimedia Content Processing (AMCP’ 98). (1998) 99–114Google Scholar
- Babaguchi, N., Etoh, M., Satoh, S., Adachi, J., Akutsu, A., Ariki, Y., Echigo, T., Shibata, M., Zen, H., Nakamura, Y., Minoh, M., Matsuyama, T.: Video database for evaluating video processing (in Japanese). Tech. Report of IEICE, PRMU2002-30 102 (2002) 69–74Google Scholar