Multimedia Processing for Advanced Communications Services
The advent of digital multimedia communications has generated a growing need for powerful multimedia processing techniques to enable the generation of useful and intelligent communications services. Multimedia processing techniques play a significant role in creating communications services by; 1) enabling efficient transmission and storage of multimedia information through media compression techniques, 2) creating effective user interfaces through media conversion, understanding, and dialogue systems, and 3) providing intelligent information searching and browsing mechanisms based on media processing and understanding techniques. In this paper we present a brief overview of media compression techniques and standards, touch upon several media processing techniques. Then we give a brief overview of three prototype services based on these techniques.
KeywordsSpeech Recognition Communication Service Video Program Optical Character Recognition Joint Photographic Expert Group
Unable to display preview. Download preview PDF.
- 1.W. B. Kleijn, and K. K. Paliwal, Ed., Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995.Google Scholar
- 2.A. N. Netravali, and B. G. Haskell, Digital Pictures - Representation, Compression, and Standards. 2nd ed. New York, Plenum, 1995.Google Scholar
- 3.L. R. Rabiner, and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.Google Scholar
- 4.L. R. Rabiner, B. H. Juang, and C. H. Lee, ‘An overview of automatic speech recognition’ in Automatic Speech and Speaker Recognition, Advanced Topics, C. H. Lee, F. K. Soong, and K. K. Paliwal Eds. Norwell, MA: Kluwer, 1996.Google Scholar
- 5.M. Mohri, and M. Riley, ‘Weighted determinization and minimization for large vocabulary speech recognition’ Proc. Eurospeech, pp. 131–134, 1997.Google Scholar
- 7.B. Shahraray, ‘Scene change detection and content-based sampling of video sequences’ Proc. SPIE 2419, Digital Video Compression: Algorithms and Technologies, pp. 2–13, February 1995.Google Scholar
- 8.S. F. Chang, W. Chen, H. Meng, H. Sundaram, and D. Zhong, ‘An automated content-based video search system using visual cues’ ACM Multimedia Conference, November 1997, Seattle, WA. also Columbia University/CTR Technical Report, CTR-TR #478–97–12.Google Scholar
- 10.L. Bottou, P. G. Howard, and Y. Bengio, ‘The Z-coder Adaptive Bilinear Coder’ Proc. IEEE Data Compression Conference, pp. 13–22, Snowbird, UT, 1998.Google Scholar
- 11.B. Shahraray, and D. C. Gibbon, ‘Automatic generation of pictorial transcripts of video programs’ Proc. SPIE 2417, Multimedia Computing and Networking, pp. 512–518, February 1995.Google Scholar
- 12.M. R. Civanlar, G. L. Cash, R. Kollarits, B. Paul, C. Swain, B. G. Haskell, and D. Kapilow, ‘VideoTalks—A Comprehensive Multimedia Conferencing System’ to be published in IEEE Trans. on Multimedia.Google Scholar