Advertisement

Multimedia Processing for Advanced Communications Services

  • B. Shahraray
  • R. Cox
  • B. Haskell
  • Y. LeCun
  • L. Rabiner
Conference paper

Abstract

The advent of digital multimedia communications has generated a growing need for powerful multimedia processing techniques to enable the generation of useful and intelligent communications services. Multimedia processing techniques play a significant role in creating communications services by; 1) enabling efficient transmission and storage of multimedia information through media compression techniques, 2) creating effective user interfaces through media conversion, understanding, and dialogue systems, and 3) providing intelligent information searching and browsing mechanisms based on media processing and understanding techniques. In this paper we present a brief overview of media compression techniques and standards, touch upon several media processing techniques. Then we give a brief overview of three prototype services based on these techniques.

Keywords

Speech Recognition Communication Service Video Program Optical Character Recognition Joint Photographic Expert Group 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    W. B. Kleijn, and K. K. Paliwal, Ed., Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995.Google Scholar
  2. 2.
    A. N. Netravali, and B. G. Haskell, Digital Pictures - Representation, Compression, and Standards. 2nd ed. New York, Plenum, 1995.Google Scholar
  3. 3.
    L. R. Rabiner, and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.Google Scholar
  4. 4.
    L. R. Rabiner, B. H. Juang, and C. H. Lee, ‘An overview of automatic speech recognition’ in Automatic Speech and Speaker Recognition, Advanced Topics, C. H. Lee, F. K. Soong, and K. K. Paliwal Eds. Norwell, MA: Kluwer, 1996.Google Scholar
  5. 5.
    M. Mohri, and M. Riley, ‘Weighted determinization and minimization for large vocabulary speech recognition’ Proc. Eurospeech, pp. 131–134, 1997.Google Scholar
  6. 6.
    M. Christel, T. Kanade, M. Mouldin, R. Reddy, M. Sirbu, S. Stevens, and H. Wackler, ‘Informedia digital video library’ Commun. ACM, vol. 38, no. 4, pp. 57–58, April 1995.CrossRefGoogle Scholar
  7. 7.
    B. Shahraray, ‘Scene change detection and content-based sampling of video sequences’ Proc. SPIE 2419, Digital Video Compression: Algorithms and Technologies, pp. 2–13, February 1995.Google Scholar
  8. 8.
    S. F. Chang, W. Chen, H. Meng, H. Sundaram, and D. Zhong, ‘An automated content-based video search system using visual cues’ ACM Multimedia Conference, November 1997, Seattle, WA. also Columbia University/CTR Technical Report, CTR-TR #478–97–12.Google Scholar
  9. 9.
    L. Bottou, P. Haffner, P. G. Howard, P. Simard, Y. Bengio, and Y. LeCun, ’High quality document image compression with DjVu’, Journal of Electronic Imaging, 7(3), pp. 410–428, July 1998.CrossRefGoogle Scholar
  10. 10.
    L. Bottou, P. G. Howard, and Y. Bengio, ‘The Z-coder Adaptive Bilinear Coder’ Proc. IEEE Data Compression Conference, pp. 13–22, Snowbird, UT, 1998.Google Scholar
  11. 11.
    B. Shahraray, and D. C. Gibbon, ‘Automatic generation of pictorial transcripts of video programs’ Proc. SPIE 2417, Multimedia Computing and Networking, pp. 512–518, February 1995.Google Scholar
  12. 12.
    M. R. Civanlar, G. L. Cash, R. Kollarits, B. Paul, C. Swain, B. G. Haskell, and D. Kapilow, ‘VideoTalks—A Comprehensive Multimedia Conferencing System’ to be published in IEEE Trans. on Multimedia.Google Scholar

Copyright information

© Springer-Verlag London Limited 1999

Authors and Affiliations

  • B. Shahraray
    • 1
  • R. Cox
    • 1
  • B. Haskell
    • 1
  • Y. LeCun
    • 1
  • L. Rabiner
    • 1
  1. 1.AT&T Labs—ResearchUSA

Personalised recommendations