Automated Lecture Services

Zhang, Cha; Rui, Yong

doi:10.1007/978-3-540-78502-6_14

Cha Zhang⁴ &
Yong Rui⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 120))

501 Accesses
1 Citations

Summary

Increasingly popular, lectures are given before a live audience, while simultaneously being viewed remotely and recorded for subsequent on-demand viewing over the Internet. Traditionally, it is very expensive to offer such services due to the high labor cost involved. In this chapter, we survey existing approaches for providing automated lecture services. In particular, we examine two major challenges in providing such services, namely, how to capture, analyze and render the lecture content automatically, and how to provide live/on-demand lecture viewing/browsing experience with an automated end-to-end system. The chapter is concluded by a list of future research directions, hoping to inspire even more work on this interesting and highly useful topic.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

He, L., Grudin, J., Gupta, A.: Designing presentations for on-demand viewing. ACM Conference on Computer Supported Cooperative Work (CSCW) (2001)
Google Scholar
Steinmetz, A., Kienzle, M.: The e-seminar lecture recording and distribution system. Proceedings of the SPIE Multimedia Computing and Networking 4312 (2001)
Google Scholar
Allen, I.E., Seaman, J.: Making the grade: Online education in the united states, 2006. Sloan Consortium Surveys (2006)
Google Scholar
Bianchi, M.: Autoauditorium: A fully automatic, multi-camera system to televise auditorium presentations. Proceedings of the Joint DARPA/NIST Smart Spaces Technology Workshop (1998)
Google Scholar
Bianchi, M.: Automatic video production of lectures using an intelligent and aware environment. Proceedings of the 3rd International Conference on Mobile and Ubiquitous Multimedia (2004) 117-123
Google Scholar
Abowd, G.: Classroom 2000: An experiment with the instrumentation of a living educational environment. IBM Systems Journal 38 (4) (1999) 508-530
Article Google Scholar
Mukhopadhyay, S., Smith, B.: Passive capture and structuring of lectures. Proceedings of the ACM Multimedia (1999) 477-487
Google Scholar
Rowe, L.A., Pletcher, P., Harley, D., Lawrence, S.: BIBS: A lecture webcasting system. Technical report, Berkeley Multimedia Research Center, U.C. Berkeley (2001)
Google Scholar
Rui, Y., He, L., Gupta, A., Liu, Q.: Building an intelligent camera management system. Proceedings of the ACM Multimedia (2001) 2-11
Google Scholar
Rui, Y., Gupta, A., Grudin, J., He, L.: Automating lecture capture and broadcast: Technology and videography. ACM Multimedia Systems Journal 10 (1) (2004) 3-15
Article Google Scholar
Zhang, C., Rui, Y., Crawford, J., He, L.: An automated end-to-end lecture capture and broadcasting system. Technical report MSR-TR-2005-128, Microsoft Research (2005)
Google Scholar
Baecker, R.: A principled design for scalable internet visual communications with rich media, interactivity and structured archives. Proceedings of the Centre for Advanced Studies on Collaborative Research (2003)
Google Scholar
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP- 24 (4) (1976) 320-327
Article Google Scholar
Brandstein, M., Silverman, H.: A practical methodology for speech localization with microphone arrays. Computer, Speech, and Language 11 (2) (1997) 91-126
Article Google Scholar
Cox, H., Zeskind, R.M., Owen, M.M.: Robust adaptive beamforming. IEEE Transactions on Acoustics, Speech and Signal Processing ASSP-35 (10) (1987) 1365-1376
Article Google Scholar
Veen, B.D.V., Buckley, K.M.: Beamforming: a versatile approach to spatial filtering. IEEE Signal Processing Magazine (1988) 4-24
Google Scholar
Strobel, N., Spors, S., Rabenstein, R.: Joint audio-video object localization and tracking. IEEE Signal Processing Magazine 18 (2001) 22-31
Article Google Scholar
Mungamuru, B., Aarabi, P.: Enhanced sound localization. IEEE Transactions on Systems, Man and Cybernetics - Part B: Cybernetics 34 (13) (2004) 1526-1540
Article Google Scholar
Segura, C., Canton-Ferrer, C., Abad, A., Casas, J.R., Hernando, J.: Multimodal head orientation towards attention tracking in smartrooms. Proceedings of ICASSP (2007)
Google Scholar
Wang, H., Chu, P.: Voice source localization for automatic camera pointing system in videoconferencing. Proceedings of IEEE ICASSP (1997)
Google Scholar
Weng, J., Guentchev, K.Y.: Three-dimensional sound localization from a compact non-coplanar array of microphones using tree-based learning. Journal of the Acoustical Society of America 110 (1) (2001) 310-323
Article Google Scholar
Zhang, C., Zhang, Z., Florêncio, D.: Maximum likelihood sound source localization for multiple directional microphones. Proceedings of ICASSP (2007)
Google Scholar
Griffiths, L.J., Jim, C.W.: An alternative approach to linearly constrained adaptive beamforming. IEEE Transactions on Anttenas and Propagation AP-30 (1) (1982) 27-34
Article Google Scholar
Hoshuyama, O., Sugiyama, A., Hirano, A.: A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters. IEEE Transactions on Signal Processing 47 (10) (1999) 2677-2684
Article Google Scholar
El-Keyi, A., Kirubarajan, T., Gershman, A.: Robust adaptive beamforming based on the kalman filter. IEEE Transactions on Signal Processing 53 (8) (2005) 3032-3041
Article MathSciNet Google Scholar
Anderson, R., Anderson, R., Chung, O., Davis, K.M., Davis, P., Prince, C., Razmov, V., Simon, B.: Classroom presenter - A classroom interaction system for active and collaborative learning. Proceedings of WIPTE (2006)
Google Scholar
Amir, A., Ashour, G., Srinivasan, S.: Automatic generation of conference video proceedings. Journal of Visual Communication and Image Representation 15 (2004) 467-488
Article Google Scholar
Rowe, L.A., Casalaina, V.: Capturing conference presentations. IEEE Multi- media 13 (4) (2006)
Google Scholar
Tang, J., Issacs, E.: Why do users like video? Studies of multimedia-supported collaboration. Computer Supported Cooperative Work: An International Journal 1 (3) (1993) 163-196
Article Google Scholar
Kariya, S.: Online education expands and evolves. IEEE Spectrum (2003)
Google Scholar
Liu, T., Kender, J.R.: Lecture videos for e-learning: Current research and challenges. Proceedings of IEEE International Workshop on Multimedia Contentbased Analysis and Retrieval (2004)
Google Scholar
Yokoi, T., Fujiyoshi, H.: Virtual camerawork for generating lecture video from high resolution images. Proceedings of ICME (2004)
Google Scholar
Zhang, C., Rui, Y., He, L., Wallick, M.: Hybrid speaker tracking in an automated lecture room. Proceedings of ICME (2005)
Google Scholar
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 25 (5) (2003) 564-577
Article Google Scholar
Isard, M., Blake, A.: Condensation - Conditional density propagation for visual tracking. International Journal of Computer Vision 29 (1) (1998) 5-28
Article Google Scholar
Cruz, G., Hill, R.: Capturing and playing multimedia events with streams. Proceedings of the ACM Multimedia (1994) 193-200
Google Scholar
Mantey, P., Richards, J.: Using large shared displays to create a collaborative classroom learning environment. Workshop on Advanced Collaborative Environments (2005)
Google Scholar
Matsuo, Y., Amano, M., Uehara, K.: Mining video editing rules in video streams. Proceedings of the ACM Multimedia (2002) 255-258
Google Scholar
Kelly, P.H., Katkere, A., Kuramura, D.Y., Moezzi, S., Chatterjee, S., Jain, R.: An architecture for mulitple perspective interactive video. Proceedings of the ACM Multimedia (1995)
Google Scholar
Vazirgiannis, M., Boll, S.: Events in interactive multimedia applications: Modeling and implementation design. IEEE International Conference on Multimedia Computing and Systems (1997)
Google Scholar
Nahrstedt, K., Yu, B., Liang, J., Cui, Y.: Hourglass multimedia content and service composition framework for smart room environments. Pervasive and Mobile Computing 1 (1) (2005) 43-75
Article Google Scholar
Syeda-Mahmood, T.F., Ponceleon, D.: Learning video browsing behavior and its application in the generation of video previews. Proceedings of the ACM Multimedia (2001)
Google Scholar
Liu, Q., Kimber, D., Foote, J., Wilcox, L., Boreczky, J.: FlySPEC: A multi-user video camera system with hybrid human and automatic control. Proceedings of the ACM Multimedia (2002)
Google Scholar
Machnicki, E.: Virtual director: Automating a webcast. Proceedings of SPIE Multimedia Computing and Networking (2002)
Google Scholar
Yu, B., Nahrstedt, K.: AVPUC: Automatic video production with user customization. Multimecia Computing and Networking Conference (2005)
Google Scholar
Yu, B., Zhang, C., Rui, Y., Nahrstedt, K.: A three-layer virtual director model for supporting automated multi-site distributed education. Proceedings of ICME (2006)
Google Scholar
Onishi, M., Fukunaga, K.: Shooting the lecture scene using computer controlled cameras based on situation understanding and evaluation of video images. International Conference on Pattern Recognition (ICPR) (2004)
Google Scholar
Wang, F., Ngo, C.W., Pong, T.C.: Exploiting self-adaptive posture-based focus esitmation for lecture video editing. Proceedings of the ACM Multimedia (2005)
Google Scholar
Wang, F., Ngo, C.W., Pong, T.C.: Gesture tracking and recognition for video editing. International Conference on Pattern Recognition (2004)
Google Scholar
Canton-Ferrer, C., Casas, J.R., Pardàs, M.: Fusion of multiple viewpoint information towards 3d face robust orientation detection. Proceedings of ICIP (2005)
Google Scholar
Liu, T., Hjelsvold, R., Kender, J.R.: Analysis and enhancement of videos of electronic slide presentation. Proceedings of ICME (2002)
Google Scholar
Gleicher, M., Masanz, J.: Towards virtual videography. Proceedings of the ACM Multimedia (2000) 375-378
Google Scholar
Heng, W.J., Tian, Q.: Content enhancement for e-learning lecture video using foreground/background separation. IEEE Workshop on Multimedia Signal Processing (2002)
Google Scholar
He, L.W., Zhang, Z.: Real-time whiteboard capture and processing using a video camera for teleconferencing. Microsoft Research Technical report, MSRTR-2004-91 (2004)
Google Scholar
He, L., Sanocki, E., Gupta, A., Grudin, J.: Auto-summarization of audio-video presentations. Proceeding of the ACM Multimedia (1999)
Google Scholar
Liu, T., Kender, J.R.: Rule-based semantic summarization of instructional videos. Proceedings of ICIP (2002)
Google Scholar
Liu, T., Kender, J.R.: Semantic mosaic for indexing and compressing instruc- tional videos. Proceedings of ICIP (2002)
Google Scholar
Vinciarelli, A., Odobez, J.M.: Application of information retrieval technologies to presentation slides. IEEE Transactions on Multimedia 8 (5) (2006) 981-995
Article Google Scholar
Hain, T., Burget, L., Dines, J., Garau, G., Karafiat, M., Lincoln, M., McCowan, I., Moore, D., Wan, V., Ordelman, R., Renals, S.: The 2005 ami system for the transcription of speech in meetings. Proceedings of the Rich Transcription 2005 Spring (RT05s) Meeting Recognition Evaluation (2005)
Google Scholar
Vinciarelli, A., Bourlard, H.: Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations. Technical report, IDIAP-RR 06-56 (2006)
Google Scholar
Phung, D.Q., Venkatesh, S., Dorai, C.: High level segmentation of instructional videos based on content density. Proceedings of the ACM Multimedia (2002)
Google Scholar
Brotherton, J.A., Abowd, G.D.: Lessons learned from eclass: Assessing auto-mated capture and access in the classroom. ACM Transactions on Computer-Human Interaction 11 (2) (2004) 121-155
Article Google Scholar

Download references

Author information

Authors and Affiliations

Microsoft Research One Microsoft Way, Redmond, WA, 98052, USA
Cha Zhang & Yong Rui

Authors

Cha Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Rui
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, University of Piraeus, Karaoli-Dimitriou Str. 80, 185 34, Piraeus, Greece
George A. Tsihrintzis
School of Electrical & Information Engineering, University of South Australia KES Centre, Mawson Lakes Campus, Adelaide, SA, 5095, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, C., Rui, Y. (2008). Automated Lecture Services. In: Tsihrintzis, G.A., Jain, L.C. (eds) Multimedia Services in Intelligent Environments. Studies in Computational Intelligence, vol 120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78502-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-540-78502-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78491-3
Online ISBN: 978-3-540-78502-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics