Summary
Increasingly popular, lectures are given before a live audience, while simultaneously being viewed remotely and recorded for subsequent on-demand viewing over the Internet. Traditionally, it is very expensive to offer such services due to the high labor cost involved. In this chapter, we survey existing approaches for providing automated lecture services. In particular, we examine two major challenges in providing such services, namely, how to capture, analyze and render the lecture content automatically, and how to provide live/on-demand lecture viewing/browsing experience with an automated end-to-end system. The chapter is concluded by a list of future research directions, hoping to inspire even more work on this interesting and highly useful topic.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
 References
He, L., Grudin, J., Gupta, A.: Designing presentations for on-demand viewing. ACM Conference on Computer Supported Cooperative Work (CSCW) (2001)
Steinmetz, A., Kienzle, M.: The e-seminar lecture recording and distribution system. Proceedings of the SPIE Multimedia Computing and Networking 4312 (2001)
Allen, I.E., Seaman, J.: Making the grade: Online education in the united states, 2006. Sloan Consortium Surveys (2006)
Bianchi, M.: Autoauditorium: A fully automatic, multi-camera system to televise auditorium presentations. Proceedings of the Joint DARPA/NIST Smart Spaces Technology Workshop (1998)
Bianchi, M.: Automatic video production of lectures using an intelligent and aware environment. Proceedings of the 3rd International Conference on Mobile and Ubiquitous Multimedia (2004) 117-123
Abowd, G.: Classroom 2000: An experiment with the instrumentation of a living educational environment. IBM Systems Journal 38 (4) (1999) 508-530
Mukhopadhyay, S., Smith, B.: Passive capture and structuring of lectures. Proceedings of the ACM Multimedia (1999) 477-487
Rowe, L.A., Pletcher, P., Harley, D., Lawrence, S.: BIBS: A lecture webcasting system. Technical report, Berkeley Multimedia Research Center, U.C. Berkeley (2001)
Rui, Y., He, L., Gupta, A., Liu, Q.: Building an intelligent camera management system. Proceedings of the ACM Multimedia (2001) 2-11
Rui, Y., Gupta, A., Grudin, J., He, L.: Automating lecture capture and broadcast: Technology and videography. ACM Multimedia Systems Journal 10 (1) (2004) 3-15
Zhang, C., Rui, Y., Crawford, J., He, L.: An automated end-to-end lecture capture and broadcasting system. Technical report MSR-TR-2005-128, Microsoft Research (2005)
Baecker, R.: A principled design for scalable internet visual communications with rich media, interactivity and structured archives. Proceedings of the Centre for Advanced Studies on Collaborative Research (2003)
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP- 24 (4) (1976) 320-327
Brandstein, M., Silverman, H.: A practical methodology for speech localization with microphone arrays. Computer, Speech, and Language 11 (2) (1997) 91-126
Cox, H., Zeskind, R.M., Owen, M.M.: Robust adaptive beamforming. IEEE Transactions on Acoustics, Speech and Signal Processing ASSP-35 (10) (1987) 1365-1376
Veen, B.D.V., Buckley, K.M.: Beamforming: a versatile approach to spatial filtering. IEEE Signal Processing Magazine (1988) 4-24
Strobel, N., Spors, S., Rabenstein, R.: Joint audio-video object localization and tracking. IEEE Signal Processing Magazine 18 (2001) 22-31
Mungamuru, B., Aarabi, P.: Enhanced sound localization. IEEE Transactions on Systems, Man and Cybernetics - Part B: Cybernetics 34 (13) (2004) 1526-1540
Segura, C., Canton-Ferrer, C., Abad, A., Casas, J.R., Hernando, J.: Multimodal head orientation towards attention tracking in smartrooms. Proceedings of ICASSP (2007)
Wang, H., Chu, P.: Voice source localization for automatic camera pointing system in videoconferencing. Proceedings of IEEE ICASSP (1997)
Weng, J., Guentchev, K.Y.: Three-dimensional sound localization from a compact non-coplanar array of microphones using tree-based learning. Journal of the Acoustical Society of America 110 (1) (2001) 310-323
Zhang, C., Zhang, Z., Florêncio, D.: Maximum likelihood sound source localization for multiple directional microphones. Proceedings of ICASSP (2007)
Griffiths, L.J., Jim, C.W.: An alternative approach to linearly constrained adaptive beamforming. IEEE Transactions on Anttenas and Propagation AP-30 (1) (1982) 27-34
Hoshuyama, O., Sugiyama, A., Hirano, A.: A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters. IEEE Transactions on Signal Processing 47 (10) (1999) 2677-2684
El-Keyi, A., Kirubarajan, T., Gershman, A.: Robust adaptive beamforming based on the kalman filter. IEEE Transactions on Signal Processing 53 (8) (2005) 3032-3041
Anderson, R., Anderson, R., Chung, O., Davis, K.M., Davis, P., Prince, C., Razmov, V., Simon, B.: Classroom presenter - A classroom interaction system for active and collaborative learning. Proceedings of WIPTE (2006)
Amir, A., Ashour, G., Srinivasan, S.: Automatic generation of conference video proceedings. Journal of Visual Communication and Image Representation 15 (2004) 467-488
Rowe, L.A., Casalaina, V.: Capturing conference presentations. IEEE Multi- media 13 (4) (2006)
Tang, J., Issacs, E.: Why do users like video? Studies of multimedia-supported collaboration. Computer Supported Cooperative Work: An International Journal 1 (3) (1993) 163-196
Kariya, S.: Online education expands and evolves. IEEE Spectrum (2003)
Liu, T., Kender, J.R.: Lecture videos for e-learning: Current research and challenges. Proceedings of IEEE International Workshop on Multimedia Contentbased Analysis and Retrieval (2004)
Yokoi, T., Fujiyoshi, H.: Virtual camerawork for generating lecture video from high resolution images. Proceedings of ICME (2004)
Zhang, C., Rui, Y., He, L., Wallick, M.: Hybrid speaker tracking in an automated lecture room. Proceedings of ICME (2005)
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 25 (5) (2003) 564-577
Isard, M., Blake, A.: Condensation - Conditional density propagation for visual tracking. International Journal of Computer Vision 29 (1) (1998) 5-28
Cruz, G., Hill, R.: Capturing and playing multimedia events with streams. Proceedings of the ACM Multimedia (1994) 193-200
Mantey, P., Richards, J.: Using large shared displays to create a collaborative classroom learning environment. Workshop on Advanced Collaborative Environments (2005)
Matsuo, Y., Amano, M., Uehara, K.: Mining video editing rules in video streams. Proceedings of the ACM Multimedia (2002) 255-258
Kelly, P.H., Katkere, A., Kuramura, D.Y., Moezzi, S., Chatterjee, S., Jain, R.: An architecture for mulitple perspective interactive video. Proceedings of the ACM Multimedia (1995)
Vazirgiannis, M., Boll, S.: Events in interactive multimedia applications: Modeling and implementation design. IEEE International Conference on Multimedia Computing and Systems (1997)
Nahrstedt, K., Yu, B., Liang, J., Cui, Y.: Hourglass multimedia content and service composition framework for smart room environments. Pervasive and Mobile Computing 1 (1) (2005) 43-75
Syeda-Mahmood, T.F., Ponceleon, D.: Learning video browsing behavior and its application in the generation of video previews. Proceedings of the ACM Multimedia (2001)
Liu, Q., Kimber, D., Foote, J., Wilcox, L., Boreczky, J.: FlySPEC: A multi-user video camera system with hybrid human and automatic control. Proceedings of the ACM Multimedia (2002)
Machnicki, E.: Virtual director: Automating a webcast. Proceedings of SPIE Multimedia Computing and Networking (2002)
Yu, B., Nahrstedt, K.: AVPUC: Automatic video production with user customization. Multimecia Computing and Networking Conference (2005)
Yu, B., Zhang, C., Rui, Y., Nahrstedt, K.: A three-layer virtual director model for supporting automated multi-site distributed education. Proceedings of ICME (2006)
Onishi, M., Fukunaga, K.: Shooting the lecture scene using computer controlled cameras based on situation understanding and evaluation of video images. International Conference on Pattern Recognition (ICPR) (2004)
Wang, F., Ngo, C.W., Pong, T.C.: Exploiting self-adaptive posture-based focus esitmation for lecture video editing. Proceedings of the ACM Multimedia (2005)
Wang, F., Ngo, C.W., Pong, T.C.: Gesture tracking and recognition for video editing. International Conference on Pattern Recognition (2004)
Canton-Ferrer, C., Casas, J.R., Pardà s, M.: Fusion of multiple viewpoint information towards 3d face robust orientation detection. Proceedings of ICIP (2005)
Liu, T., Hjelsvold, R., Kender, J.R.: Analysis and enhancement of videos of electronic slide presentation. Proceedings of ICME (2002)
Gleicher, M., Masanz, J.: Towards virtual videography. Proceedings of the ACM Multimedia (2000) 375-378
Heng, W.J., Tian, Q.: Content enhancement for e-learning lecture video using foreground/background separation. IEEE Workshop on Multimedia Signal Processing (2002)
He, L.W., Zhang, Z.: Real-time whiteboard capture and processing using a video camera for teleconferencing. Microsoft Research Technical report, MSRTR-2004-91 (2004)
He, L., Sanocki, E., Gupta, A., Grudin, J.: Auto-summarization of audio-video presentations. Proceeding of the ACM Multimedia (1999)
Liu, T., Kender, J.R.: Rule-based semantic summarization of instructional videos. Proceedings of ICIP (2002)
Liu, T., Kender, J.R.: Semantic mosaic for indexing and compressing instruc- tional videos. Proceedings of ICIP (2002)
Vinciarelli, A., Odobez, J.M.: Application of information retrieval technologies to presentation slides. IEEE Transactions on Multimedia 8 (5) (2006) 981-995
Hain, T., Burget, L., Dines, J., Garau, G., Karafiat, M., Lincoln, M., McCowan, I., Moore, D., Wan, V., Ordelman, R., Renals, S.: The 2005 ami system for the transcription of speech in meetings. Proceedings of the Rich Transcription 2005 Spring (RT05s) Meeting Recognition Evaluation (2005)
Vinciarelli, A., Bourlard, H.: Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations. Technical report, IDIAP-RR 06-56 (2006)
Phung, D.Q., Venkatesh, S., Dorai, C.: High level segmentation of instructional videos based on content density. Proceedings of the ACM Multimedia (2002)
Brotherton, J.A., Abowd, G.D.: Lessons learned from eclass: Assessing auto-mated capture and access in the classroom. ACM Transactions on Computer-Human Interaction 11 (2) (2004) 121-155
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Zhang, C., Rui, Y. (2008). Automated Lecture Services. In: Tsihrintzis, G.A., Jain, L.C. (eds) Multimedia Services in Intelligent Environments. Studies in Computational Intelligence, vol 120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78502-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-78502-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78491-3
Online ISBN: 978-3-540-78502-6
eBook Packages: EngineeringEngineering (R0)