Abstract
The focus of much of the research on providing user-centered control of multimedia has been on the definition of models and (meta-data) descriptions that assist in locating or recommending media objects. While this can provide a more efficient means of selecting content, it provides little extra control for users once that content is rendered. In this article, we consider various means for supporting user-centered control of media within a collection of objects that are structured into a multimedia presentation. We begin with an examination of the constraints of user-centered control based on the characteristics of multimedia applications and the media processing pipeline. We then define four classes of control that can enable a more user-centric manipulation within media content. Each of these control classes is illustrated in terms of a common news viewing system. We continue with reflections on the impact of these control classes on the development of multimedia languages, rendering infrastructures and authoring systems. We conclude with a discussion of our plans for infrastructure support for user-centered multimedia control.
Similar content being viewed by others
References
The Ambulant SMIL 2.1 Player. http://www.cwi.nl/projects/AmbulantPlayer/
André E., WIP PPP, (1997). A comparison of two multimedia presentation systems in terms of the standard reference model. Comput. Stand. Interfaces 18: 555–563
Ardissono L., Kobsa A., Maybury M. ed. (2004) Personalized Digital Television. Targeting programs to individual users. Kluwer, Dordrecht
Borer, T., Davies, T.: DIRAC—Video Compression using Open Technology, EBU Technical Review (2005)
Bulterman, D.C.A.: User-centered abstractions for adaptive hypermedia presentations. In: Proceedings of the ACM Multimedia (1998)
Bulterman D.C.A., Hardman L., Jansen A.J., Mullender K.S., Rutledge L., (1998). GRiNS: A GRaphical INterface for creating and playing SMIL documents. Comput. Netw. ISDN Syst. 30: 519–529
Bulterman D.C.A., (2001). Repurposing Broadcast Content for the Web,. EBU Technical Review, 287: 1–10
Bulterman, D.C.A.: Using SMIL to encode interactive, peer-level multimedia annotations. In: Proceedings of ACM DocumentEngineering 2003 pp. 32–41. Grenoble, France (2003)
Bulterman, D.C.A.: Animating peer-level annotations within web-based multimedia. In: Eurographics Multimedia 2004, Nanjing, China, 27–28 October 2004
Bulterman D.C.A., Rultedge L., (2004). SMIL 2.0: Interactive Multimedia for the Web and Mobile Devices. Springer, Berlin Heidelberg New York
Bulterman, D.C.A., Gassel, G., et al.: Synchronized Multimedia Integration Language (SMIL 2.1). http://www.w3.org/ TR/2005/REC-SMIL2-20051213/
Cesar, P., Bulterman, D.C.A., Jansen, A.J.: An architecture for end-user TV content enrichment. Proc.EuroITV: 4th European Conference on Interactive Television, Athens, Greece, pp. 39–47, May 2006
El-Beltagy, S., DeRoure, D., Hall, W.: The evolution of a practical agent-based recommender system. In: Proceedings of Workshop on Agent-Based Recommender Systems, Autonomous Agents (2000)
Cooper, M., Foote, J.: Scene Boundary Detection Via Video Self-Similarity Analysis. In: Proceedings of IEEE International Conference on Image Processing (2001)
Daisy Consortium, Specifications for the Digital Talking Book, ANSI/NISO Z39.86-2005
Dowman, M., Tablan, V., Cunningham, H., Popov, B.: Web-assisted annotation, semantic indexing and search of television and radio news. In: Proceedings of the 14th international Conference on World Wide Web (Chiba, Japan, May 10–14, 2005). WWW ’05. ACM Press, New York, pp. 225–234 (2005)
Fischer G., (1991). The Importance of models in making complex systems comprehensible. Mental Models and Human-Computer Interaction 2. Elsevier, North Holland
Flickr. http://www.flickr.com/
Foote J.T., (1997). Content-based retrieval of music and audio. In: Proceedings of SPIE Multimedia Storage and Archiving Systems II, 3229:138–147
Geurts, J., Bocconi, S., van Ossenbruggen, J., Hardman, L.: Towards ontology-driven discourse: from semantic graphs to multimedia presentations. In: Second International Semantic Web Conference (ISWC2003) Sanibel Island, Florida, USA, pp. 597–612, 20–23 October 2003
Haas, N., Bolle, R., Dimitrova, N., Janevski, A., Zimmerman, J.: Personalized news through content augmentation and profiling. In: Proceedings of ICIP’02, pp. 9–12. IEEE Press, Rochester (2002)
ITEA Project Passepartout: http://www.hitech-projects.com/ euprojects/passepartout/
Jackson, D., Northway, C.: Scalable Vector Graphics - 1.2 Specification. http://www.w3.org/TR/SVG12/
Li, F.C., Gupta, A., Sanocki, E., He, L., Rui, Y.: Browsing digital video, in CHI ’00: Proceedings of Human Factors in Computing Systems, ACM Press, pp. 169–176 (2000)
Merialdo, B., Lee, K.T., Luparello, D., Roudaire, J.: Automatic construction of personalized TV news programs. In: Proceedings of the Seventh ACM international Conference on Multimedia (Part 1) (Orlando, Florida, United States, October 30 - November 05, 1999). MULTIMEDIA ’99. ACM Press, New York pp. 323–331 (1999)
MONET: Extending databases for multimedia. URL: http://www.cwi.nl/~monet/modprg.html
MPEG-4 Specification. ISO/IEC JTC1/SC29/WG11
NIST, TREC Video Retrieval Evaluation Home Page. www-nlpir.nist.gov/projects/trecvid/
Patel N.V., Sethi I.K., (1997). “Video shot detection and characterization for video databases”,. Pattern Recognition, Special Issue on Multimedia, 30:583–592
Qi, Y., Hauptman, A., Liu, T.: Supervised classification for video shot segmentation. In: Proceedings of IEEE International Conference on Multimedia & Expo (2003)
Robson, G.D.: The Closed-Captioning Handbook. Focal Press/Elsevier (2004)
Rodrigues, R.F., Soares, L.F.G.: Inter and intra media-object QoS provisioning in adaptive formatters. In: IV ACM Symposium on Document Engineering - DocEng2003, Grenoble, France, November 2003
Schulzrinne, H.: RTP: real-time transport protocol URL: http://www.cs.columbia.edu/~hgs/rtp/
Schulzrinne, H.: A real-time stream control protocol(RTSP). URL: http://www.cs.columbia.edu/~hgs/rtsp/draft/draft-ietf-mmusic-stream-00.txt (11/26/96)
Tsinaraki, C., Polydoros, P., Kazasis, F., Christodoulakis, S.: Ontology-based Semantic Indexing for MPEG-7 and TV-Anytime Audiovisual Content. Special issue of Multimedia Tools and Applications Journal on Video Segmentation for Semantic Annotation and Transcoding (2004)
TV-Anytime Forum website: http://www.tv-anytime.org/
W3C, Timed Text Home Page. http://www.w3.org/AudioVideo/TT/
W3C, XForms Home Page. http://www.w3.org/MarkUp/ Forms/
W3C, XHTML Home Page. http://www.w3.org/MarkUp/
W3C, XPath Specification. http://www.w3.org/TR/xpath
W3C, XML Pointer, XML Base and XML Linking Home Page. http://www.w3.org/XML/Linking
Weck, D.: LimSee2: The Cross-Platform SMIL 2.0 Authoring Tool. http://wam.inrialpes.fr/software/limsee2/
Wold E., Blum T., Kreislar D., Wheaton J., (1996). Content-based classification, search, and retrieval of audio. IEEE Multimedia 3(3): 27–36
XIPF.org Foundation, Vorbis I Specification. http://www.xiph.org/vorbis/doc/Vorbis_I_spec.pdf
YouTube - Broadcast Yourself. http://www.youTube.com
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bulterman, D.C.A. User-centered control within multimedia presentations. Multimedia Systems 12, 423–438 (2007). https://doi.org/10.1007/s00530-006-0065-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-006-0065-6