Abstract
Ever since the invention of the world’s first telephone in the nineteenth century, the evolution of multimedia applications has drastically changed human life and behaviors, and has introduced new demands for multimedia synchronization. In this chapter, we present a historical view of temporal synchronization efforts with a focus on continuous multimedia (i.e., sequences of time-correlated multimedia data). We demonstrate how the development of multimedia systems has advanced the research on synchronization, and what additional challenges have been imposed by next-generation multimedia technologies. We conclude with a new application-dependent multilocation multi-demand synchronization framework to address these new challenges.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
BELLLABS: The picture of the future. Bell Labs Rec. 47(5), 134–186 (1969)
RFC-958: Network Time Protocol (NTP). http://www.ntp.org/. Accessed 28 Apr 2017
Cornell University: The CU-SeeMe Project. http://ftp.icm.edu.pl/packages/cu-seeme/html/Welcome.html. Accessed 28 Apr 2017
The Cambridge iTV Trial. http://koo.corpus.cam.ac.uk/projects/itv/. Accessed 28 Apr 2017
Caltech/CERN Project. http://pcbunn.cithep.caltech.edu/. Accessed 28 Apr 2017
IEEE-1588 standard: Precise time synchronization as the basis for real time applications in automation. https://standards.ieee.org/findstds/standard/1588-2008.html. Accessed 28 Apr 2017
RFC-5905: Network Time Protocol version 4: Protocol and algorithms specification. http://www.ntp.org/. Accessed 28 Apr 2017
Akyildiz, I.F., Yen, W.: Multimedia group synchronization protocols for integrated services networks. IEEE J. Sel. Areas Commun. 14(1), 162–173 (1996)
Anderson, D.P., Homsy, G.: A continuous media I/O server and its synchronization mechanism. IEEE Comput. 24(10), 51–57 (1991)
Arefin, A., Huang, Z., Nahrstedt, K., Agarwal, P.: 4D Telecast: Towards large scale multi-site and multi-view dissemination of 3DTI contents. In: Proceedings of IEEE 32nd International Conference on Distributed Computer Systems (ICDCS), Macau, China, pp. 82–91 (2012)
Basilio, C.: Antonio meucci inventore del telefono. Notiziario Tec. Telecommun. Ital. 12(1), 114 (2003)
Baxter, B., Scheib, V., Lin, M.C., Manocha, D.: DAB: interactive haptic painting with 3D virtual brushes. In: Proceedings of ACM Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), Los Angeles, CA, pp. 461–468 (2001)
Blakowski, G., Steinmetz, R.: A media synchronization survey: reference model, specification, and case studies. IEEE J. Sel. Areas Commun. 1, 5–35 (1996)
Blesser, B.: Digitization of audio: a comprehensive examination of theory, implementation, and current practice. AES J. Audio Eng. Soc. 26(10), 739–771 (1978)
Boronat, F., Cebollada, J.C.G., Mauri, J.L.: An RTP/RTCP based approach for multimedia group and inter-stream synchronization. Springer J. Multimedia Tools Appl. 40(2), 285–319 (2008)
Boronat, F., Lloret, J., Garcia, M.: Multimedia group and inter-stream synchronization techniques: a comparative study. Elsevier Inf. Syst. 34(1), 108–131 (2009)
Bulterman, D.: Specification and support of adaptable networked multimedia. Springer Multimedia Syst. 1(2), 68–76 (1993)
Campbell, A., Coulson, G., Garcła, F., Hutchison, D.: Orchestration services for distributed multimedia synchronisation. In: Proceedings of IFIP International Conference on High Performance Networking (HPN), Liegel, Belgium (1992)
Chung, S.M., Pereira, A.L.: Timed petri net representation of SMIL. IEEE Multimedia 12(1), 64–72 (2005)
Courtiat, J., de Oliveira, R.C.: Proving temporal consistency in a new multimedia synchronization model. In: Proceedings of ACM International Conference on Multimedia (MM), Boston, USA, pp. 141–152 (1996)
Curcio, I., Lundan, M.: Human perception of lip synchronization in mobile environment. In: Proceedings of IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM), Espoo, Finland, pp. 1–7 (2007)
Damer, B.: Avatars: Exploring and Building Virtual Worlds on the Internet, pp. 383–386 . Peachpit Press (1998)
Dannenberg, R., Stern, R.: Experiments concerning the allowable skew of two audio channels operating in the stereo mode. Pers. Commun. (1993)
Deventer, M., Stokking, H., Hammond, M., Cesar, P.: Standards for multi-stream and multi-device media synchronization. IEEE Commun. Mag. 54(3), 16–21 (2016)
Ehley, L., Furth, B., Ilyas, M.: Evaluation of multimedia synchronization techniques. In: Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), Boston, USA, pp. 110–119 (1994)
Fujimoto, T., Ishibashi, Y., Sugawara, S.: Influences of inter-stream synchronization error on collaborative work in haptic and visual environments. In: Proceedings of IEEE Symposium on Haptic Interfaces for Virtual Environment and Teleoperator System (HAPTICS), Reno, USA, pp. 113–119 (2008)
Gardner, B.: A realtime multichannel room simulator. In: Proceedings of 124th Meeting of the Acoustical Society of America, New Orleans, USA (1992)
Geerts, D., Vaishnavi, I., Mekuria, R., van Deventer, O., Cesar, P.: (2011) Are we in sync? Synchronization requirements for watching online video together. In: Proceedings of the 29th ACM Conference on Human Factors in Computing Systems (SIGCHI), Vancouver, Canada, pp. 311–314
Ghinea, G., Ademoye, O.A.: Perceived synchronization of olfactory multimedia. IEEE Trans. Syst. Man Cybern. 40(4), 657–663 (2010)
Goldmann, L., Lee, J.S., Ebrahimi, T.: Temporal synchronization in stereoscopic video: Influence on quality of experience and automatic asynchrony detection, hong kong, china. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 3241–3244 (2010)
Hodges, M., Sasnett, R., Ackerman, M.: Athena Muse: a construction set for multimedia applications. IEEE Softw. 6(1), 37–43 (1989)
Hoshino, S., Ishibashi, Y., Fukushima, N., Sugawara, S.: Qoe assessment in olfactory and haptic media transmission: Influence of inter-stream synchronization error. In: Proceedings of IEEE International Workshop on Communications Quality and Reliability (CQR), Naples, FL, USA, pp. 1–6 (2011)
Hsu, P., Chen, Y., Chang, Y.: STRPN: a petri-net approach for modeling spatial-temporal relations between moving multimedia objects. IEEE Trans. Softw. Eng. 29(1), 63–76 (2003)
Hu, N., Steenkiste, P.: Estimating available bandwidth using packet pair probing. Carnegie Mellon University Techical Report, CMU-CS-02-166 (2002)
Huang, Z., Nahrstedt, K.: Perception-based media packet scheduling for high-quality tele-immersion. In: Proceedings of IEEE International Conference on Computer Communications (INFOCOM), Orlando, USA, pp. 29–34 (2012)
Huang, Z., Wu, W., Nahrstedt, K., Arefin, A., Rivas, R.: TSync: A new synchronization framework for multi-site 3D tele-immersion. In: Proceedings of ACM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV), Amsterdam, the Netherlands, pp. 39–44 (2010)
Huang, Z., Mei, C., Li, L., Woo, T.: CloudStream: delivering high-quality streaming video through a cloud-based H.264/SVC proxy. In: Proceedsings of IEEE International Conference on Computer Communications (INFOCOM), Shanghai, China, pp. 201–205 (2011)
Huang, Z., Wu, W., Nahrstedt, K., Rivas, R., Arefin, A.: Synccast: synchronized dissemination in multi-site interactive 3D tele-immersion. In: Proceedings of ACM Multimedia Systems Conference (MMSYS), San Jose, USA, pp. 69–80 (2011)
Huang, Z., Nahrstedt, K., Liang, K.: Human-centric multi-layer synchronization scheme with inter-sender synchronization skew control. In: Proceedings of IEEE International Workshop on Quality of Multimedia Experience (QoMEX), Singapore, pp. 25–30 (2014)
Iimura, T.: Zoned federation of game servers: A peer-to-peer approach to scalable multi-player online games. In: Proceedings of ACM Proceedings of 3rd ACM SIGCOMM workshop on Network and system support for games (NetGames), Portland, Oregon, pp. 116–120 (2004)
Ishibashi, Y., Tasaka, S.: A distributed control scheme for group synchronization in multicast communications. In: Proceedings of International Symposium Communications (ISCOM), Japan, pp. 317–323 (1999)
Ishibashi, Y., Tasaka, S.: A comparative survey of synchronization algorithms for continuous media in network environments. In: Proceedings of IEEE Conference on Local Computer Networks (LCN), Tampa, FL, USA, pp. 337–348 (2000)
Ishibashi, Y., Tsuji, A., Tasaka, S.: A group synchronization mechanism for stored media in multicast communications. In: Proceedings of Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), Kobe, Japan, pp. 692–700 (1997)
ISO International Standard: Information technology hypermedia/time-based structuring language (HyTime). https://www.iso.org/standard/18834.html (1992). Accessed 28 Apr 2017
ITU-H263: Video coding for low bit rate communication. http://www.itu.int/rec/T-REC-H.263/en/ (2005). Accessed 28 Apr 2017
ITU-H323: Packet-based multimedia communications systems. http://www.itu.int/rec/T-REC-H.323/en/ (2009). Accessed 28 Apr 2017
King, P.: Towards a temporal logic based formalism for expressing temporal constraints in multimedia documents. Technical Report, 942, LRI, Universite de Paris-Sud, Orsay, France (1994)
Leroux, P., Verstraete, V., De Turck, F., Demeester, P.: Synchronized interactive services for mobile devices over IPDC/DVB-H and UMTS. In: Proceedings of IEEE/IFIP International Workshop on Broadband Convergence Networks (BCN), Munich, Germany, pp. 1–12 (2007)
Little, T.: A framework for synchronous delivery of time-depdent multimedia data. Springer Multimedia Syst. 1(2), 87–94 (1993)
Little, T., Ghafoor, A.: Synchronization and storage models for multimedia objects. IEEE J. Sel. Areas Commun. 8(3), 413–427 (1990)
Little, T., Ghafoor, A.: Spatio-temporal composition of distributed multimedia objects for value-added networks. IEEE Comput. 24(10), 42–50 (1991)
Lo, B., Thiemjarus, S., King, R., Yang, G.: Body sensor network - a wireless sensor platform for pervasive healthcare monitoring. In: Proceedings of IEEE International Conference on Pervasive Computing (PERCOM), pp. 77–80 (2005)
Marcheschi, S., Portillo ,O., Raspolli, M., Avizzano, C., Bergamasco, M.: The haptic desktop: a novel 2D multimodal device. In: Proceedings of IEEE International Conference on Robot and Human Interactive Communication (ROMAN), Kurashiki, Okayama, Japan, pp. 521–526 (2004)
Mayagoitia, R.E., Nene, A.V., Veltink, P.H.: Accelerometer and rate gyroscope measurement of kinematics: an inexpensive alternative to optical motion analysis systems. Elsevier J. Biomech. 35(4), 537–542 (2002)
Meyer, T., Effelsberg, W., Steinmetz, R.: A taxonomy on multimedia synchronization. In: Proceedings of IEEE Workshop on Future Trends of Distributed Computing Systems, Lisbon, Portugal, pp. 97–103 (1994)
Michel, U.: History of acoustic beamforming. In: Proceedings of Berlin Beamforming Conference (BeBeC), Berlin, Germany (2006)
Montagud, M., Boronat, F.: On the use of adaptive media playout for inter-destination synchronization. IEEE Commun. Lett. 15(8), 863–865 (2011)
Montagud, M., Boronat, F., Stokking, H., van Brandenburg, R.: Inter-destination multimedia synchronization: schemes, use cases and standardization. Springer Multimedia Syst. 18(6), 459–482 (2012)
Montagud, M., Boronat, F., Stokking, H., César, P.: Design, development and assessment of control schemes for IDMS in a standardized RTCP-based solution. Elsevier Comput. Netw. 70(1), 240–259 (2014)
Montagud, M., Boronat, F., Roig, B., Sapena, A.: How to perform AMP? Cubic adjustments for improving the QoE. Elsevier Comput. Commun. 103, 61–73 (2017)
Montagud Climent, M.A., Jansen, AJ., Cesar Garcia, PS., Boronat, F.: Review of media sync reference models: Advances and open issues. Media Synchronization Workshop (MediaSync), Brussels, Belgium (2015)
Murray, N., Lee, B., Qiao, Y., Muntean, G.:The influence of human factors on olfaction based mulsemedia quality of experience. In: Proceedings of IEEE International Conference on Quality of Multimedia Experience (QoMEX), Lisbon, Portugal, pp. 1–6 (2016)
PictureTel: Picturetel In Project With I.B.M., New York Times. http://www.nytimes.com/1991/10/22/business/company-news-picturetel-in-project-with-ibm.html (1991). Accessed 28 Apr 2017
Rainer, B., Timmerer, C.: A quality of experience model for adaptive media playout. In: Proceedings of IEEE International Workshop on Quality of Multimedia Experience (QoMEX), Singapore, pp. 1–4 (2014)
Rainer, B., Petscharnig, S., Timmerer, C.: Is one second enough? Evaluating QoE for inter-destination multimedia schronization using human computation and crowdsourcing. In: Seventh International Workshop on Quality of Multimedia Experience, pp. 1–6 (2015)
Ramanathan, S., Rangan, P.: Feedback techniques for intra-media continuity and inter-media synchronization in distributed media systems. Comput. J. Oxford Univ. Press 36(1), 19–31 (1993)
Ramanathan, S., Rangan, P.V.: Continuous media synchronization in distributed multimedia systems. In: Proceedings of ACM International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), La Jolla, CA, USA, pp. 289–296 (1992)
Ravindran, K., Bansal, V.: Delay compensation protocols for synchronization of multimedia data streams. IEEE Trans. Knowl. Data Eng. 4(5), 574–589 (1993)
RFC-1889: Obsolete version—RTP: a transport protocol for real-time applications. http://tools.ietf.org/html/rfc1889/ (1996). Accessed 28 Apr 2017
RFC-3550: RTP: a transport protocol for real-time applications. http://tools.ietf.org/html/rfc3550/ (2003). Accessed 28 Apr 2017
RFC-3611: RTP control protocol extended reports (RTCP XR). http://tools.ietf.org/html/rfc3611/ (2003). Accessed 28 Apr 2017
RFC-7272: Inter-destination media synchronization (IDMS) using the RTP control protocol (RTCP). http://tools.ietf.org/html/rfc7272 (2014). Accessed 28 Apr 2017
Rothermel, K., Helbig, T.: An adaptive stream synchronization protocol. In: Proceedings of ACM International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), Durham, NH, USA, pp. 189–202 (1995)
Rouskas, G.N., Baldine, I.: Multicast routing with end-to-end delay and delay variation constraints. IEEE J. Sel. Areas in Commun. 15(3), 346–356 (1997)
Shi, S.Y., Turner, J.S., Waldvogel, M.: Dimensioning server access bandwidth and multicast routing in overlay networks. In: Proceedings of ACM International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), Danfords on the Sound, NY, USA, pp. 83–92 (2001)
Steinmetz, R.: Analyse von synchronisation mechanismen mit anwendung im multimedia-bereich. In: Proceedings of GI ITG Workshop Sprachen und System zur Parallelverarbeitung, Arnoldshain, Germany, pp. 39–47 (1990)
Steinmetz, R.: Human perception of jitter and media synchronation. IEEE J. Sel. Areas Commun. 14(1), 61–72 (1996)
Steinmetz, R., Nahrstedt, K.: Multimedia Computing, Communications and Applications. Prentice Hall (2015)
Stockham, T.: A/D and D/A converters: their effect on digital audio fidelity. IEEE Digital Signal Process. 55–66 (1972)
Tov, S.Y.: Happy 10th birthday, VoIP, The Marker. http://archive.li/TqIrI (2005). Accessed 28 Apr 2017
Vaishnavi, I., Cesar, P., Bulterman, D., Friedrich, O., Gunkel, S., Geerts, D.: From IPTV to synchronous shared experiences: challenges in design: distributed media synchronization. Signal Process. Image Commun. 26, 370–377 (2011)
Wahl, T., Rothernel, K.: Representing time in multimedia systems. In: Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), Boston, USA, pp. 538–543 (1994)
Woo, M., Qazi, N., Ghafoor, A.: A synchronization framework for communication of pre-orchestrated multimedia information. IEEE Netw. 1(8), 52–61 (1994)
Yavatkar, R.: MCP: A protocol for coordination and temporal synchronization in collaborative applications. In: Proceedings of the IEEE International Conference Distributed Computing Systems (ICDCS), Yokohama, Japan, pp. 606–613 (1992)
Zhang, X., Liu, J., Li, B., shing Peter Yum T.: CoolStreaming/DONet: A data-driven overlay network for peer-to-peer live media streaming. In: Proceedings of IEEE International Conference on Computer Communications (INFOCOM), Miami, USA, pp. 2102–2111 (2005)
Zimmermann R, Liang K.: Spatialized audio streaming for networked virtual environments. In: Proceedings of ACM International Conference on Multimedia (MM), Vancouver, Canada, pp. 299–308 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Appendix I: Mathematical Symbols and Denotations
Table 2.1 summarizes the mathematical symbols and denotations in this chapter.
Appendix II: Comparison Summary of Synchronization Studies
We summarize two comparison tables for the synchronization studies we have discussed in Sect. 2.3. Table 2.2 is for discussing the synchronization specification models in Sect. 2.3.2.3. Compared to interval-based and Petri-net-based specification models, Table 2.2 shows that both axis-based and control-based specification models are easy to implement and add/remove media objects, but still they require additional information and mechanisms during synchronization specifications.
Table 2.3 is for evaluating the inter-receiver/group synchronization control algorithms in Sect. 2.3.3.2. In general, centralized approaches have lower communication overhead, and adaptive responsiveness is much faster when compared to distributed approaches.
Appendix III: Synchronization Reference Selection in Tele-immersive (TI) System
In this section, we present an example of synchronization reference selection methodology in our current TI implementation. Note that the selection rule is policy-based, meaning that it can vary depending on specific end user interests in different multimedia applications.
Intra-stream Synchronization
The reference frame or the intra-stream synchronization reference is usually selected as the first media frame within a sensory stream at each system control update. Hence, other media frames behind it can be played at the output devices by consulting their original captured inter-frame periods at the media sensor.
Intra-media Synchronization
The intra-media synchronization reference is selected as the reference stream which has the largest contribution to end user interests within a media modality. The media contribution can vary depending on the characteristics of each modality. Here, we discuss four commonly deployed media modalities which we have used.
Multi-view videos. Multi-view video streams capture the same physical object at the same time, but from different viewpoints. The importance of each video stream is decided by their contributions of 3D image pixels to the end user viewpoint [36], which can be computed using the orientation difference between the sender camera and the receiver view. Given the sender \(n^x\)’s camera orientation of a video stream \(s^x_{\text {V},i}\) (denoted as \(\mathbf {O}(s^x_{\text {V},i})\)), and the desired view orientation of \(n^x\)’s videos for receiver site \(n^y\) (denoted as \(\mathbf {O}^{x,y}\)), the visual contribution or the contribution factor (CF) of \(s^x_{\text {V},i}\) to the receiver site \(n^y\) is defined by 2.9 as
Hence, the video reference stream is elected as the video stream with the largest CF within the video modality for each receiver.
Spatial audios. Multiple omnidirectional microphones concurrently record the same physical ambient environment. The contribution of each audio stream is decided by its signal-to-noise ratio (SNR), a metric indicating the intelligibility of the speaker’s utterances. SNR can be computed online by estimating the noises during silence periods. We prescribe that the audio reference stream is the audio stream with the largest SNR within the audio modality.
Haptics or Body sensory streams. Multiple haptic or body sensory streams may record different parts of a physical object. In the TI systems, we decide the haptic/body reference stream as the one with the largest data rate within the haptic/body sensory modality, because a larger data rate for these sensory streams usually means higher precision information.
Intra-bundle Synchronization
The importance of media modalities can vary at different applications, and the intra-bundle synchronization reference is defined as the most important reference modality. Empirically, for TI systems, we can classify different applications based on real user perceptual feedback. (1) Users attach more importance to the intelligibility of audio signals in a conversation-oriented application (e.g., conferencing or remote education), so the reference modality is the audio. (2) The clarity of video signals is of the greatest significance in a collaborative task with fine motor skills (e.g., rock–paper–scissor gaming or cyber-archeology), so the video is selected as the reference modality. (3) The body sensory streams can have the largest contribution in the telehealth or the remote rehabilitation application, because the doctors need to evaluate the patient’s health status by consistent body sensory feedback. Thus, we choose the body sensory modality as the reference.
Intra-session Synchronization
In multisite interactive multimedia systems, the most active site usually demands higher quality streaming bundles in order to guarantee uninterrupted collaborations in a session. The intra-session synchronization reference of inter-sender or inter-receiver synchronization is, thus, selected as the media bundle corresponding to the most active user among all senders or receivers. In the TI systems, for example, this user usually takes the lead in the multimedia applications (e.g., a trainer in the remote education, a director in the conferencing, or a doctor in the telehealth). The selection of the lead person is context-dependent, so it must be specified explicitly by the media applications.
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Huang, Z., Nahrstedt, K., Steinmetz, R. (2018). Evolution of Temporal Multimedia Synchronization Principles. In: Montagud, M., Cesar, P., Boronat, F., Jansen, J. (eds) MediaSync. Springer, Cham. https://doi.org/10.1007/978-3-319-65840-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-65840-7_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65839-1
Online ISBN: 978-3-319-65840-7
eBook Packages: Computer ScienceComputer Science (R0)