Skip to main content
Log in

Object-based audio for interactive football broadcast

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

An end-to-end AV broadcast system providing an immersive, interactive experience for live events is the development aim for the EU FP7 funded project, FascinatE. The project has developed real time audio object event detection and localisation, scene modelling and processing methods for multimedia data, which will allow users to navigate the event by creating their own unique user-defined scene. As part of the first implementation of the system a test shoot was carried out capturing a live Premier League football game and methods have been developed to detect, analyse, extract and localise salient audio events from a range of sensors and represent them within an audio scene in order to allow free navigation within the scene. Within this context, this paper describes a procedure for the detection, extraction and localisation of ball-kicks and whistle-blows from the pitch-side microphones used in the broadcast of football and describes a potential audio streaming format for an object-based broadcast.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Andrews P (2011) The sound of sport: what is real? In: IoA reproduced sound

  2. Barsanti RJ, Tummala M (2003) Wavelet-based time delay estimates for transient signals. In: IEEE Conference on Signals, Systems and Computers

  3. Benesty J (2000) Adaptive eigenvalue decomposition for passive acoustic source localization. J Acoust Soc Am 107(1):384–391

    Article  Google Scholar 

  4. Berkhout A, de Vries D, Vogel P (1993) Acoustic control by wave field synthesis. J Acoust Soc Am 93(5):2764–2778

    Article  Google Scholar 

  5. Bove VM Jr (1995) Object oriented television. SMPTE J 104(12):803–807

    Article  Google Scholar 

  6. Bove VM Jr (1996) Multimedia based on object models: some whys and hows. IBM Syst J 35(3.4):337–348

    Article  Google Scholar 

  7. Carey R, Bell G (1997) The annotated VRML 97 reference manual. Addison-Wesley Professional

  8. Carter GC (1993) Coherence and time delay estimation: an applied tutorial for research, development, test, and evaluation engineers. IEEE Press

  9. Cengarle G, Mateos T, Olaiz N, Arum P (2001) A new technology for the assisted mixing of sport events: application to live football broadcasting. In: 128th Conv. Audio Eng. Soc

  10. Chen SC, Shyu ML, Zhang C, Luo L, Chen M (2003) Detection of soccer goal shots using joint multimedia features and classification rules. In: 4th International Workshop on Multimedia Data Mining (MDM/KDD2003), pp. 36–44

  11. Cheng Z, Tjhung TT (2003) A new time delay estimator based on ETDE. IEEE Trans Signal Process 51(7):1859–1869

    Article  Google Scholar 

  12. Choi S, Cichocki A, Park H-M, Lee S-Y (2005) Blind source separation and independent component analysis: a review. Neural Inf Process Lett Rev 6(1):1–57

    Google Scholar 

  13. Do H, Silverman HF, Yu Y (2007) A real-time SRP-PHAT source location implementation using stochastic region contraction (SRC) on a large-aperture microphone array. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007

  14. Geier M, Ahrens J, Spors S (2010) Object-based audio reproduction and the audio scene description format. Organised Sound 15(3):219–227

    Article  Google Scholar 

  15. Gerzon MA (1973) Periphony: with-height sound reproduction. J Audio Eng Soc 21(1):2–10

    Google Scholar 

  16. Gerzon MA (1985) Ambisonics in multichannel broadcasting and video. J Audio Eng Soc 33(11):859–871

    Google Scholar 

  17. Grennberg A, Sandell M (1994) Estimation of subsample time delay differences in narrowband ultrasonic echoes using the Hilbert transform correlation. IEEE Trans Ultrason Ferroelectr Freq Control 41(5):588–595

    Article  Google Scholar 

  18. Hoffmann H, Dachselt R, Meissner K (2003) An independent declarative 3D audio format on the Basis of XML. In: 2003 int. conf. on Auditory Display, ICAD

  19. Jakobsson A, Swindlehurst AL, Stoica P (1998) Subspace-based estimation of time delays and Doppler shifts. IEEE Trans Signal Process 46(9):2472–2483

    Article  Google Scholar 

  20. Kim H-G, Moreau N, Sikora T (2006) MPEG-7 audio and beyond: audio content indexing and retrieval. Wiley

  21. Knapp C, Carter G (1976) The generalized correlation method for estimation of time delay. IEEE Trans Acoust Speech Signal Process 24(4):320–327

    Article  Google Scholar 

  22. Kyriakakis C (1998) Fundamental and technological limitations of immersive audio systems. Proc IEEE 86(5):941–951

    Article  Google Scholar 

  23. Lindsay A, Herre J (2001) MPEG-7 and MPEG-7 Audio - an overview. J Audio Eng Soc 49(7/8):589–594

    Google Scholar 

  24. Metz CE (1978) Basic principles of ROC analysis. Semin Nucl Med 8(4):283–298

    Article  Google Scholar 

  25. MPEG (1998) ISO 14496-3 (MPEG-4 Audio) Final Committee Draft. MPEG Document W2203

  26. Peters N (2008) SpatDIF – the spatial sound description interchange format. In: 2008 International Computer Music Conference, ICMC

  27. Pihkala K, Lokki T (2003) Extending SMIL with 3D Audio. In: 2003 int. conf. on Auditory Display, ICAD

  28. Scheirer E, Vaananen R, Huopaniemi J (1999) AudioBIFS: describing audio scenes with the MPEG-4 multimedia standard. IEEE Trans Multimed 1(3):237–250

    Article  Google Scholar 

  29. Schreer O, Feldmann I, Weissig C, Kauff P, Schäfer R (2013) Ultrahigh-resolution panoramic imaging for format-agnostic video production. Proc IEEE 101(1):99–114

    Article  Google Scholar 

  30. Shirley BG, Kendrick P (2006) The clean audio project: digital tv as assistive technology. J Technol Disabil 18(1):31–41

    Google Scholar 

  31. Silverman HF, Yu Y, Sachar JM, Patterson WR III (2005) Performance of real-time source-location estimators for a large-aperture microphone array. IEEE Trans Speech Audio Process 13(4):593–606

    Article  Google Scholar 

  32. Torkkola K (1999) Blind separation for audio signals–are we there yet? In: Workshop on independent component analysis and blind signal separation

  33. Vincent E, Jafari MG, Abdallah SA, Plumbley MD, Davies ME (2005) Blind audio source separation, Tech Report C4DM-TR-05-01

  34. Wang J, Xu C, Chng E, Tian Q (2004) Sports highlight detection from keyword sequences using hmm. In: IEEE International Conference on Multimedia and Expo (ICME’04), pp. 599–602

  35. Watlington JA, Bove VM Jr (1997) A system for parallel media processing. Parallel Comput 23(12):1793–1809

    Article  MATH  Google Scholar 

  36. Westner AG (1998) Object-based audio capture: separating acoustically-mixed sounds. Massachusetts Institute of Technology (MIT), US

    Google Scholar 

  37. Zongchuang L, Xingzhao L, Yongtan L (2002) A modified time delay estimation algorithm based on higher order statistics for signal detection problems. 6th IEEE Int. Conf. on Signal Processing

Download references

Acknowledgments

This research project work is part of the FascinatE project which has received funding from the European Union’s Seventh Framework Programme (FP7/2007- 2013) under grant agreement no: 248138.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert Oldfield.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Oldfield, R., Shirley, B. & Spille, J. Object-based audio for interactive football broadcast. Multimed Tools Appl 74, 2717–2741 (2015). https://doi.org/10.1007/s11042-013-1472-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-013-1472-2

Keywords

Navigation