Interactive Music with Active Audio CDs

Marchand, Sylvain; Mansencal, Boris; Girin, Laurent

doi:10.1007/978-3-642-23126-1_3

Sylvain Marchand²⁰,
Boris Mansencal²⁰ &
Laurent Girin²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6684))

Included in the following conference series:

International Symposium on Computer Music Modeling and Retrieval

1163 Accesses
2 Citations
3 Altmetric

Abstract

With a standard compact disc (CD) audio player, the only possibility for the user is to listen to the recorded track, passively: the interaction is limited to changing the global volume or the track. Imagine now that the listener can turn into a musician, playing with the sound sources present in the stereo mix, changing their respective volumes and locations in space. For example, a given instrument or voice can be either muted, amplified, or more generally moved in the acoustic space. This will be a kind of generalized karaoke, useful for disc jockeys and also for music pedagogy (when practicing an instrument). Our system shows that this dream has come true, with active CDs fully backward compatible while enabling interactive music. The magic is that “the music is in the sound”: the structure of the mix is embedded in the sound signal itself, using audio watermarking techniques, and the embedded information is exploited by the player to perform the separation of the sources (patent pending) used in turn by a spatializer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, pp. 99–102 (2001)
Google Scholar
Araki, S., Sawada, H., Makino, S.: K-means based underdetermined blind speech separation. In: Makino, S., et al. (eds.) Blind Source Separation, pp. 243–270. Springer, Heidelberg (2007)
Chapter Google Scholar
Araki, S., Sawada, H., Mukai, R., Makino, S.: Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing 87(8), 1833–1847 (2007)
Article MATH Google Scholar
Bass, H., Sutherland, L., Zuckerwar, A., Blackstock, D., Hester, D.: Atmospheric absorption of sound: Further developments. Journal of the Acoustical Society of America 97(1), 680–683 (1995)
Article Google Scholar
Berg, R.E., Stork, D.G.: The Physics of Sound, 2nd edn. Prentice Hall, Englewood Cliffs (1994)
Google Scholar
Blauert, J.: Spatial Hearing. revised edn. MIT Press, Cambridge (1997); Translation by J.S. Allen
Google Scholar
Bofill, P., Zibulevski, M.: Underdetermined blind source separation using sparse representations. Signal Processing 81(11), 2353–2362 (2001)
Article MATH Google Scholar
Chen, B., Wornell, G.: Quantization index modulation: A class of provably good methods for digital watermarking and information embedding. IEEE Transactions on Information Theory 47(4), 1423–1443 (2001)
Article MathSciNet MATH Google Scholar
Chowning, J.M.: The simulation of moving sound sources. Journal of the Acoustical Society of America 19(1), 2–6 (1971)
Google Scholar
International Organization for Standardization, Geneva, Switzerland: ISO 9613-1:1993: Acoustics – Attenuation of Sound During Propagation Outdoors – Part 1: Calculation of the Absorption of Sound by the Atmosphere (1993)
Google Scholar
ISO/IEC JTC1/SC29/WG11 MPEG: Information technology Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC) IS13818-7(E) (2004)
Google Scholar
ITU-R: Method for objective measurements of perceived audio quality (PEAQ) Recommendation BS1387-1 (2001)
Google Scholar
Kuhn, G.F.: Model for the interaural time differences in the azimuthal plane. Journal of the Acoustical Society of America 62(1), 157–167 (1977)
Article Google Scholar
Mouba, J., Marchand, S., Mansencal, B., Rivet, J.M.: RetroSpat: a perception-based system for semi-automatic diffusion of acousmatic music. In: Proceedings of the Sound and Music Computing (SMC) Conference, Berlin, pp. 33–40 (2008)
Google Scholar
O’Grady, P., Pearlmutter, B.A., Rickard, S.: Survey of sparse and non-sparse methods in source separation. International Journal of Imaging Systems and Technology 15(1), 18–33 (2005)
Article Google Scholar
Parvaix, M., Girin, L.: Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas (2010)
Google Scholar
Parvaix, M., Girin, L.: Informed source separation of linear instantaneous under-determined audio mixtures by source index embedding. IEEE Transactions on Audio, Speech, and Language Processing (accepted, pending publication, 2011)
Google Scholar
Pinel, J., Girin, L., Baras, C.: A high-rate data hiding technique for uncompressed audio signals. IEEE Transactions on Audio, Speech, and Language Processing (submitted)
Google Scholar
Pinel, J., Girin, L., Baras, C., Parvaix, M.: A high-capacity watermarking technique for audio signals based on MDCT-domain quantization. In: International Congress on Acoustics (ICA), Sydney, Australia (2010)
Google Scholar
Plumbley, M.D., Blumensath, T., Daudet, L., Gribonval, R., Davies, M.E.: Sparse representations in audio and music: From coding to source separation. Proceedings of the IEEE 98(6), 995–1005 (2010)
Article Google Scholar
Princen, J., Bradley, A.: Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Transactions on Acoustics, Speech, and Signal Processing 64(5), 1153–1161 (1986)
Article Google Scholar
Strutt (Lord Rayleigh), J.W.: Acoustical observations i. Philosophical Magazine 3, 456–457 (1877)
Article Google Scholar
Strutt (Lord Rayleigh), J.W.: On the acoustic shadow of a sphere. Philosophical Transactions of the Royal Society of London 203A, 87–97 (1904)
Article MATH Google Scholar
Thiede, T., Treurniet, W., Bitto, R., Schmidmer, C., Sporer, T., Beerends, J., Colomes, C.: PEAQ - the ITU standard for objective measurement of perceived audio quality. Journal of the Audio Engineering Society 48(1), 3–29 (2000)
Google Scholar
Tournery, C., Faller, C.: Improved time delay analysis/synthesis for parametric stereo audio coding. Journal of the Audio Engineering Society 29(5), 490–498 (2006)
Google Scholar
Vincent, E., Gribonval, R., Plumbley, M.D.: Oracle estimators for the benchmarking of source separation algorithms. Signal Processing 87, 1933–1950 (2007)
Article MATH Google Scholar
Viste, H.: Binaural Localization and Separation Techniques. Ph.D. thesis, École Polytechnique Fédérale de Lausanne, Switzerland (2004)
Google Scholar
Woodworth, R.S.: Experimental Psychology. Holt, New York (1954)
Google Scholar
Yılmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing 52(7), 1830–1847 (2004)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

LaBRI – CNRS, University of Bordeaux, France
Sylvain Marchand & Boris Mansencal
GIPSA-lab – CNRS, Grenoble Institute of Technology, France
Laurent Girin

Authors

Sylvain Marchand
View author publications
You can also search for this author in PubMed Google Scholar
Boris Mansencal
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Girin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS - LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Sølvi Ystad
CNRS-INCM, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Mitsuko Aramaki
CNRS-LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Richard Kronland-Martinet
Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marchand, S., Mansencal, B., Girin, L. (2011). Interactive Music with Active Audio CDs. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Exploring Music Contents. CMMR 2010. Lecture Notes in Computer Science, vol 6684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23126-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-23126-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23125-4
Online ISBN: 978-3-642-23126-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics