Computational Analysis of Sound Scenes and Events

  • Tuomas Virtanen
  • Mark D. Plumbley
  • Dan Ellis

Table of contents

  1. Front Matter
    Pages i-x
  2. Foundations

    1. Front Matter
      Pages 1-1
    2. Tuomas Virtanen, Mark D. Plumbley, Dan Ellis
      Pages 3-12
    3. Toni Heittola, Emre Çakır, Tuomas Virtanen
      Pages 13-40
    4. Guillaume Lemaitre, Nicolas Grimault, Clara Suied
      Pages 41-67
  3. Core Methods

    1. Front Matter
      Pages 69-69
    2. Romain Serizel, Victor Bisot, Slim Essid, Gaël Richard
      Pages 71-101
    3. Annamaria Mesaros, Toni Heittola, Dan Ellis
      Pages 147-179
  4. Advanced Methods

    1. Front Matter
      Pages 181-181
    2. Catherine Guastavino
      Pages 183-213
    3. Emmanouil Benetos, Dan Stowell, Mark D. Plumbley
      Pages 215-242
    4. Slim Essid, Sanjeel Parekh, Ngoc Q. K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci et al.
      Pages 243-276
  5. Applications

    1. Front Matter
      Pages 277-277
    2. Frederic Font, Gerard Roma, Xavier Serra
      Pages 279-301
    3. Dan Stowell
      Pages 303-333
    4. Sacha Krstulović
      Pages 335-371
    5. Juan Pablo Bello, Charlie Mydlarz, Justin Salamon
      Pages 373-397
  6. Perspectives

    1. Front Matter
      Pages 399-399
    2. Dan Ellis, Tuomas Virtanen, Mark D. Plumbley, Bhiksha Raj
      Pages 401-415
  7. Back Matter
    Pages 417-422

About this book


This book presents computational methods for extracting the useful information from audio signals, collecting the state of the art in the field of sound event and scene analysis. The authors cover the entire procedure for developing such methods, ranging from data acquisition and labeling, through the design of taxonomies used in the systems, to signal processing methods for feature extraction and machine learning methods for sound recognition. The book also covers advanced techniques for dealing with environmental variation and multiple overlapping sound sources, and taking advantage of multiple microphones or other modalities. The book gives examples of usage scenarios in large media databases, acoustic monitoring, bioacoustics, and context-aware devices. Graphical illustrations of sound signals and their spectrographic representations are presented, as well as block diagrams and pseudocode of algorithms.

  • Gives an overview of methods for computational analysis of sounds scenes and events, allowing those new to the field to become fully informed;
  • Covers all the aspects of the machine learning approach to computational analysis of sound scenes and events, ranging from data capture and labeling process to development of algorithms;
  • Includes descriptions of algorithms accompanied by a website from which software implementations can be downloaded, facilitating practical interaction with the techniques.


Audio signal processing Computational auditory scene analysis Acoustic pattern recognition Sound event detection Sound scene analysis

Editors and affiliations

  • Tuomas Virtanen
    • 1
  • Mark D. Plumbley
    • 2
  • Dan Ellis
    • 3
  1. 1.Laboratory of Signal ProcessingTampere University of TechnologyTampereFinland
  2. 2.Centre for Vision, Speech and Signal ProcessingUniversity of SurreySurreyUnited Kingdom
  3. 3.Google Inc.New YorkUSA

Bibliographic information

  • DOI
  • Copyright Information Springer International Publishing AG 2018
  • Publisher Name Springer, Cham
  • eBook Packages Engineering
  • Print ISBN 978-3-319-63449-4
  • Online ISBN 978-3-319-63450-0
  • Buy this book on publisher's site