Speech/Music Discrimination in Audio Podcast Using Structural Segmentation and Timbre Recognition

  • Mathieu Barthet
  • Steven Hargreaves
  • Mark Sandler
Conference paper

DOI: 10.1007/978-3-642-23126-1_10

Volume 6684 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Barthet M., Hargreaves S., Sandler M. (2011) Speech/Music Discrimination in Audio Podcast Using Structural Segmentation and Timbre Recognition. In: Ystad S., Aramaki M., Kronland-Martinet R., Jensen K. (eds) Exploring Music Contents. CMMR 2010. Lecture Notes in Computer Science, vol 6684. Springer, Berlin, Heidelberg

Abstract

We propose two speech/music discrimination methods using timbre models and measure their performances on a 3 hour long database of radio podcasts from the BBC. In the first method, the machine estimated classifications obtained with an automatic timbre recognition (ATR) model are post-processed using median filtering. The classification system (LSF/K-means) was trained using two different taxonomic levels, a high-level one (speech, music), and a lower-level one (male and female speech, classical, jazz, rock & pop). The second method combines automatic structural segmentation and timbre recognition (ASS/ATR). The ASS evaluates the similarity between feature distributions (MFCC, RMS) using HMM and soft K-means algorithms. Both methods were evaluated at a semantic (relative correct overlap RCO), and temporal (boundary retrieval F-measure) levels. The ASS/ATR method obtained the best results (average RCO of 94.5% and boundary F-measure of 50.1%). These performances were favourably compared with that obtained by a SVM-based technique providing a good benchmark of the state of the art.

Keywords

Speech/Music Discrimination Audio Podcast Timbre Recognition Structural Segmentation Line Spectral Frequencies K-means clustering Mel-Frequency Cepstral Coefficients Hidden Markov Models 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Mathieu Barthet
    • 1
  • Steven Hargreaves
    • 1
  • Mark Sandler
    • 1
  1. 1.Centre for Digital MusicQueen Mary University of LondonLondonUnited Kingdom