A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

  • Kyogu Lee
Conference paper

DOI: 10.1007/978-3-540-79860-6_11

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4918)
Cite this paper as:
Lee K. (2008) A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models. In: Boujemaa N., Detyniecki M., Nürnberger A. (eds) Adaptive Multimedia Retrieval: Retrieval, User, and Semantics. AMR 2007. Lecture Notes in Computer Science, vol 4918. Springer, Berlin, Heidelberg

Abstract

We describe a system for automatic chord transcription from the raw audio using genre-specific hidden Markov models trained on audio-from-symbolic data. In order to avoid enormous amount of human labor required to manually annotate the chord labels for ground-truth, we use symbolic data such as MIDI files to automate the labeling process. In parallel, we synthesize the same symbolic files to provide the models with the sufficient amount of observation feature vectors along with the automatically generated annotations for training. In doing so, we build different models for various musical genres, whose model parameters reveal characteristics specific to their corresponding genre. The experimental results show that the HMMs trained on synthesized data perform very well on real acoustic recordings. It is also shown that when the correct genre is chosen, simpler, genre-specific model yields performance better than or comparable to that of more complex model that is genre-independent. Furthermore, we also demonstrate the potential application of the proposed model to the genre classification task.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Kyogu Lee
    • 1
  1. 1.Center for Computer Research in Music and AcousticsStanford UniversityStanfordUSA

Personalised recommendations