Effects of Audio Compression on Chord Recognition

  • Aiko Uemura
  • Kazumasa Ishikura
  • Jiro Katto
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8326)


Feature analysis of audio compression is necessary to achieve high accuracy in musical content recognition and content-based music information retrieval (MIR). Bit rate differences are expected to adversely affect musical content analysis and content-based MIR results because the frequency response might be changed by the encoding. In this paper, we specifically examine its effect on the chroma vector, which is a commonly used feature vector for music signal processing. We analyze sound qualities extracted from encoded music files with different bit rates and compare them with the chroma features of original songs obtained using datasets for chord recognition.


chroma vector audio compression chord recognition 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hamawaki, S., Funasawa, S., Katto, J., Ishizaki, H., Hoashi, K., Takishima, Y.: Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates. In: Huet, B., Smeaton, A.F., Mayer-Patel, K., Avrithis, Y. (eds.) MMM 2009. LNCS, vol. 5371, pp. 298–309. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  2. 2.
    Fujishima, T.: Realtime Chord Recognition of Musical Sound: a System using Common Lisp Music. In: Proceedings of the International Computer Music Association, pp. 464–467 (1999)Google Scholar
  3. 3.
    Harte, C., Sandler, M.: Automatic Chord Identification using a Quantised Chromagram. In: Proceedings of the Audio Engineering Society (2005)Google Scholar
  4. 4.
    Ellis, D., Poliner, G.: Identifying Cover Songs with Chroma Features and Dynamic Programming Beat Tracking. In: Proceedings of ICASSP, pp. 1429–1432 (2007)Google Scholar
  5. 5.
    Mauch, M., Dixon, S.: Approximate Note Transcription for the Improved Identification of Difficult Chords. In: Proceedings of the International Society for Music Information Retrieval Conference (2010)Google Scholar
  6. 6.
    Müller, M., Ewert, S.: Towards Timbre-invariant Audio Features for Harmony-based Music. IEEE Trans. on Audio, Speech, and Language Processing 18(3), 649–662 (2010)CrossRefGoogle Scholar
  7. 7.
    Thiede, T., Treurniet, W.C., Bitto, R., Schmidmer, C., Sporer, T., Beerends, J.G., Colomes, C.: PEAQ-The ITU Standard for Objective Measurement of Perceived Audio Quality. Journal of Audio Engineering Society 48(1/2), 3–29 (2000)Google Scholar
  8. 8.
    LAME MP3 Encoder,
  9. 9.
  10. 10.
  11. 11.
    Intelligent Sound Processing,
  12. 12.
    Kabal, P.: An Examination and Interpretation of ITU-R BS.1387: Perceptual Evaluation of Audio Quality. TSP Lab Technical Report, Dept. ECE, McGill University (2002)Google Scholar
  13. 13.
    Supervised Chord Recognition for Music Audio in Matlab,
  14. 14.
    Joachims, T.: Sequence Tagging with Structural Support Vector Machines (2008),
  15. 15.
  16. 16.
    Goto, M.: AIST Annotation for the RWC Music Database. In: Proceedings of the International Conference on Music Information Retrieval, pp. 359–360 (2006)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Aiko Uemura
    • 1
  • Kazumasa Ishikura
    • 1
  • Jiro Katto
    • 1
  1. 1.Waseda UniversityShinjuku-kuJapan

Personalised recommendations