Skip to main content

Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates

  • Conference paper
Book cover Advances in Multimedia Modeling (MMM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5371))

Included in the following conference series:

Abstract

In order to achieve highly accurate content-based music information retrieval (MIR), it is necessary to compensate the various bit rates of encoded songs which are stored in the music collection, since the bit rate differences are expected to apply a negative effect to content-based MIR results. In this paper, we examine how the bit rate differences affect MIR results, propose methods to normalize MFCC features extracted from encoded files with various bit rates, and show their effects to stabilize MIR results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sigurdsson, S., Petersen, K.B., Lehn-Schiøler, T.: Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music. In: Proceedings of the International Conference on Music Information Retrieval (2006)

    Google Scholar 

  2. Hoashi, K., Matsumoto, K., Sugaya, F., Ishizaki, H., Katto, J.: Feature space modification for content-based music retrieval based on user preferences. In: Proceedings of ICASSP, pp. 517–520 (2006)

    Google Scholar 

  3. Foote, J.: Content-based retrieval of music and audio. In: Proceedings of SPIE, vol. 3229, pp. 138–147 (1997)

    Google Scholar 

  4. Spevak, C., Favreau, E.: SOUNDSPOTTER-A prototype system for content-based audio retrieval. In: Proceedings of the International Conference on Digital Audio Effects, pp. 27–32 (2002)

    Google Scholar 

  5. Deshpande, H., Singh, R., Nam, U.: Classification of musical signals in the visual domain. In: Proceedings of the International Conference on Digital Audio Effects (2001)

    Google Scholar 

  6. Tzanetakis, G., Cook, P.: MARSYAS: A framework for audio analysis. Organized Sound 4(3), 169–175 (2000)

    Article  Google Scholar 

  7. Mermelstein, P.: Distance measures for speech recognition. Psychological and instrumental. Pattern Recognition and Artificial Intelligence, 374–388 (1976)

    Google Scholar 

  8. Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of the International Symposium on Music Information Retrieval (2000)

    Google Scholar 

  9. Slaney, M.: Auditory toolbox, version 2. Technical Report #1998-010, Interval Research Corporation (1998)

    Google Scholar 

  10. Goto, M., et al.: RWC Music Database: Music GenreDatabase and Musical Instrument Sound Database. In: Proceedings of the International Conference on Music Information Retrieval, pp. 229–230 (2003)

    Google Scholar 

  11. Viikki, O., Laurila, k.: Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication 25, 133–147 (1998)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hamawaki, S., Funasawa, S., Katto, J., Ishizaki, H., Hoashi, K., Takishima, Y. (2009). Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates. In: Huet, B., Smeaton, A., Mayer-Patel, K., Avrithis, Y. (eds) Advances in Multimedia Modeling . MMM 2009. Lecture Notes in Computer Science, vol 5371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92892-8_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-92892-8_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-92891-1

  • Online ISBN: 978-3-540-92892-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics