A Matlab Toolbox for Music Information Retrieval
We present MIRToolbox, an integrated set of functions written in Matlab, dedicated to the extraction from audio files of musical features related, among others, to timbre, tonality, rhythm or form. The objective is to offer a state of the art of computational approaches in the area of Music Information Retrieval (MIR). The design is based on a modular framework: the different algorithms are decomposed into stages, formalized using a minimal set of elementary mechanisms, and integrating different variants proposed by alternative approaches — including new strategies we have developed —, that users can select and parametrize. These functions can adapt to a large area of objects as input.
This paper offers an overview of the set of features that can be extracted with MIRToolbox, illustrated with the description of three particular musical features. The toolbox also includes functions for statistical analysis, segmentation and clustering.
One of our main motivations for the development of the toolbox is to facilitate investigation of the relation between musical features and music-induced emotion. Preliminary results show that the variance in emotion ratings can be explained by a small set of acoustic features.
KeywordsAudio Signal Acoustic Feature Emotion Rating Musical Piece Musical Genre
Unable to display preview. Download preview PDF.
- EEROLA, T. and TOIVIAINEN, P. (2004): MIR in Matlab: The MIDI Toolbox. Proceedings of 5th International Conference on Music Information Retrieval, 22-27, Barcelona.Google Scholar
- FOOTE, J. and COOPER, M. (2003): Media segmentation using self-similarity decomposi-tion. In Proceedings of SPIE Storage and Retrieval for Multimedia Databases, 5021, 167-75.Google Scholar
- GOMEZ, E. (2006): Tonal description of polyphonic audio for music content processing. IN-FORMS Journal on Computing, 18-3, 294-304.Google Scholar
- JUSLIN, P. N. (1997): Emotional communication in music performance: A functionalist per-spective and some data. Music Perception, 14, 383-418.Google Scholar
- KRUMHANSL, C. (1990): Cognitive Foundations of Musical Pitch. Oxford University Press, New York.Google Scholar
- NABNEY, I. (2002): NETLAB: Algorithms for Pattern Recognition. Springer Advances In Pattern Recognition Series, Springer-Verlag, New-York.Google Scholar
- RABINER, L. and JUANG, B. H. (1993): Fundamentals of Speech Recognition. Prentice-Hall. SCHERER, K. R. and OSHINSKY J. S. (1977): Cue utilization in emotion attribution from auditory stimuli. Motivation and Emotion, 1-4, 331-346.Google Scholar
- SLANEY, M. (1998): Auditory Toolbox Version 2. Technical Report 1998-010, Interval Re-search Corporation.Google Scholar
- TZANETAKIS, G and COOK, P. (1999): Multifeature audio segmentation for browsing and annotation. Proceedings of the 1999 IEEE Workshop on Applications of Signal Process-ing to Audio and Acoustics. New-York.Google Scholar
- VESANTO, J. (1999): Self-organizing map in Matlab: the SOM Toolbox. Proceedings of the Matlab DSP Conference 1999. Espoo, Finland,35-40.Google Scholar