Pitch-Dependent Identification of Musical Instrument Sounds
- 113 Downloads
This paper describes a musical instrument identification method that takes into consideration the pitch dependency of timbres of musical instruments. The difficulty in musical instrument identification resides in the pitch dependency of musical instrument sounds, that is, acoustic features of most musical instruments vary according to the pitch (fundamental frequency, F0). To cope with this difficulty, we propose an F0-dependent multivariate normal distribution, where each element of the mean vector is represented by a function of F0. Our method first extracts 129 features (e.g., the spectral centroid, the gradient of the straight line approximating the power envelope) from a musical instrument sound and then reduces the dimensionality of the feature space into 18 dimension. In the 18-dimensional feature space, it calculates an F0-dependent mean function and an F0-normalized covariance, and finally applies the Bayes decision rule. Experimental results of identifying 6,247 solo tones of 19 musical instruments shows that the proposed method improved the recognition rate from 75.73% to 79.73%.
Keywordsmusical instrument identification the pitch dependency fundamental frequency automatic music transcription computational auditory scene analysis
Unable to display preview. Download preview PDF.
- J.C. Brown, “Computer identification of musical instruments using pattern recognition with cepstral coefficients as features,” Journal of Acoustic Society of America vol. 103, no. 3, pp. 1933–1941, 1999.Google Scholar
- A. Eronen and A. Klapuri, “Musical instrument recognition using cepstral coefficients and temporal features,” in Proceedings of International Conference on Acoustics, Speech and Signal Processing, IEEE, 2000, pp. 753–756.Google Scholar
- I. Fujinaga and K. MacMillan, “Realtime recognition of orchestral instruments,” in Proceedings of International Computer Music Conference, 2000, pp. 141–143.Google Scholar
- K. Kashino, K. Nakadai, T. Kinoshita, and H. Tanaka, “Application of the bayesian probability network to music scene analysis,” in Computational Auditory Scene Analysis, edited by D. Rosenthal and H.~G. Okuno, Eds., Lawrence Erlbaum Associates, 1998, pp. 115–137.Google Scholar
- K.D. Martin, “Sound-Source Recognition: A Theory and Computational Model,” Ph.D. Thesis, MIT, 1999.Google Scholar
- K. Kashino and H. Murase, “A sound source identification system for ensemble music based on template adaptation and music stream extraction,” Speech Communication, vol. 27, nos. 3–4, pp. 337–349, 1999.Google Scholar
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, “RWC music database: Music genre database and musical instrument sound database,” in Proceedings of International Conference on Music Information Retrieval, 2003, pp. 229–230.Google Scholar
- D. Rosenthal and H.G. Okuno, eds. Computational Auditory Scene Analysis, Lawrence Erlbaum Associates, Mahwah, New Jersey, 1998.Google Scholar