Pattern Induction and Matching in Music Signals

Klapuri, Anssi

doi:10.1007/978-3-642-23126-1_13

Anssi Klapuri²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6684))

Included in the following conference series:

International Symposium on Computer Music Modeling and Retrieval

1225 Accesses
2 Citations

Abstract

This paper discusses techniques for pattern induction and matching in musical audio. At all levels of music - harmony, melody, rhythm, and instrumentation - the temporal sequence of events can be subdivided into shorter patterns that are sometimes repeated and transformed. Methods are described for extracting such patterns from musical audio signals (pattern induction) and computationally feasible methods for retrieving similar patterns from a large database of songs (pattern matching).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abesser, J., Lukashevich, H., Dittmar, C., Schuller, G.: Genre classification using bass-related high-level features and playing styles. In: Intl. Society on Music Information Retrieval Conference, Kobe, Japan (2009)
Google Scholar
Badeau, R., Emiya, V., David, B.: Expectation-maximization algorithm for multi-pitch estimation and separation of overlapping harmonic spectra. In: Proc. IEEE ICASSP, Taipei, Taiwan, pp. 3073–3076 (2009)
Google Scholar
Barbour, J.: Analytic listening: A case study of radio production. In: International Conference on Auditory Display, Sydney, Australia (July 2004)
Google Scholar
Barry, D., Lawlor, B., Coyle, E.: Sound source separation: Azimuth discrimination and resynthesis. In: 7th International Conference on Digital Audio Effects, Naples, Italy, pp. 240–244 (October 2004)
Google Scholar
Bartsch, M.A., Wakefield, G.H.: To catch a chorus: Using chroma-based representations for audio thumbnailing. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, USA, pp. 15–18 (2001)
Google Scholar
Begleiter, R., El-Yaniv, R., Yona, G.: On prediction using variable order Markov models. J. of Artificial Intelligence Research 22, 385–421 (2004)
MathSciNet MATH Google Scholar
Bertin-Mahieux, T., Weiss, R.J., Ellis, D.P.W.: Clustering beat-chroma patterns in a large music database. In: Proc. of the Int. Society for Music Information Retrieval Conference, Utrecht, Netherlands (2010)
Google Scholar
Bever, T.G., Chiarello, R.J.: Cerebral dominance in musicians and nonmusicians. The Journal of Neuropsychiatry and Clinical Neurosciences 21(1), 94–97 (2009)
Article Google Scholar
Brown, J.C.: Calculation of a constant Q spectral transform. J. Acoust. Soc. Am. 89(1), 425–434 (1991)
Article Google Scholar
Burred, J., Röbel, A., Sikora, T.: Dynamic spectral envelope modeling for the analysis of musical instrument sounds. IEEE Trans. Audio, Speech, and Language Processing (2009)
Google Scholar
de Cheveigné, A.: Multiple F0 estimation. In: Wang, D., Brown, G.J. (eds.) Computational Auditory Scene Analysis: Principles, Algorithms and Applications. Wiley–IEEE Press (2006)
Google Scholar
Dannenberg, R.B., Goto, M.: Music structure analysis from acoustic signals. In: Havelock, D., Kuwano, S., Vorländer, M. (eds.) Handbook of Signal Processing in Acoustics, pp. 305–331. Springer, Heidelberg (2009)
Google Scholar
Dannenberg, R.B., Hu, N.: Pattern discovery techniques for music audio. Journal of New Music Research 32(2), 153–163 (2003)
Article Google Scholar
Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.: Locality-sensitive hashing scheme based on p-stable distributions. In: ACM Symposium on Computational Geometry, pp. 253–262 (2004)
Google Scholar
Dixon, S., Pampalk, E., Widmer, G.: Classification of dance music by periodicity patterns. In: 4th International Conference on Music Information Retrieval, Baltimore, MD, pp. 159–165 (2003)
Google Scholar
Downie, J.S.: The music information retrieval evaluation exchange (2005–2007): A window into music information retrieval research. Acoustical Science and Technology 29(4), 247–255 (2008)
Article Google Scholar
Dressler, K.: An auditory streaming approach on melody extraction. In: Intl. Conf. on Music Information Retrieval, Victoria, Canada (2006); MIREX evaluation
Google Scholar
Duda, A., Nürnberger, A., Stober, S.: Towards query by humming/singing on audio databases. In: International Conference on Music Information Retrieval, Vienna, Austria, pp. 331–334 (2007)
Google Scholar
Durrieu, J.L., Ozerov, A., Févotte, C., Richard, G., David, B.: Main instrument separation from stereophonic audio signals using a source/filter model. In: Proc. EUSIPCO, Glasgow, Scotland (August 2009)
Google Scholar
Durrieu, J.L., Richard, G., David, B., Fevotte, C.: Source/filter model for unsupervised main melody extraction from polyphonic audio signals. IEEE Trans. on Audio, Speech, and Language Processing 18(3), 564–575 (2010)
Article Google Scholar
Ellis, D., Arroyo, J.: Eigenrhythms: Drum pattern basis sets for classification and generation. In: International Conference on Music Information Retrieval, Barcelona, Spain
Google Scholar
Ellis, D.P.W., Poliner, G.: Classification-based melody transcription. Machine Learning 65(2-3), 439–456 (2006)
Article Google Scholar
FitzGerald, D., Cranitch, M., Coyle, E.: Extended nonnegative tenson factorisation models for musical source separation. Computational Intelligence and Neuroscience (2008)
Google Scholar
Fujihara, H., Goto, M.: A music information retrieval system based on singing voice timbre. In: Intl. Conf. on Music Information Retrieval, Vienna, Austria (2007)
Google Scholar
Gersho, A., Gray, R.: Vector Quantization and Signal Compression. Kluwer Academic Publishers, Dordrecht (1991)
MATH Google Scholar
Ghias, A., Logan, J., Chamberlin, D.: Query by humming: Musical information retrieval in an audio database. In: ACM Multimedia Conference 1995. Cornell University, San Fransisco (1995)
Google Scholar
Goto, M.: A chorus-section detecting method for musical audio signals. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, China, vol. 5, pp. 437–440 (April 2003)
Google Scholar
Goto, M.: A real-time music scene description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication 43(4), 311–329 (2004)
Article Google Scholar
Guo, L., He, X., Zhang, Y., Lu, Y.: Content-based retrieval of polyphonic music objects using pitch contour. In: IEEE International Conference on Audio, Speech and Signal Processing, Las Vegas, USA, pp. 2205–2208 (2008)
Google Scholar
Hainsworth, S.W., Macleod, M.D.: Automatic bass line transcription from polyphonic music. In: International Computer Music Conference, Havana, Cuba, pp. 431–434 (2001)
Google Scholar
Helén, M., Virtanen, T.: Separation of drums from polyphonic music using non-negtive matrix factorization and support vector machine. In: European Signal Processing Conference, Antalya, Turkey (2005)
Google Scholar
Jang, J.S.R., Gao, M.Y.: A query-by-singing system based on dynamic programming. In: International Workshop on Intelligent Systems Resolutions (2000)
Google Scholar
Jang, J.S.R., Hsu, C.L., Lee, H.R.: Continuous HMM and its enhancement for singing/humming query retrieval. In: 6th International Conference on Music Information Retrieval, London, UK (2005)
Google Scholar
Jensen, K.: Multiple scale music segmentation using rhythm, timbre, and harmony. EURASIP Journal on Advances in Signal Processing (2007)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and language processing. Prentice Hall, New Jersey (2000)
Google Scholar
Kitahara, T., Goto, M., Komatani, K., Ogata, T., Okuno, H.G.: Instrogram: Probabilistic representation of instrument existence for polyphonic music. IPSJ Journal 48(1), 214–226 (2007)
Google Scholar
Klapuri, A.: A method for visualizing the pitch content of polyphonic music signals. In: Intl. Society on Music Information Retrieval Conference, Kobe, Japan (2009)
Google Scholar
Klapuri, A., Davy, M. (eds.): Signal Processing Methods for Music Transcription. Springer, New York (2006)
Google Scholar
Klapuri, A., Eronen, A., Astola, J.: Analysis of the meter of acoustic musical signals. IEEE Trans. Speech and Audio Processing 14(1) (2006)
Google Scholar
Lartillot, O., Dubnov, S., Assayag, G., Bejerano, G.: Automatic modeling of musical style. In: International Computer Music Conference (2001)
Google Scholar
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article MATH Google Scholar
Lemström, K.: String Matching Techniques for Music Retrieval. Ph.D. thesis, University of Helsinki (2000)
Google Scholar
Lerdahl, F., Jackendoff, R.: A Generative Theory of Tonal Music. MIT Press, Cambridge (1983)
Google Scholar
Leveau, P., Vincent, E., Richard, G., Daudet, L.: Instrument-specific harmonic atoms for mid-level music representation. IEEE Trans. Audio, Speech, and Language Processing 16(1), 116–128 (2008)
Article Google Scholar
Li, Y., Wang, D.L.: Separation of singing voice from music accompaniment for monaural recordings. IEEE Trans. on Audio, Speech, and Language Processing 15(4), 1475–1487 (2007)
Article Google Scholar
Marolt, M.: Audio melody extraction based on timbral similarity of melodic fragments. In: EUROCON (November 2005)
Google Scholar
Mauch, M., Noland, K., Dixon, S.: Using musical structure to enhance automatic chord transcription. In: Proc. 10th Intl. Society for Music Information Retrieval Conference, Kobe, Japan (2009)
Google Scholar
McNab, R., Smith, L., Witten, I., Henderson, C., Cunningham, S.: Towards the digital music library: Tune retrieval from acoustic input. In: First ACM International Conference on Digital Libraries, pp. 11–18 (1996)
Google Scholar
Meek, C., Birmingham, W.: Applications of binary classification and adaptive boosting to the query-by-humming problem. In: Intl. Conf. on Music Information Retrieval, Paris, France (2002)
Google Scholar
Müller, M., Ewert, S., Kreuzer, S.: Making chroma features more robust to timbre changes. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, pp. 1869–1872 (April 2009)
Google Scholar
Nishimura, T., Hashiguchi, H., Takita, J., Zhang, J.X., Goto, M., Oka, R.: Music signal spotting retrieval by a humming query using start frame feature dependent continuous dynamic programming. In: 2nd Annual International Symposium on Music Information Retrieval, Bloomington, Indiana, USA, pp. 211–218 (October 2001)
Google Scholar
Ono, N., Miyamoto, K., Roux, J.L., Kameoka, H., Sagayama, S.: Separation of a monaural audio signal into harmonic/percussive components by complementary diffucion on spectrogram. In: European Signal Processing Conference, Lausanne, Switzerland, pp. 240–244 (August 2008)
Google Scholar
Ozerov, A., Philippe, P., Bimbot, F., Gribonval, R.: Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs. IEEE Trans. on Audio, Speech, and Language Processing 15(5), 1564–1578 (2007)
Article Google Scholar
Paiva, R.P., Mendes, T., Cardoso, A.: On the detection of melody notes in polyphonic audio. In: 6th International Conference on Music Information Retrieval, London, UK, pp. 175–182
Google Scholar
Paulus, J.: Signal Processing Methods for Drum Transcription and Music Structure Analysis. Ph.D. thesis, Tampere University of Technology (2009)
Google Scholar
Paulus, J., Klapuri, A.: Measuring the similarity of rhythmic patterns. In: Intl. Conf. on Music Information Retrieval, Paris, France (2002)
Google Scholar
Paulus, J., Müller, M., Klapuri, A.: Audio-based music structure analysis. In: Proc. of the Int. Society for Music Information Retrieval Conference, Utrecht, Netherlands (2010)
Google Scholar
Paulus, J., Virtanen, T.: Drum transcription with non-negative spectrogram factorisation. In: European Signal Processing Conference, Antalya, Turkey (September 2005)
Google Scholar
Peeters, G.: Sequence representations of music structure using higher-order similarity matrix and maximum-likelihood approach. In: Intl. Conf. on Music Information Retrieval, Vienna, Austria, pp. 35–40 (2007)
Google Scholar
Peeters, G.: A large set of audio features for sound description (similarity and classification) in the CUIDADO project. Tech. rep., IRCAM, Paris, France (April 2004)
Google Scholar
Poliner, G., Ellis, D., Ehmann, A., Gómez, E., Streich, S., Ong, B.: Melody transcription from music audio: Approaches and evaluation. IEEE Trans. on Audio, Speech, and Language Processing 15(4), 1247–1256 (2007)
Article Google Scholar
Purwins, H.: Profiles of Pitch Classes – Circularity of Relative Pitch and Key: Experiments, Models, Music Analysis, and Perspectives. Ph.D. thesis, Berlin University of Technology (2005)
Google Scholar
Rowe, R.: Machine musicianship. MIT Press, Cambridge (2001)
Google Scholar
Ryynänen, M., Klapuri, A.: Query by humming of MIDI and audio using locality sensitive hashing. In: IEEE International Conference on Audio, Speech and Signal Processing, Las Vegas, USA, pp. 2249–2252
Google Scholar
Ryynänen, M., Klapuri, A.: Transcription of the singing melody in polyphonic music. In: Intl. Conf. on Music Information Retrieval, Victoria, Canada, pp. 222–227 (2006)
Google Scholar
Ryynänen, M., Klapuri, A.: Automatic bass line transcription from streaming polyphonic audio. In: IEEE International Conference on Audio, Speech and Signal Processing, pp. 1437–1440 (2007)
Google Scholar
Ryynänen, M., Klapuri, A.: Automatic transcription of melody, bass line, and chords in polyphonic music. Computer Music Journal 32(3), 72–86 (2008)
Article Google Scholar
Schörkhuber, C., Klapuri, A.: Constant-Q transform toolbox for music processing. In: 7th Sound and Music Computing Conference, Barcelona, Spain (2010)
Google Scholar
Selfridge-Field, E.: Conceptual and representational issues in melodic comparison. Computing in Musicology 11, 3–64 (1998)
Google Scholar
Serra, J., Gomez, E., Herrera, P., Serra, X.: Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. on Audio, Speech, and Language Processing 16, 1138–1152 (2007)
Article Google Scholar
Serra, X.: Musical sound modeling with sinusoids plus noise. In: Roads, C., Pope, S., Picialli, A., Poli, G.D. (eds.) Musical Signal Processing, Swets & Zeitlinger (1997)
Google Scholar
Song, J., Bae, S.Y., Yoon, K.: Mid-level music melody representation of polyphonic audio for query-by-humming system. In: Intl. Conf. on Music Information Retrieval, Paris, France, pp. 133–139 (October 2002)
Google Scholar
Tokuda, K., Kobayashi, T., Masuko, T., Imai, S.: Mel-generalized cepstral analysis – a unified approach to speech spectral estimation. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia (1994)
Google Scholar
Typke, R.: Music Retrieval based on Melodic Similarity. Ph.D. thesis, Universiteit Utrecht (2007)
Google Scholar
Vincent, E., Bertin, N., Badeau, R.: Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription. In: IEEE ICASSP, Las Vegas, USA (2008)
Google Scholar
Virtanen, T.: Unsupervised learning methods for source separation in monaural music signals. In: Klapuri, A., Davy, M. (eds.) Signal Processing Methods for Music Transcription, pp. 267–296. Springer, Heidelberg (2006)
Chapter Google Scholar
Virtanen, T.: Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria. IEEE Trans. Audio, Speech, and Language Processing 15(3), 1066–1074 (2007)
Article Google Scholar
Virtanen, T., Mesaros, A., Ryynänen, M.: Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music. In: ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, Brisbane, Australia (September 2008)
Google Scholar
Wang, L., Huang, S., Hu, S., Liang, J., Xu, B.: An effective and efficient method for query by humming system based on multi-similarity measurement fusion. In: International Conference on Audio, Language and Image Processing, pp. 471–475 (July 2008)
Google Scholar
Welch, T.A.: A technique for high-performance data compression. Computer 17(6), 8–19 (1984)
Article Google Scholar
Wu, X., Li, M., Yang, J., Yan, Y.: A top-down approach to melody match in pitch countour for query by humming. In: International Conference of Chinese Spoken Language Processing (2006)
Google Scholar
Yeh, C.: Multiple fundamental frequency estimation of polyphonic recordings. Ph.D. thesis, University of Paris VI (2008)
Google Scholar
Yilmaz, O., Richard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Trans. on Signal Processing 52(7), 1830–1847 (2004)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Digital Music, Queen Mary University of London, Mile End Road, E1 4NS, London, United Kingdom
Anssi Klapuri

Authors

Anssi Klapuri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS - LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Sølvi Ystad
CNRS-INCM, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Mitsuko Aramaki
CNRS-LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Richard Kronland-Martinet
Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Klapuri, A. (2011). Pattern Induction and Matching in Music Signals. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Exploring Music Contents. CMMR 2010. Lecture Notes in Computer Science, vol 6684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23126-1_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-23126-1_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23125-4
Online ISBN: 978-3-642-23126-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics