Abstract
This paper describes a method for labelling structural parts of a musical piece. Existing methods for the analysis of piece structure often name the parts with musically meaningless tags, e.g., “p1”, “p2”, “p3”. Given a sequence of these tags as an input, the proposed system assigns musically more meaningful labels to these; e.g., given the input “p1, p2, p3, p2, p3” the system might produce “intro, verse, chorus, verse, chorus”. The label assignment is chosen by scoring the resulting label sequences with Markov models. Both traditional and variable-order Markov models are evaluated for the sequence modelling. Search over the label permutations is done with N-best variant of token passing algorithm. The proposed method is evaluated with leave-one-out cross-validations on two large manually annotated data sets of popular music. The results show that Markov models perform well in the desired task.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Peeters, G.: Deriving musical structure from signal analysis for music audio summary generation: ”sequence” and ”state” approach. In: Wiil, U.K. (ed.) CMMR 2003. LNCS, vol. 2771, pp. 143–166. Springer, Heidelberg (2004)
Ong, B.S.: Structural analysis and segmentation of musical signals. Ph.D thesis, Universitat Pompeu Fabra, Barcelona (2006)
Shiu, Y., Jeong, H., Kuo, C.C.J.: Musical structure analysis using similarity matrix and dynamic programming. In: Proc. of SPIE. Multimedia Systems and Applications VIII, vol. 6015 (2005)
Maddage, N.C.: Automatic structure detection for popular music. IEEE Multimedia 13(1), 65–77 (2006)
Goto, M.: A chorus-section detecting method for musical audio signals. In: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, pp. 437–440 (2003)
Boutard, G., Goldszmidt, S., Peeters, G.: Browsing inside a music track, the experimentation case study. In: Proc. of 1st Workshop on Learning the Semantics of Audio Signals, Athens, pp. 87–94 (December 2006)
Pollack, A.W.: ‘Notes on...’ series. The Official rec.music.beatles Home Page (1989-2001), http://www.recmusicbeatles.com
Jurafsky, D., Martin, J.H.: Speech and language processing. Prentice-Hall, New Jersey (2000)
Young, S.J., Russell, N.H., Thornton, J.H.S.: Token passing: a simple conceptual model for connected speech recognition systems. Technical Report CUED/F-INFENG/TR38, Cambridge University Engineering Department, Cambridge, UK (July 1989)
Ron, D., Singer, Y., Tishby, N.: The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning 25(2–3), 117–149 (1996)
Witten, I.H., Bell, T.C.: The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transcations on Information Theory 37(4), 1085–1094 (1991)
Begleiter, R., El-Yaniv, R., Yona, G.: On prediction using variable order Markov models. Journal of Artificial Intelligence Research 22, 385–421 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Paulus, J., Klapuri, A. (2009). Labelling the Structural Parts of a Music Piece with Markov Models. In: Ystad, S., Kronland-Martinet, R., Jensen, K. (eds) Computer Music Modeling and Retrieval. Genesis of Meaning in Sound and Music. CMMR 2008. Lecture Notes in Computer Science, vol 5493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02518-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-02518-1_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02517-4
Online ISBN: 978-3-642-02518-1
eBook Packages: Computer ScienceComputer Science (R0)