Music Summary Detection with State Space Embedding and Recurrence Plot

Gao, Yongwei; Shen, Yichun; Zhang, Xulong; Yu, Shuai; Li, Wei

doi:10.1007/978-981-13-8707-4_4

Yongwei Gao³⁸,
Yichun Shen³⁸,
Xulong Zhang³⁸,
Shuai Yu³⁸ &
…
Wei Li^38,39

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 568))

364 Accesses

Abstract

Automatic music summary detection is a task that identifies the most representative part of a song, facilitating users to retrieve the desired songs. In this paper, we propose a novel method based on state space embedding and recurrence plot. Firstly, an extended audio feature with state space embedding is extracted to construct a similarity matrix. Compared with the raw audio features, this extended feature is more robust against noise. Then recurrence plot based on global strategy is adopted to detect similar segment pairs within a song. Finally, we proposed to extract the most repeated part as a summary by selecting and merging the stripes containing the lowest distance in the similarity matrix under the constraints of slope and duration. Experimental results show that the performance of the proposed algorithm is more powerful than the other two competitive baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://en.wikipedia.org/wiki/Song_structure.

References

Gao S, Li H (2015) Popular song summarization using chorus section detection from audio signal. In: Proceedings of the 17th international workshop on multimedia signal processing (MMSP), pp 1–6. IEEE, Xiamen, China
Google Scholar
Maddage NC, Xu C, Kankanhalli MS et al (2004) Content-based music structure analysis with applications to music semantics understanding. In: Proceedings of the 12th ACM international conference on multimedia (MM), pp 112–119. ACM, New York, USA
Google Scholar
Matthew C, Jonathan F (2002) Automatic music summarization via similarity analysis. In: Proceedings of the 3rd international society for music information retrieval (ISMIR), pp 122–127. Paris, France
Google Scholar
Bartsch MA, Wakefield GH (2005) Audio thumbnailing of popular music using chroma-based representations. IEEE Trans Multimedia (MM) 7(1):96–104
Article Google Scholar
Lu L, Zhang HJ (2003) Automated extraction of music snippets. In: Proceedings of the 11th ACM international conference on multimedia (MM), pp 140–147. ACM, CA, USA
Google Scholar
Chai W (2006) Semantic segmentation and summarization of music: methods based on tonality and recurrent structure. IEEE Signal Process Mag 23(2):124–132
Article MathSciNet Google Scholar
Nieto O, Humphrey EJ, Bello JP (2012) Compressing music recordings into audio summaries. In: Proceedings of 13th international society for music information retrieval (ISMIR), pp 313–318, Porto, Portugal (2012)
Google Scholar
Xu C, Maddage MC, Shao X (2005) Automatic music classification and summarization. IEEE Trans Speech Audio Process (TASLP) 13(3):441–450
Article Google Scholar
Xu C, Zhu Y, Tian Q (20025) Automatic music summarization based on temporal, spectral and cepstral features. In: Proceedings of international conference on multimedia and expo, pp 117–120, Lausanne, Switzerland
Google Scholar
Zlatintsi A, Maragos P, Potamianos A (2012) A saliency-based approach to audio event detection and summarization. In: Proceedings of the 20th European signal processing conference (EUSIPCO), pp 1294–1298, Bucharest, Romania
Google Scholar
Logan B, Chu S (2000) Music summarization using key phrases. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 749–752. Istanbul, Turkey
Google Scholar
Müller M, Ewert S (2010) Towards timbre-invariant audio features for harmony-based music. IEEE Trans Audio Speech Lang Process (TASLP) 18(3):649–662
Article Google Scholar
Müller M, Ewert S (2011) Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th international conference on music information retrieval (ISMIR), pp 215–220, Miami, Florida
Google Scholar
Kantz H, Schreiber T (2004) Nonlinear time series analysis. Cambridge University Press, Cambridge, United Kingdom
Google Scholar
Bello JP (2011) Measuring structural similarity in music. IEEE Trans Audio Speech Lang Process (TASLP) 19(7):2013–2025
Article Google Scholar
Serrà J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017
Article Google Scholar
Cho T, Bello JP (2011) A feature smoothing method for chord recognition using recurrence plots. In: Proceedings of the 12th international society for music information retrieval (ISMIR), pp 651–656, Miami, Florida
Google Scholar
Bertin-Mahieux T, Ellis DPW (2011) Large-scale cover song recognition using hashed chroma landmarks. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 117–120, New York, USA
Google Scholar
Egorov A, Linetsky G (2008) Cover song identification with IF-F0 pitch class profiles. MIREX extended abstract
Google Scholar
Matthew C, Jonathan F (2003) Summarizing popular music via structural similarity analysis. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 1159–1170, New York, USA (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Fudan University, Shanghai, 201203, China
Yongwei Gao, Yichun Shen, Xulong Zhang, Shuai Yu & Wei Li
Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, 201203, China
Wei Li

Authors

Yongwei Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yichun Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xulong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Li .

Editor information

Editors and Affiliations

School of Computer Science and Technology, Fudan University, Shanghai, China
Wei Li
Beijing University of Posts and Telecommunications, Beijing, China
Shengchen Li
Institute of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Xi Shao
China Conservatory of Music, Beijing, China
Zijin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, Y., Shen, Y., Zhang, X., Yu, S., Li, W. (2019). Music Summary Detection with State Space Embedding and Recurrence Plot. In: Li, W., Li, S., Shao, X., Li, Z. (eds) Proceedings of the 6th Conference on Sound and Music Technology (CSMT). Lecture Notes in Electrical Engineering, vol 568. Springer, Singapore. https://doi.org/10.1007/978-981-13-8707-4_4

Download citation

DOI: https://doi.org/10.1007/978-981-13-8707-4_4
Published: 03 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8706-7
Online ISBN: 978-981-13-8707-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics