Skip to main content

Music Summary Detection with State Space Embedding and Recurrence Plot

  • Conference paper
  • First Online:
Proceedings of the 6th Conference on Sound and Music Technology (CSMT)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 568))

  • 364 Accesses

Abstract

Automatic music summary detection is a task that identifies the most representative part of a song, facilitating users to retrieve the desired songs. In this paper, we propose a novel method based on state space embedding and recurrence plot. Firstly, an extended audio feature with state space embedding is extracted to construct a similarity matrix. Compared with the raw audio features, this extended feature is more robust against noise. Then recurrence plot based on global strategy is adopted to detect similar segment pairs within a song. Finally, we proposed to extract the most repeated part as a summary by selecting and merging the stripes containing the lowest distance in the similarity matrix under the constraints of slope and duration. Experimental results show that the performance of the proposed algorithm is more powerful than the other two competitive baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://en.wikipedia.org/wiki/Song_structure.

References

  1. Gao S, Li H (2015) Popular song summarization using chorus section detection from audio signal. In: Proceedings of the 17th international workshop on multimedia signal processing (MMSP), pp 1–6. IEEE, Xiamen, China

    Google Scholar 

  2. Maddage NC, Xu C, Kankanhalli MS et al (2004) Content-based music structure analysis with applications to music semantics understanding. In: Proceedings of the 12th ACM international conference on multimedia (MM), pp 112–119. ACM, New York, USA

    Google Scholar 

  3. Matthew C, Jonathan F (2002) Automatic music summarization via similarity analysis. In: Proceedings of the 3rd international society for music information retrieval (ISMIR), pp 122–127. Paris, France

    Google Scholar 

  4. Bartsch MA, Wakefield GH (2005) Audio thumbnailing of popular music using chroma-based representations. IEEE Trans Multimedia (MM) 7(1):96–104

    Article  Google Scholar 

  5. Lu L, Zhang HJ (2003) Automated extraction of music snippets. In: Proceedings of the 11th ACM international conference on multimedia (MM), pp 140–147. ACM, CA, USA

    Google Scholar 

  6. Chai W (2006) Semantic segmentation and summarization of music: methods based on tonality and recurrent structure. IEEE Signal Process Mag 23(2):124–132

    Article  MathSciNet  Google Scholar 

  7. Nieto O, Humphrey EJ, Bello JP (2012) Compressing music recordings into audio summaries. In: Proceedings of 13th international society for music information retrieval (ISMIR), pp 313–318, Porto, Portugal (2012)

    Google Scholar 

  8. Xu C, Maddage MC, Shao X (2005) Automatic music classification and summarization. IEEE Trans Speech Audio Process (TASLP) 13(3):441–450

    Article  Google Scholar 

  9. Xu C, Zhu Y, Tian Q (20025) Automatic music summarization based on temporal, spectral and cepstral features. In: Proceedings of international conference on multimedia and expo, pp 117–120, Lausanne, Switzerland

    Google Scholar 

  10. Zlatintsi A, Maragos P, Potamianos A (2012) A saliency-based approach to audio event detection and summarization. In: Proceedings of the 20th European signal processing conference (EUSIPCO), pp 1294–1298, Bucharest, Romania

    Google Scholar 

  11. Logan B, Chu S (2000) Music summarization using key phrases. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 749–752. Istanbul, Turkey

    Google Scholar 

  12. Müller M, Ewert S (2010) Towards timbre-invariant audio features for harmony-based music. IEEE Trans Audio Speech Lang Process (TASLP) 18(3):649–662

    Article  Google Scholar 

  13. Müller M, Ewert S (2011) Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th international conference on music information retrieval (ISMIR), pp 215–220, Miami, Florida

    Google Scholar 

  14. Kantz H, Schreiber T (2004) Nonlinear time series analysis. Cambridge University Press, Cambridge, United Kingdom

    Google Scholar 

  15. Bello JP (2011) Measuring structural similarity in music. IEEE Trans Audio Speech Lang Process (TASLP) 19(7):2013–2025

    Article  Google Scholar 

  16. Serrà J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017

    Article  Google Scholar 

  17. Cho T, Bello JP (2011) A feature smoothing method for chord recognition using recurrence plots. In: Proceedings of the 12th international society for music information retrieval (ISMIR), pp 651–656, Miami, Florida

    Google Scholar 

  18. Bertin-Mahieux T, Ellis DPW (2011) Large-scale cover song recognition using hashed chroma landmarks. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 117–120, New York, USA

    Google Scholar 

  19. Egorov A, Linetsky G (2008) Cover song identification with IF-F0 pitch class profiles. MIREX extended abstract

    Google Scholar 

  20. Matthew C, Jonathan F (2003) Summarizing popular music via structural similarity analysis. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 1159–1170, New York, USA (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gao, Y., Shen, Y., Zhang, X., Yu, S., Li, W. (2019). Music Summary Detection with State Space Embedding and Recurrence Plot. In: Li, W., Li, S., Shao, X., Li, Z. (eds) Proceedings of the 6th Conference on Sound and Music Technology (CSMT). Lecture Notes in Electrical Engineering, vol 568. Springer, Singapore. https://doi.org/10.1007/978-981-13-8707-4_4

Download citation

Publish with us

Policies and ethics