Applications of Duplicate Detection in Music Archives: From Metadata Comparison to Storage Optimisation

The Case of the Belgian Royal Museum for Central Africa
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 806)


This work focuses on applications of duplicate detection for managing digital music archives. It aims to make this mature music information retrieval (MIR) technology better known to archivists and provide clear suggestions on how this technology can be used in practice. More specifically applications are discussed to complement meta-data, to link or merge digital music archives, to improve listening experiences and to re-use segmentation data. To illustrate the effectiveness of the technology a case study is explored. The case study identifies duplicates in the archive of the Royal Museum for Central Africa, which mainly contains field recordings of Central Africa. Duplicate detection is done with an existing Open Source acoustic fingerprinter system. In the set, 2.5% of the recordings are duplicates. It is found that meta-data differs dramatically between original and duplicate showing that merging meta-data could improve the quality of descriptions. The case study also shows that duplicates can be identified even if recording speed is not the same for original and duplicate.


MIR applications Documentation Collaboration Digital music archives 



This work was partially supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 703937 and partly supported by an FWO Methusalem project titled Expressive Music Interaction.

Supplementary material (20.5 mb)
Supplementary material 1 (zip 21022 KB)


  1. 1.
    Orio, N.: Searching and classifying affinities in a web music collection. In: Agosti, M., Bertini, M., Ferilli, S., Marinai, S., Orio, N. (eds.) IRCDL 2016. CCIS, vol. 701, pp. 59–70. Springer, Cham (2017). CrossRefGoogle Scholar
  2. 2.
    Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of audio fingerprinting. J. VLSI Signal Process. 41, 271–284 (2005)CrossRefGoogle Scholar
  3. 3.
    IFLA - Audiovisual and Multimedia Section: Guidelines for digitization projects: for collections and holdings in the public domain, particularly those held by libraries and archives. Technical report, International Federation of Library Associations and Institutions (IFLA), Paris, France, March 2002Google Scholar
  4. 4.
    IASA-TC 2004: Guidelines on the Production and Preservation of Digital Objects. IASA Technical Committee (2004)Google Scholar
  5. 5.
    Boston, G.: Safeguarding the Documentary Heritage. A guide to Standards, Recommended Practices and Reference Literature Related to the Preservation of Documents of all kinds. UNESCO (1998)Google Scholar
  6. 6.
    Bressan, F., Canazza, S., Vets, T., Leman, M.: Hermeneutic implications of cultural encoding: a reflection on audio recordings and interactive installation art. In: Agosti, M., Bertini, M., Ferilli, S., Marinai, S., Orio, N. (eds.) IRCDL 2016. CCIS, vol. 701, pp. 47–58. Springer, Cham (2017). CrossRefGoogle Scholar
  7. 7.
    Wang, A.L.C.: An industrial-strength audio search algorithm. In: Proceedings of the 4th International Symposium on Music Information Retrieval (ISMIR 2003), pp. 7–13 (2003)Google Scholar
  8. 8.
    Haitsma, J., Kalker, T.: A highly robust audio fingerprinting system. In: Proceedings of the 3th International Symposium on Music Information Retrieval (ISMIR 2002) (2002)Google Scholar
  9. 9.
    Ellis, D., Whitman, B., Porter, A.: Echoprint - an open music identification service. In: Proceedings of the 12th International Symposium on Music Information Retrieval (ISMIR 2011) (2011)Google Scholar
  10. 10.
    Fenet, S., Richard, G., Grenier, Y.: A scalable audio fingerprint method with robustness to pitch-shifting. In: Proceedings of the 12th International Symposium on Music Information Retrieval (ISMIR 2011), pp. 121–126 (2011)Google Scholar
  11. 11.
    Bellettini, C., Mazzini, G.: Reliable automatic recognition for pitch-shifted audio. In: Proceedings of 17th International Conference on Computer Communications and Networks (ICCCN 2008), pp. 838–843. IEEE (2008)Google Scholar
  12. 12.
    Ramona, M., Peeters, G.: AudioPrint: an efficient audio fingerprint system based on a novel cost-less synchronization scheme. In: Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP 2013), pp. 818–822 (2013)Google Scholar
  13. 13.
    Zhu, B., Li, W., Wang, Z., Xue, X.: A novel audio fingerprinting method robust to time scale modification and pitch shifting. In: Proceedings of the international conference on Multimedia (MM 2010), pp. 987–990. ACM (2010)Google Scholar
  14. 14.
    Malekesmaeili, M., Ward, R.K.: A local fingerprinting approach for audio copy detection. Computing Research Repository (CoRR) abs/1304.0793 (2013)Google Scholar
  15. 15.
    Six, J., Leman, M.: Panako - a scalable acoustic fingerprinting system handling time-scale and pitch modification. In: Proceedings of the 15th ISMIR Conference (ISMIR 2014), pp. 1–6 (2014)Google Scholar
  16. 16.
    Sonnleitner, R., Widmer, G.: Quad-based audio fingerprinting robust to time and frequency scaling. In: Proceedings of the 17th International Conference on Digital Audio Effects (DAFx-2014) (2014)Google Scholar
  17. 17.
    Cornelis, O., De Caluwe, R., Detré, G., Hallez, A., Leman, M., Matthé, T., Moelants, D., Gansemans, J.: Digitisation of the ethnomusicological sound archive of the RMCA. IASA J. 26, 35–44 (2005)Google Scholar
  18. 18.
    Cornelis, O., Lesaffre, M., Moelants, D., Leman, M.: Access to ethnic music: advances and perspectives in content-based music information retrieval. Sig. Process. 90(4), 1008–1031 (2010). Special Section: Ethnic Music Audio Documents: From the Preservation to the FruitionCrossRefzbMATHGoogle Scholar
  19. 19.
    Six, J., Cornelis, O., Leman, M.: TarsosDSP, a real-time audio processing framework in Java. In: Proceedings of the 53rd AES Conference (AES 53rd), The Audio Engineering Society (2014)Google Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.IPEMGhent UniversityGhentBelgium

Personalised recommendations