Melody Recognition with Learned Edit Distances

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5342)


In a music recognition task, the classification of a new melody is often achieved by looking for the closest piece in a set of already known prototypes. The definition of a relevant similarity measure becomes then a crucial point. So far, the edit distance approach with a-priori fixed operation costs has been one of the most used to accomplish the task. In this paper, the application of a probabilistic learning model to both string and tree edit distances is proposed and is compared to a genetic algorithm cost fitting approach. The results show that both learning models outperform fixed-costs systems, and that the probabilistic approach is able to describe consistently the underlying melodic similarity model.


Edit distance learning music similarity genetic algorithms probabilistic models 


  1. 1.
    Bouchard, G., Triggs, B.: The trade-off between generative and discriminative classifiers. In: COMPSTAT 2004. Springer, Heidelberg (2004)Google Scholar
  2. 2.
    Mongeau, M., Sankoff, D.: Comparison of musical sequences. Computers and the Humanities 24, 161–175 (1990)CrossRefGoogle Scholar
  3. 3.
    Gómez, C., Abad-Mota, S., Ruckhaus, E.: An Analysis of the Mongeau-Sankoff Algorithm for Music Information Retrieval. In: Proc. of the 8th International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, pp. 109–110 (2007)Google Scholar
  4. 4.
    Grachten, M., Arcos, J., López de Mántaras, R.: Melody Retrieval using the Implication/Realization Model. In: Proceedings of the 6th International Conference on Music Information Retrieval, ISMIR 2005 (2005)Google Scholar
  5. 5.
    Rotstan, N., Meffert, K.: JGAP: Java Genetic Algorithms Package (accessed June 7, 2008),
  6. 6.
    Doraisamy, S., Rüger, M.: Robust Polyphonic Music Retrieval with N-grams. J. Intell. Inf. Syst. 21, 53–70 (2003)CrossRefGoogle Scholar
  7. 7.
    Rizo, D., Moreno-Seco, F., Iñesta, J.M.: Tree-structured representation of musical information. In: Perales, F.J., Campilho, A.C., Pérez, N., Sanfeliu, A. (eds.) IbPRIA 2003. Lecture Notes in Computer Science (LNAI), vol. 2652, pp. 838–846. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  8. 8.
    Illescas, P.R., Rizo, D., Iñesta, J.M.: Harmonic, melodic, and functional automatic analysis. In: Proc. International Computer Music Conference, pp. 165–168 (2007)Google Scholar
  9. 9.
    Aloupis, G., Fevens, T., Langerman, S., Matsui, T., Mesa, A., Nuñez, Y., Rappaport, D., Toussaint, G.: Algorithms for Computing Geometric Measures of Melodic Similarity. Computer Music Journal, Fall 2006 30(3), 67–76 (2006)Google Scholar
  10. 10.
    Bernard, M., Habrard, A., Sebban, M.: Learning stochastic tree edit distance. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS, vol. 4212, pp. 42–53. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  11. 11.
    Bernard, M., Janodet, J.-C., Sebban, M.: A discriminative model of stochastic edit distance in the form of a conditional transducer. In: Sakakibara, Y., Kobayashi, S., Sato, K., Nishino, T., Tomita, E. (eds.) ICGI 2006. LNCS, vol. 4201, pp. 240–252. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  12. 12.
    Dempster, A., Laird, M., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B(39), 1–38 (1977)MathSciNetzbMATHGoogle Scholar
  13. 13.
    Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological sequence analysis. Cambridge University Press, Cambridge (1998)CrossRefzbMATHGoogle Scholar
  14. 14.
    Boyer, L., Habrard, A., Sebban, M.: Learning metrics between tree structured data: Application to image recognition. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS, vol. 4701, pp. 54–66. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  15. 15.
    McCallum, A., Bellare, K., Pereira, P.: A conditional random field for disciminatively-trained finite-state string edit distance. In: Proc. of UAI 2005, pp. 388–400 (2005)Google Scholar
  16. 16.
    Oncina, J., Sebban, M.: Learning stochastic edit distance: application in handwritten character recognition. J. of Pattern Recognition 39(9), 1575–1587 (2006)CrossRefzbMATHGoogle Scholar
  17. 17.
    Ristad, S.V., Yianilos, P.N.: Learning string-edit distance. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(5), 522–532 (1998)CrossRefGoogle Scholar
  18. 18.
    Selkow, S.M.: The tree-to-tree editing problem. Information Processing Letters 6(6), 184–186 (1977)MathSciNetCrossRefzbMATHGoogle Scholar
  19. 19.
    Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing, 1245–1262 (1989)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  1. 1.Laboratoire d’Informatique FondamentaleUniversité de ProvenceMarseille cedex 13France
  2. 2.Dept. Lenguajes y Sistemas InformáticosUniversidad de AlicanteAlicanteSpain
  3. 3.Laboratoire Hubert CurienUniversité de Saint-EtienneSaint-EtienneFrance

Personalised recommendations