Stochastic Text Models for Music Categorization

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5342)


Music genre meta-data is of paramount importance for the organization of music repositories. People use genre in a natural way when entering a music store or looking into music collections. Automatic genre classification has become a popular topic in music information retrieval research. This work brings to symbolic music recognition some technologies, like the stochastic language models, already successfully applied to text categorization. In this work we model chord progressions and melodies as n-grams and strings and then apply perplexity and naïve Bayes classifiers, respectively, in order to assess how often those structures are found in the target genres. Also a combination of the different techniques as an ensemble of classifiers is proposed. Some genres and sub-genres among popular, jazz, and academic music have been considered. The results show that the ensemble is a good trade-off approach able to perform well without the risk of choosing the wrong classifier.


  1. 1.
    Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10, 205–210 (2002)CrossRefGoogle Scholar
  2. 2.
    McKay, C., Fujinaga, I.: Automatic genre classification using large high-level musical feature sets. In: Proceedings of the ISMIR, pp. 525–530 (2004)Google Scholar
  3. 3.
    Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)Google Scholar
  4. 4.
    Cavnar, W.B., Trenkle, J.M.: N-gram-based text categorization. In: Proceedings of SDAIR 1994, 3rd Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, US, pp. 161–175 (1994)Google Scholar
  5. 5.
    Pérez-Sancho, C., Iñesta, J.M., Calera-Rubio, J.: Style recognition through statistical event models. Journal of New Music Research 34, 331–340 (2005)CrossRefGoogle Scholar
  6. 6.
    Tzanetakis, G., Ermolinskyi, A., Cook, P.: Pitch histograms in audio and symbolic music information retrieval. In: Proceedings of the ISMIR, Paris, France (2002)Google Scholar
  7. 7.
    Cruz-Alcázar, P.P., Vidal, E., Pérez-Cortes, J.C.: Musical style identification using grammatical inference: The encoding problem. In: Sanfeliu, A., Ruiz-Shulcloper, J. (eds.) CIARP 2003. LNCS, vol. 2905, pp. 375–382. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  8. 8.
    Mccallum, A., Nigam, K.: A comparison of event models for naive bayes text classification (1998)Google Scholar
  9. 9.
    Domingos, P., Pazzani, M.: Beyond independence: conditions for the optimality of simple bayesian classifier. Machine Learning 29, 103–130 (1997)CrossRefzbMATHGoogle Scholar
  10. 10.
    Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley, Chichester (1991)CrossRefzbMATHGoogle Scholar
  11. 11.
    Witten, I., Bell, T.: The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transactions on Information Theory 37, 1085–1094 (1991)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  1. 1.Department of Software and Computing SystemsUniversity of AlicanteAlicanteSpain

Personalised recommendations