Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

  • Davide Buscaldi
  • Paolo Rosso
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4578)


In this paper we present some results obtained in humour classification over a corpus of Italian quotations manually extracted and tagged from the Wikiquote project. The experiments were carried out using both a multinomial Naïve Bayes classifier and a Support Vector Machine (SVM). The considered features range from single words to n-grams and sentence length. The obtained results show that it is possible to identify the funny quotes even with the simplest features (bag of words); the bayesian classifier performed better than the SVM. However, the size of the corpus size is too small to support definitive assertions.


Support Vector Machine Natural Language Processing Machine Translation Polynomial Kernel Sentence Length 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    De Bono, E.: I am Right, You are Wrong: From This to the New Renaissance, From Rock Logic to Water Logic. Penguin (1991)Google Scholar
  2. 2.
    Mihalcea, R., Strapparava, C.: Computational Laughing: Automatic Recognition of Humorous One-liners. In: Proc. 27th Ann. Conf. Cognitive Science Soc (CogSci 2005), Stresa, Italy, pp. 1513–1518 (2005)Google Scholar
  3. 3.
    Mihalcea, R., Strapparava, C.: Technologies That Make You Smile: Adding Humor to Text-Based Applications. IEEE Intelligent Systems 21(5), 33–39 (2006)CrossRefGoogle Scholar
  4. 4.
    Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Springer (ed.) Proc. 10th European Conf. on Machine Learning (ECML 1998), pp. 137–142 (1998)Google Scholar
  5. 5.
    Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)zbMATHGoogle Scholar
  6. 6.
    Coyotl, R.M.: Villaseñor, L. In: Martínez-Trinidad, J.F., Carrasco Ochoa, J.A., Kittler, J. (eds.) CIARP 2006. LNCS, vol. 4225, pp. 844–853. Springer, Heidelberg (2006)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Davide Buscaldi
    • 1
  • Paolo Rosso
    • 1
  1. 1.Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de ValenciaSpain

Personalised recommendations