Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

  • Davide Buscaldi
  • Paolo Rosso
Conference paper

DOI: 10.1007/978-3-540-73400-0_58

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4578)
Cite this paper as:
Buscaldi D., Rosso P. (2007) Some Experiments in Humour Recognition Using the Italian Wikiquote Collection. In: Masulli F., Mitra S., Pasi G. (eds) Applications of Fuzzy Sets Theory. WILF 2007. Lecture Notes in Computer Science, vol 4578. Springer, Berlin, Heidelberg

Abstract

In this paper we present some results obtained in humour classification over a corpus of Italian quotations manually extracted and tagged from the Wikiquote project. The experiments were carried out using both a multinomial Naïve Bayes classifier and a Support Vector Machine (SVM). The considered features range from single words to n-grams and sentence length. The obtained results show that it is possible to identify the funny quotes even with the simplest features (bag of words); the bayesian classifier performed better than the SVM. However, the size of the corpus size is too small to support definitive assertions.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Davide Buscaldi
    • 1
  • Paolo Rosso
    • 1
  1. 1.Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de ValenciaSpain

Personalised recommendations