Some Experiments in Humour Recognition Using the Italian Wikiquote Collection
In this paper we present some results obtained in humour classification over a corpus of Italian quotations manually extracted and tagged from the Wikiquote project. The experiments were carried out using both a multinomial Naïve Bayes classifier and a Support Vector Machine (SVM). The considered features range from single words to n-grams and sentence length. The obtained results show that it is possible to identify the funny quotes even with the simplest features (bag of words); the bayesian classifier performed better than the SVM. However, the size of the corpus size is too small to support definitive assertions.
Unable to display preview. Download preview PDF.
- 1.De Bono, E.: I am Right, You are Wrong: From This to the New Renaissance, From Rock Logic to Water Logic. Penguin (1991)Google Scholar
- 2.Mihalcea, R., Strapparava, C.: Computational Laughing: Automatic Recognition of Humorous One-liners. In: Proc. 27th Ann. Conf. Cognitive Science Soc (CogSci 2005), Stresa, Italy, pp. 1513–1518 (2005)Google Scholar
- 4.Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Springer (ed.) Proc. 10th European Conf. on Machine Learning (ECML 1998), pp. 137–142 (1998)Google Scholar