Detecting Large Concept Extensions for Conceptual Analysis
When performing a conceptual analysis of a concept, philosophers are interested in all forms of expression of a concept in a text—be it direct or indirect, explicit or implicit. In this paper, we experiment with topic-based methods of automating the detection of concept expressions in order to facilitate philosophical conceptual analysis. We propose six methods based on LDA, and evaluate them on a new corpus of court decision that we had annotated by experts and non-experts. Our results indicate that these methods can yield important improvements over the keyword heuristic, which is often used as a concept detection heuristic in many contexts. While more work remains to be done, this indicates that detecting concepts through topics can serve as a general-purpose method for at least some forms of concept expression that are not captured using naive keyword approaches.
KeywordsConcept mining Topic models Conceptual analysis
This work is supported by research grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) and from the Social Sciences and Humanities Research Council of Canada (SSHRC).
- 2.Blondel, M.: Latent Dirichlet Allocation in Python (2010). https://gist.github.com/mblondel/542786
- 3.Braddon-Mitchell, D., Nola, R.: Introducing the Canberra plan. In: Braddon-Mitchell, D., Nola, R. (eds.) Conceptual Analysis and Philosophical Naturalism, pp. 1–20. MIT Press (2009)Google Scholar
- 7.Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Magazine 17(3), 37 (1996)Google Scholar
- 11.Hoffman, M., Bach, F.R., Blei, D.M.: Online learning for latent Dirichlet allocation. In: Advances in Neural Information Processing Systems, pp. 856–864 (2010)Google Scholar
- 12.Jackson, F.: From Metaphysics to Ethics: A Defence of Conceptual Analysis. Oxford University Press, New York (1998)Google Scholar
- 13.Knobe, J., Nichols, S.: An experimental philosophy manifesto. In: Knobe, J., Nichols, S. (eds.) Experimental philosophy, pp. 3–14. Oxford University Press (2008)Google Scholar
- 15.Meunier, J.G., Biskri, I., Forest, D.: Classification and categorization in computer assisted reading and analysis of texts. In: Lefebvre, C., Cohen, H. (eds.) Handbook of Categorization in Cognitive Science, pp. 955–978. Elsevier (2005)Google Scholar
- 16.Řehůřek, R., Sojka, P.: software framework for topic modelling with large corpora. In: Proceedings of LREC 2010 workshop New Challenges for NLP Frameworks, pp. 46–50. University of Malta, Valletta, Malta (2010). http://www.fi.muni.cz/usr/sojka/presentations/lrec2010-poster-rehurek-sojka.pdf
- 17.Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing, vol. 12, pp. 44–49 (1994)Google Scholar