Abstract
Researchers commonly use co-occurrence counts to assess the similarity of objects. This paper illustrates how traditional association measures can lead to misguided significance tests of co-occurrence in settings where the usual multinomial sampling assumptions do not hold. I propose a Monte Carlo permutation test that preserves the original distributions of the co-occurrence data. I illustrate the test on a dataset of organizational categorization, in which I investigate the relations between organizational categories (such as “Argentine restaurants” and “Steakhouses”).
Similar content being viewed by others
Notes
Obviously, the number of permutation needed depends on the sample size. Given that there are about 5,000 organizations with two or more categories, and given that most organizations are in two or three categories, 10,000 permutations are likely enough to arrive at random category associations.
References
Agresti, A.: A survey of exact inference for contingency tables. Stat. Sci. 7, 131–177 (1992)
Breiger, R.L.: The duality of persons and groups. Soc. Forces 53, 181–190 (1974)
Dean, J., Henzinger, M.R.: Finding related pages in the World Wide Web. Comput. Netw. 31, 1467–1479 (1999)
Garfield, E.: Citation analysis as a tool in journal evaluation. Science 178, 471–479 (1972)
Good, P.I.: Permutation, Parametric and Bootstrap Tests of Hypotheses. Springer, New York (2005)
Hubert, L.J.: Combinatorial data analysis. Psychometrika 50, 449–467 (1985)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA (1999)
Pearson, K.: On a criterion that a given system of deviations from the probable in the case of correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philos. Mag. 5, 157–175 (1900)
Wickens, T.D.: Multiway Contingency Tables Analysis for the Social Sciences. Lawrence Erlbaum Associates, Hillsdale (1989)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kovács, B. A Monte Carlo permutation test for co-occurrence data. Qual Quant 48, 955–960 (2014). https://doi.org/10.1007/s11135-012-9817-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-012-9817-x