Use of Poset Theory with Big Datasets: A New Proposal Applied to the Analysis of Life Satisfaction in Italy
- 191 Downloads
The aim of this work is to propose a tool for measuring a complex concept, and to apply it to big sets of data measured on ordinal and/or dichotomous scales. An important field of application are subjective data, that are often based on opinions or personal evaluations. Many national and international surveys employ this kind of data, measured among thousand of individuals. Thanks to the use of the “average rank” as a synthetic measure of a complex concept, we believe that poset theory could be a very useful approach for dealing with ordinal data avoiding the use of scaling procedures. Because classic poset approaches are at their best when applied to few data at a time, our idea is based on a procedure for sampling units from a big population using a simple criterion to summarize the resulting values appropriately. Applying the central limit theorem enables a comparison of the results obtained from different groups using statistical tests on the means. We used our Height of Groups by Sampling (HOGS) method to compare the average rank among groups that are defined by one or more socio-demographic variables influencing the level of the complex concept we wish to measure. The application of the HOGS procedure to life satisfaction in Italy generated convincing results, revealing significant differences between regions, genders and levels of formal education. We compared the results given by HOGS with the non linear principal component analysis and obtain an easy readable output with convincing precision and accuracy; we are confident that the HOGS procedure can be applied to many other concepts investigated in the social sciences.
KeywordsSynthetic measure Partially ordered sets Quality of life Ordinal data
- Brüggemann, R., & Carlsen, L. (2011). An improved estimation of averaged ranks of partial orders. MATCH Communications in Mathematical and in Computer Chemistry, 65, 383–414.Google Scholar
- Brüggemann, R., & Voigt, K. (2009). Analysis of partial orders in environmental systems applying the new software pyhasse. In J. Wittmann, M. Flechsig (Eds.), Simulation in Umwelt-und Geowissenschaften, Workshop Potsdam (pp. 43–55), ShakerVerlag.Google Scholar
- Caperna, G. (2016). Partial order theory for synthetic indicators. Ph.D. thesis, University of Padova.Google Scholar
- De Loof, K. (2009). Efficient computation of rank probabilities in posets. Ph.D. thesis, University of Ghent.Google Scholar
- De Loof, K., De Baets, B., & De Meyer, H. (2011). Approximation of average ranks in posets. MATCH Communications in Mathematical and in Computer Chemistry, 66, 219–229.Google Scholar
- Fattore, M. (2015). Partially ordered sets and the measurement of multidimensional ordinal deprivation. Social Indicators Research, 1–24. doi: 10.1007/s11205-015-1059-6.
- Fattore, M., & Arcagni, A. (2014). Parsec: an r package for poset-based evaluation of multidimensional poverty. In R. Brüggeman, L. Carlsen & J. Wittmann (Eds.), Multi-indicator systems and modelling in partial order (pp. 317–330). New York: Springer.Google Scholar
- Gifi, A. (1990). Nonlinear multivariate analysis. Chicester: Wiley.Google Scholar
- Maggino, F., & Fattore, M. (2011). New tools for the construction of ranking and evaluation indicators in multidimensional systems of ordinal variables. In Proceedings of the: New techniques and technologies for statistics, Brussels. https://ec.europa.eu/eurostat/cros/sites/crosportal/files/S16P4.pdf-20/02/2016.
- Maggino, F., & Nuvolati, G. (2012). Quality of life in Italy: Research and reflections (Vol. 48). Dordrecht: Springer Science & Business Media.Google Scholar
- Polya, G. (1920). Über den zentralen Grenzwertsatz der Wahrscheinlichkeitsrechnung und das Momentenproblem. Mathematische Zeitschrift, 8(3), 171–181.Google Scholar
- Schröder, B. (2012). Ordered sets: An introduction. New York: Springer Science & Business Media.Google Scholar