Abstract
In this paper we analyze 26 communities across the United States with the objective to understand what attaches people to their community and how this attachment differs among communities. How different are attached people from unattached? What attaches people to their community? How different are the communities? What are key drivers behind emotional attachment? To address these questions, graphical, supervised and unsupervised learning tools were used and information from the Census Bureau and the Knight Foundation were combined. Using the same pre-processed variables as Knight (Soul of the community, Technical report, 2010) most likely will drive the results towards the same conclusions than the Knight foundation, so this paper does not use those variables.
Similar content being viewed by others
References
Atkinson AB, Bourguignon F (2000) Handbook of income distribution, vol 1. Elsevier, Amsterdam
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Dahl DB (2016) xtable: export tables to LaTeX or HTML. R package version 1.8-2
de Vries A, Ripley BD (2016) ggdendro: create dendrograms and tree diagrams using ‘ggplot2’. R package version 0.1-20
Fox J, Weisberg S (2011) An R companion to applied regression, 2nd edn. Sage, Thousand Oaks
Friendly M (1994) Mosaic displays for multi-way contingency tables. J Am Stat Assoc 89(425):190–200
Handcock MS (2016) Relative distribution methods. Los Angeles, CA. Version 1.6-6. Project home page at http://www.stat.ucla.edu/~handcock/RelDist
Hartigan JA, Kleiner B (1981) Mosaics for contingency tables. In: Computer science and statistics: proceedings of the 13th symposium on the interface. Springer, pp 268–273
Hofmann H, Wickham H, Cook D (2019) The 2013 data expo of the American Statistical Association. Comput Stat 25:551–554
Hummel J (1996) Linked bar charts: analysing categorical data graphically. Comput Stat 11(1):23–33
Inselberg A (2009) Parallel coordinates: visual multidimensional geometry and its applications. Springer, Berlin, Heidelberg
Izenman A (2008) Modern multivariate statistical techniques: regression, classification, and manifold learning, 1st edn. Springer. ISBN:0387781889
Knight F (2010) Soul of the community. Technical report. https://knightfoundation.org/sotc/overall-findings/
Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18–22
Lumley T (2018) Survey: analysis of complex survey samples. R package version 3.34
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1. Oakland, CA, USA, pp 281–297
R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Rushton M (2008) A note on the use and misuse of the racial diversity index. Policy Stud J 36(3):445–459
Schloerke B, Crowley J, Cook D, Briatte F, Marbach M, Thoen E, Elberg A, Larmarange J (2018) GGally: extension to ’ggplot2’. R package version 1.4.0
U.S. Census Bureau (2011) 2011 American community survey 5-year estimates. Tables B01001, B01003, B02001, B05005, B08202, B19001, B19301, B23006, B25026. http://factfinder2.census.gov
Wickham H (2017) tidyverse: easily install and load the ‘Tidyverse’. R package version 1.2.1
Wickham H, Hofmann H (2011) Product plots. IEEE Trans Vis Comput Graph 17(12):2223–2230
Wickham H, Hofmann H (2016) productplots: product plots for R. R package version 0.1.1
Xie Y (2018) knitr: a general-purpose package for dynamic report generation in R. R package version 1.20
Acknowledgements
Professor Di Cook helped with the analysis and reviewing the paper. Xiaoyue Cheng’s mergeGui was used to create more amenable initial data. Israel Almodovar, Vianey Leos and Ricardo Martinez helped us with proofreading our paper for final corrections.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Description of the variables created
Description of the variables created
See Table 5.
Rights and permissions
About this article
Cite this article
da Silva, N., Alvarez-Castro, I. Clicks and cliques: exploring the soul of the community. Comput Stat 34, 1537–1563 (2019). https://doi.org/10.1007/s00180-019-00881-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-019-00881-3