Skip to main content
Log in

Clicks and cliques: exploring the soul of the community

  • Original paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

In this paper we analyze 26 communities across the United States with the objective to understand what attaches people to their community and how this attachment differs among communities. How different are attached people from unattached? What attaches people to their community? How different are the communities? What are key drivers behind emotional attachment? To address these questions, graphical, supervised and unsupervised learning tools were used and information from the Census Bureau and the Knight Foundation were combined. Using the same pre-processed variables as Knight (Soul of the community, Technical report, 2010) most likely will drive the results towards the same conclusions than the Knight foundation, so this paper does not use those variables.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  • Atkinson AB, Bourguignon F (2000) Handbook of income distribution, vol 1. Elsevier, Amsterdam

    Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  • Dahl DB (2016) xtable: export tables to LaTeX or HTML. R package version 1.8-2

  • de Vries A, Ripley BD (2016) ggdendro: create dendrograms and tree diagrams using ‘ggplot2’. R package version 0.1-20

  • Fox J, Weisberg S (2011) An R companion to applied regression, 2nd edn. Sage, Thousand Oaks

    Google Scholar 

  • Friendly M (1994) Mosaic displays for multi-way contingency tables. J Am Stat Assoc 89(425):190–200

    Article  Google Scholar 

  • Handcock MS (2016) Relative distribution methods. Los Angeles, CA. Version 1.6-6. Project home page at http://www.stat.ucla.edu/~handcock/RelDist

  • Hartigan JA, Kleiner B (1981) Mosaics for contingency tables. In: Computer science and statistics: proceedings of the 13th symposium on the interface. Springer, pp 268–273

  • Hofmann H, Wickham H, Cook D (2019) The 2013 data expo of the American Statistical Association. Comput Stat 25:551–554

    Google Scholar 

  • Hummel J (1996) Linked bar charts: analysing categorical data graphically. Comput Stat 11(1):23–33

    MathSciNet  MATH  Google Scholar 

  • Inselberg A (2009) Parallel coordinates: visual multidimensional geometry and its applications. Springer, Berlin, Heidelberg

    Book  Google Scholar 

  • Izenman A (2008) Modern multivariate statistical techniques: regression, classification, and manifold learning, 1st edn. Springer. ISBN:0387781889

  • Knight F (2010) Soul of the community. Technical report. https://knightfoundation.org/sotc/overall-findings/

  • Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18–22

    Google Scholar 

  • Lumley T (2018) Survey: analysis of complex survey samples. R package version 3.34

  • MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1. Oakland, CA, USA, pp 281–297

  • R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna

  • Rushton M (2008) A note on the use and misuse of the racial diversity index. Policy Stud J 36(3):445–459

    Article  Google Scholar 

  • Schloerke B, Crowley J, Cook D, Briatte F, Marbach M, Thoen E, Elberg A, Larmarange J (2018) GGally: extension to ’ggplot2’. R package version 1.4.0

  • U.S. Census Bureau (2011) 2011 American community survey 5-year estimates. Tables B01001, B01003, B02001, B05005, B08202, B19001, B19301, B23006, B25026. http://factfinder2.census.gov

  • Wickham H (2017) tidyverse: easily install and load the ‘Tidyverse’. R package version 1.2.1

  • Wickham H, Hofmann H (2011) Product plots. IEEE Trans Vis Comput Graph 17(12):2223–2230

    Article  Google Scholar 

  • Wickham H, Hofmann H (2016) productplots: product plots for R. R package version 0.1.1

  • Xie Y (2018) knitr: a general-purpose package for dynamic report generation in R. R package version 1.20

Download references

Acknowledgements

Professor Di Cook helped with the analysis and reviewing the paper. Xiaoyue Cheng’s mergeGui was used to create more amenable initial data. Israel Almodovar, Vianey Leos and Ricardo Martinez helped us with proofreading our paper for final corrections.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natalia da Silva.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Description of the variables created

Description of the variables created

See Table 5.

Table 5 Variables created in the paper

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

da Silva, N., Alvarez-Castro, I. Clicks and cliques: exploring the soul of the community. Comput Stat 34, 1537–1563 (2019). https://doi.org/10.1007/s00180-019-00881-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-019-00881-3

Keywords

Navigation