Skip to main content

Personal Activity Centres and Geosocial Data Analysis: Combining Big Data with Small Data

Part of the Lecture Notes in Geoinformation and Cartography book series (LNGC)

Abstract

Understanding how people move and interact within urban settings has been greatly facilitated by the expansion of personal computing and mobile studies. Geosocial data derived from social media applications have the potential to both document how large segments of urban populations move about and use space, as well as how they interact with their environments. In this paper we examine spatial and temporal clustering of individuals’ geosocial messages as a way to derive personal activity centres for a subset of Twitter users in the City of Toronto. We compare the two types of clustering, and for a subset of users, compare to actual self-reported activity centres. Our analysis reveals that home locations were detected within 500 m for up to 53% of users using simple spatial clustering methods based on a sample of 16 users. Work locations were detected within 500 m for 33% of users. Additionally, we find that the broader pattern of geosocial footprints indicated that 35% of users have only one activity centre, 30% have two activity centres, and 14% have three activity centres. Tweets about environment were more likely sent from locations other than work and home, and when not directed to another user. These findings indicate activity centres defined from Twitter do relate to general spatial activities, but the limited degree of spatial variability on an individual level limits the applications of geosocial footprints for more detailed analyses of movement patterns in the city.

Keywords

  • Geosocial
  • Personal activity centres
  • Clustering
  • Spatial analysis

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-56759-4_9
  • Chapter length: 17 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   219.00
Price excludes VAT (USA)
  • ISBN: 978-3-319-56759-4
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   279.99
Price excludes VAT (USA)
Hardcover Book
USD   279.99
Price excludes VAT (USA)
Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

References

  • Batty M, Axhausen KW, Giannotti F, Pozdnoukhov A, Bazzani A, Wachowicz M et al (2012) Smart cities of the future. Eur Phys J Spec Top 214(1):481–518

    CrossRef  Google Scholar 

  • Boyd D (2014) It’s complicated: the social lives of networked teens. Yale University Press, New Haven

    Google Scholar 

  • Crampton JW, Graham M, Poorthuis A, Shelton T, Stephens M, Wilson MW, Zook M (2013) Beyond the geotag: situating “big data” and leveraging the potential of the geoweb. Cartogr Geogr Inf Sci 40(2):130–139

    CrossRef  Google Scholar 

  • Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol 96, no 34, pp 226–231

    Google Scholar 

  • Golledge RG, Stimson RJ (1997) Spatial behavior: a geographic perspective. Guilford Press

    Google Scholar 

  • Goodchild MF (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69:211–221

    CrossRef  Google Scholar 

  • Haklay M (2010) How good is volunteered geographical information? A comparative study of OpenStreetMap and Ordnance Survey datasets. Environ Plan Des B 37:682–703

    CrossRef  Google Scholar 

  • Hickman P (2013) “Third places” and social interaction in deprived neighbourhoods in Great Britain. J Hous Built Environ 28(2):221–236

    CrossRef  Google Scholar 

  • Hollenstein L, Purves R (2013) Exploring place through user-generated content: using Flickr tags to describe city cores. J Spat Inf Sci 1(January):21–48

    Google Scholar 

  • Huang Q, Cao G, Wang C (2014) From where do tweets originate?: a GIS approach for user location inference. In: Proceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, ACM, pp 1–8

    Google Scholar 

  • Huang Q, Wong DWS (2016) Activity patterns, socioeconomic status and urban spatial structure: what can social media data tell us? Int J Geogr Inf Sci 30:1873–1898

    CrossRef  Google Scholar 

  • Kitchin R (2014) The real-time city? Big data and smart urbanism. GeoJournal 79(1):1–14

    CrossRef  Google Scholar 

  • Li L, Goodchild MF, Xu B (2013) Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr. Cartogr Geogr Inf Sci 40:61–77

    CrossRef  Google Scholar 

  • Meilă M (2007) Comparing clusterings—an information based distance. J Multivar Anal 98(5):873–895

    CrossRef  Google Scholar 

  • Miller HJ (2010) The data avalanche is here. Shouldn’t we be digging? J Reg Sci 50(1):181–201

    CrossRef  Google Scholar 

  • Miller HJ, Goodchild MF (2015) Data-driven geography. GeoJournal 80(4):449–461

    CrossRef  Google Scholar 

  • Mitchell L, Frank MR, Harris KD, Dodds PS, Danforth CM (2013) The geography of happiness: connecting twitter sentiment and expression, demographics, and objective characteristics of place. PLoS ONE 8(5):e64417

    CrossRef  Google Scholar 

  • Morstatter F, Pfeffer J, Liu H, Carley KM (2013) Is the sample good enough? Comparing data from Twitter’s streaming API with Twitter’s firehose. arXiv preprint arXiv:1306.5204

  • Oldenburg R, Brissett D (1982) The third place. Qual Soc 5(4):265–284

    CrossRef  Google Scholar 

  • Poorthuis A, Zook M, Shelton T, Graham M, Stephens M (2016) Using geotagged digital social data in geographic research. In: Clifford N, French S, Cope M, Gillespie T (eds) Key methods in geography. Sage, London, pp 248–269

    Google Scholar 

  • Robertson C, Feick R (2015) Bumps and bruises in the digital skins of cities: unevenly distributed user-generated content across US urban areas. Cartogr Geogr Inf Sci 1–18

    Google Scholar 

  • Soukup C (2006) Computer-mediated communication as a virtual third place: building Oldenburg’s great good places on the world wide web. New Media Soc 8(3):421–440

    CrossRef  Google Scholar 

  • Steinkuehler CA, Williams D (2006) Where everybody knows your (screen) name: online games as “third places”. J Comput-Mediat Commun 11(4):885–909

    CrossRef  Google Scholar 

  • Sykora MD, Robertson C, Shankardass K, Feick R, Shaughnessy K, Coates B, Lawrence H, Jackson T (2015) Stresscapes: validating linkages between place and stress expression on social media. Published by CEUR Workshop Proceedings

    Google Scholar 

Download references

Acknowledgements

The authors gratefully acknowledge our study participants as well as the Social Sciences and Humanities Research Council of Canada for funding this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Colin Robertson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Robertson, C., Feick, R., Sykora, M., Shankardass, K., Shaughnessy, K. (2017). Personal Activity Centres and Geosocial Data Analysis: Combining Big Data with Small Data. In: Bregt, A., Sarjakoski, T., van Lammeren, R., Rip, F. (eds) Societal Geo-innovation. AGILE 2017. Lecture Notes in Geoinformation and Cartography. Springer, Cham. https://doi.org/10.1007/978-3-319-56759-4_9

Download citation