Inferring Thematic Places from Spatially Referenced Natural Language Descriptions
Places are more than just a location and spatial footprint. A sense of place is the result of subjective experience that a person has from being in a place or from interacting with information about a place. Although it is difficult to directly model a person’s conceptualization of sense of place in a computational representation, there exist many natural language data online that describe people’s experiences with places and which can be used to learn computational representations. In this paper we evaluate the usage of topic modeling on a set of travel blog entries to identify the themes that are most closely associated with places around the world. Using these representations we can calculate the similarity of places. In addition, by focusing on individual or sets of topics we identify new regions where topics are most salient. Finally we discuss how temporal changes in sense of place can be evaluated using these methods.
KeywordsTopic Modeling Latent Dirichlet Allocation Volunteer Geographic Information Topic Distribution Latent Dirichlet Allocation Model
The authors wish to thank Mike Goodchild and two anonymous reviewers for their valuable comments.
- Agnew, J. (1987). The United States in the world economy. Cambridge, MA: Cambridge University Press.Google Scholar
- Bishop, C. (2006). Pattern recognition and machine learning (Vol. 4). New York: Springer.Google Scholar
- Blei, D. M., & Lafferty, J. D. (2006). Correlated topic models. In Y. Weiss, B. Schölkopf, & J. Platt (Eds.), Advances in neural information processing systems (NIPS) 18 (pp. 147–154). Cambridge, MA: MIT Press.Google Scholar
- Blei, D. M., & Lafferty, J. D. (2009). Topic models. In A. N. Srivastava & M. Sahami (Eds.), Text mining: Classification, clustering, and applications (pp. 71–94). Boca Raton: CRC Press.Google Scholar
- Blei, D., & McAuliffe, J. (2008). Supervised topic models. In J. C. Platt, D. Koller, Y. Singer, & S. Roweis (Eds.), Advances in neural information processing systems (NIPS) 20 (pp. 121–128). Cambridge, MA: MIT Press.Google Scholar
- Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.Google Scholar
- Chang, J., & Blei, D. (2009). Relational topic models for document networks. In D. van Dyk & M. Welling (Eds.), Proceedings of the 12th international conference on artificial intelligence and statistics (AISTATS) (pp. 81–88). Clearwater Beach: Journal of Machine Learning Research.Google Scholar
- Cresswell, T. (2004). Place: A short introduction. Malden: Blackwell Publishing Ltd.Google Scholar
- de Smith, M., Goodchild, M., & Longley, P. (2007). Geospatial analysis: A comprehensive guide to principles, techniques and software tools (2nd ed.). Leicester: Winchelsea Press.Google Scholar
- Entrikin, N. (1991). The betweenness of place: Toward a geography of modernity. Baltimore: Johns Hopkins University Press.Google Scholar
- Hao, Q., Cai, R., Wang, C., Xiao, R., Yang, J. M., Pang, Y., & Zhang, L. (2010). Equip tourists with knowledge mined from travelogues. In M. Rappa, P. Jones, J. Freire, & S. Chakrabarti (Eds.), Proceedings of the 19th international conference on world wide web (WWW’10) (pp. 401–410). New York: ACM Press.CrossRefGoogle Scholar
- McCallum, A. (2002). MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu. Accessed October 8, 2011.
- Mei, Q., Liu, C., Su, H., & Zhai, C. (2006). A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In L. Carr, D. D. Roure, A. Iyengar, C. A. Goble, & M. Dahlin (Eds.), Proceedings of the 15th international conference on world wide web (pp. 533–542). New York: ACM Press.CrossRefGoogle Scholar
- Serdyukov, P., Murdock, V., & van Zwol, R. (2009). Placing flickr photos on a map. In J. Allan, J. A. Aslam, M. Sanderson, C. Zhai, & J. Zobel (Eds.), Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 484–491). New York: ACM Press.Google Scholar
- Steyvers, M., & Griffiths, T. (2007). Probabilistic topic models. In T. Landauer, D. Mcnamara, S. Dennis, & W. Kintsch (Eds.), Handbook of latent semantic analysis (pp. 424–440). Hillsdale: Lawrence Erlbaum Associates.Google Scholar
- Steyvers, M., Smyth, P., Rosen-Zvi, M., & Griffiths, T. (2004). Probabilistic author-topic models for information discovery. In KDD’04: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining (pp. 306–315). New York: ACM Press.Google Scholar
- Tuan, Y. F. (1977). Space and place: The perspective of experience. Minneapolis: The Regents of the University of Minnesota.Google Scholar