Embedding Geographic Locations for Modelling the Natural Environment Using Flickr Tags and Structured Data

Jeawak, Shelan S.; Jones, Christopher B.; Schockaert, Steven

doi:10.1007/978-3-030-15712-8_4

Shelan S. Jeawak^20,21,
Christopher B. Jones²⁰ &
Steven Schockaert²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11437))

Included in the following conference series:

European Conference on Information Retrieval

2637 Accesses
1 Citations

Abstract

Meta-data from photo-sharing websites such as Flickr can be used to obtain rich bag-of-words descriptions of geographic locations, which have proven valuable, among others, for modelling and predicting ecological features. One important insight from previous work is that the descriptions obtained from Flickr tend to be complementary to the structured information that is available from traditional scientific resources. To better integrate these two diverse sources of information, in this paper we consider a method for learning vector space embeddings of geographic locations. We show experimentally that this method improves on existing approaches, especially in cases where structured information is available.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Neural Embeddings for Populated Geonames Locations

Extract Knowledge About Geo-Location Using Context and Content Information of Geo-Tagged Social Media

Identifying Locally- and Globally-Distinctive Urban Place Descriptors from Heterogeneous User-Generated Content

Notes

1.
http://www.flickr.com.
2.
One exception is perhaps when we want to predict the scenicness of a given location, where e.g. terms that are related to professional landscape photography might be a strong indicator of scenicness.
3.
http://www.eea.europa.eu/data-and-maps/data/eu-dem.
4.
http://data.europa.eu/89h/jrc-luisa-europopmap06.
5.
http://worldclim.org.
6.
http://www.eea.europa.eu/data-and-maps/data/corine-land-cover-2006-raster-2.
7.
https://www.soilgrids.org.
8.
http://ec.europa.eu/environment/nature/natura2000/index_en.htm.
9.
http://scenic.mysociety.org/.
10.
http://www.cs.cornell.edu/people/tj/svm_light/.
11.
The EGEL source code is available online at https://github.com/shsabah84/EGEL-Model.git.

References

Barve, V.V.: Discovering and developing primary biodiversity data from social networking sites. Ph.D. thesis, University of Kansas (2015)
Google Scholar
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Advances in Neural Information Processing Systems, pp. 2787–2795 (2013)
Google Scholar
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
Google Scholar
Cocos, A., Callison-Burch, C.: The language of place: semantic value from geospatial context. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, vol. 2, pp. 99–104 (2017)
Google Scholar
Cunha, E., Martins, B.: Using one-class classifiers and multiple kernel learning for defining imprecise geographic regions. Int. J. Geogr. Inf. Sci. 28(11), 2220–2241 (2014)
Article Google Scholar
Daume, S.: Mining Twitter to monitor invasive alien species - an analytical framework and sample information topologies. Ecol. Inform. 31, 70–82 (2016)
Article Google Scholar
De Choudhury, M., Feldman, M., Amer-Yahia, S., Golbandi, N., Lempel, R., Yu, C.: Constructing travel itineraries from tagged geo-temporal breadcrumbs. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1083–1084 (2010)
Google Scholar
Derrac, J., Schockaert, S.: Inducing semantic relations from conceptual spaces: a data-driven approach to plausible reasoning. Artif. Intell. 228, 74–105 (2015)
Article MathSciNet Google Scholar
Eisenstein, J., O’Connor, B., Smith, N.A., Xing, E.P.: A latent variable model for geographic lexical variation. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 1277–1287 (2010)
Google Scholar
ElQadi, M.M., Dorin, A., Dyer, A., Burd, M., Bukovac, Z., Shrestha, M.: Mapping species distributions with social media geo-tagged images: case studies of bees and flowering plants in Australia. Ecol. Inform. 39, 23–31 (2017)
Article Google Scholar
Feng, S., Cong, G., An, B., Chee, Y.M.: Poi2vec: geographical latent representation for predicting future visitors. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 102–108 (2017)
Google Scholar
Ge, L., Moh, T.S.: Improving text classification with word embedding. In: IEEE International Conference on Big Data, pp. 1796–1805 (2017)
Google Scholar
Grave, E., Mikolov, T., Joulin, A., Bojanowski, P.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 427–431 (2017)
Google Scholar
Grothe, C., Schaab, J.: Automated footprint generation from geotags with kernel density estimation and support vector machines. Spat. Cogn. Comput. 9(3), 195–211 (2009)
Article Google Scholar
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864 (2016)
Google Scholar
Guo, S., Wang, Q., Wang, B., Wang, L., Guo, L.: Semantically smooth knowledge graph embedding. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, pp. 84–94 (2015)
Google Scholar
Gupta, A., Boleda, G., Baroni, M., Padó, S.: Distributional vectors encode referential attributes. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 12–21 (2015)
Google Scholar
Hasegawa, M., Kobayashi, T., Hayashi, Y.: Social image tags as a source of word embeddings: a task-oriented evaluation. In: LREC, pp. 969–973 (2018)
Google Scholar
Hollenstein, L., Purves, R.: Exploring place through user-generated content: using Flickr tags to describe city cores. J. Spat. Inf. Sci. 1, 21–48 (2010)
Google Scholar
Jameel, S., Schockaert, S.: D-glove: a feasible least squares model for estimating word embedding densities. In: Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers, pp. 1849–1860 (2016)
Google Scholar
Jeawak, S., Jones, C., Schockaert, S.: Using Flickr for characterizing the environment: an exploratory analysis. In: 13th International Conference on Spatial Information Theory, vol. 86, pp. 21:1–21:13 (2017)
Google Scholar
Jeawak, S., Jones, C., Schockaert, S.: Mapping wildlife species distribution with social media: augmenting text classification with species names. In: Proceedings of the 10th International Conference on Geographic Information Science, pp. 34:1–34:6 (2018)
Google Scholar
Joachims, T.: Making large-scale SVM learning practical. Technical report, SFB 475: Komplexitätsreduktion in Multivariaten Datenstrukturen, Universität Dortmund (1998)
Google Scholar
Joulin, A., Grave, E., Bojanowski, P., Nickel, M., Mikolov, T.: Fast linear model for knowledge graph embeddings. arXiv preprint arXiv:1710.10881 (2017)
Kuang, S., Davison, B.D.: Learning word embeddings with chi-square weights for healthcare tweet classification. Appl.Sci. 7(8), 846 (2017)
Article Google Scholar
Lilleberg, J., Zhu, Y., Zhang, Y.: Support vector machines and word2vec for text classification with semantic features. In: IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC), pp. 136–140 (2015)
Google Scholar
Liu, Q., Jiang, H., Wei, S., Ling, Z.H., Hu, Y.: Learning semantic word embeddings based on ordinal knowledge constraints. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, pp. 1501–1511 (2015)
Google Scholar
Liu, Q., Ling, Z.H., Jiang, H., Hu, Y.: Part-of-speech relevance weights for learning word embeddings. arXiv preprint arXiv:1603.07695 (2016)
Liu, X., Liu, Y., Li, X.: Exploring the context of locations for personalized location recommendations. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 1188–1194 (2016)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Nickel, M., Kiela, D.: Poincaré embeddings for learning hierarchical representations. In: Advances in Neural Information Processing Systems, pp. 6341–6350 (2017)
Google Scholar
Niwa, Y., Nitta, Y.: Co-occurrence vectors from corpora vs. distance vectors from dictionaries. In: Proceedings of the 15th Conference on Computational Linguistics-Volume 1, pp. 304–309 (1994)
Google Scholar
Ono, M., Miwa, M., Sasaki, Y.: Word embedding-based antonym detection using thesauri and distributional information. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 984–989 (2015)
Google Scholar
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1532–1543 (2014)
Google Scholar
Qiu, L., Cao, Y., Nie, Z., Yu, Y., Rui, Y.: Learning word representation considering proximity and ambiguity. In: AAAI, pp. 1572–1578 (2014)
Google Scholar
Quercia, D., Schifanella, R., Aiello, L.M.: The shortest path to happiness: recommending beautiful, quiet, and happy routes in the city. In: Proceedings of the 25th ACM Conference on Hypertext and Social Media, pp. 116–125 (2014)
Google Scholar
Rattenbury, T., Good, N., Naaman, M.: Towards automatic extraction of event and place semantics from Flickr tags. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 103–110 (2007)
Google Scholar
Rattenbury, T., Naaman, M.: Methods for extracting place semantics from Flickr tags. ACM Trans. Web 3(1), 1 (2009)
Article Google Scholar
Richards, D.R., Friess, D.A.: A rapid indicator of cultural ecosystem service usage at a fine spatial scale: content analysis of social media photographs. Ecol. Ind. 53, 187–195 (2015)
Article Google Scholar
Rothe, S., Schütze, H.: Word embedding calculus in meaningful ultradense subspaces. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 512–517 (2016)
Google Scholar
Saeidi, M., Riedel, S., Capra, L.: Lower dimensional representations of city neighbourhoods. In: AAAI Workshop: AI for Cities (2015)
Google Scholar
Speer, R., Chin, J., Havasi, C.: Conceptnet 5.5: an open multilingual graph of general knowledge. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 4444–4451 (2017)
Google Scholar
Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., Qin, B.: Learning sentiment-specific word embedding for twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1555–1565 (2014)
Google Scholar
Trouillon, T., Welbl, J., Riedel, S., Gaussier, É., Bouchard, G.: Complex embeddings for simple link prediction. In: International Conference on Machine Learning, pp. 2071–2080 (2016)
Google Scholar
Van Canneyt, S., Schockaert, S., Dhoedt, B.: Discovering and characterizing places of interest using Flickr and Twitter. Int. J. Semant. Web Inf. Syst. (IJSWIS) 9(3), 77–104 (2013)
Article Google Scholar
Van Laere, O., Quinn, J.A., Schockaert, S., Dhoedt, B.: Spatially aware term selection for geotagging. IEEE Trans. Knowl. Data Eng. 26, 221–234 (2014)
Article Google Scholar
Vendrov, I., Kiros, R., Fidler, S., Urtasun, R.: Order-embeddings of images and language. arXiv preprint arXiv:1511.06361 (2015)
Wang, X., Cui, P., Wang, J., Pei, J., Zhu, W., Yang, S.: Community preserving network embedding. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 203–209 (2017)
Google Scholar
Weston, J., Bengio, S., Usunier, N.: Large scale image annotation: learning to rank with joint word-image embeddings. Mach. Learn. 81(1), 21–35 (2010)
Article MathSciNet Google Scholar
Xu, C., et al.: RC-NET: a general framework for incorporating knowledge into word representations. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pp. 1219–1228 (2014)
Google Scholar
Yan, B., Janowicz, K., Mai, G., Gao, S.: From ITDL to Place2Vec: reasoning about place type similarity and relatedness by learning embeddings from augmented spatial contexts. In: Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 35:1–35:10 (2017)
Google Scholar
Yang, B., Yih, W., He, X., Gao, J., Deng, L.: Embedding entities and relations for learning and inference in knowledge bases. In: Proceeding of ICLR 2015 (2015)
Google Scholar
Yao, Y., et al.: Sensing spatial distribution of urban land use by integrating points-of-interest and Google word2vec model. Int. J. Geogr. Inf. Sci. 31(4), 825–848 (2017)
Article Google Scholar
Zhang, C., et al.: Regions, periods, activities: uncovering urban dynamics via cross-modal representation learning. In: Proceedings of the 26th International Conference on World Wide Web, pp. 361–370 (2017)
Google Scholar
Zhao, S., Zhao, T., King, I., Lyu, M.R.: Geo-teaser: geo-temporal sequential embedding rank for point-of-interest recommendation. In: Proceedings of the 26th International Conference on World Wide Web Companion, pp. 153–162 (2017)
Google Scholar

Download references

Acknowledgments

Shelan Jeawak has been sponsored by HCED Iraq. Steven Schockaert has been supported by ERC Starting Grant 637277.

Author information

Authors and Affiliations

School of Computer Science and Informatics, Cardiff University, Cardiff, UK
Shelan S. Jeawak, Christopher B. Jones & Steven Schockaert
Department of Computer Science, Al-Nahrain University, Baghdad, Iraq
Shelan S. Jeawak

Authors

Shelan S. Jeawak
View author publications
You can also search for this author in PubMed Google Scholar
Christopher B. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Steven Schockaert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shelan S. Jeawak .

Editor information

Editors and Affiliations

University of Strathclyde, Glasgow, UK
Leif Azzopardi
Bauhaus Universität Weimar, Weimar, Germany
Benno Stein
Universität Duisburg-Essen, Duisburg, Germany
Norbert Fuhr
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany
Philipp Mayr
Delft University of Technology, Delft, The Netherlands
Claudia Hauff
University of Twente, Enschede, The Netherlands
Djoerd Hiemstra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jeawak, S.S., Jones, C.B., Schockaert, S. (2019). Embedding Geographic Locations for Modelling the Natural Environment Using Flickr Tags and Structured Data. In: Azzopardi, L., Stein, B., Fuhr, N., Mayr, P., Hauff, C., Hiemstra, D. (eds) Advances in Information Retrieval. ECIR 2019. Lecture Notes in Computer Science(), vol 11437. Springer, Cham. https://doi.org/10.1007/978-3-030-15712-8_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-15712-8_4
Published: 07 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15711-1
Online ISBN: 978-3-030-15712-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Embedding Geographic Locations for Modelling the Natural Environment Using Flickr Tags and Structured Data

Abstract

Access this chapter

Similar content being viewed by others

Neural Embeddings for Populated Geonames Locations

Extract Knowledge About Geo-Location Using Context and Content Information of Geo-Tagged Social Media

Identifying Locally- and Globally-Distinctive Urban Place Descriptors from Heterogeneous User-Generated Content

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Embedding Geographic Locations for Modelling the Natural Environment Using Flickr Tags and Structured Data

Abstract

Access this chapter

Similar content being viewed by others

Neural Embeddings for Populated Geonames Locations

Extract Knowledge About Geo-Location Using Context and Content Information of Geo-Tagged Social Media

Identifying Locally- and Globally-Distinctive Urban Place Descriptors from Heterogeneous User-Generated Content

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation