Skip to main content
Log in

Location- and keyword-based querying of geo-textual data: a survey

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

With the broad adoption of mobile devices, notably smartphones, keyword-based search for content has seen increasing use by mobile users, who are often interested in content related to their geographical location. We have also witnessed a proliferation of geo-textual content that encompasses both textual and geographical information. Examples include geo-tagged microblog posts, yellow pages, and web pages related to entities with physical locations. Over the past decade, substantial research has been conducted on integrating location into keyword-based querying of geo-textual content in settings where the underlying data is assumed to be either relatively static or is assumed to stream into a system that maintains a set of continuous queries. This paper offers a survey of both the research problems studied and the solutions proposed in these two settings. As such, it aims to offer the reader a first understanding of key concepts and techniques, and it serves as an “index” for researchers who are interested in exploring the concepts and techniques underlying proposed solutions to the querying of geo-textual data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. https://twitter.com/

  2. https://www.flickr.com/

  3. https://www.instagram.com/

  4. https://foursquare.com/

  5. https://www.yelp.com/

  6. https://www.tripadvisor.com/

References

  1. 30 foursquare statistics to help you optimize the platform in 2019. https://99firms.com/blog/foursquare-statistics/#gref. Accessed 05 Nov 2019

  2. Apache lucene. https://lucene.apache.org. Accessed 05 Nov 2019

  3. An introduction to yelp metrics as of 30 june, 2019. https://www.yelp.com/factsheet. Accessed 05 Nov 2019

  4. Abdelhamid, A.S., Tang, M., Aly, A.M., Mahmood, A.R., Qadah, T., Aref, W.G., Basalamah, S.: Cruncher: distributed in-memory processing for location-based services. In: ICDE, pp. 1406–1409 (2016)

  5. Abdelhaq, H., Sengstock, C., Gertz, M.: Eventweet: Online localized event detection from twitter. PVLDB 6(12), 1326–1329 (2013)

    Google Scholar 

  6. Ahmed, P., Hasan, M., Kashyap, A., Hristidis, V., Tsotras, V.J.: Efficient computation of top-k frequent terms over spatio-temporal ranges. In: SIGMOD, pp. 1227–1241 (2017)

  7. Ahuja, R., Armenatzoglou, N., Papadias, D., Fakas, G.J.: Geo-social keyword search. In: SSTD, pp. 431–450 (2015)

  8. Almaslukh, A., Magdy, A.: Evaluating spatial-keyword queries on streaming data. In: SIGSPATIAL, pp. 209–218 (2018)

  9. de Almeida, J.P.D., Rocha-Junior, J.B.: Top-k spatial keyword preference query. JIDM 6(3), 162–177 (2015)

    Google Scholar 

  10. Alsubaiee, S., Behm, A., Li, C.: Supporting location-based approximate-keyword queries. In: SIGSPATIAL, pp. 61–70 (2010)

  11. Bar-Yossef, Z., Jayram, T.S., Kumar, R., Sivakumar, D., Trevisan, L.: Counting distinct elements in a data stream. In: RANDOM, pp. 1–10. Springer (2002)

  12. Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: ICDE, pp. 421–430 (2001)

  13. Bouros, P., Ge, S., Mamoulis, N.: Spatio-textual similarity joins. PVLDB 6(1), 1–12 (2012)

    Google Scholar 

  14. Cao, X., Chen, L., Cong, G., Jensen, C.S., Qu, Q., Skovsgaard, A., Wu, D., Yiu, M.L.: Spatial keyword querying. In: ER, pp. 16–29. Springer (2012)

  15. Cao, X., Chen, L., Cong, G., Xiao, X.: Keyword-aware optimal route search. PVLDB 5(11), 1136–1147 (2012)

    Google Scholar 

  16. Cao, X., Cong, G., Guo, T., Jensen, C.S., Ooi, B.C.: Efficient processing of spatial group keyword queries. TODS 40(2), 1–48 (2015)

    Article  MathSciNet  Google Scholar 

  17. Cao, X., Cong, G., Jensen, C.S.: Retrieving top-k prestige-based relevant spatial web objects. PVLDB 3(1), 373–384 (2010)

    Google Scholar 

  18. Cao, X., Cong, G., Jensen, C.S., Ooi, B.C.: Collective spatial keyword querying. In: SIGMOD, pp. 373–384 (2011)

  19. Cary, A., Wolfson, O., Rishe, N.: Efficient and scalable method for processing top-k spatial Boolean queries. In: SSDBM, pp. 87–95 (2010)

  20. Chan, H.K., Long, C., Wong, R.C.: Inherent-cost aware collective spatial keyword queries. In: SSTD, pp. 357–375 (2017)

  21. Chan, H.K.H., Long, C., Wong, R.C.W.: On generalizing collective spatial keyword queries. TKDE 30(9), 1712–1726 (2018)

    Google Scholar 

  22. Chen, G., Zhao, J., Gao, Y., Chen, L., Chen, R.: Time-aware Boolean spatial keyword queries. TKDE 29(11), 2601–2614 (2017)

    Google Scholar 

  23. Chen, J., Xu, J., Liu, C., Li, Z., Liu, A., Ding, Z.: Multi-objective spatial keyword query with semantics. In: DASFAA, pp. 34–48 (2017)

  24. Chen, L., Cong, G., Cao, X.: An efficient query indexing mechanism for filtering geo-textual data. In: SIGMOD, pp. 749–760 (2013)

  25. Chen, L., Cong, G., Cao, X., Tan, K.: Temporal spatial-keyword top-k publish/subscribe. In: ICDE, pp. 255–266 (2015)

  26. Chen, L., Cong, G., Jensen, C.S., Wu, D.: Spatial keyword query processing: an experimental evaluation. PVLDB 6(3), 217–228 (2013)

    Google Scholar 

  27. Chen, L., Li, Y., Xu, J., Jensen, C.S.: Direction-aware why-not spatial keyword top-k queries. In: ICDE, pp. 107–110 (2017)

  28. Chen, L., Lin, X., Hu, H., Jensen, C.S., Xu, J.: Answering why-not questions on spatial keyword top-k queries. In: ICDE, pp. 279–290 (2015)

  29. Chen, L., Roy, A.: Event detection from flickr data through wavelet-based spatial analysis. In: CIKM, pp. 523–532 (2009)

  30. Chen, L., Shang, S., Yang, C., Li, J.: Spatial keyword search: a survey. GeoInformatica 24(1), 85–106 (2020)

    Article  Google Scholar 

  31. Chen, L., Shang, S., Zheng, K., Kalnis, P.: Cluster-based subscription matching for geo-textual data streams. In: ICDE, pp. 890–901 (2019)

  32. Chen, L., Xu, J., Jensen, C.S., Li, Y.: YASK: a why-not question answering engine for spatial keyword query services. PVLDB 9(13), 1501–1504 (2016)

    Google Scholar 

  33. Chen, L., Xu, J., Lin, X., Jensen, C.S., Hu, H.: Answering why-not spatial keyword top-k queries via keyword adaption. In: ICDE, pp. 697–708 (2016)

  34. Chen, Y., Chen, Z., Cong, G., Mahmood, A.R., Aref, W.G.: Sstd: a distributed system on streaming spatio-textual data. PVLDB 13(12), 2284–2296 (2020)

    Google Scholar 

  35. Chen, Y., Suel, T., Markowetz, A.: Efficient query processing in geographic web search engines. In: SIGMOD, pp. 277–288 (2006)

  36. Chen, Z., Cong, G., Zhang, Z., Fuz, T.Z., Chen, L.: Distributed publish/subscribe query processing on the spatio-textual data stream. In: ICDE, pp. 1095–1106 (2017)

  37. Choi, D.W., Pei, J., Lin, X.: Finding the minimum spatial keyword cover. In: ICDE, pp. 685–696 (2016)

  38. Choudhury, F.M., Culpepper, J.S., Sellis, T.K., Cao, X.: Maximizing bichromatic reverse spatial and textual k nearest neighbor queries. PVLDB 9(6), 456–467 (2016)

    Google Scholar 

  39. Christoforaki, M., He, J., Dimopoulos, C., Markowetz, A., Suel, T.: Text vs. space: efficient geo-search query processing. In: CIKM, pp. 423–432 (2011)

  40. Cong, G., Feng, K., Zhao, K.: Querying and mining geo-textual data for exploration: Challenges and opportunities. In: ICDE Workshops, pp. 165–168 (2016)

  41. Cong, G., Jensen, C.S.: Querying geo-textual data: Spatial keyword queries and beyond. In: SIGMOD, pp. 2207–2212. ACM (2016)

  42. Cong, G., Jensen, C.S., Wu, D.: Efficient retrieval of the top-k most relevant spatial web objects. In: PVLDB, pp. 337–348 (2009)

  43. Cui, N., Li, J., Yang, X., Wang, B., Reynolds, M., Xiang, Y.: When geo-text meets security: Privacy-preserving Boolean spatial keyword queries. In: ICDE (2019)

  44. Deng, K., Li, X., Lu, J., Zhou, X.: Best keyword cover search. In: ICDE, pp. 61–73 (2014)

  45. Dijkstra, E.W.: A note on two problems in connexion with graphs. Numerische mathematik 1(1), 269–271 (1959)

    Article  MathSciNet  MATH  Google Scholar 

  46. Fagin, R.: Combining fuzzy information from multiple systems. In: PODS, pp. 216–226 (1996)

  47. Faloutsos, C., Christodoulakis, S.: Signature files: an access method for documents and its analytical performance evaluation. TOIS 2(4), 267–288 (1984)

    Article  Google Scholar 

  48. Fan, J., Li, G., Zhou, L., Chen, S., Hu, J.: SEAL: spatio-textual similarity search. PVLDB 5(9), 824–835 (2012)

    Google Scholar 

  49. Fang, H., Zhao, P., Sheng, V.S., Li, Z., Xu, J., Wu, J., Cui, Z.: Ranked reverse Boolean spatial keyword nearest neighbors search. In: WISE, pp. 92–107 (2015)

  50. Fang, Y., Cheng, R., Cong, G., Mamoulis, N., Li, Y.: On spatial pattern matching. In: ICDE, pp. 293–304 (2018)

  51. Feng, W., Zhang, C., Zhang, W., Han, J., Wang, J., Aggarwal, C., Huang, J.: STREAMCUBE: hierarchical spatio-temporal hashtag clustering for event exploration over the twitter stream. In: ICDE, pp. 1561–1572 (2015)

  52. Felipe, I.D., Hristidis, V., Rishe, N.: Keyword search on spatial databases. In: ICDE, pp. 656–665 (2008)

  53. Google s2 geometry library. https://code.google.com/archive/p/s2-geometry-library/. Accessed 10 Nov 2020

  54. Gao, Y., Qin, X., Zheng, B., Chen, G.: Efficient reverse top-k Boolean spatial keyword queries on road networks. TKDE 27(5), 1205–1218 (2015)

    Google Scholar 

  55. Gao, Y., Wang, Y., Yi, S.: Preference-aware top-k spatio-textual queries. In: WAIM, pp. 186–197 (2016)

  56. Gao, Y., Zhao, J., Zheng, B., Chen, G.: Efficient collective spatial keyword query processing on road networks. TITS 17(2), 469–480 (2016)

    Google Scholar 

  57. Göbel, R., Henrich, A., Niemann, R., Blank, D.: A hybrid index structure for geo-textual searches. In: CIKM, pp. 1625–1628 (2009)

  58. Guo, L., Shao, J., Aung, H.H., Tan, K.L.: Efficient continuous top-k spatial keyword queries on road networks. In: GeoInformatica, pp. 29–60 (2015)

  59. Guo, L., Zhang, D., Li, G., Tan, K., Bao, Z.: Location-aware pub/sub system: When continuous moving queries meet dynamic event streams. In: SIGMOD, pp. 843–857 (2015)

  60. Guo, T., Cao, X., Cong, G.: Efficient algorithms for answering the m-closest keywords query. In: SIGMOD, pp. 405–418 (2015)

  61. Güting, R.H., Valdés, F., Damiani, M.L.: Symbolic trajectories. TSAS 1(2), 1–51 (2015)

    Article  Google Scholar 

  62. Guttman, A.: R-trees: A dynamic index structure for spatial searching. In: SIGMOD, pp. 47–57 (1984)

  63. Hdfs architecture guide. https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html. Accessed 10 Nov 2020

  64. Haghani, P., Michel, S., Aberer, K.: The gist of everything new: Personalized top-k processing over web 2.0 streams. In: CIKM, pp. 489–498 (2010)

  65. Hariharan, R., Hore, B., Li, C., Mehrotra, S.: Processing spatial-keyword (sk) queries in geographic information retrieval (gir) systems. In: SSDBM, p. 16 (2007)

  66. He, P., Xu, H., Zhao, X., Shen, Z.: Scalable collective spatial keyword query. In: ICDE Workshops, pp. 182–189 (2015)

  67. Hershberger, J., Shrivastava, N., Suri, S., Tóth, C.D.: Adaptive spatial partitioning for multidimensional data streams. In: ISAAC, pp. 522–533 (2005)

  68. Hoang-Vu, T.A., Vo, H.T., Freire, J.: A unified index for spatio-temporal keyword queries. In: CIKM, pp. 135–144 (2016)

  69. Hu, H., Liu, Y., Li, G., Feng, J., Tan, K.: A location-aware publish/subscribe framework for parameterized spatio-textual subscriptions. In: ICDE, pp. 711–722 (2015)

  70. Hu, J., Cheng, R., Wu, D., Jin, B.: Efficient top-k subscription matching for location-aware publish/subscribe. In: SSTD, pp. 333–351 (2015)

  71. Huang, W., Li, G., Tan, K., Feng, J.: Efficient safe-region construction for moving top-k spatial keyword queries. In: CIKM, pp. 932–941 (2012)

  72. Huiqi, H., Guoliang, L., Zhifeng, B., Jianhua, F., Yongwei, W., Zhiguo, G., Yaoqing, X.: Top-k spatio-textual similarity join. TKDE 28(2), 551–565 (2015)

    Google Scholar 

  73. Introduction to postgis. https://postgis.net/workshops/postgis-intro/index.html. Accessed 10 Nov 2020

  74. Jiang, J., Lu, H., Yang, B., Cui, B.: Finding top-k local users in geo-tagged social media data. In: ICDE, pp. 267–278 (2015)

  75. Kanza, Y., Levin, R., Safra, E., Sagiv, Y.: An interactive approach to route search. In: SIGSPATIAL, pp. 408–411 (2009)

  76. Kanza, Y., Safra, E., Sagiv, Y.: Route search over probabilistic geospatial data. In: SSTD, pp. 153–170 (2009)

  77. Kanza, Y., Safra, E., Sagiv, Y., Doytsher, Y.: Heuristic algorithms for route-search queries over geographical data. In: SIGSPATIAL, pp. 1–10 (2008)

  78. Khodaei, A., Shahabi, C., Li, C.: Hybrid indexing and seamless ranking of spatial and textual features of web documents. DEXA 1, 450–466 (2010)

    Google Scholar 

  79. Krumm, J., Horvitz, E.: Eyewitness: identifying local events via space-time signals in twitter feeds. In: SIGSPATIAL, pp. 20:1–20:10 (2015)

  80. Lee, T., Park, J.W., Lee, S., Hwang, S.W., Elnikety, S., He, Y.: Processing and optimizing main memory spatial-keyword queries. PVLDB 9(3), 132–143 (2015)

    Google Scholar 

  81. Li, G., Feng, J., Xu, J.: DESKS: direction-aware spatial keyword search. In: ICDE, pp. 474–485 (2012)

  82. Li, G., Wang, Y., Wang, T., Feng, J.: Location-aware publish/subscribe. In: KDD, pp. 802–810 (2013)

  83. Li, J., Wang, H., Li, J., Gao, H.: Skyline for geo-textual data. In: GeoInformatica, pp. 453–469 (2016)

  84. Li, J., Yang, Y.D., Mamoulis, N.: Optimal route queries with arbitrary order constraints. TKDE 25(5), 1097–1110 (2013)

    Google Scholar 

  85. Li, M., Chen, L., Cong, G., Gu, Y., Yu, G.: Efficient processing of location-aware group preference queries. In: CIKM, pp. 559–568 (2016)

  86. Li, R., Lei, K.H., Khadiwala, R., Chang, K.C.: TEDAS: A twitter-based event detection and analysis system. In: ICDE, pp. 1273–1276 (2012)

  87. Li, Y., Yang, W., Dan, W., Xie, Z.: Keyword-aware dominant route search for various user preferences. In: DASFAA, pp. 207–222 (2015)

  88. Li, Z., Lee, K.C.K., Zheng, B., Lee, W.C., Lee, D.L., Wang, X.: Ir-tree: an efficient index for geographic document search. TKDE 23(4), 585–599 (2011)

    Google Scholar 

  89. Liu, J., Deng, K., Sun, H., Yu, G., Zhou, X., Jensen, C.S.: Clue-based spatio-textual query. PVLDB 10(5), 529–540 (2017)

    Google Scholar 

  90. Liu, S., Li, G., Feng, J.: A prefix-filter based method for spatio-textual similarity join. TKDE 26(10), 2354–2367 (2014)

    Google Scholar 

  91. Long, C., Wong, R.C., Wang, K., Fu, A.W.: Collective spatial keyword queries: a distance owner-driven approach. In: SIGMOD, pp. 689–700 (2013)

  92. Lu, J., Lu, Y., Cong, G.: Reverse spatial and textual k nearest neighbor search. In: SIGMOD, pp. 349–360 (2011)

  93. Lu, Y., Cong, G., Lu, J., Shahabi, C.: Efficient algorithms for answering reverse spatial-keyword nearest neighbor queries. In: SIGSPATIAL, pp. 82:1–82:4 (2015)

  94. Lu, Y., Lu, J., Cong, G., Wu, W., Shahabi, C.: Efficient algorithms and cost models for reverse spatial-keyword k-nearest neighbor search. TODS 39(2), 13 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  95. Lu, Y., Zhang, M., Witherspoon, S., Yesha, Y., Yesha, Y., Rishe, N.: Sksopen: efficient indexing, querying, and visualization of geo-spatial big data. In: ICMLA, pp. 495–500 (2013)

  96. Luo, C., Junlin, L., Li, G., Wei, W., Li, Y., Li, J.: Efficient reverse spatial and textual k nearest neighbor queries on road networks. Knowl.-Based Syst. 93, 121–134 (2016)

    Article  Google Scholar 

  97. Luo, S., Luo, Y., Zhou, S., Cong, G., Guan, J.: Disks: a system for distributed spatial group keyword search on road networks. PVLDB 5(12), 1966–1969 (2012)

    Google Scholar 

  98. Luo, S., Luo, Y., Zhou, S., Cong, G., Guan, J., Yong, Z.: Distributed spatial keyword querying on road networks. In: EDBT, pp. 235–246 (2014)

  99. Mysql 8.0 reference manual. https://dev.mysql.com/doc/refman/8.0/en/. Accessed 10 Nov 2020

  100. Ma, Y., Zhang, Y., Meng, X.: St-hbase: a scalable data management system for massive geo-tagged objects. In: WAIM, pp. 155–166 (2013)

  101. Machanavajjhala, A., Vee, E., Garofalakis, M., Shanmugasundaram, J.: Scalable ranked publish/subscribe. PVLDB 1(1), 451–462 (2008)

    Google Scholar 

  102. Magdy, A., Aly, A.M., Mokbel, M.F., Elnikety, S., He, Y., Nath, S., Aref, W.G.: Geotrend: Spatial trending queries on real-time microblogs. In: SIGSPATIAL, pp. 7:1–7:10 (2016)

  103. Mahmood, A., Daghistani, A., Aly, A., Tang, M., Basalamah, S., Prabhakar, S., Aref, W.: Adaptive processing of spatial-keyword data over a distributed streaming cluster. In: SIGSPATIAL, pp. 219–228 (2018)

  104. Mahmood, A.R., Aly, A.M., Aref, W.G.: FAST: frequency-aware indexing for spatio-textual data streams. In: ICDE, pp. 305–316 (2018)

  105. Mahmood, A.R., Aly, A.M., Qadah, T., Rezig, E.K., Daghistani, A., Madkour, A., Abdelhamid, A.S., Hassan, M.S., Aref, W.G., Basalamah, S.: Tornado: a distributed spatio-textual stream processing system. PVLDB 8(12), 2020–2023 (2015)

    Google Scholar 

  106. Mahmood, A.R., Aref, W.G.: Scalable processing of spatial-keyword queries. Synth. Lect. Data Manag. 11(1), 1–116 (2019)

    Article  Google Scholar 

  107. Mahmood, A.R., Aref, W.G., Aly, A.M., Tang, M.: Atlas: on the expression of spatial-keyword group queries using extended relational constructs. In: SIGSPATIAL, pp. 45:1–45:10 (2016)

  108. Mahmood, A.R., Punni, S., Aref, W.G.: Spatio-temporal access methods: a survey (2010–2017). GeoInformatica 23(1), 1–36 (2019)

    Article  Google Scholar 

  109. Meagher, D.: Geometric modeling using octree encoding. Comput. Graphics Image Process. 19(2), 129–147 (1982)

    Article  Google Scholar 

  110. Mehta, P., Skoutas, D., Voisard, A.: Spatio-temporal keyword queries for moving objects. In: SIGSPATIAL, pp. 55:1–55:4 (2015)

  111. Memon, I., Chen, L., Majid, A., Lv, M., Hussain, I., Chen, G.: Travel recommendation using geo-tagged photos in social media for tourist. Wirel. Pers. Commun. 80(4), 1347–1362 (2015)

    Article  Google Scholar 

  112. Nepal, S., Ramakrishna, M.: Query processing issues in image (multimedia) databases. In: ICDE, pp. 22–29. IEEE (1999)

  113. Nepomnyachiy, S., Gelley, B., Jiang, W., Minkus, T.: What, where, and when: keyword search with spatio-temporal ranges. In: GIR, pp. 2:1–2:8 (2014)

  114. Oracle help center. https://docs.oracle.com/en/. Accessed 10 Nov 2020

  115. Pat, B., Kanza, Y.: Wheres waldo? geosocial search over myriad geotagged posts. In: SIGSPATIAL, pp. 1–10 (2017)

  116. Quezada, M., Araya, V.P., Poblete, B.: Location-aware model for news events in social media. In: SIGIR, pp. 935–938 (2015)

  117. Ram, A., Sunita, J., Jalal, A., Manoj, K.: A density based algorithm for discovering density varied clusters in large spatial databases. IJCA 3(6), 1–4 (2010)

    Article  Google Scholar 

  118. Rao, J., Lin, J., Samet, H.: Partitioning strategies for spatio-textual similarity join. In: SIGSPATIAL, pp. 40–49 (2014)

  119. Regalado, A., Goncalves, M., Abad-Mota, S.: Evaluating skyline queries on spatial web objects. In: DEXA, pp. 416–423 (2012)

  120. Rocha-Junior, J.B., Gkorgkas, O., Jonassen, S., Nørvåg, K.: Efficient processing of top-k spatial keyword queries. In: SSTD, pp. 205–222 (2011)

  121. Rocha-Junior, J.B., Nørvåg, K.: Top-k spatial keyword queries on road networks. In: EDBT, pp. 168–179 (2012)

  122. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: WWW, pp. 851–860 (2010)

  123. Samet, H.: Foundations of multidimensional and metric data structures. Morgan Kaufmann series in data management systems. Academic Press (2006)

  124. Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: SIGSPATIAL, pp. 42–51 (2009)

  125. Sharifzadeh, M., Kolahdouzan, M., Shahabi, C.: The optimal sequenced route query. VLDB J. 17(4), 765–787 (2008)

    Article  Google Scholar 

  126. Shi, J., Wu, D., Mamoulis, N.: Textually relevant spatial skylines. TKDE 28(1), 224–237 (2016)

    Google Scholar 

  127. Shraer, A., Gurevich, M., Fontoura, M., Josifovski, V.: Top-k publish-subscribe for social annotation of news. PVLDB 6(6), 385–396 (2013)

    Google Scholar 

  128. Skovsgaard, A., Jensen, C.S.: Finding top-k relevant groups of spatial web objects. PVLDB 24(4), 537–555 (2015)

    Google Scholar 

  129. Skovsgaard, A., Sidlauskas, D., Jensen, C.S.: Scalable top-k spatio-temporal term querying. In: ICDE, pp. 148–159 (2014)

  130. Su, S., Zhao, S., Cheng, X., Bi, R., Cao, X., Wang, J.: Group-based collective keyword querying in road networks. Inf. Process. Lett. 118, 83–90 (2017)

    Article  MathSciNet  MATH  Google Scholar 

  131. The mongodb4.4 manual mongodb manual. https://docs.mongodb.com/manual/. Accessed 10 Nov 2020

  132. Tao, Y., Sheng, C.: Fast nearest neighbor search with keywords. TKDE 26(4), 878–888 (2014)

    Google Scholar 

  133. Tsatsanifos, G., Vlachou, A.: On processing top-k spatio-textual preference queries. In: EDBT, pp. 433–444 (2015)

  134. Vaid, S., Jones, C.B., Joho, H., Sanderson, M.: Spatio-textual indexing for geographical search on the web. In: SSTD, pp. 218–235 (2005)

  135. Wang, B., Zhu, R., Yang, X., Wang, G.: Top-k representative documents query over geo-textual data stream. WWW pp. 537–555 (2018)

  136. Wang, X., Zhang, Y., Zhang, W., Lin, X.: Efficient identification of local keyword patterns in microblogging platforms. TKDE 28(10), 2621–2634 (2016)

    Google Scholar 

  137. Wang, X., Zhang, Y., Zhang, W., Lin, X., Huang, Z.: SKYPE: top-k spatial-keyword publish/subscribe over sliding window. PVLDB 9(7), 588–599 (2016)

    Google Scholar 

  138. Wang, X., Zhang, Y., Zhang, W., Lin, X., Wang, W.: Selectivity estimation on streaming spatio-textual data using local correlations. PVLDB 8(2), 101–112 (2014)

    Google Scholar 

  139. Wang, X., Zhang, Y., Zhang, W., Lin, X., Wang, W.: Ap-tree: Efficiently support continuous spatial-keyword queries over stream. In: ICDE, pp. 1107–1118 (2015)

  140. Watanabe, K., Ochi, M., Okabe, M., Onai, R.: Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs. In: CIKM, pp. 2541–2544 (2011)

  141. Wu, D., Choi, B., Xu, J., Jensen, C.S.: Authentication of moving top-k spatial keyword queries. TKDE 27(4), 922–935 (2015)

    Google Scholar 

  142. Wu, D., Cong, G., Jensen, C.S.: A framework for efficient spatial web object retrieval. VLDB J. 21(6), 797–822 (2012)

    Article  Google Scholar 

  143. Wu, D., Jensen, C.S.: A density-based approach to the retrieval of top-k spatial textual clusters. In: CIKM, pp. 2095–2100 (2016)

  144. Wu, D., Li, Y., Choi, B., Xu, J.: Social-aware top-k spatial keyword search. In: MDM, pp. 235–244 (2014)

  145. Wu, D., Yiu, M.L., Cong, G., Jensen, C.S.: Joint top-k spatial keyword query processing. TKDE 24(10), 1889–1903 (2012)

    Google Scholar 

  146. Wu, D., Yiu, M.L., Jensen, C.S.: Moving spatial keyword queries: formulation, methods, and analysis. TODS 38(1), 7 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  147. Wu, D., Yiu, M.L., Jensen, C.S., Cong, G.: Efficient continuously moving top-k spatial keyword query processing. In: ICDE, pp. 541–552 (2011)

  148. Xie, X., Lin, X., Xu, J., Jensen, C.S.: Reverse keyword-based location search. In: ICDE, pp. 375–386 (2017)

  149. Xu, W., Chow, C., Yiu, M.L., Li, Q., Poon, C.K.: Mobifeed: a location-aware news feed framework for moving users. GeoInformatica 19(3), 633–669 (2015)

    Article  Google Scholar 

  150. Xu, Y., Chen, L., Yao, B., Shang, S., Zhu, S., Zheng, K., Li, F.: Location-based top-k term querying over sliding window. In: WISE, pp. 299–314 (2017)

  151. Yang, J., Zhang, Y., Zhou, X., Wang, J., Hu, H., Xing, C.: A hierarchical framework for top-k location-aware error-tolerant keyword search. In: ICDE, pp. 986–997 (2019)

  152. Yang, M., Zheng, L., Lu, Y., Guo, M., Li, J.: Cloud-assisted spatio-textual k nearest neighbor joins in sensor networks. In: INISCom, pp. 12–17 (2015)

  153. Yao, B., Tang, M., Li, F.: Multi-approximate-keyword routing in GIS data. In: SIGSPATIAL, pp. 201–210 (2011)

  154. Yu, M., Li, G., Feng, J.: A cost-based method for location-aware publish/subscribe services. In: CIKM, pp. 693–702 (2015)

  155. Yu, M., Li, G., Wang, T., Feng, J., Gong, Z.: Efficient filtering algorithms for location-aware publish/subscribe. TKDE 27(4), 950–963 (2015)

    Google Scholar 

  156. Zhang, C., Liu, L., Lei, D., Yuan, Q., Zhuang, H., Hanratty, T., Han, J.: Triovecevent: Embedding-based online local event detection in geo-tagged tweet streams. In: KDD, pp. 595–604 (2017)

  157. Zhang, C., Zhang, Y., Zhang, W., Lin, X.: Inverted linear quadtree: Efficient top k spatial keyword search. In: ICDE, pp. 901–912 (2013)

  158. Zhang, C., Zhang, Y., Zhang, W., Lin, X., Cheema, M.A., Wang, X.: Diversified spatial keyword search on road networks. In: EDBT, pp. 367–378 (2014)

  159. Zhang, C., Zhou, G., Yuan, Q., Zhuang, H., Zheng, Y., Kaplan, L.M., Wang, S., Han, J.: Geoburst: Real-time local event detection in geo-tagged tweet streams. In: SIGIR, pp. 513–522 (2016)

  160. Zhang, D., Chee, Y.M., Mondal, A., Tung, A.K.H., Kitsuregawa, M.: Keyword search in spatial databases: Towards searching by document. In: ICDE, pp. 688–699 (2009)

  161. Zhang, D., Ooi, B.C., Tung, A.K.H.: Locating mapped resources in web 2.0. In: ICDE, pp. 521–532 (2010)

  162. Zhang, D., Tan, K.L., Tung, A.K.H.: Scalable top-k spatial keyword search. In: EDBT, pp. 359–370 (2013)

  163. Zhang, P., Lin, H., Yao, B., Lu, D.: Level-aware collective spatial keyword queries. Inf. Sci. 378, 194–214 (2017)

    Article  MathSciNet  MATH  Google Scholar 

  164. Zhang, S., Cheng, Y., Ke, D.: Event-radar: Real-time local event detection system for geo-tagged tweet streams. arXiv (2017)

  165. Zhang, Y., Ma, Y., Meng, X.: Efficient spatio-textual similarity join using mapreduce. In: IAT, pp. 52–59 (2014)

  166. Zhao, J., Gao, Y., Chen, G., Chen, R.: Why-not questions on top-k geo-social keyword queries in road networks. In: ICDE (2018)

  167. Zhao, J., Gao, Y., Chen, G., Jensen, C.S., Chen, R., Cai, D.: Reverse top-k geo-social keyword queries in road networks. In: ICDE, pp. 387–398 (2017)

  168. Zhao, K., Chen, L., Cong, G.: Topic exploration in spatio-temporal document collections. In: SIGMOD, pp. 985–998 (2016)

  169. Zhao, K., Cong, G., Chin, J.Y., Wen, R.: Exploring market competition over topics in spatio-temporal document collections. VLDB J. 28(1), 123–145 (2019)

    Article  Google Scholar 

  170. Zhao, P., Fang, H., Sheng, V.S., Li, Z., Xu, J., Wu, J., Cui, Z.: Monochromatic and bichromatic ranked reverse Boolean spatial keyword nearest neighbors search. WWW 20(1), 39–59 (2017)

    Google Scholar 

  171. Zhao, S., Cheng, X., Su, S., Shuang, K.: Popularity-aware collective keyword queries in road networks. GeoInformatica 21(3), 485–518 (2017)

    Article  Google Scholar 

  172. Zheng, B., Su, H., Hua, W., Zheng, K., Zhou, X., Li, G.: Efficient clue-based route search on road networks. TKDE 29(9), 1846–1859 (2017)

    Google Scholar 

  173. Zheng, B., Zheng, K., Jensen, C.S., Nguyen, Q.V.H., Su, H., Li, G., Zhou, X.: Answering why-not group spatial keyword queries. TKDE 32(1), 26–39 (2018)

    Google Scholar 

  174. Zheng, B., Zheng, K., Xiao, X., Su, H., Yin, H., Zhou, X., Li, G.: Keyword-aware continuous knn query on road networks. In: ICDE, pp. 871–882 (2016)

  175. Zheng, K., Su, H., Zheng, B., Shang, S., Xu, J., Liu, J., Zhou, X.: Interactive top-k spatial keyword queries. In: ICDE, pp. 423–434 (2015)

  176. Zhou, Y., Xie, X., Wang, C., Gong, Y., Ma, W.Y.: Hybrid index structures for location-based web search. In: CIKM, pp. 155–162 (2005)

Download references

Acknowledgements

This research was supported in part by MOE Tier-2 Grant MOE2019-T2-2-181, MOE Tier-1 Grant RG114/19, an NTU ACE Grant, and the Singtel Cognitive and Artificial Intelligence Lab for Enterprises (SCALE@NTU), which is a collaboration between Singapore Telecommunications Limited (Singtel) and Nanyang Technological University (NTU) that is funded by the Singapore Government through the Industry Alignment Fund Industry Collaboration Projects Grant, and by the Innovation Fund Denmark centre, DIREC. The authors would like to thank the anonymous reviewers for providing constructive feedback and valuable suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gao Cong.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, Z., Chen, L., Cong, G. et al. Location- and keyword-based querying of geo-textual data: a survey. The VLDB Journal 30, 603–640 (2021). https://doi.org/10.1007/s00778-021-00661-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-021-00661-w

Keywords

Navigation