Skip to main content

Analysis and evaluation of the top-\(k\) most influential location selection query


In this paper, we propose a new type of queries to retrieve the top-k most influential locations from a candidate set \(C\) given sets of customers \(M\) and existing facilities \(F\). The influence models the popularity of a facility. Such queries have wide applications in decision support systems. A naive solution sequentially scans (SS) all data sets, which is expensive, and hence, we investigate two branch-and-bound algorithms for the query, namely Estimate Expanding Pruning (EEP) and Bounding Influence Pruning (BIP). Both algorithms follow the best first traverse. On determining the traversal order, while EEP leverages distance metrics between nodes, BIP relies on half plane pruning which avoids the repetitive estimations in EEP. As our experiments shown, BIP is much faster than SS which outperforms EEP, while the worst-case complexity of EEP and BIP is worse than that of SS. To improve the efficiency, we further propose a Nearest Facility Circle Join (NFCJ) algorithm. NFCJ builds an influence R-tree on the influence relationship between customers and existing facilities and joins the candidate R-tree with the influence R-tree to obtain the results. We compare all algorithms and conclude that NFCJ is the best solution, which outperforms SS, EEP, and BIP by orders of magnitude.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18


  1. Achtert E, Kriegel HP, Krger P, Renz M, Zfle A (2009) Reverse k-nearest neighbor search in dynamic and general metric databases. In: Proceedings of EDBT

  2. ANNLibrary (2011)

  3. Aronovich L, Spiegler I (2009) Bulk construction of dynamic clustered metric trees. Knowl Inf Syst 22(2):211–244

    Article  Google Scholar 

  4. Brinkhoff T, Kriegel HP, Seeger B (1993) Efficient processing of spatial joins using r-trees. In: Proceedings of SIGMOD

  5. Cabello S, Díaz-Báñez JM, Langerman S, Seara C, Ventura I (2006) Reverse facility location problems. In: Proceedings of CCCG

  6. Cheema MA, Lin X, Wang W, Zhang W, Pei J (2010) Probabilistic reverse nearest neighbor queries on uncertain data. IEEE TKDE 22(4):550–564

    Google Scholar 

  7. Cheema MA, Lin X, Zhang W, Zhang Y (2011) Influence zone : efficiently processing reverse k nearest neighbors queries. In: Proceedings of ICDE

  8. Cheema MA, Zhang W, Lin X, Zhang Y (2012) Efficiently processing snapshot and continuous reverse k nearest neighbors queries. VLDB J

  9. Chen H, Liu J, Furuse K, Yu JX, Ohbo N (2010) Indexing expensive functions for efficient multi-dimensional similarity search. Knowl Inf Syst 27(2):165–192

    Article  Google Scholar 

  10. CloudMade (2013)

  11. Du Y, Zhang D, Xia T (2005) The optimal-location query. Adv Sp Temp Databases 3633:163–180

    Article  Google Scholar 

  12. Gao Y, Zheng B, Chen G, Li Q (2009) Optimal-location-selection query processing in spatial databases. IEEE TKDE 68(8):1162–1177

    Google Scholar 

  13. Ghaemi P, Shahabi K, Wilson JP, Banaei-Kashani F (2010) Optimal network location queries. In: Proceedings of GIS

  14. Guttman A (1984) R-trees: a dynamic index structure for spatial searching. In: Proceedings of SIGMOD, pp 47–57

  15. Huang J, Wen Z, Qi J, Zhang R, Chen J, He Z (2011) Top-k most influential location selection. In: Proceedings of CIKM

  16. Korn F, Muthukrishnan S (2000) Influence sets based on reverse nearest neighbor queries. In: Proceedings of SIGMOD

  17. Mouratidis K, Papadias D, Papadimitriou S (2005) Medoid queries in large spatial databases. In: Proceedings of SSTD, pp 55–72

  18. OpenStreetMap (2013)

  19. Qi J, Zhang R, Kulik L, Lin D, Xue Y (2012) The min-dist location selection query. In: Proceedings of ICDE

  20. Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: Proceedings of SIGMOD, pp 71–79

  21. Shang S, Yuan B, Deng K, Xie K, Zhou X (2011) Finding the most accessible locations-reverse path nearest neighbor query in road networks categories and subject descriptors. In: Proceedings of GIS

  22. SouFang (2013)

  23. Stanoi I, Riedewald M, Agrawal D, Abbadi AE (2001) Discovery of influence sets in frequently updated database. In: Proceedings of VLDB

  24. Sun Y, Huang J, Chen Y, Zhang R, Du X (2012) Location selection for utility maximization with capacity constraints. In: Proceedings of CIKM

  25. Tao Y, Lian X (2004) Reverse kNN search in arbitrary dimensionality. In: Proceedings of VLDB

  26. Trulia (2013)

  27. Vaidya PM (1989) AnO(n logn) algorithm for the all-nearest-neighbors problem. Discret Comput Geom 4(1):101–115

    Google Scholar 

  28. Wong RCW, Özsu MT, Fu AWC, Yu PS, Liu L, Liu Y (2011) Maximizing bichromatic reverse nearest neighbor for L p -norm in two- and three-dimensional spaces. VLDB J 20(6):893–919

    Article  Google Scholar 

  29. Wong RCW, Ozsu MT, Yu PS, Fu AWC, Liu L (2009) Efficient method for maximizing bichromatic reverse nearest neighbor. In: Proceedings of VLDB

  30. Wu W, Yang F, Chan CY, Tan KL (2008) FINCH: evaluating reverse k-nearest-neighbor queries on location data. In: Proceedings of VLDB

  31. Xia T, Zhang D, Kanoulas E, Du Y (2005) On computing top-t most influential spatial sites. In: Proceedings of VLDB

  32. Yan D, Wong RCW, Ng W (2011) Efficient methods for finding influential locations with adaptive grids. In: Proceedings of CIKM, pp 1475–1484

  33. Yang C, Lin KI (2001) An index structure for efficient reverse nearest neighbor queries. In: Proceedings of ICDE, pp 485–492

  34. Zhang D, Du Y, Xia T, Tao Y (2006) Progressive computation of the min-dist optimal location query. In: Proceedings of VLDB

  35. Zhang J, Mamoulis N, Papadias D, Tao Y (2004) All-nearest-neighbors queries in spatial databases. In: Proceedings of SSDM, pp 297–306

  36. Zheng K, Huang Z, Zhou A, Zhou X (2011) Discovering the most influential sites over uncertain data: a rank based approach. IEEE TKDE

Download references


This work was supported in part by the National Natural Science Foundation of China (No. 61272065) and the Natural Science Foundation of Guangdong Province, China (No. S2012010009311), the Fundamental Research Funds for the Central Universities, SCUT(Grant No. 2012ZZ0088), and the Australian Research Council (ARC) Discovery Project DP130104587. Dr. Rui Zhang was supported by the ARC Future Fellowships Project FT120100832. Zeyi Wen was supported by the Commonwealth Scientific and Industrial Research Organisation (CSIRO).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Rui Zhang.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Chen, J., Huang, J., Wen, Z. et al. Analysis and evaluation of the top-\(k\) most influential location selection query. Knowl Inf Syst 43, 181–217 (2015).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • Reverse nearest neighbor
  • R-tree
  • Efficiency
  • Location selection