Skip to main content
Log in

A survey of skyline processing in highly distributed environments

The VLDB Journal Aims and scope Submit manuscript

Abstract

During the last decades, data management and storage have become increasingly distributed. Advanced query operators, such as skyline queries, are necessary in order to help users to handle the huge amount of available data by identifying a set of interesting data objects. Skyline query processing in highly distributed environments poses inherent challenges and demands and requires non-traditional techniques due to the distribution of content and the lack of global knowledge. This paper surveys this interesting and still evolving research area, so that readers can easily obtain an overview of the state-of-the-art. We outline the objectives and the main principles that any distributed skyline approach has to fulfill, leading to useful guidelines for developing algorithms for distributed skyline processing. We review in detail existing approaches that are applicable for highly distributed environments, clarify the assumptions of each approach, and provide a comparative performance analysis. Moreover, we study the skyline variants each approach supports. Our analysis leads to a taxonomy of existing approaches. Finally, we present interesting research topics on distributed skyline computation that have not yet been explored.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Androutsellis-Theotokis S., Spinellis D.: A survey of peer-to-peer content distribution technologies. ACM Comput. Surv. 36(4), 335–371 (2004)

    Article  Google Scholar 

  2. Balke, W., Güntzer, U., Zheng, J.X.: Efficient distributed skylining for web information systems. In: Proceedings of International Conference on Extending Database Technology (EDBT), pp. 256–273 (2004)

  3. Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 421–432 (2001)

  4. Chan, C., Eng, P., Tan, K.: Stratified computation of skylines with partially-ordered domains. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 203–214 (2005)

  5. Chan, C., Jagadish, H., Tan, K., Tung, A., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 503–514 (2006a)

  6. Chan, C.Y., Jagadish, H.V., Tan, K.L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Proceedings of International Conference on Extending Database Technology (EDBT), pp. 478–495 (2006b)

  7. Chen, B., Liang, W.: Progressive skyline query processing in wireless sensor networks. In: International Conference on Mobile Ad-hoc and Sensor Networks(MSN), pp. 17–24 (2009)

  8. Chen L., Cui B., Lu H.: Constrained skyline query processing against distributed data sites. IEEE Trans. Knowl. Data Eng. (TKDE) 23(2), 204–217 (2011)

    Article  Google Scholar 

  9. Chen, L., Cui, B., Lu, H., Xu, L., Xu, Q.: iSky: efficient and progressive skyline computing in a structured P2P network. In: Proceedings of the International Conference on Distributed Computing Systems (ICDCS), pp. 160–167 (2008)

  10. Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 717–816 (2003)

  11. Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proceedings of the International Conference on Distributed Computing Systems (ICDCS), pp. 23–30 (2002)

  12. Cui B., Chen L., Xu L., Lu H., Song G., Xu Q.: Efficient skyline computation in structured peer-to-peer systems. IEEE Trans. Knowl. Data Eng. (TKDE) 21(7), 1059–1072 (2009)

    Article  Google Scholar 

  13. Cui, B., Lu, H., Xu, Q., Chen, L., Dai, Y., Zhou, Y.: Parallel distributed processing of constrained skyline queries by filtering. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 546–555 (2008)

  14. Dellis, E., Seeger, B.: Efficient computation of reverse skyline queries. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 291–302 (2007)

  15. Ding, X., Jin, H.: Efficient and progressive algorithms for distributed skyline queries over uncertain data. IEEE Trans. Knowl. Data Eng. (TKDE) 99 (PrePrints) (2011). (To appear)

  16. Fotiadou, K., Pitoura, E.: BITPEER: continuous subspace skyline computation with distributed bitmap indexes. In: Proceedings of International Workshop on Data Management in Peer-to-Peer Systems (DaMaP), pp. 35–42 (2008)

  17. Godfrey, P., Shipley, R., Gryz, J.: Maximal vector computation in large data sets. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 229–240 (2005)

  18. Hose, K., Lemke, C., Sattler, K.: Processing relaxed skylines in PDMS using distributed data summaries. In: Proceedings of International Conference on Information and Knowledge Management (CIKM), pp. 425–434 (2006)

  19. Hose, K., Lemke, C., Sattler, K., Zinn, D.: A relaxed but not necessarily constrained way from the top to the sky. In: Proceedings of International Conference on Cooperative Information Systems (CoopIS), pp. 339–407 (2007)

  20. Huang, Z., Jensen, C.S., Lu, H., Ooi, B.C.: Skyline queries against mobile lightweight devices in manets. In: Proceedings of International Conference on Data Engineering (ICDE), p. 66 (2006)

  21. Jagadish, H., Ooi, B., Vu, Q.: BATON: a balanced tree structure for peer-to-peer networks. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 661–672 (2005)

  22. Khalefa, M., Mokbel, M., Levandoski, J.: Skyline query processing for incomplete data. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 556–565 (2008)

  23. Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 275–286 (2002)

  24. Kung H.T., Luccio F., Preparata F.P.: On finding the maxima of a set of vectors. J. ACM 22(4), 469–476 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  25. Li, H., Tan, Q., Lee, W.: Efficient progressive processing of skyline queries in peer-to-peer systems. In: Proceedings of the International Conference on Scalable Information Systems (Infoscale), p. 26 (2006)

  26. Lin, X., Yuan, Y., Wang, W., Lu, H.: Stabbing the sky: efficient skyline computation over sliding windows. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 502–513 (2005)

  27. Lin, X., Yuan, Y., Zhang, Q., Zhang, Y.: Selecting stars: the k most representative skyline operator. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 86–95 (2007)

  28. Lo E., Yip K.Y., Lin K.I., Cheung D.W.: Progressive skylining over web-accessible databases. Data Knowl. Eng. (DKE) 57(2), 122–147 (2006)

    Article  Google Scholar 

  29. Papadias, D., Tao, Y., Fu, G., Seeger, B.: An optimal and progressive algorithm for skyline queries. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 467–478 (2003)

  30. Pei, J., Jiang, B., Lin, X., Yuan, Y.: Probabilistic skylines on uncertain data. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 15–26 (2007)

  31. Pei, J., Jin, W., Ester, M., Tao, Y.: Catching the best views of skyline: a semantic approach based on decisive subspaces. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 253–264 (2005)

  32. Preparata F.P., Shamos M.I.: Computational Geometry—An Introduction. Springer, Berlin (1985)

    Google Scholar 

  33. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Schenker, S.: A scalable content-addressable network. In: Proceedings of Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM), pp. 161–172 (2001)

  34. Risson J., Moors T.: Survey of research towards robust peer-to-peer networks: search methods. Comput. Netw. 50(17), 3485–3521 (2006)

    Article  MATH  Google Scholar 

  35. Rocha-Junior, J.B., Vlachou, A., Doulkeridis, C., Nørvåg, K.: AGiDS: a grid-based strategy for distributed skyline query processing. In: Proceedings of International Conference on Data Management in Grid and Peer-to-Peer Systems (Globe), pp. 12–23 (2009)

  36. Rocha-Junior, J.B., Vlachou, A., Doulkeridis, C., Nørvåg, K.: Efficient execution plans for distributed skyline query processing. In: Proceedings of International Conference on Extending Database Technology (EDBT), pp. 271–282 (2011)

  37. Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of Conference on Applications, technologies, architectures, and protocols for computer communications (SIGCOMM), pp. 149–160 (2001)

  38. Sun S., Huang Z., Zhong H., Dai D., Liu H., Li J.: Efficient monitoring of skyline queries over distributed data streams. Knowl. Inf. Syst. 25, 575–606 (2010)

    Article  Google Scholar 

  39. Tan, K.L., Eng, P.K., Ooi, B.C.: Efficient progressive skyline computation. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 301–310 (2001)

  40. Tao Y., Papadias D.: Maintaining sliding window skylines on data streams. IEEE Trans. Knowl. Data Eng. (TKDE) 18(3), 377–391 (2006)

    Article  Google Scholar 

  41. Tao, Y., Ding, L., Lin, X., Pei, J.: Distance-based representative skyline. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 892–903 (2009)

  42. Tao, Y., Xiao, X., Pei, J.: Subsky: efficient computation of skylines in subspaces. In: Proceedings of International Conference on Data Engineering (ICDE), p. 65 (2006)

  43. Valkanas, G., Papadopoulos, A.: Efficient and adaptive distributed skyline computation. In: International Conference on Scientific and Statistical Database Management (SSDBM), pp. 24–41 (2010)

  44. Vlachou, A., Doulkeridis, C., Kotidis, Y., Vazirgiannis, M.: SKYPEER: efficient subspace skyline computation over distributed data. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 416–425 (2007)

  45. Vlachou, A., Doulkeridis, C., Kotidis, Y.: Angle-based space partitioning for efficient parallel skyline computation. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 227–238 (2008)

  46. Vlachou A., Doulkeridis C., Kotidis Y., Vazirgiannis M.: Efficient routing of subspace skyline queries over highly distributed data. IEEE Trans. Knowl. Data Eng. (TKDE) 22(12), 1694–1708 (2010)

    Article  Google Scholar 

  47. Vlachou, A., Nørvåg, K.: Bandwidth-constrained distributed skyline computation. In: Proceedings of the International Workshop on Data Engineering for Wireless and Mobile Access (MobiDE), pp. 17–24 (2009)

  48. Wang, S., Ooi, B., Tung, A., Xu, L.: Efficient skyline query processing on peer-to-peer networks. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 1126–1135 (2007)

  49. Wang S., Vu Q.H., Ooi B.C., Tung A.K., Xu L.: Skyframe: a framework for skyline query processing in peer-to-peer systems. VLDB J. 18(1), 345–362 (2009)

    Article  Google Scholar 

  50. Wang, J., Wu, S., Gao, H., Li, J., Ooi, B.C.: Indexing multi-dimensional data in a cloud system. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 591–602 (2010)

  51. Wu, P., Zhang, C., Feng, Y., Zhao, B., Agrawal, D., Abbadi, A.: Parallelizing skyline queries for scalable distribution. In: Proceedings of International Conference on Extending Database Technology (EDBT), pp. 112–130 (2006)

  52. Xin, J., Wang, G., Chen, L., Zhang, X., Wang, Z.: Continuously maintaining sliding window skylines in a sensor network. In: Advances in Databases: Concepts, Systems and Applications (DASFAA), pp. 509–521 (2007)

  53. Yang, B., Garcia-Molina, H.: Designing a super-peer network. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 49–60 (2003)

  54. Yuan, Y., Lin, X., Liu, Q., Wang, W., Yu, J.X., Zhang, Q.: Efficient computation of the skyline cube. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 241–252 (2005)

  55. Zhang, Z., Yang, Y., Cai, R., Papadias, D., Tung, A.: Kernel-based skyline cardinality estimation. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 509–522 (2009)

  56. Zhu L., Tao Y., Zhou S.: Distributed skyline retrieval with low bandwidth consumption. IEEE Trans. Knowl. Data Eng. (TKDE) 21(3), 384–400 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katja Hose.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hose, K., Vlachou, A. A survey of skyline processing in highly distributed environments. The VLDB Journal 21, 359–384 (2012). https://doi.org/10.1007/s00778-011-0246-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-011-0246-6

Keywords

Navigation