Abstract
As an important type of multidimensional preference query, the skyline query can find a superset of optimal results when there is no given linear function to combine values for all attributes of interest. Its processing has been extensively investigated in the past. While most skyline query processing algorithms are designed based on the assumption that query processing is done for all attributes in a static dataset with deterministic attribute values, some advanced work has been done recently to remove part of such a strong assumption in order to process skyline queries for real-life applications, namely, to deal with data with multi-valued attributes (known as data uncertainty), to support skyline queries in a subspace which is a subset of attributes selected by the user, and to support continuous queries on streaming data. Naturally, there are many application scenarios where these three complex issues must be considered together. In this paper, we tackle the problem of probabilistic subspace skyline query processing over sliding windows on uncertain data streams. That is, to retrieve all objects from the most recent window of streaming data in a user-selected subspace with a skyline probability no smaller than a given threshold. Based on the subtle relationship between the full space and an arbitrary subspace, a novel approach using a regular grid indexing structure is developed for this problem. An extensive empirical study under various settings is conducted to show the effectiveness and efficiency of our PSS algorithm.
Similar content being viewed by others
References
Börzsönyi S, Kossmann D, Stocker K. The skyline operator. In Proc. the 17th ICDE, Apr. 2001, pp.421-430.
Kossmann D, Ramsak F, Rost S. Shooting stars in the sky: An online algorithm for skyline queries. In Proc. the 28th VLDB, Aug. 2002, pp.275-286.
Chomicki J, Godfrey P, Gryz J, Liang D. Skyline with presorting. In Proc. the 19th ICDE, Mar. 2003, pp.717-719.
Tao Y, Papadias D. Maintaining sliding window skylines on data streams. IEEE Trans. Knowl. Data Eng., 2006, 18(2): 377-391.
Zhang W, Lin X, Zhang Y, Wang W, Yu J X. Probabilistic skyline operator over sliding windows. In Proc. the 25th ICDE, Mar. 29-Apr. 2, 2009, pp.1060-1071.
Tan K L, Eng P K, Ooi B C. Efficient progressive skyline computation. In Proc. the 27th VLDB, Sept. 2001, pp.301-310.
Godfrey P, Shipley R, Gryz J. Maximal vector computation in large datasets. In Proc. the 31st VLDB, Aug. 30-Sept. 2, 2005, pp.229-240.
Papadias D, Tao Y, Fu G et al. An optimal and progressive algorithm for skyline queries. In Proc. the 22nd SIGMOD, Jun. 2003, pp.467-478.
Lee K C K, Zheng B, Li H, Lee W C. Approaching the skyline in Z order. In Proc. the 33rd VLDB, Sept. 2007, pp.279-290.
Lee J, Hwang S. BSkyTree: Scalable skyline computation using a balanced pivot selection. In Proc. the 13th EDBT, Mar. 2010, pp.195-206.
Zhang S, Mamoulis N, Cheung D W. Scalable skyline computation using object-based space partitioning. In Proc. the 28th SIGMOD, Jun. 29-July 2, 2009, pp.483-494.
Lin X, Zhang Y, Zhang W, Cheema M A. Stochastic skyline operator. In Proc. the 27th ICDE, Apr. 2011, pp.721-732.
Köhler H, Yang J, Zhou X. Efficient parallel skyline processing using hyperplane projections. In Proc. the 30th SIGMOD, Jun. 2011, pp.85-96.
Cho S R, Lee J, Hwang S W. Vskyline: Vectorization for efficient skyline computation. In Proc. the 29th SIGMOD, Jun. 2010, pp.19-26.
Kontaki M, Papadopoulos A N, Manolopoulos Y. Continuous top-k dominating queries in subspaces. In Proc. the 2008 Panhellenic Conference on Informatics, Aug. 2008, pp.31-35.
Yiu M L, Mamoulis N. Multi-dimensional top-k dominating queries. VLDB J., 2009, 18(3): 695-718.
Lian X, Chen L. Monochromatic and bichromatic reverse skyline search over uncertain databases. In Proc. the 2008 SIG-MOD, Jun. 2008, pp.213-226.
Zhang W, Lin X, Zhang Y, Pei J, Wang W. Threshold-based probabilistic top-k dominating queries. VLDB J., 2010, 19(2): 283-305.
Lin X, Yuan Y, Wang W, Lu H. Stabbing the sky: Efficient skyline computation over sliding windows. In Proc. the 21st ICDE, Apr. 2005, pp.502-513.
Morse M D, Patel J M, Grosky W I. Efficient continuous skyline computation. In Proc. the 22nd ICDE, Apr. 2006, p.108.
Pei J, Jin W, Ester M, Tao Y. Catching the best views of skyline: A semantic approach based on decisive subspaces. In Proc. the 31st VLDB, Aug. 30-Sept. 2, 2005, pp.253-264.
Yuan Y, Lin X, Liu Q, Wang W, Yu J X, Zhang Q. Efficient computation of the skyline cube. In Proc. the 31st VLDB, Oct. 2005, pp.241-252.
Tao Y, Xiao X, Pei J. SUBSKY: Efficient computation of skylines in subspaces. In Proc. the 22nd ICDE, Apr. 2006, p.65.
Pei J, Fu A W, Lin X, Wang H. Computing compressed multi-dimensional skyline cubes efficiently. In Proc. the 23rd ICDE, Apr. 2007, pp.96-105.
Raїssi C, Pei J, Kister T. Computing closed skycubes. PVLDB, 2010, 3(1): 838-847.
Pei J, Jiang B, Lin X, Yuan Y. Probabilistic skylines on uncertain data. In Proc. the 33rd VLDB, Sept. 2007, pp.15-26.
Atallah M J, Qi Y. Computing all skyline probabilities for uncertain data. In Proc. the 28th PODS, Jun. 29-July 2, 2009, pp.279-287.
Mouratidis K, Bakiras S, Papadias D. Continuous monitoring of top-k queries over sliding windows. In Proc. the 2006 SIGMOD, Jun. 2006, pp.635-646.
Kontaki M, Papadopoulos A N, Manolopoulos Y. Continuous top-k dominating queries. IEEE Trans. Knowl. Data Eng., 2012, 24(5): 840-853.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by the National Natural Science Foundation of China under Grant Nos. 61073061, 61003044, 61303019, and the Natural Science Foundation of Colleges and Universities of Jiangsu Province of China under Grant No. 12KJB520017.
Electronic supplementary material
Below is the link to the electronic supplementary material.
ESM 1
(DOC 27 kb)
Rights and permissions
About this article
Cite this article
Zhao, L., Yang, YY. & Zhou, X. Continuous Probabilistic Subspace Skyline Query Processing Using Grid Projections. J. Comput. Sci. Technol. 29, 332–344 (2014). https://doi.org/10.1007/s11390-014-1434-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-014-1434-9