Advertisement

Non-index Based Skyline Analysis on High Dimensional Data with Uncertain Dimensions

  • Nurul Husna Mohd SaadEmail author
  • Hamidah Ibrahim
  • Fatimah Sidi
  • Razali Yaakob
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 838)

Abstract

The notion of skyline query is to find a set of objects that is not dominated by any other objects. Regrettably, existing works lack on how to conduct skyline queries on high dimensional uncertain data with objects represented as continuous ranges and exact values, which in this paper is referred to as uncertain dimensions. Hence, in this paper we define skyline queries over data with uncertain dimensions and propose an algorithm, SkyQUD, to efficiently answer skyline queries. The SkyQUD algorithm determines skyline objects through three methods that guaranteed the probability of each object being in the final skyline results: exact domination, range domination, and uncertain domination. The algorithm has been validated through extensive experiments employing real and synthetic datasets. Results exhibit our proposed algorithm is efficient and scalable in answering skyline query on high dimensional and large datasets with uncertain dimensions.

Keywords

Skyline query Uncertain data Uncertain dimensions 

Notes

Acknowledgements

This research was supported by Ministry of Science, Technology, and Innovation under the Fundamental Research Grant Scheme (Grant no. 08-01-16-1853FR). All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agencies. We thank the anonymous reviewers for their comments.

References

  1. 1.
    Atallah, M., Qi, Y.: Computing all skyline probabilities for uncertain data. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium of the Principles of Database Systems (PODS), pp. 279–287 (2009)Google Scholar
  2. 2.
    Berchtold, S., Keim, D.A., Kriegel, H.P.: The X-tree: an index structure for high-dimensional data. In: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB), pp. 28–39 (1996)Google Scholar
  3. 3.
    Böhm, C., Fiedler, F., Oswald, A., Plant, C., Wackersreuther, B.: Probabilistic skyline queries. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 651–660 (2009)Google Scholar
  4. 4.
    Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of the 17th International Conference on Data Engineering (ICDE), pp. 421–430 (2001)Google Scholar
  5. 5.
    Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., et al. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006).  https://doi.org/10.1007/11687238_30CrossRefGoogle Scholar
  6. 6.
    Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 503–514 (2006)Google Scholar
  7. 7.
    Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 717–816 (2003)Google Scholar
  8. 8.
    Godfrey, P., Shipley, R., Gryz, J.: Maximal vector computation in large data sets. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 229–240 (2005)Google Scholar
  9. 9.
    Khalefa, M.E., Mokbel, M.F., Levandoski, J.J.: Skyline query processing for uncertain data. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM), pp. 1293–1296 (2010)Google Scholar
  10. 10.
    Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 275–286 (2002)Google Scholar
  11. 11.
    Li, X., Wang, Y., Li, X., Wang, G.: Skyline query processing on interval uncertain data. In: IEEE 15th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops, pp. 87–92 (2012)Google Scholar
  12. 12.
    Mokbel, M.F., Levandoski, J.J.: Toward context and preference-aware location-based services. In: Proceedings of the International Workshop on Data Engineering for Wireless and Mobile Access, pp. 25–35 (2009)Google Scholar
  13. 13.
    Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Trans. Database Syst. 30(1), 41–82 (2005)CrossRefGoogle Scholar
  14. 14.
    Pei, J., Jiang, B., Lin, X., Yuan, Y.: Probabilistic skylines on uncertain data. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 15–26 (2007)Google Scholar
  15. 15.
    Ross, S.M.: Introduction to Probability Models, 8th edn. American Press, San Diego (2003)zbMATHGoogle Scholar
  16. 16.
    Tan, K.L., Eng, P.K., Ooi, B.C.: Efficient progressive skyline computation. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 301–310 (2001)Google Scholar
  17. 17.
    Yong, H., Kim, J.-H., Hwang. S.-W.: Skyline ranking for uncertain data with maybe confidence. In: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering Workshop (ICDEW), pp. 572–579 (2008)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Nurul Husna Mohd Saad
    • 1
    Email author
  • Hamidah Ibrahim
    • 1
  • Fatimah Sidi
    • 1
  • Razali Yaakob
    • 1
  1. 1.Faculty of Computer Science and Information TechnologyUniversiti Putra MalaysiaSelangorMalaysia

Personalised recommendations