Abstract
The Skyline is one of the most important operator in multi-criteria decision making and can be useful for many applications such as customer information services, decision support and decision making systems. It selects the best tuples from a multi-dimensional database. The main problem of using Skyline is that the queries that compute it can be computationally expensive, so the best solution is to do it using a parallelized approach. In this paper we propose a parallel algorithm based on nearest neighbor search to compute the skyline using the Map Reduce framework. This method consist on computing in a considered region the nearest neighbor point to the origin and partitions the region where each new region is obtained by adding the constraint that the coordinate with respect to a dimension is upper bounded by that of the computed point in the same dimension. The algorithm applies the same method recursively through these regions of the partition computed at each step. We provide a parallel approach for this method based on the Map Reduce framework to come up with a solution to this problem by handling the independent regions in parallel using mappers and reducers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Borzsonyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: IEEE International Conference on Data Engineering (ICDE), pp. 421–430, Heidelberg, Germany, 2–6 April 2001
Chen L., Hwang, K., Wu, J.: Mapreduce skyline query processing with a new angular partitioning approach. In: IPDPS Workshops, pp. 2262–2270 (2012)
Hjaltason, G., Samet, H.: Distance browsing in spatial databases. ACM Trans. Database Syst. 24(2), 265–318 (1999)
Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: an online algorithm for skyline queries. In: VLDB (2002)
Park, Y., Min, J.-K., Shim, K.: Efficient processing of skyline queries using MapReduce. IEEE Trans. Knowl. Data Eng. 29(5), 1031–1044 (2017)
Roussopoulos, N., Kelly, S., Vincent, F.: Nearest neighbor queries. In: ACM Conference on the Management of Data SIGMOD, San Jose, CA, 22–25 May 1995, pp. 71–79 (1995)
Tan, K.-L., Eng, P.-K., Ooi, B.C.: Efficient progressive skyline computation. In: VLDB, pp. 301–310 (2001)
Zhang, J., Jiang, X., Ku, W.-S., Qin, X.: Efficient parallel skyline evaluation using MapReduce. IEEE Trans. Parallel Distrib. Syst. 27(7), 1996–2009 (2016)
Zhang, B., Zhou, S., Guan, J.: Adapting skyline computation to the mapreduce framework: algorithms and experiments. In: DASFAA Workshops, pp. 403–414 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Bouderar, B., Alaoui, L., Hadi, M.Y. (2019). A Parallel Nearest Neighbor Algorithm for Skyline Computation Using Map Reduce. In: Farhaoui, Y., Moussaid, L. (eds) Big Data and Smart Digital Environment. ICBDSDE 2018. Studies in Big Data, vol 53. Springer, Cham. https://doi.org/10.1007/978-3-030-12048-1_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-12048-1_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-12047-4
Online ISBN: 978-3-030-12048-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)