Abstract
A geofence is a virtual perimeter for a real-world geographic area. Geofencing is a technique used to monitor a geographical area by dividing it into smaller subareas demarcated by geofences. It can be used to create triggers whenever a device moves across a geofence to provide useful location-based services. Since real-world objects tend to move continuously, it is essential to provide these services in real-time to be effective. Towards this objective, this paper introduces a scalable data pipeline for geofencing that can reliably handle and process data streams with high velocity using Apache Pulsar - an open-source Publish/Subscribe messaging system that has both stream processing and light-weight computational capabilities. Further, an implementation of the proposed data pipeline for a specific real-world case study is presented to demonstrate the envisaged advantages of the same.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache flink: stream and batch processing in a single engine. Bull. IEEE Comput. Soc. Tech. Comm. Data Eng. 36(4), 28–38 (2015)
Guttman, A.: R-trees: a dynamic index structure for spatial searching. ACM SIGMOD Rec. 14(2), 1–11 (1984). https://doi.org/10.1145/971697.602266
Hunt, P., Konar, M., Junqueira, F.P., Reed, B.: ZooKeeper: wait-free coordination for internet-scale systems. In: Proceedings of the USENIX Annual Technical Conference, June 2010
Junqueira, F.P., Kelly, I., Reed, B.: Durability with BookKeeper. ACM SIGOPS Oper. Syst. Rev. 47(1), 9–15 (2013)
Kreps, J., Narkhede, N., Rao, J.: Kafka: a distributed messaging system for log processing. In: Proceedings of the NetDB, pp. 1–7 (2011)
Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. ACM SIGOPS Oper. Syst. Rev. 44(2), 35–40 (2010)
Târnaucă, B., Puiu, D., Nechifor, S., Comnac, V.: Using complex event processing for implementing a geofencing service. In: IEEE 11th International Symposium on Intelligent Systems and Informatics, Subotica, Serbia, pp. 1–6, September 2013
Wang, Y., et al.: An open-source infrastructure for real-time automatic agricultural machine data processing. In: 2017 ASABE Annual International Meeting, Spokane, Washington, pp. 1–13, July 2017. https://doi.org/10.13031/aim.201701022
Wang, J., Wang, W., Chen, R.: Distributed data streams processing based on Flume/Kafka/Spark. In: 2015 3rd International Conference on Mechatronics and Industrial Informatics, Zhuhai, China, pp. 948–952, October 2015. https://doi.org/10.2991/icmii-15.2015.167
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, Berkeley, CA, USA, p. 10 (2010)
Building a Scalable Geolocation Telemetry System Using the Maps API. https://cloud.google.com/solutions/scalable-geolocation-telemetry-system-using-maps-api. Accessed 9 Aug 2020
Kafkaesque. https://kafkaesque.io/performance-comparison-between-apache-pulsar-and-kafka-latency/. Accessed 4 Jan 2021
Pulsar Architecture. https://pulsar.apache.org/docs/en/concepts-architecture-overview/. Accessed 4 Jan 2021
OpenMessaging Benchmark. http://openmessaging.cloud/. Accessed 4 Jan 2021
RabbitMQ. https://www.rabbitmq.com/documentation.html. Accessed 4 Jan 2021
Apache Pulsar. https://pulsar.apache.org/. Accessed 4 Jan 2021
Apache Flume. https://flume.apache.org/. Accessed 4 Jan 2021
Redis. https://redis.io/. Accessed 4 Jan 2021
Memcached. https://memcached.org/. Accessed 4 Jan 2021
Aerospike. https://www.aerospike.com/. Accessed 4 Jan 2021
Google BigQuery. https://cloud.google.com/bigquery. Accessed 4 Jan 2021
MongoDB. https://www.mongodb.com/. Accessed 4 Jan 2021
Amazon Redshift. https://aws.amazon.com/redshift/. Accessed 4 Jan 2021
LucidDB. https://dbdb.io/db/luciddb. Accessed 4 Jan 2021
PostgreSQL. https://www.postgresql.org/. Accessed 4 Jan 2021
Chicago Taxi Trips. https://www.kaggle.com/chicago/chicago-taxi-trips-bq/. Accessed 4 Jan 2021
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 IFIP International Federation for Information Processing
About this paper
Cite this paper
Sundar Rajan, K., Vishal, A., Babu, C. (2021). A Scalable Data Pipeline for Realtime Geofencing Using Apache Pulsar. In: Krishnamurthy, V., Jaganathan, S., Rajaram, K., Shunmuganathan, S. (eds) Computational Intelligence in Data Science. ICCIDS 2021. IFIP Advances in Information and Communication Technology, vol 611. Springer, Cham. https://doi.org/10.1007/978-3-030-92600-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-92600-7_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92599-4
Online ISBN: 978-3-030-92600-7
eBook Packages: Computer ScienceComputer Science (R0)