Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure

Basker, Venkatesh; Bhatia, Manish; Ganeshan, Vinny; Gupta, Ashish; He, Shan; Holzer, Scott; Jiang, Haifeng; Lenart, Monica Chawathe; Melville, Navin; Qiu, Tianhao; Sikka, Namit; Singh, Manpreet; Smolyanov, Alexander; Vasilevski, Yuri; Venkataraman, Shivakumar; Agrawal, Divyakant

doi:10.1007/978-3-030-24124-7_10

Venkatesh Basker⁹,
Manish Bhatia⁹,
Vinny Ganeshan⁹,
Ashish Gupta⁹,
Shan He⁹,
Scott Holzer⁹,
Haifeng Jiang⁹,
Monica Chawathe Lenart⁹,
Navin Melville⁹,
Tianhao Qiu⁹,
Namit Sikka⁹,
Manpreet Singh⁹,
Alexander Smolyanov⁹,
Yuri Vasilevski⁹,
Shivakumar Venkataraman⁹ &
…
Divyakant Agrawal⁹

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 337))

Included in the following conference series:

362 Accesses

Abstract

Most of today’s Internet applications generate vast amounts of data (typically, in the form of event logs) that needs to be processed and analyzed for detailed reporting, enhancing user experience and increasing monetization. In this paper, we describe the architecture of Ubiq, a geographically distributed framework for processing continuously growing log files in real time with high scalability, high availability and low latency. The Ubiq framework fully tolerates infrastructure degradation and data center-level outages without any manual intervention. It also guarantees exactly-once semantics for application pipelines to process logs as a collection of multiple events. Ubiq has been in production for Google’s advertising system for many years and has served as a critical log processing framework for several dozen pipelines. Our production deployment demonstrates linear scalability with machine resources, extremely high availability even with underlying infrastructure failures, and an end-to-end latency of under a minute.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abadi, D.J., et al.: Aurora: a new model and architecture for data stream management. VLDB J. 12(2), 120–139 (2003)
Article Google Scholar
Abadi, D.J., et al.: The design of the borealis stream processing engine. In: CIDR, pp. 277–289 (2005)
Google Scholar
Ananthanarayanan, R., et al.: Photon: fault-tolerant and scalable joining of continuous data streams. In: SIGMOD, pp. 577–588 (2013)
Google Scholar
Apache Flink (2014). http://flink.apache.org
Apache Samza (2014). http://samza.apache.org
Apache Storm (2013). http://storm.apache.org
Arasu, A., et al.: STREAM: the Stanford stream data manager. In: SIGMOD, p. 665 (2003)
Google Scholar
Chandra, T.D., et al.: Paxos made live - an engineering perspective. In: PODC, pp. 398–407 (2007)
Google Scholar
Chandrasekaran, S., et al.: TelegraphCQ: continuous dataflow processing. In: SIGMOD, p. 668 (2003)
Google Scholar
Chen, J., et al.: NiagaraCQ: a scalable continuous query system for internet databases. In: SIGMOD, pp. 379–390 (2000)
Google Scholar
Corbett, J.C., et al.: Spanner: Google’s globally distributed database. ACM Trans. Comput. Syst. 31(3), 8 (2013)
Article Google Scholar
Gupta, A., et al.: Mesa: geo-replicated, near real-time, scalable data warehousing. PVLDB 7(12), 1259–1270 (2014)
Google Scholar
Gupta, A., Shute, J.: High-availability at massive scale: building Google’s data infrastructure for ads. In: BIRTE (2015)
Google Scholar
Kulkarni, S., et al.: Twitter Heron: stream processing at scale. In: SIGMOD, SIGMOD 2015, pp. 239–250 (2015)
Google Scholar
Lamport, L.: The part-time parliament. ACM Trans. Comput. Syst. 16(2), 133–169 (1998)
Article Google Scholar
Verma, A., et al.: Large-scale cluster management at Google with Borg. In: EuroSys, pp. 18:1–18:17 (2015)
Google Scholar
Zaharia, M., et al.: Discretized streams: fault-tolerant streaming computation at scale. In: SOSP, pp. 423–438 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Google Inc., Mountain View, USA
Venkatesh Basker, Manish Bhatia, Vinny Ganeshan, Ashish Gupta, Shan He, Scott Holzer, Haifeng Jiang, Monica Chawathe Lenart, Navin Melville, Tianhao Qiu, Namit Sikka, Manpreet Singh, Alexander Smolyanov, Yuri Vasilevski, Shivakumar Venkataraman & Divyakant Agrawal

Authors

Venkatesh Basker
View author publications
You can also search for this author in PubMed Google Scholar
Manish Bhatia
View author publications
You can also search for this author in PubMed Google Scholar
Vinny Ganeshan
View author publications
You can also search for this author in PubMed Google Scholar
Ashish Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Shan He
View author publications
You can also search for this author in PubMed Google Scholar
Scott Holzer
View author publications
You can also search for this author in PubMed Google Scholar
Haifeng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Monica Chawathe Lenart
View author publications
You can also search for this author in PubMed Google Scholar
Navin Melville
View author publications
You can also search for this author in PubMed Google Scholar
Tianhao Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Namit Sikka
View author publications
You can also search for this author in PubMed Google Scholar
Manpreet Singh
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Smolyanov
View author publications
You can also search for this author in PubMed Google Scholar
Yuri Vasilevski
View author publications
You can also search for this author in PubMed Google Scholar
Shivakumar Venkataraman
View author publications
You can also search for this author in PubMed Google Scholar
Divyakant Agrawal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manpreet Singh .

Editor information

Editors and Affiliations

Teradata, Santa Clara, CA, USA
Malu Castellanos
University of Pittsburgh, Pittsburgh, PA, USA
Panos K. Chrysanthis
University of Pittsburgh, Pittsburgh, PA, USA
Konstantinos Pelechrinis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Basker, V. et al. (2019). Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure. In: Castellanos, M., Chrysanthis, P., Pelechrinis, K. (eds) Real-Time Business Intelligence and Analytics. BIRTE BIRTE BIRTE 2015 2016 2017. Lecture Notes in Business Information Processing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-030-24124-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-24124-7_10
Published: 11 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24123-0
Online ISBN: 978-3-030-24124-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics