Advertisement

The VLDB Journal

, Volume 13, Issue 4, pp 370–383 | Cite as

Retrospective on Aurora

  • Hari BalakrishnanEmail author
  • Magdalena Balazinska
  • Don Carney
  • Uğur Çetintemel
  • Mitch Cherniack
  • Christian Convey
  • Eddie Galvez
  • Jon Salz
  • Michael Stonebraker
  • Nesime Tatbul
  • Richard Tibbetts
  • Stan Zdonik
Article

Abstract.

This experience paper summarizes the key lessons we learned throughout the design and implementation of the Aurora stream-processing engine. For the past 2 years, we have built five stream-based applications using Aurora. We first describe in detail these applications and their implementation in Aurora. We then reflect on the design of Aurora based on this experience. Finally, we discuss our initial ideas on a follow-on project, called Borealis, whose goal is to eliminate the limitations of Aurora as well as to address new key challenges and applications in the stream-processing domain.

Keywords:

Data stream management Stream-processing engines Monitoring applications Distributed stream processing Quality-of-service 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    A guide for hot lane development: A U.S. Department of Transportation Federal Highway Administration. http://www.itsdocs.fhwa.dot.gov/JPODOCS/REPTS\_TE/13668.htmlGoogle Scholar
  2. 2.
    Abadi D, Carney D, Çetintemel U, Cherniack M, Convey C, Erwin C, Galvez E, Hatoun M, Hwang J, Maskey A, Rasin A, Singer A, Stonebraker M, Tatbul N, Xing Y, Yan R, Zdonik S (2003) Aurora: A data stream management system (demo description). In: ACM SIGMODGoogle Scholar
  3. 3.
    Abadi D, Carney D, Çetintemel U, Cherniack M, Convey C, Lee S, Stonebraker M, Tatbul N, Zdonik S (2003) Aurora: A new model and architecture for data stream management. VLDB J 12(2):120-139CrossRefGoogle Scholar
  4. 4.
    Arasu A, Cherniack M, Galvez E, Maier D, Maskey A, Ryvkina E, Stonebraker M, Tibbetts R (2004) Linear Road: A benchmark for stream data management systems. In: VLDB conference, Toronto (in press)Google Scholar
  5. 5.
    Balazinska M, Balakrishnan H, Stonebraker M (2004) Contract-based load management in federated distributed systems. In: NSDI symposiumGoogle Scholar
  6. 6.
    Barlett J, Gray J, Horst B (1986) Fault tolerance in tandem computer systems. Technical Report TR-86.2, Tandem ComputersGoogle Scholar
  7. 7.
    Carney D, Çetintemel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S (2002) Monitoring streams - a new class of data management applications. In: VLDB conference, Hong KongGoogle Scholar
  8. 8.
    Carney D, Çetintemel U, Rasin A, Zdonik S, Cherniack M, Stonebraker M (2003) Operator scheduling in a data stream manager. In: VLDB conference, Berlin, GermanyGoogle Scholar
  9. 9.
    Chandrasekaran S, Deshpande A, Franklin M, Hellerstein J, Hong W, Krishnamurthy S, Madden S, Raman V, Reiss F, Shah M (2003) TelegraphCQ: Continuous dataflow processing for an uncertain world. In: CIDR conferenceGoogle Scholar
  10. 10.
    Cherniack M, Balakrishnan H, Balazinska M, Carney D, Çetintemel U, Xing Y, Zdonik S (2003) Scalable distributed stream processing. In: CIDR conference, Asilomar, CAGoogle Scholar
  11. 11.
    Congestion pricing: a report from intelligent transportation systems (ITS). http://www.path.berkeley.eduGoogle Scholar
  12. 12.
    DeWitt D, Naughton J, Schneider D (1991) An evaluation of non-equijoin algorithms. In: VLDB conference, Barcelona, Catalonia, SpainGoogle Scholar
  13. 13.
    Hwang J, Balazinska M, Rasin A, Çetintemel U, Stonebraker M, Zdonik S (2003) A comparison of stream-oriented high-availability algorithms. Technical Report CS-03-17, Department of Computer Science, Brown University, Providence, RIGoogle Scholar
  14. 14.
    Lerner A, Shasha D (2003) AQuery: Query language for ordered data, optimization techniques, and experiments. In: VLDB conference, Berlin, GermanyGoogle Scholar
  15. 15.
    Motwani R, Widom J, Arasu A, Babcock B, Babu S, Datar M, Manku G, Olston C, Rosenstein J, Varma R (2003) Query processing, approximation, and resource management in a data stream management system. In: CIDR conferenceGoogle Scholar
  16. 16.
    Poole RW (2002) Hot lanes prompted by federal program. http://www.rppi.org/federalhotlanes.htmlGoogle Scholar
  17. 17.
    Seshadri P, Livny M, Ramakrishnan R (1995) SEQ: A model for sequence databases. In: IEEE ICDE conference, Taipei, TaiwanGoogle Scholar
  18. 18.
    Tatbul N, Çetintemel U, Zdonik S, Cherniack M, Stonebraker M (2003) Load shedding in a data stream manager. In: VLDB conference, Berlin, GermanyGoogle Scholar
  19. 19.
    The MITRE Corporation. http://www.mitre.org/Google Scholar
  20. 20.
    US Army Medical Research and Materiel Command. https://mrmc-www.army.mil/Google Scholar

Copyright information

© Springer-Verlag Berlin/Heidelberg 2004

Authors and Affiliations

  • Hari Balakrishnan
    • 3
    Email author
  • Magdalena Balazinska
    • 3
  • Don Carney
    • 2
  • Uğur Çetintemel
    • 2
  • Mitch Cherniack
    • 1
  • Christian Convey
    • 2
  • Eddie Galvez
    • 1
  • Jon Salz
    • 3
  • Michael Stonebraker
    • 3
  • Nesime Tatbul
    • 2
  • Richard Tibbetts
    • 3
  • Stan Zdonik
    • 2
  1. 1.Department of Computer ScienceBrandeis UniversityWalthamUSA
  2. 2.Department of Computer ScienceBrown UniversityProvidenceUSA
  3. 3.Department of EECS and Laboratory of Computer ScienceMassachussetts Institute of TechnologyCambridgeUSA

Personalised recommendations