Incorporating Recovery from Failures into a Data Integration Benchmark

  • Len Wyatt
  • Brian Caufield
  • Marco Vieira
  • Meikel Poess
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7755)

Abstract

The proposed TPC-DI benchmark measures the performance of Data Integration systems (a.k.a. ETL systems) given the task of integrating data from an OLTP system and other data sources to create a data warehouse.This paper describes the scenario, structure and timing principles used in TPC-DI. Although failure recovery is very important in real deployments of Data Integration systems, certain complexities made it difficult to specify in the benchmark. Hence failure recovery aspects have been scoped out of the current version of TPC-DI. The issues around failure recovery are discussed in detail and some options are described. Finally the audience is invited to offer additional suggestions.

Keywords

Industry Standard Benchmarks Data Integration ETL ACID properties Durability Dependability Reliability Recovery Data Warehouse Decision Support 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Transaction Performance Council website (TPC), http://www.tpc.org
  2. 2.
    Wyatt, L., Caufield, B., Pol, D.: Principles for an ETL Benchmark. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 183–198. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  3. 3.
    Kimball, R.: The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses. John Wiley (1996)Google Scholar
  4. 4.
    Laprie, J.C.: Dependable Computing: Concepts, Limits, Challenges. In: Proceedings of the 25th International Symposium on Fault-Tolerant Computing, FTCS-25, Special Issue, Pasadena, CA, USA, pp. 42–54 (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Len Wyatt
    • 1
  • Brian Caufield
    • 2
  • Marco Vieira
    • 3
  • Meikel Poess
    • 4
  1. 1.Microsoft CorporationUSA
  2. 2.IBMUSA
  3. 3.CISUC - Department of Informatics EngineeringUniversity of CoimbraPortugal
  4. 4.Oracle CorporationRedwood ShoresUSA

Personalised recommendations