Journal on Data Semantics

, Volume 4, Issue 4, pp 231–267 | Cite as

Impact Analysis and Policy-Conforming Rewriting of Evolving Data-Intensive Ecosystems

  • Petros Manousis
  • Panos Vassiliadis
  • George Papastefanatos
Original Article

Abstract

Data-intensive ecosystems are conglomerations of data repositories surrounded by applications that depend on them for their operation. In this paper, we address the problem of performing what-if analysis for the evolution of the database part of a data-intensive ecosystem, to identify what other parts of an ecosystem are affected by a potential change in the database schema, and how will the ecosystem look like once the change has been performed, while, at the same time, retaining the ability to regulate the flow of events. We model the ecosystem as a graph, uniformly covering relations, views, and queries as nodes and their internal structure and interdependencies as the edges of the graph. We provide a simple language to annotate the modules of the graph with policies for their response to evolutionary events to regulate the flow of events and their impact by (i) vetoing (“blocking”) the change in parts that the developers want to retain unaffected and (ii) allowing (“propagating”) the change in parts that we need to adapt to the new schema. Our method for the automatic adaptation of ecosystems is based on three algorithms that automatically (i) assess the impact of a change, (ii) compute the need of different variants of an ecosystem’s components, depending on policy conflicts, and (iii) rewrite the modules to adapt to the change. We theoretically prove the coverage of the language, as well as the termination, consistency, and confluence of our algorithms and experimentally verify our methods effectiveness and efficiency.

Keywords

Evolution Data-intensive ecosystems Adaptation 

References

  1. 1.
    Cleve A, Brogneaux AF, Hainaut J-L (2010) A conceptual approach to database applications evolution. In: 29th international conference on conceptual modeling (ER), Vancouver, pp 132–145Google Scholar
  2. 2.
    Curino C, Moon HJ, Deutsch A, Zaniolo C (2010) Update rewriting and integrity constraint maintenance in a schema evolution support system: PRISM++. PVLDB 4(2):117–128Google Scholar
  3. 3.
    Curino C, Moon HJ, Deutsch A, Zaniolo C (2013) Automating the database schema evolution process. VLDB J 22(1):73–98CrossRefGoogle Scholar
  4. 4.
    Drupal Community (2014) Drupal. http://ftp.drupal.org/files/projects/
  5. 5.
    Foster JN, Greenwald MB, Moore JT, Pierce BC, Schmitt A (2007) Combinators for bidirectional tree transformations: a linguistic approach to the view-update problem. ACM Trans Program Lang Syst 29(3)Google Scholar
  6. 6.
    Gallagher K, Binkley D (2008) Program slicing. In: Frontiers of software maintenance. IEEE CS Press, New YorkGoogle Scholar
  7. 7.
    Golfarelli M, Lechtenbörger J, Rizzi S, Vossen G (2006) Schema versioning in data warehouses: enabling cross-version querying via schema augmentation. Data Knowl Eng 59(2):435–459CrossRefGoogle Scholar
  8. 8.
    Gupta A, Mumick IS, Rao J, Ross KA (2001) Adapting materialized views after redefinitions: techniques and a performance study. Inf Syst 26(5):323–362MATHCrossRefGoogle Scholar
  9. 9.
    Hartung M, Terwilliger JF, Rahm E (2011) Recent advances in schema and ontology evolution. In: Bellahsene Z, Bonifati A, Rahm E (eds) Schema matching and mapping. Springer, New York, pp 149–190CrossRefGoogle Scholar
  10. 10.
    Manousis P (2013) Database evolution and maintenance of dependent applications via query rewriting. MSc. Thesis, Department of Computer Science, University of Ioannina. http://www.cs.uoi.gr/~pmanousi/publications.html
  11. 11.
    Manousis P, Vassiliadis P, Papastefanatos G (2013) Automating the adaptation of evolving data-intensive ecosystems. In: 32th international conference on conceptual modeling (ER), Hong-Kong pp 182–196Google Scholar
  12. 12.
    Maule A, Emmerich W, Rosenblum DS (2008) Impact analysis of database schema changes. In: 30th international conference on software engineering (ICSE 2008), Leipzig, pp 451–460Google Scholar
  13. 13.
    McBrien P, Poulovassilis A (2003) Data integration by bi-directional schema transformation rules. In: Proceedings of the 19th international conference on data engineering, 5–8 March 2003, Bangalore, pp 227–238Google Scholar
  14. 14.
    Nica A, Lee AJ, Rundensteiner EA (1998) The CVS algorithm for view synchronization in evolvable large-scale information systems. In: 6th international conference on extending database technology (EDBT 1998), Valencia, pp 359–373Google Scholar
  15. 15.
    Papastefanatos G, Kyzirakos K, Vassiliadis P, Vassiliou Y (2005) Hecataeus: a framework for representing SQL constructs as graphs. In: Proceedings of 10th international workshop on exploring modeling methods for systems analysis and design-EMMSAD, PortoGoogle Scholar
  16. 16.
    Papastefanatos G, Vassiliadis P, Simitsis A (2011) Propagating evolution events in data-centric software artifacts. In: ICDE workshops, pp 162–167Google Scholar
  17. 17.
    Papastefanatos G, Vassiliadis P, Simitsis A, Aggistalis K, Pechlivani F, Vassiliou Y (2008) Language extensions for the automation of database schema evolution. In: Proceedings of the ICEIS (1), Barcelona, pp 74–81Google Scholar
  18. 18.
    Papastefanatos G, Vassiliadis P, Simitsis A, Vassiliou Y (2008) Design metrics for data warehouse evolution. In: 27th international conference on conceptual modeling (ER), Barcelona, pp 440–454Google Scholar
  19. 19.
    Papastefanatos G, Vassiliadis P, Simitsis A, Vassiliou Y (2009) Policy-regulated management of ETL evolution. J Data Semant 13:147–177CrossRefGoogle Scholar
  20. 20.
    Papastefanatos G, Vassiliadis P, Simitsis A, Vassiliou Y (2010) HECATAEUS: regulating schema evolution. In: Proceedings of the 26th international conference on data engineering (ICDE), Long Beach, pp 1181–1184Google Scholar
  21. 21.
    Pressman R (2000) Software engineering: a practitioner’s approach: European adaption, 5th edn. McGraw-Hill, New YorkGoogle Scholar
  22. 22.
    Ram S, Shankaranarayanan G (2003) Research issues in database schema evolution: the road not taken. In: Working paper, Department of Information Systems, Boston University School of Management. http://smgapps.bu.edu/smgnet/Personal/Faculty/Publication/pubUploads/Shankar,_G_15.pdf?wid=1536
  23. 23.
    Roddick JF (1992) Schema evolution in database systems—an annotated bibliography. SIGMOD Rec 21(4):35–40CrossRefGoogle Scholar
  24. 24.
    Terwilliger JF, Cleve A, Curino C (2012) How clean is your sandbox?—towards a unified theoretical framework for incremental bidirectional transformations. In: 5th international conference on theory and practice of model transformations (ICMT), Prague, pp 1–23Google Scholar
  25. 25.
    Terwilliger JF, Delcambre LML, Maier D, Steinhauer J, Britell S (2010) Updatable and evolvable transforms for virtual databases. PVLDB 3(1):309–319Google Scholar
  26. 26.
    Transaction Processing Performance Council (2012) The new decision support benchmark standard. http://www.tpc.org/tpcds/default.asp
  27. 27.
    Velegrakis Y, Miller RJ, Popa L (2004) Preserving mapping consistency under schema changes. VLDB J 13(3):274–293CrossRefGoogle Scholar
  28. 28.
    Winsemann T, Köppen V, Saake G (2012) A layered architecture for enterprise data warehouse systems. In: Bajec M, Eder J (eds) CAiSE workshops. Lecture notes in business information processing, vol 112, pp 192–199. Springer, New YorkGoogle Scholar
  29. 29.
    Wrembel R, Bebel B (2007) Metadata management in a multiversion data warehouse. J Data Semant 8:118–157Google Scholar
  30. 30.
    Xing Z, Stroulia E (2005) Analyzing the evolutionary history of the logical design of object-oriented software. IEEE Trans Softw Eng 31(10):850–868CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2015

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringUniversity of IoanninaIoanninaGreece
  2. 2.Institute for the Management of Information SystemsResearch Center “Athena”AthensGreece

Personalised recommendations