Supporting Complex Changes in Evolving Interrelated Web Databanks

  • Yannis Stavrakas
  • George Papastefanatos
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6426)

Abstract

In this paper we deal with problems occurring in evolving interrelated Web databanks. Examples of such databanks are networks of interlinked scientific repositories on the Web, managed independently by cooperating research groups. We argue that changes should not be treated solely as transforming operations, but rather as first class citizens retaining structural, semantic and temporal characteristics. We propose a graph model called evo-graph for capturing in a coherent way the inherent relationship between evolving data and changes applied on them. Evo-graph represents changes as arbitrarily complex objects, similarly to data objects. We discuss the temporal characteristics of the evo-graph, and show how the evo-graph can provide past snapshots of the data. To uniformly express temporal and provenance queries we introduce evo-path, a path expression language based on XPath. Evo-path takes advantage of complex changes in the evo-graph in order to answer queries that interpret and elucidate data evolution.

Keywords

Evolution of semistructured data change-centric management 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amagasa, T., Yoshikawa, M., Uemura, S.: A Data Model for Temporal XML Documents. In: Ibrahim, M., Küng, J., Revell, N. (eds.) DEXA 2000. LNCS, vol. 1873, p. 334. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  2. 2.
    Bairoch, A., et al.: The Universal Protein Resource (UniProt). Nucleic Acids Research 33, D154–D159 (2005) Database issue, http://www.uniprot.org/
  3. 3.
    Buneman, P., Khanna, S., Tajima, K., Tan, W.C.: Archiving Scientific Data. ACM Transactions on Database Systems 20, 1–39 (2004)Google Scholar
  4. 4.
    Buneman, P., Chapman, A.P., Cheney, J.: Provenance Management in Curated Databases. In: SIGMOD 2006 (2006)Google Scholar
  5. 5.
    Chawathe, S., Abiteboul, S., Widom, J.: Managing Historical Semistructured Data. Journal of Theory and Practice of Object Systems 24(4), 1–20 (1999)Google Scholar
  6. 6.
    Chawathe, S., Rajaraman, A., Garcia-Molina, H., Widom, J.: Change Detection in Hierarchically Structured Information. In: SIGMOD 1996 (1996)Google Scholar
  7. 7.
    Chien, S.-Y., Tsotras, V.J., Zaniolo, C., Zhang, D.: Storing and Querying Multiversion XML Documents using Durable Node Numbers. In: WISE 2001 (2001)Google Scholar
  8. 8.
    Chien, S.-Y., Tsotras, V.J., Zaniolo, C.: Efficient Management of Multiversion Documents by Object Referencing. In: VLDB 2001, pp. 291–300 (2001)Google Scholar
  9. 9.
    Dyreson, C.: Observing Transaction-Time Semantics with TTXPath. In: WISE 2001 (2001)Google Scholar
  10. 10.
    Gao, D., Snodgrass, R.T.: Temporal Slicing in the Evaluation of XML Queries. In: VLDB 2003 (2003)Google Scholar
  11. 11.
    Gergatsoulis, M., Stavrakas, Y.: Representing Changes in XML Documents using Dimensions. In: Bellahsène, Z., Chaudhri, A.B., Rahm, E., Rys, M., Unland, R. (eds.) XSym 2003. LNCS, vol. 2824, pp. 208–222. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  12. 12.
    Grandi, F.: Introducing an Annotated Bibliography on Temporal and Evolution Aspects in the World Wide Web. SIGMOD Record 33(2), 84–86 (2004)CrossRefGoogle Scholar
  13. 13.
    Harris, M.A., et al.: The Gene Ontology (GO) database and informatics. Nucleic Acids Research, Database issue D258–D261 32(1) (2004), http://www.geneontology.org/
  14. 14.
    Marian, A., Abiteboul, S., Cobena, G., Mignet, L.: Change-Centric Management of Versions in an XML Warehouse. In: VLDB 2001 (2001)Google Scholar
  15. 15.
    Moon, H.J., Curino, C., Deutsch, A., Hou, C.Y., Zaniolo, C.: Managing and querying transaction-time databases under schema evolution. In: VLDB 2008, pp. 882–895 (2008)Google Scholar
  16. 16.
    Papavassiliou, V., Flouris, G., Fundulaki, I., Kotzinos, D., Christophides, V.: On Detecting High-Level Changes in RDF/S KBs. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 473–488. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  17. 17.
    Rizzolo, F., Vaisman, A.A.: Temporal XML: modeling, indexing, and query processing. VLDB J. 17(5), 1179–1212 (2008)CrossRefGoogle Scholar
  18. 18.
    Wang, F., Zaniolo, C.: Temporal Queries in XML Document Archives and Web Warehouses. In: TIME 2003, pp. 47–55 (2003)Google Scholar
  19. 19.
    Wang, Y., DeWitt, D.J., Cai, J.: X-Diff: An Effective Change Detection Algorithm for XML Documents. In: ICDE 2003 (2003)Google Scholar
  20. 20.
    W3C. XML Path Language (XPath) 2.0 (January 2007), http://www.w3.org/TR/xpath20/
  21. 21.
    W3C. The XML data model (August 2005), http://www.w3.org/XML/Datamodel.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Yannis Stavrakas
    • 1
  • George Papastefanatos
    • 1
  1. 1.Institute for the Management of Information SystemsAthensGreece

Personalised recommendations