XML Restructuring and Integration for Tabular Data

  • Wei Yu
  • Z. Meral Ozsoyoglu
  • Gultekin Ozsoyoglu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2736)


We study the data integration and restructuring issues of tabular data. We consider the case where the same set of data is collected from independent sites, stored in different DBMSs or other repositories, organized in different tabular or equivalent semi-structured formats, and published on the web. These sites transform tabular data into XML data with possible syntactic discrepancies in their original tabular structures. Data integration refers to the task of creating an integrated XML view with a pre-specified format by restructuring and integrating different XML documents. Existing XML query algebras are not sufficient for tabular conversions. We propose the XML-T model and the restructuring and integration operators for tabular data. We show with examples the uses of our operators to create “views".


Integrate View Repetition Structure Tabular Data Column Element Document Type Definition 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Christophides, V., et al.: On Wrapping Query Languages and Efficient XML Integration. ACM SIGMOD 2000 (2000)Google Scholar
  2. 2.
    Fankhauser, P. (ed.): XQuery 1.0 Formal Semantics. W3C XML Query Working Gr. Note (2002) Google Scholar
  3. 3.
    Naughton, J. (ed.): The Niagara Internet Query System. IEEE Data Eng. Bulletin 24(2) Google Scholar
  4. 4.
    Gyssens, M. et al: Tables As a Paradigm for Querying and Restructuring. ACM PODS 1996 (1996) Google Scholar
  5. 5.
    Jagadish, H., Lakshmanan, L., Srivastava, D., Thompson, K.: TAX: a Tree Algebra for XML. In: Proc. DBPL Conf., Rome, Italy (2001)Google Scholar
  6. 6.
    Krishnamurthy, L., Nadeau, J., Ozsoyoglu, G., Ozsoyoglu, M. (ed.): Pathways database system: an integrated system for biological pathways. Bioinformatics (2003)Google Scholar
  7. 7.
    Lin, J., Ozsoyoglu, M.: Processing OODB Queries by O-algebra. In: CIKM 1996 (1996)Google Scholar
  8. 8.
    Madhavan, J., Bernstein, P., Rahm, E.: Generic Schema Matching with Cupid. In: Proc. Of VLDB Conf., Rome, Italy (2001)Google Scholar
  9. 9.
    Ross, M., Korth, H., Silberschatz, A.: Extended Algebra and Calculus for –1NF Relational Databases. ACM TODS 13(4) (1988)Google Scholar
  10. 10.
    Shaw, G., Zdonik, S.: A Query Algebra for Object Oriented Databases. In: IEEE ICDE Conf. (1990)Google Scholar
  11. 11.
    Yu, W., Ozsoyoglu, M., Ozsoyoglu, G.: XML-T Algebra Operators and Implementation Issues. Tech. Report Case Western Reserve University, Cleveland, U.S.A (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Wei Yu
    • 1
  • Z. Meral Ozsoyoglu
    • 1
  • Gultekin Ozsoyoglu
    • 1
  1. 1.Electrical Engineering and Computer Science DepartmentCase Western Reserve UniversityClevelandU.S.A.

Personalised recommendations