Abstract
This paper presents a detailed integration process for XML schemata called BInXS. BInXS adopts a global-as-view integration approach that builds a global schema from a set of heterogeneous XML schemata related to a same application domain. This bottom-up approach maps all element and attribute definitions in XML schemata to correspondent concepts at the global schema, allowing access to all data available at the XML sources. The integration process is semi-automatically performed over conceptual representations of the XML schemata, which provides a better understanding of the semantics of the XML data to be unified. A conceptual schema is generated by a set of conversion rules that are applied to a schema definition for XML data. Once this conceptual schema is the result of a meticulous analysis of the XML logical model, it is able to abstract the particularities of semistructured and XML data, like elements with mixed contents and elements with alternative representations. Therefore, the further unification of such conceptual schemata implicitly deals with structural conflicts inherent to semistructured and XML data. In addition, BInXS supports a mapping strategy based on XPath expressions in order to maintain correspondences among global concepts and data at the XML sources.
This work was partially supported by CAPES Foundation.
Chapter PDF
Similar content being viewed by others
References
CXML.org. (march 2005), Available at http://www.cxml.org
DBLP Bibliography (March 2005), Available at http://www.informatik.uni-trier.de/~ley/db/
EBisXML (March 2005), Available at http://www.basda.org
SIGMOD Record (March 2005), Available at http://www.acm.org/sigs/sigmod/record/xml
W3C XML Schema (March 2005), Available at http://www.w3.org/XML/Schema
XML Path Language (March 2005), Available at http://www.w3.org/TR/xpath
Extensible Markup Language (XML) (March 2005), Available at http://www.w3.org/XML
Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, San Francisco (2000)
Batini, C., Ceri, S., Navathe, S.B.: Conceptual Database Design: An Entity-Relationship Approach. Benjamin/Cummings Publishing Company (1992)
Batini, C., Lanzerini, M., Navathe, S.B.: A Comparative Analysis of Methodologies for Database Schema Integration. ACM Computing Surveys 18(4), 323–364 (1986)
Bergamaschi, S., Castano, S., Beneventano, D., Vincini, M.: Semantic Integration of Heterogeneous Information Sources. Data & Knowledge Engineering 36(1), 215–249 (2001)
Busse, S., Kutshce, R., Leser, U., Weber, H.: Federated Information Systems: Concepts, Terminology and Architectures (Technical Report, 99-9). Universität, Berlin (1999)
Camillo, S.D., Heuser, C.A., Mello, R.S.: Querying Heterogeneous XML Sources Through a Conceptual Schema. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 186–199. Springer, Heidelberg (2003)
Castano, S., Antonellis, V., Vimercati, S.C.: Global Viewing of Heterogeneous Data Sources. IEEE Transactions on Knowledge and Data Engineering 13(2), 277–297 (2001)
Cruz, I.F., Xiao, H., Hsu, F.: An Ontology-Based Framework for XML Semantic Integration. In: International Database Engineering and Applications Symposium (IDEAS 2004), Coimbra, Portugal, pp. 217–226. IEEE, Los Alamitos (2004)
Doan, A., Domingos, P., Halevy, A.: Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In: ACM International Conference on Management of Data (SIGMOD), Santa Barbara, USA, May 2001, pp. 509–520 (2001)
Elmagarmid, A., Rusinkiewicz, M., Sheth, A.: Management of Heterogeneous and Autonomous Database Systems. Morgan Kaufmann, San Francisco (1999)
Halevy, A.Y.: Answering Queries Using Views: A Survey. VLDB Journal 10(4), 270–294 (2001)
Halphin, T.: Object-Role Modeling (ORM/NIAM), Handbook on Architectures of Information Systems, ch. 4, pp. 81–102. Springer, Heidelberg (1998)
Lethi, P., Fankhause, P.: XML Data Integration with OWL: Experiences & Challanges. In: International Symposium On Applications and the Internet (SAINT 2004), Tokyo, Japan, pp. 160–170. IEEE, Los Alamitos (2004)
Lim, S., Ng, Y.: An Automated Integration Approach for Semi-structured and Structured Data. In: 3th International Symposium on Cooperative Database Systems for Advanced Applications (CODAS), Beijing, China, April 2001, pp. 12–21. IEEE, Los Alamitos (2001)
Lóscio, B.F., Salgado, A.C.: Generating Mediation Queries for XML-based Data Integration Systems. In: 18th Brazilian Symposium on Databases (SBBD 2003), Manaus, AM, October 2003, pp. 99–113 (2003)
Madhavan, J., Bernstein, P., Rahm, E.: Generic Schema Matching with Cupid. In: 27th Conference on Very Large Data Bases (VLDB), Rome, Italy, September 2001, pp. 49–58. Morgan Kaufmann, San Francisco (2001)
McBrien, P., Poulovassilis, A.: A Semantic Approach to Integrating XML and Structured Data Sources. In: Dittrich, K.R., Geppert, A., Norrie, M.C. (eds.) CAiSE 2001. LNCS, vol. 2068, pp. 330–345. Springer, Heidelberg (2001)
Mello, R.S.: Uma Abordagem Bottom-Up para a Integracao Semantica de Esquemas XML. PhD thesis, Universidade Federal do Rio Grande do Sul, july (2002) (in Portuguese)
Mello, R.S., Castano, S., Heuser, C.A.: A Method for The Unification of XML Schemata. Information and Software Technology 44(4), 241–249 (2002)
Mello, R.S., Heuser, C.A.: A Rule-Based Conversion of a DTD to a Conceptual Schema. In: 20th International Conference On Conceptual Modeling (ER), Yokohama, Japan, pp. 133–148. Springer, Heidelberg (2001)
Passi, K., Lane, L., Madria, S.K., Sakamuri, B.C., Mohania, M.K., Bhowmick, S.S.: A Model for XML Schema Integration. In: Bauknecht, K., Tjoa, A.M., Quirchmayr, G. (eds.) EC-Web 2002. LNCS, vol. 2455, pp. 193–202. Springer, Heidelberg (2002)
Reynaud, C., Sirot, J., Vodislav, D.: Semantic Integration of XML Heterogeneous Data Sources. In: International Database Engineering & Applications Symposium (IDEAS), Grenoble, France, july 2001, pp. 199–208. IEEE Computer Society Press, Los Alamitos (2001)
Rodriguez-Gianolli, P., Mylopoulos, J.: A Semantic Approach to XML-Based Data Integration. In: Kunii, H.S., Jajodia, S., Sølvberg, A. (eds.) ER 2001. LNCS, vol. 2224, pp. 117–132. Springer, Heidelberg (2001)
Sheth, A.P., Larson, J.A.: Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys 22(3), 183–236 (1990)
Yang, X., Lee, M.L., Ling, T.W.: Resolving Structural Conflicts in the Integration of XML Schemas: A Semantic Approach. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 520–533. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
dos Santos Mello, R., Heuser, C.A. (2005). BInXS: A Process for Integration of XML Schemata. In: Pastor, O., Falcão e Cunha, J. (eds) Advanced Information Systems Engineering. CAiSE 2005. Lecture Notes in Computer Science, vol 3520. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11431855_12
Download citation
DOI: https://doi.org/10.1007/11431855_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26095-0
Online ISBN: 978-3-540-32127-9
eBook Packages: Computer ScienceComputer Science (R0)