Abstract
Schema matching plays an important role in various fields of enterprise system modeling and integration, such as in databases, business intelligence, knowledge management, interoperability, and others. The matching problem relates to finding the semantic correspondences between two or more schemas. The focus of the most of the research done in schema and ontology matching is pairwise matching, where 2 schemas are compared at the time. While few semi-automatic approaches have been recently proposed in pairwise matching to involve user, current multi-schema approaches mainly rely on the use of statistical information in order to avoid user interaction, which is largely limited to parameter tuning. In this study, we propose a user-guided iterative approach for large-scale multi-schema integration. Given n schemas, the goal is to match schema elements iteratively and demonstrate that the learning approach results in improved accuracy during iterations. The research is conducted in SAP Research Karlsruhe, followed by an evaluation using large e-business schemas. The evaluation results demonstrated an improvement in accuracy of matching proposals based on user’s involvement, as well as an easier accomplishment of a unified data model.
Chapter PDF
Similar content being viewed by others
References
SAP Netweaver. Adaptive Technology for the Networked Enterprise (August 15, 2011), http://www.sap.com/platform/netweaver/index.epx
Microsoft BizTalk Server, Microsoft BizTalk Server website (August 15, 2011), http://www.microsoft.com/biztalk/en/us/default.aspx
IBM InfoSphere, InfoSphere Platform (August 15, 2011), http://www-01.ibm.com/software/data/infosphere/
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB 2001), pp. 49–58. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Rahm, E., Bernstein, P.A.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4), 334–350 (2001)
Berlin, J., Motro, A.: Autoplex: Automated Discovery of Content for Virtual Databases. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, pp. 108–122. Springer, Heidelberg (2001)
Doan, A.H., Domingos, P., Halevy, A.Y.: Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data (SIGMOD 2001), vol. 30(2), pp. 509–520. ACM, New York (2001)
Rahm, E., Do, H.H., Maßmann, S.: Matching large XML schemas. ACM SIGMOD Record 33(4) (2004)
Bernstein, P.A., et al.: Industrial-strength Schema Matching. ACM SIGMOD Record 33(4) (2004)
Euzenat, J., Shvaiko, P.: A survey of schema-based matching approaches. Technical report, Informatica e Telecomunicazioni, University of Trento (2007)
Rahm, E.: Towards Large-Scale Schema and Ontology Matching. In: Schema Matching and Mapping. Data-Centric Systems and Applications, pp. 3–28. Springer (2011)
Uno, T., et al.: LCM ver. 2: Efficient mining algorithms for frequent/closed/maximal itemsets. In: IEEE International Conference on Data Mining, Workshop on Frequent Itemset Mining Implementations (FIMI), Brighton, UK (2004)
Bellahsene, Z., Bonifati, A., Rahm, E.: Schema Matching and Mapping. Data-Centric Systems and Applications. Springer (2011)
Do, H.-H., Melnik, S., Rahm, E.: Comparison of Schema Matching Evaluations. In: Chaudhri, A.B., Jeckle, M., Rahm, E., Unland, R. (eds.) Web Database System and Web-Services 2002. LNCS, vol. 2593, pp. 221–237. Springer, Heidelberg (2003)
Shvaiko, P., Euzenat, J.: A Survey of Schema-Based Matching Approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)
Noy, N.F.: Semantic Integration: A Survey of Ontology-Based Approaches. SIGMOD Record 33(4), 65–70 (2004)
Bernstein, P.A., Melnik, S., Churchill, J.E.: Incremental Schema Matching. In: Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB 2006), pp. 1167–1170 (2006)
Chen, D., et al.: A User Guided Iterative Alignment Approach for Ontology Mapping. In: Semantic Web Enabled Web Service (SWWS), pp. 51–56 (2008)
Falconer, S.M., Noy, N.F.: Interactive Techniques to Support Ontology Matching. In: Bellahsene, Z., Bonifati, A., Rahm, E. (eds.) Schema Matching and Mapping, pp. 29–52. Springer (2011)
Rech, J., et al.: Intelligent assistance for collaborative schema governance in the German agricultural eBusiness sector. In: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services. ACM, New York (2010)
Zhdanova, A.V., Shvaiko, P.: Community-Driven Ontology Matching. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 34–49. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 IFIP International Federation for Information Processing
About this paper
Cite this paper
Khan, M.W., Zdravkovic, J. (2012). A User-Guided Approach for Large-Scale Multi-schema Integration. In: Sandkuhl, K., Seigerroth, U., Stirna, J. (eds) The Practice of Enterprise Modeling. PoEM 2012. Lecture Notes in Business Information Processing, vol 134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34549-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-34549-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34548-7
Online ISBN: 978-3-642-34549-4
eBook Packages: Computer ScienceComputer Science (R0)