Matching and Alignment: What Is the Cost of User Post-Match Effort?

Duchateau, Fabien; Bellahsene, Zohra; Coletta, Remi

doi:10.1007/978-3-642-25109-2_28

Matching and Alignment: What Is the Cost of User Post-Match Effort?

(Short Paper)

Fabien Duchateau²⁹,
Zohra Bellahsene³⁰ &
Remi Coletta³⁰

Conference paper

594 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7044))

Abstract

Generating new knowledge from scientific databases, fusioning products information of business companies or computing an overlap between various data collections are a few examples of applications that require data integration. A crucial step during this integration process is the discovery of correspondences between the data sources, and the evaluation of their quality. For this purpose, the overall metric has been designed to compute the post-match effort, but it suffers from major drawbacks. Thus, we present in this paper two related metrics to compute this effort. The former is called post-match effort, i.e., the amount of work that the user must provide to correct the correspondences that have been discovered by the tool. The latter enables the measurement of human-spared resources, i.e., the rate of automation that has been gained by using a matching tool.

Supported by ANR DataRing ANR-08-VERSO-007-04. The first author carried out this work during an ERCIM “Alain Bensoussan” Fellowship Programme.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Smith, K., Morse, M., Mork, P., Li, M., Rosenthal, A., Allen, D., Seligman, L.: The role of schema matching in large enterprises. In: CIDR (2009)
Google Scholar
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB Journal 10(4), 334–350 (2001)
Article MATH Google Scholar
Yatskevich, M.: Preliminary evaluation of schema matching systems. Technical Report DIT-03-028, Informatica e Telecomunicazioni, University of Trento (2003)
Google Scholar
Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)
Chapter Google Scholar
Noy, N.F., Doan, A., Halevy, A.Y.: Semantic integration. AI Magazine 26(1), 7–10 (2005)
Google Scholar
Euzenat, J., Shvaiko, P.: Ontology matching. Springer, Heidelberg (2007)
MATH Google Scholar
Bellahsene, Z., Bonifati, A., Rahm, E.: Schema Matching and Mapping. Springer, Heidelberg (2011)
Book MATH Google Scholar
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: ICDE, pp. 117–128 (2002)
Google Scholar
Alexe, B., Tan, W.C., Velegrakis, Y.: STBenchmark: towards a benchmark for mapping systems. Proceedings of the VLDB 1(1), 230–244 (2008)
Article Google Scholar
Do, H.-H., Melnik, S., Rahm, E.: Comparison of Schema Matching Evaluations. In: Chaudhri, A.B., Jeckle, M., Rahm, E., Unland, R. (eds.) NODe-WS 2002. LNCS, vol. 2593, pp. 221–237. Springer, Heidelberg (2003)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Norwegian University of Science and Technology, NO-7491, Trondheim, Norway
Fabien Duchateau
LIRMM - Université Montpellier 2, 161 rue Ada, 34392, Montpellier, France
Zohra Bellahsene & Remi Coletta

Authors

Fabien Duchateau
View author publications
You can also search for this author in PubMed Google Scholar
Zohra Bellahsene
View author publications
You can also search for this author in PubMed Google Scholar
Remi Coletta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

STAR Lab, Vrije Universiteit Brussel (VUB), Bldg G/10, Pleinlaan 2, 1050, Brussel, Belgium
Robert Meersman
DEBII, Curtin University of Technology, Technology Park, De Laeter Way, 6102, Bentley, WA, Australia
Tharam Dillon
Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660, Boadilla del Monte, Madrid, Spain
Pilar Herrero
Smeal College of Business, Pennsylvania State University, University Park, PA 16802, U.S.A.
Akhil Kumar
Institute of Databases and Information Systems, Ulm University, Germany
Manfred Reichert
City University of Hong Kong, Hong Kong
Li Qing
National University of Singapore (NUS), Singapore
Beng-Chin Ooi
Dipartemento Tecnologie dell’Informazione, Universitá degli Studi di Milano, Via Bramante 65, 26013, Crema, Italy
Ernesto Damiani
VU Station B #1829, Vanderbilt University, 2015 Terrace Place, TN 37203, Nashville, USA
Douglas C. Schmidt
Virginia Tech, 24060, Blacksburg, VA, USA
Jules White
Digital Enterprise Research Institute (DERI), National University of Ireland, IDA Business Park, Lower Dangan, Galway, Ireland
Manfred Hauswirth
Kno.e.sis Center, Wright State University, Dayton, Ohio, USA
Pascal Hitzler
IBM India Research Lab, 4, Block C, Institutional Area, Vasant Kunj, 110 070, New Delhi, India
Mukesh Mohania

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Duchateau, F., Bellahsene, Z., Coletta, R. (2011). Matching and Alignment: What Is the Cost of User Post-Match Effort?. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2011. OTM 2011. Lecture Notes in Computer Science, vol 7044. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25109-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-25109-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25108-5
Online ISBN: 978-3-642-25109-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics