Advertisement

Ontology and Instance Matching

  • Silvana Castano
  • Alfio Ferrara
  • Stefano Montanelli
  • Gaia Varese
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6050)

Abstract

The growing need of sharing data and digital resources within and across organizations has produced a novel attention on issues related to ontology and instance matching. After an introductory classification of the main techniques and tools for ontology matching, the chapter focuses on instance matching by providing an accurate classification of the matching techniques proposed in the literature, and a comparison of the recent instance matching tools according to the results achieved in the OAEI 2009 contest. Ontology and instance matching solutions developed in the BOEMIE project for multimedia resource management and ontology evolution are finally presented.

Keywords

Record Linkage Match Technique String Match Ontology Match Record Pair 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Shvaiko, P., Euzenat, J.: Ten Challenges for Ontology Matching. In: Chung, S. (ed.) OTM 2008, Part II. LNCS, vol. 5332. Springer, Heidelberg (2008)Google Scholar
  2. 2.
    Castano, S., Ferrara, A., Montanelli, S.: Dealing with Matching Variability of Semantic Web Data Using Contexts. In: Pernici, B. (ed.) CAiSE 2010. LNCS, vol. 6051, pp. 194–208. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  3. 3.
    Bouquet, P., Stoermer, H., Mancioppi, M., Giacomuzzi, D.: OkkaM: Towards a Solution to the Identity Crisis on the Semantic Web. In: Proc. of the 3rd Italian Semantic Web Workshop, Pisa, Italy (2006)Google Scholar
  4. 4.
    Wang, C., Lu, J., Zhang, G.: Integration of Ontology Data through Learning Instance Matching. In: Proc. of the 2006 IEEE/WIC/ACM Int. Conference on Web Intelligence (WI 2006), Washington, DC, USA, pp. 536–539 (2006)Google Scholar
  5. 5.
    Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An Empirical Study of Instance-Based Ontology Matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  6. 6.
    Engmann, D., Maßmann, S.: Instance Matching with COMA++. In: Proc. of the Workshop on Datenbanksysteme in Business, Technologie und Web (BTW 2007), Aachen, Germany (2007)Google Scholar
  7. 7.
    Kalfoglou, Y., Schorlemmer, M.: Ontology Mapping: the State of the Art. The Knowledge Engineering Review 18(1) (2003)Google Scholar
  8. 8.
    Noy, N.: Semantic Integration: a Survey of Ontology-based Approaches. SIGMOD Record, Special Issue on Semantic Integration 33(4) (2004)Google Scholar
  9. 9.
    INTEROP: State of the Art and State of the Practice Including Initial Possible Research Orientations. Deliverable D8.1, NoE INTEROP - IST Project n. 508011 - 6th EU Framework Programme (2004)Google Scholar
  10. 10.
    Shvaiko, P., Euzenat, J.: A Survey of Schema-based Matching Approaches. Journal on Data Semantics IV (2005)Google Scholar
  11. 11.
    Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)zbMATHGoogle Scholar
  12. 12.
    Rahm, E., Bernstein, P.: A Survey of Approaches to Automatic Schema Matching. The VLDB Journal 10(4) (2001)Google Scholar
  13. 13.
    Navarro, G.: A Guided Tour to Approximate String Matching. ACM Computing Surveys 33(1), 31–88 (2001)CrossRefGoogle Scholar
  14. 14.
    Levenshtein, V.: Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Soviet Physics Doklady 10(8) (1966)Google Scholar
  15. 15.
    Cormode, G., Muthukrishnan, S.: The String Edit Distance Matching Problem with Moves. In: Proc. of the 13th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2002), San Francisco, CA, USA, pp. 667–676 (2002)Google Scholar
  16. 16.
    Ukkonen, E., Wood, D.: Approximate String Matching with Suffix Automata. Algorithmica 10(5), 353–364 (1993)MathSciNetCrossRefzbMATHGoogle Scholar
  17. 17.
    Baeza-Yates, R.A.: Text-Retrieval: Theory and Practice. In: Proc. of the IFIP 12th World Computer Congress on Algorithms, Software, Architecture - Information Processing, Amsterdam, The Netherlands, pp. 465–476 (1992)Google Scholar
  18. 18.
    Navarro, G., Baeza-Yates, R.A., Sutinen, E., Tarhio, J.: Indexing Methods for Approximate String Matching. IEEE Data Engineering Bulletin 24(4), 19–27 (2001)Google Scholar
  19. 19.
    Madhavan, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: Proc. of the Int. Conference on Very Large Data Bases (VLDB 2002), Hong Kong, China, pp. 49–58 (2002)Google Scholar
  20. 20.
    Castano, S., De Antonellis, V., De Capitani Di Vimercati, S.: Global viewing of heterogeneous data sources. IEEE Transactions on Knowledge and Data Engineering 13(2), 277–297 (2001)CrossRefGoogle Scholar
  21. 21.
    Jeh, G., Widom, J.: SimRank: a Measure of Structural-Context Similarity. In: Proc. of the 8th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2002), Edmonton, Alberta, Canada, pp. 538–543 (2002)Google Scholar
  22. 22.
    Shasha, D., Wang, J.T.L., Giugno, R.: Algorithmics and Applications of Tree and Graph Searching. In: Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2002), Madison, Wisconsin, USA, pp. 39–52 (2002)Google Scholar
  23. 23.
    Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In: Proc. of the 18th Int. Conference on Data Engineering (ICDE 2002), San Jose, CA, USA (2002)Google Scholar
  24. 24.
    Meilicke, C., Stuckenschmidt, H., Tamilin, A.: Repairing Ontology Mappings. In: Proc. of the 22nd Conference on Artificial Intelligence (AAAI 2007), Vancouver, BC, Canada, pp. 1408–1413 (2007)Google Scholar
  25. 25.
    Giunchiglia, F., Shvaiko, P.: Semantic Matching. Knowledge Engineering Review 18(3) (2003)Google Scholar
  26. 26.
    Borgida, A., Serafini, L.: Distributed Description Logics: Assimilating Information from Peer Sources. Journal on Data Semantics I, 153–184 (2003)zbMATHGoogle Scholar
  27. 27.
    Castano, S., Ferrara, A., Lorusso, D., Näth, T.H., Möller, R.: Mapping Validation by Probabilistic Reasoning. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 170–184. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  28. 28.
    Meilicke, C., Stuckenschmidt, H., Tamilin, A.: Reasoning Support for Mapping Revision. Journal of Logic and Computation 19(5), 807–829 (2009)MathSciNetCrossRefzbMATHGoogle Scholar
  29. 29.
    Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to Map between Ontologies on the Semantic Web. In: Proc. of the 11th Int. Conference on World Wide Web (WWW 2002), Honolulu, Hawaii, USA, pp. 662–673 (2002)Google Scholar
  30. 30.
    Lacher, M., Groh, G.: Facilitating the Exchange of Explicit Knowledge Through Ontology Mappings. In: Proc. of the 14th Int. FLAIRS Conference, Key West, FL, USA, pp. 305–309 (2001)Google Scholar
  31. 31.
    Peng, Y., Ding, Z., Pan, R.: Uncertainty in Ontology Mapping: A Bayesian Perspective. In: Proc. of the Information Interpretation and Integration Conference (I3CON), Gaithersburg, MD, USA (2004)Google Scholar
  32. 32.
    Smith, A., Elkan, C.: A Bayesian Network Framework for Reject Inference. In: Proc. of the 10th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2004), New York, NY, USA, pp. 286–295 (2004)Google Scholar
  33. 33.
    Euzenat, J.: The Ontology Alignment Evaluation InitiativeGoogle Scholar
  34. 34.
    Seddiqui, M.H., Aono, M.: An Efficient and Scalable Algorithm for Segmented Alignment of Ontologies of Arbitrary Size. Web Semantics: Science, Services and Agents on the World Wide Web 7(4), 344–356 (2009)CrossRefGoogle Scholar
  35. 35.
    Cruz, I.F., Palandri Antonelli, F., Stroe, C.: AgreementMaker Efficient Matching for Large Real-World Schemas and Ontologies. In: Proc. of the 35th Int. Conference on Very Large Databases (VLDB 2009), Lyon, France, pp. 1586–1589 (2009)Google Scholar
  36. 36.
    David, J., Guillet, F., Briand, H.: Association Rule Ontology Matching Approach. Int. Journal on Semantic Web and Information Systems 3(2), 27–49 (2007)CrossRefGoogle Scholar
  37. 37.
    Jean-Mary, Y.R., Shironoshita, E.P., Kabuka, M.R.: Ontology matching with semantic verification. Web Semantics: Science, Services and Agents on the World Wide Web 7(3), 235–251 (2009)CrossRefGoogle Scholar
  38. 38.
    Gracia, J., Lopez, V., D’Aquin, M., Sabou, M., Motta, E., Mena, E.: Solving Semantic Ambiguity to Improve Semantic Web based Ontology Matching. In: Proc. of the 2nd Int. Workshop on Ontology Matching, Busan, Korea (2007)Google Scholar
  39. 39.
    Nagy, M., Vargas-Vera, M., Motta, E.: DSSim - Managing Uncertainty on the Semantic Web. In: Proc. of the 2nd Int. Workshop on Ontology Matching, Busan, Korea (2007)Google Scholar
  40. 40.
    Kensche, D., Quix, C., Chatti, M., Jarke, M.: GeRoMe: A Generic Role Based Metamodel for Model Management. Journal on Data Semantics VIII, 82–117 (2007)zbMATHGoogle Scholar
  41. 41.
    Reul, Q., Pan, J.Z.: KOSIMap: Ontology Alignments Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  42. 42.
    Wang, P., Xu, B.: Lily: Ontology Alignment Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  43. 43.
    Bock, J., Liu, P., Hettenhausen, J.: MapPSO Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  44. 44.
    Li, J., Tang, J., Li, Y., Luo, Q.: RiMOM: A Dynamic Multistrategy Ontology Alignment Framework. IEEE Transactions on Knowledge and Data Engineering (TKDE) 21(8), 1218–1232 (2008)Google Scholar
  45. 45.
    Lambrix, P., Tan, H.: SAMBO-A System for Aligning and Merging Biomedical Ontologies. Web Semantics: Science, Services and Agents on the World Wide Web 4(3), 196–206 (2006)CrossRefGoogle Scholar
  46. 46.
    Xu, P., Tao, H., Zang, T., Wang, Y.: Alignment Results of SOBOM for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  47. 47.
    Hamdi, F., Safar, B., Niraula, N.B., Reynaud, C.: TaxoMap in the OAEI 2009 Alignment Contest. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  48. 48.
    Gu, L., Baxter, R., Vickers, D., Rainsford, C.: Record Linkage: Current Practice and Future Directions. Technical report, CSIRO Mathematical and Information Sciences, Canberra, Australia (2003)Google Scholar
  49. 49.
    Zhou, R., Hansen, E.A.: Domain-Independent Structured Duplicate Detection. In: Proc. of the 21st National Conference on Artificial Intelligence (AAAI 2006), Boston, Massachusetts, USA (2006)Google Scholar
  50. 50.
    Singla, P., Domingos, P.: Multi-Relational Record Linkage. In: Proc. of the 3rd KDD Workshop on Multi-Relational Data Mining, Seattle, WA, USA (2004)Google Scholar
  51. 51.
    Sarawagi, S., Bhamidipaty, A.: Interactive Deduplication Using Active Learning. In: Proc. of the 8th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2002), Edmonton, Alberta, Canada, pp. 269–278 (2002)Google Scholar
  52. 52.
    Verykios, V., Elmagarmid, A., Houstis, E.: Automating the Approximate Record-Matching Process. Information Sciences - Informatics and Computer Science: An Int. Journal 126(1), 83–98 (2000)zbMATHGoogle Scholar
  53. 53.
    Christen, P.: A Two-Step Classification Approach to Unsupervised Record Linkage. In: Proc. of the 6th Australasian Data Mining Conference (AusDM 2007), Gold Coast, Australia, pp. 111–119 (2007)Google Scholar
  54. 54.
    Christen, P.: Automatic Record Linkage using Seeded Nearest Neighbour and Support Vector Machine Classification. In: Proc. of the 14th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, pp. 151–159 (2008)Google Scholar
  55. 55.
    Pasula, H., Marthi, B., Milch, B., Russell, S., Shpitser, I.: Identity Uncertainty and Citation Matching. In: Proc. of the Conference on Advances in Neural Information Processing Systems (NIPS 2002), Vancouver, BC, Canada, pp. 1401–1408 (2002)Google Scholar
  56. 56.
    Dey, D., Sarkar, S., De, P.: Entity Matching in Heterogeneous Databases: A Distance Based Decision Model. In: Proc. of the 31th Annual Hawaii Int. Conference on System Sciences (HICSS 1998), Kohala Coast, Hawaii, USA, pp. 305–315 (1998)Google Scholar
  57. 57.
    Dey, D., Sarkar, S., De, P.: A Distance-Based Approach to Entity Reconciliation in Heterogeneous Databases. IEEE Transactions on Knowledge and Data Engineering 14(3), 567–582 (2002)CrossRefGoogle Scholar
  58. 58.
    Guha, S., Koudas, N., Marathe, A., Srivastava, D.: Merging the Results of Approximate Match Operations. In: Proc. 30th Int. Conference on Very Large Databases (VLDB 2004), Toronto, Canada, pp. 636–647 (2004)Google Scholar
  59. 59.
    Winkler, W.: Frequency-Based Matching in Fellegi-Sunter Model of Record Linkage. Statistical research report series rr/2000/06, US Bureau of the Census, Washington, DC, USA (2000)Google Scholar
  60. 60.
    Wang, Y., Madnick, S.: The Inter-Database Instance Identification Problem in Integrating Autonomous Systems. In: Proc. of the 5th Int. Conference on Data Engineering (ICDE 1989), Washington, DC, USA, pp. 46–55 (1989)Google Scholar
  61. 61.
    Hernández, M., Stolfo, S.: Real-World Data is Dirty: Data Cleansing and the Merge/Purge Problem. Data Mining and Knowledge Discovery 2(1), 9–37 (1998)CrossRefGoogle Scholar
  62. 62.
    Bhattacharya, I., Getoor, L.: Iterative Record Linkage for Cleaning and Integration. In: Proc. of the 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2004), New York, NY, USA (2004)Google Scholar
  63. 63.
    Yan, S., Lee, D., Kan, M., Giles, L.: Adaptive Sorted Neighborhood Methods for Efficient Record Linkage. In: Proc. of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2007), Vancouver, BC, Canada, pp. 185–194 (2007)Google Scholar
  64. 64.
    Bertolazzi, P., De Santis, L., Scannapieco, M.: Automatic Record Matching in Cooperative Information Systems. In: Proc. of the ICDT Int. Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003), Siena, Italy (2003)Google Scholar
  65. 65.
    Newcombe, H.: Handbook of Record Linkage. Oxford University Press, Inc., Oxford (1988)Google Scholar
  66. 66.
    Euzenat, J., Ferrara, A., Hollink, L., Isaac, A., Joslyn, C., Malaisé, V., Meilicke, C., Nikolov, A., Pane, J., Sabou, M., Scharffe, F., Shvaiko, P., Spiliopoulos, V., Stuckenschmidt, H., Sváb-Zamazal, O., Svátek, V., Trojahn dos Santos, C., Vouros, G.A., Wang, S.: Results of the Ontology Alignment Evaluation Initiative 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  67. 67.
    Lin, D.: An Information-Theoretic Definition of Similarity. In: Proc. of the 15th Int. Conference on Machine Learning, San Francisco, CA, USA, pp. 296–304 (1998)Google Scholar
  68. 68.
    Castano, S., Ferrara, A., Lorusso, D., Montanelli, S.: The HMatch 2.0 Suite for Ontology Matchmaking. In: Proc. of the 4th Workshop on Semantic Web Applications and Perspectives (SWAP 2007), Bari, Italy (2007)Google Scholar
  69. 69.
    Stoermer, H., Rassadko, N.: Results of OKKAM Feature Based Entity Matching Algorithm for Instance Matching Contest of OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)Google Scholar
  70. 70.
    Castano, S., Ferrara, A., Montanelli, S.: Matching Ontologies in Open Networked Systems: Techniques and Applications. Journal on Data Semantics V (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Silvana Castano
    • 1
  • Alfio Ferrara
    • 1
  • Stefano Montanelli
    • 1
  • Gaia Varese
    • 1
  1. 1.Università degli Studi di Milano DICoMilanoItaly

Personalised recommendations