The Fundamentals of iSPARQL: A Virtual Triple Approach for Similarity-Based Semantic Web Tasks

  • Christoph Kiefer
  • Abraham Bernstein
  • Markus Stocker
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4825)

Abstract

This research explores three SPARQL-based techniques to solve Semantic Web tasks that often require similarity measures, such as semantic data integration, ontology mapping, and Semantic Web service matchmaking. Our aim is to see how far it is possible to integrate customized similarity functions (CSF) into SPARQL to achieve good results for these tasks. Our first approach exploits virtual triples calling property functions to establish virtual relations among resources under comparison; the second approach uses extension functions to filter out resources that do not meet the requested similarity criteria; finally, our third technique applies new solution modifiers to post-process a SPARQL solution sequence. The semantics of the three approaches are formally elaborated and discussed. We close the paper with a demonstration of the usefulness of our iSPARQL framework in the context of a data integration and an ontology mapping experiment.

References

  1. 1.
    Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval (1999)Google Scholar
  2. 2.
    Batagelj, V., Bren, M.: Comparing Resemblance Measures. J. of Classification 12(1), 73–90 (1995)MATHCrossRefMathSciNetGoogle Scholar
  3. 3.
    Bernstein, A., Kaufmann, E., Bürki, C., Klein, M.: How Similar Is It? Towards Personalized Similarity Measures in Ontologies. In: 7. Int. Tagung WI, pp. 1347–1366 (2005)Google Scholar
  4. 4.
    Bernstein, A., Kiefer, C.: Imprecise RDQL: Towards Generic Retrieval in Ontologies Using Similarity Joins. In: Proc. of the ACM Symp. on Applied Computing, pp. 1684–1689. ACM Press, New York (2006)Google Scholar
  5. 5.
    Bernstein, A., Kiefer, C., Stocker, M.: OptARQ: A SPARQL Optimization Approach based on Triple Pattern Selectivity Estimation. Technical Report IFI-2007.02, Department of Informatics, University of Zurich (2007)Google Scholar
  6. 6.
    Cohen, W.W.: Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity. In: Proc. of the ACM SIGMOD Int. Conf. on Management of Data, pp. 201–212. ACM Press, New York (1998)Google Scholar
  7. 7.
    Cohen, W.W., Ravikumar, P., Fienberg, S.: A Comparison of String Distance Metrics for Name-Matching Tasks. In: Proc. of the Ws. on Information Integration on the Web, Acapulco, Mexico, pp. 73–78 (2003)Google Scholar
  8. 8.
    Cyganiak, R.: A relational algebra for SPARQL. Technical Report HPL-2005-170, HP Labs (2005)Google Scholar
  9. 9.
    Ehrig, M., Haase, P., Stojanovic, N., Hefke, M.: Similarity for Ontologies - A Comprehensive Framework. In: Proc. of the 13th Europ. Conf. on Information Systems (2005)Google Scholar
  10. 10.
    Euzenat, J., Loup, D., Touzani, M., Valtchev, P.: Ontology Alignment with OLA. In: Proc. of the 3rd Int. Ws. on Evaluation of Ontology-based Tools, pp. 59–68 (2004)Google Scholar
  11. 11.
    Gravano, L., Ipeirotis, P.G., Koudas, N., Srivastava, D.: Text Joins in an RDBMS for Web Data Integration. In: Proc. of the 12th Int. World Wide Web Conf., pp. 90–101 (2003)Google Scholar
  12. 12.
    Kiefer, C., Bernstein, A., Lee, H.J., Klein, M., Stocker, M.: Semantic Process Retrieval with iSPARQL. In: Proc. of the 4th Europ. Semantic Web Conf., pp. 609–623 (2007)Google Scholar
  13. 13.
    Kiefer, C., Bernstein, A., Tappolet, J.: Analyzing Software with iSPARQL. In: Proc. of the 3rd Int. Ws. on Semantic Web Enabled Software Engineering (2007)Google Scholar
  14. 14.
    Klusch, M., Fries, B., Sycara, K.: Automated Semantic Web Service Discovery with OWLS-MX. In: Proc. of the 5th Int. Joint Conf. on Autonomous Agents and Multiagent Systems, pp. 915–922 (2006)Google Scholar
  15. 15.
    Lam, H.Y.K., Marenco, L., Clark, T., Gao, Y., Kinoshita, J., Shepherd, G., Miller, P., Wu, E., Wong, G., Liu, N., Crasto, C., Morse, T., Stephens, S., Cheung, K.-H.: AlzPharm: integration of neurodegeneration data using RDF. BMC Bioinformatics 8(3) (2007)Google Scholar
  16. 16.
    Levenshtein, V.I.: Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics Doklady 10, 707–710 (1966)MathSciNetGoogle Scholar
  17. 17.
    Meštrović, A., Ćubrillo, M.: Semantic Web Data Integration Using F-Logic. In: Proc. of the 10th Int. Conf. on Intelligent Engineering Systems (2006)Google Scholar
  18. 18.
    Noy, N.F.: What do we need for ontology integration on the Semantic Web, Position statement. In: Proc. of the 1st Semantic Integration Ws., pp. 175–176 (2003)Google Scholar
  19. 19.
    Noy, N.F., Musen, M.A.: The PROMPT Suite: Interactive Tools For Ontology Merging And Mapping. Int. J. of Human-Computer Studies 59(6), 983–1024 (2003)CrossRefGoogle Scholar
  20. 20.
    Orozco, J., Belanche, L.: On Aggregation Operators of Transitive Similarity and Dissimilarity Relations. In: Proc. of the IEEE Int. Conf. on Fuzzy Systems, pp. 1373–1377. IEEE Computer Society Press, Los Alamitos (2004)Google Scholar
  21. 21.
    Pérez, J., Arenas, M., Gutierrez, C.: Semantics and Complexity of SPARQL. In: Proc. of the 5th Int. Semantic Web Conf., pp. 30–43 (2006)Google Scholar
  22. 22.
    Prud’hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. Technical report. W3C Candidate Recommendation 14 June (2007)Google Scholar
  23. 23.
    Rorvig, M.: Images of Similarity: A Visual Exploration of Optimal Similarity Metrics and Scaling Properties of TREC Topic-Document Sets. J. of the Am. Soc. for Inf. Sci. 50(8), 639–651 (1999)CrossRefGoogle Scholar
  24. 24.
    Shasha, D., Zhang, K.: Approximate Tree Pattern Matching. In: Pattern Matching in Strings, Trees, and Arrays, pp. 341–371 (1997)Google Scholar
  25. 25.
    Siberski, W., Pan, J.Z., Thaden, U.: Querying the Semantic Web with Preferences. In: Proc. of the 5th Int. Semantic Web Conf. (2006)Google Scholar
  26. 26.
    Tversky, A.: Features of Similarity. Psychological Review 84(2), 327–353 (1977)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Christoph Kiefer
    • 1
  • Abraham Bernstein
    • 1
  • Markus Stocker
    • 1
  1. 1.Department of Informatics, University of ZurichSwitzerland

Personalised recommendations