Abstract
This research explores three SPARQL-based techniques to solve Semantic Web tasks that often require similarity measures, such as semantic data integration, ontology mapping, and Semantic Web service matchmaking. Our aim is to see how far it is possible to integrate customized similarity functions (CSF) into SPARQL to achieve good results for these tasks. Our first approach exploits virtual triples calling property functions to establish virtual relations among resources under comparison; the second approach uses extension functions to filter out resources that do not meet the requested similarity criteria; finally, our third technique applies new solution modifiers to post-process a SPARQL solution sequence. The semantics of the three approaches are formally elaborated and discussed. We close the paper with a demonstration of the usefulness of our iSPARQL framework in the context of a data integration and an ontology mapping experiment.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval (1999)
Batagelj, V., Bren, M.: Comparing Resemblance Measures. J. of Classification 12(1), 73–90 (1995)
Bernstein, A., Kaufmann, E., Bürki, C., Klein, M.: How Similar Is It? Towards Personalized Similarity Measures in Ontologies. In: 7. Int. Tagung WI, pp. 1347–1366 (2005)
Bernstein, A., Kiefer, C.: Imprecise RDQL: Towards Generic Retrieval in Ontologies Using Similarity Joins. In: Proc. of the ACM Symp. on Applied Computing, pp. 1684–1689. ACM Press, New York (2006)
Bernstein, A., Kiefer, C., Stocker, M.: OptARQ: A SPARQL Optimization Approach based on Triple Pattern Selectivity Estimation. Technical Report IFI-2007.02, Department of Informatics, University of Zurich (2007)
Cohen, W.W.: Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity. In: Proc. of the ACM SIGMOD Int. Conf. on Management of Data, pp. 201–212. ACM Press, New York (1998)
Cohen, W.W., Ravikumar, P., Fienberg, S.: A Comparison of String Distance Metrics for Name-Matching Tasks. In: Proc. of the Ws. on Information Integration on the Web, Acapulco, Mexico, pp. 73–78 (2003)
Cyganiak, R.: A relational algebra for SPARQL. Technical Report HPL-2005-170, HP Labs (2005)
Ehrig, M., Haase, P., Stojanovic, N., Hefke, M.: Similarity for Ontologies - A Comprehensive Framework. In: Proc. of the 13th Europ. Conf. on Information Systems (2005)
Euzenat, J., Loup, D., Touzani, M., Valtchev, P.: Ontology Alignment with OLA. In: Proc. of the 3rd Int. Ws. on Evaluation of Ontology-based Tools, pp. 59–68 (2004)
Gravano, L., Ipeirotis, P.G., Koudas, N., Srivastava, D.: Text Joins in an RDBMS for Web Data Integration. In: Proc. of the 12th Int. World Wide Web Conf., pp. 90–101 (2003)
Kiefer, C., Bernstein, A., Lee, H.J., Klein, M., Stocker, M.: Semantic Process Retrieval with iSPARQL. In: Proc. of the 4th Europ. Semantic Web Conf., pp. 609–623 (2007)
Kiefer, C., Bernstein, A., Tappolet, J.: Analyzing Software with iSPARQL. In: Proc. of the 3rd Int. Ws. on Semantic Web Enabled Software Engineering (2007)
Klusch, M., Fries, B., Sycara, K.: Automated Semantic Web Service Discovery with OWLS-MX. In: Proc. of the 5th Int. Joint Conf. on Autonomous Agents and Multiagent Systems, pp. 915–922 (2006)
Lam, H.Y.K., Marenco, L., Clark, T., Gao, Y., Kinoshita, J., Shepherd, G., Miller, P., Wu, E., Wong, G., Liu, N., Crasto, C., Morse, T., Stephens, S., Cheung, K.-H.: AlzPharm: integration of neurodegeneration data using RDF. BMC Bioinformatics 8(3) (2007)
Levenshtein, V.I.: Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics Doklady 10, 707–710 (1966)
Meštrović, A., Ćubrillo, M.: Semantic Web Data Integration Using F-Logic. In: Proc. of the 10th Int. Conf. on Intelligent Engineering Systems (2006)
Noy, N.F.: What do we need for ontology integration on the Semantic Web, Position statement. In: Proc. of the 1st Semantic Integration Ws., pp. 175–176 (2003)
Noy, N.F., Musen, M.A.: The PROMPT Suite: Interactive Tools For Ontology Merging And Mapping. Int. J. of Human-Computer Studies 59(6), 983–1024 (2003)
Orozco, J., Belanche, L.: On Aggregation Operators of Transitive Similarity and Dissimilarity Relations. In: Proc. of the IEEE Int. Conf. on Fuzzy Systems, pp. 1373–1377. IEEE Computer Society Press, Los Alamitos (2004)
Pérez, J., Arenas, M., Gutierrez, C.: Semantics and Complexity of SPARQL. In: Proc. of the 5th Int. Semantic Web Conf., pp. 30–43 (2006)
Prud’hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. Technical report. W3C Candidate Recommendation 14 June (2007)
Rorvig, M.: Images of Similarity: A Visual Exploration of Optimal Similarity Metrics and Scaling Properties of TREC Topic-Document Sets. J. of the Am. Soc. for Inf. Sci. 50(8), 639–651 (1999)
Shasha, D., Zhang, K.: Approximate Tree Pattern Matching. In: Pattern Matching in Strings, Trees, and Arrays, pp. 341–371 (1997)
Siberski, W., Pan, J.Z., Thaden, U.: Querying the Semantic Web with Preferences. In: Proc. of the 5th Int. Semantic Web Conf. (2006)
Tversky, A.: Features of Similarity. Psychological Review 84(2), 327–353 (1977)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kiefer, C., Bernstein, A., Stocker, M. (2007). The Fundamentals of iSPARQL: A Virtual Triple Approach for Similarity-Based Semantic Web Tasks. In: Aberer, K., et al. The Semantic Web. ISWC ASWC 2007 2007. Lecture Notes in Computer Science, vol 4825. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76298-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-76298-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76297-3
Online ISBN: 978-3-540-76298-0
eBook Packages: Computer ScienceComputer Science (R0)