What Should I Link to? Identifying Relevant Sources and Classes for Data Linking

Nikolov, Andriy; d’Aquin, Mathieu; Motta, Enrico

doi:10.1007/978-3-642-29923-0_19

Andriy Nikolov²³,
Mathieu d’Aquin²³ &
Enrico Motta²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7185))

Included in the following conference series:

Joint International Semantic Technology Conference

946 Accesses
11 Citations

Abstract

With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. It is necessary to choose both the repositories to link to and the relevant subsets of these repositories, which contain potentially matching individuals. In order to do this, detailed information about the content and structure of semantic repositories is often required. However, retrieving and processing such information for a potentially large number of datasets is practically unfeasible. In this paper, we propose an approach which utilises an existing semantic web index in order to identify potentially relevant datasets for interlinking and rank them. Furthermore, we adapt instance-based ontology schema matching to extract relevant subsets of selected data source and, in this way, pre-configure data linking tools.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Euzenat, J., Shvaiko, P.: Ontology matching. Springer, Heidelberg (2007)
MATH Google Scholar
Fernandez, M., Zhang, Z., Lopez, V., Uren, V., Motta, E.: Ontology augmentation: combining semantic web and text resources. In: 6th International Conference on Knowledge Capture, K-CAP 2011 (2011)
Google Scholar
Gracia, J., Mena, E.: Matching with CIDER: Evaluation report for the OAEI 2008. In: 3rd Ontology Matching Workshop (OM 2008) at the 7th International Semantic Web Conference (ISWC 2008), Karlsruhe, Germany (2008)
Google Scholar
Halpin, H., Hayes, P.J., McCusker, J.P., McGuinness, D.L., Thompson, H.S.: When owl:sameAs isn’t the same: An analysis of identity in linked data. In: Patel-Schneider, P.F., Pan, Y., Hitzler, P., Mika, P., Zhang, L., Pan, J.Z., Horrocks, I., Glimm, B. (eds.) ISWC 2010, Part I. LNCS, vol. 6496, pp. 305–320. Springer, Heidelberg (2010)
Chapter Google Scholar
Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An Empirical Study of Instance-Based Ontology Matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)
Chapter Google Scholar
Li, J., Tang, J., Li, Y., Luo, Q.: RiMOM: A dynamic multistrategy ontology alignment framework. IEEE Transactions on Knowledge and Data Engineering 21(8), 1218–1232 (2009)
Article Google Scholar
Maali, F., Cyganiak, R., Peristeras, V.: Re-using cool URIs: Entity reconciliation against LOD hubs. In: Workshop on Linked Data on the Web (LDOW 2011), WWW 2011, Hyderabad, India (2011)
Google Scholar
Nikolov, A., d’Aquin, M.: Identifying relevant sources for data linking using a semantic web index. In: Workshop on Linked Data on the Web (LDOW 2011), WWW 2011, Hyderabad, India (2011)
Google Scholar
Nikolov, A., Motta, E.: Capturing emerging relations between schema ontologies on the web of data. In: Workshop on Consuming Linked Data (COLD 2010), ISWC 2010, Shanghai, China (2010)
Google Scholar
Nikolov, A., Uren, V.S., Motta, E., De Roeck, A.: Integration of Semantically Annotated Data by the KnoFuss Architecture. In: Gangemi, A., Euzenat, J. (eds.) EKAW 2008. LNCS (LNAI), vol. 5268, pp. 265–274. Springer, Heidelberg (2008)
Chapter Google Scholar
Nikolov, A., Uren, V., Motta, E., de Roeck, A.: Overcoming Schema Heterogeneity between Linked Semantic Repositories to Improve Coreference Resolution. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) ASWC 2009. LNCS, vol. 5926, pp. 332–346. Springer, Heidelberg (2009)
Chapter Google Scholar
Tummarello, G., Cyganiak, R., Catasta, M., Danielczyk, S., Delbru, R., Decker, S.: Sig.ma: Live views on the Web of Data. Journal of Web Semantics 8(4), 355–364 (2010)
Article Google Scholar
Udrea, O., Getoor, L., Miller, R.J.: Leveraging data and structure in ontology integration. In: SIGMOD 2007, Beijing, China, pp. 449–460 (2007)
Google Scholar
Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Discovering and Maintaining Links on the Web of Data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 650–665. Springer, Heidelberg (2009)
Chapter Google Scholar
Wang, S., Englebienne, G., Schlobach, S.: Learning Concept Mappings from Instance Similarity. In: Sheth, A.P., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 339–355. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Media Institute, The Open University, Milton Keynes, UK
Andriy Nikolov, Mathieu d’Aquin & Enrico Motta

Authors

Andriy Nikolov
View author publications
You can also search for this author in PubMed Google Scholar
Mathieu d’Aquin
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Motta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing Science, University of Aberdeen, AB24 3UE, Aberdeen, UK
Jeff Z. Pan
College of Computer Science, Zhejiang University, 310027, Hangzhou, China
Huajun Chen & Zhaohui Wu &
BIKE Lab, Seoul National University, Yeongun-Dong, Jongro-Gu, 110-749, Seoul, Korea
Hong-Gee Kim
Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China
Juanzi Li
Oracle Corporation, 500 Oracle Parkway, 94065, Redwood Shores, CA, USA
Zhe Wu
Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, OX1 3QD, Oxford, UK
Ian Horrocks
Institute of Scientific and Industrial Research (ISIR), Osaka University, 8-1 Mihogaoka, 567-0047, Ibaraki, Osaka, Japan
Riichiro Mizoguchi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nikolov, A., d’Aquin, M., Motta, E. (2012). What Should I Link to? Identifying Relevant Sources and Classes for Data Linking. In: Pan, J.Z., et al. The Semantic Web. JIST 2011. Lecture Notes in Computer Science, vol 7185. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29923-0_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-29923-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29922-3
Online ISBN: 978-3-642-29923-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics