What Should I Link to? Identifying Relevant Sources and Classes for Data Linking

  • Andriy Nikolov
  • Mathieu d’Aquin
  • Enrico Motta
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7185)


With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. It is necessary to choose both the repositories to link to and the relevant subsets of these repositories, which contain potentially matching individuals. In order to do this, detailed information about the content and structure of semantic repositories is often required. However, retrieving and processing such information for a potentially large number of datasets is practically unfeasible. In this paper, we propose an approach which utilises an existing semantic web index in order to identify potentially relevant datasets for interlinking and rank them. Furthermore, we adapt instance-based ontology schema matching to extract relevant subsets of selected data source and, in this way, pre-configure data linking tools.


Jaccard Index Relevant Source Search Service Relevant Classis Ontology Match 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Euzenat, J., Shvaiko, P.: Ontology matching. Springer, Heidelberg (2007)zbMATHGoogle Scholar
  2. 2.
    Fernandez, M., Zhang, Z., Lopez, V., Uren, V., Motta, E.: Ontology augmentation: combining semantic web and text resources. In: 6th International Conference on Knowledge Capture, K-CAP 2011 (2011)Google Scholar
  3. 3.
    Gracia, J., Mena, E.: Matching with CIDER: Evaluation report for the OAEI 2008. In: 3rd Ontology Matching Workshop (OM 2008) at the 7th International Semantic Web Conference (ISWC 2008), Karlsruhe, Germany (2008)Google Scholar
  4. 4.
    Halpin, H., Hayes, P.J., McCusker, J.P., McGuinness, D.L., Thompson, H.S.: When owl:sameAs isn’t the same: An analysis of identity in linked data. In: Patel-Schneider, P.F., Pan, Y., Hitzler, P., Mika, P., Zhang, L., Pan, J.Z., Horrocks, I., Glimm, B. (eds.) ISWC 2010, Part I. LNCS, vol. 6496, pp. 305–320. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  5. 5.
    Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An Empirical Study of Instance-Based Ontology Matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  6. 6.
    Li, J., Tang, J., Li, Y., Luo, Q.: RiMOM: A dynamic multistrategy ontology alignment framework. IEEE Transactions on Knowledge and Data Engineering 21(8), 1218–1232 (2009)CrossRefGoogle Scholar
  7. 7.
    Maali, F., Cyganiak, R., Peristeras, V.: Re-using cool URIs: Entity reconciliation against LOD hubs. In: Workshop on Linked Data on the Web (LDOW 2011), WWW 2011, Hyderabad, India (2011)Google Scholar
  8. 8.
    Nikolov, A., d’Aquin, M.: Identifying relevant sources for data linking using a semantic web index. In: Workshop on Linked Data on the Web (LDOW 2011), WWW 2011, Hyderabad, India (2011)Google Scholar
  9. 9.
    Nikolov, A., Motta, E.: Capturing emerging relations between schema ontologies on the web of data. In: Workshop on Consuming Linked Data (COLD 2010), ISWC 2010, Shanghai, China (2010)Google Scholar
  10. 10.
    Nikolov, A., Uren, V.S., Motta, E., De Roeck, A.: Integration of Semantically Annotated Data by the KnoFuss Architecture. In: Gangemi, A., Euzenat, J. (eds.) EKAW 2008. LNCS (LNAI), vol. 5268, pp. 265–274. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  11. 11.
    Nikolov, A., Uren, V., Motta, E., de Roeck, A.: Overcoming Schema Heterogeneity between Linked Semantic Repositories to Improve Coreference Resolution. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) ASWC 2009. LNCS, vol. 5926, pp. 332–346. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  12. 12.
    Tummarello, G., Cyganiak, R., Catasta, M., Danielczyk, S., Delbru, R., Decker, S.: Live views on the Web of Data. Journal of Web Semantics 8(4), 355–364 (2010)CrossRefGoogle Scholar
  13. 13.
    Udrea, O., Getoor, L., Miller, R.J.: Leveraging data and structure in ontology integration. In: SIGMOD 2007, Beijing, China, pp. 449–460 (2007)Google Scholar
  14. 14.
    Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Discovering and Maintaining Links on the Web of Data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 650–665. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  15. 15.
    Wang, S., Englebienne, G., Schlobach, S.: Learning Concept Mappings from Instance Similarity. In: Sheth, A.P., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 339–355. Springer, Heidelberg (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Andriy Nikolov
    • 1
  • Mathieu d’Aquin
    • 1
  • Enrico Motta
    • 1
  1. 1.Knowledge Media InstituteThe Open UniversityMilton KeynesUK

Personalised recommendations