Distributed Human Computation Framework for Linked Data Co-reference Resolution

  • Yang Yang
  • Priyanka Singh
  • Jiadi Yao
  • Ching-man Au Yeung
  • Amir Zareian
  • Xiaowei Wang
  • Zhonglun Cai
  • Manuel Salvadores
  • Nicholas Gibbins
  • Wendy Hall
  • Nigel Shadbolt
Conference paper

DOI: 10.1007/978-3-642-21034-1_3

Part of the Lecture Notes in Computer Science book series (LNCS, volume 6643)
Cite this paper as:
Yang Y. et al. (2011) Distributed Human Computation Framework for Linked Data Co-reference Resolution. In: Antoniou G. et al. (eds) The Semantic Web: Research and Applications. ESWC 2011. Lecture Notes in Computer Science, vol 6643. Springer, Berlin, Heidelberg

Abstract

Distributed Human Computation (DHC) is used to solve computational problems by incorporating the collaborative effort of a large number of humans. It is also a solution to AI-complete problems such as natural language processing. The Semantic Web with its root in AI has many research problems that are considered as AI-complete. E.g. co-reference resolution, which involves determining whether different URIs refer to the same entity, is a significant hurdle to overcome in the realisation of large-scale Semantic Web applications. In this paper, we propose a framework for building a DHC system on top of the Linked Data Cloud to solve various computational problems. To demonstrate the concept, we are focusing on handling the co-reference resolution when integrating distributed datasets. Traditionally machine-learning algorithms are used as a solution for this but they are often computationally expensive, error-prone and do not scale. We designed a DHC system named iamResearcher, which solves the scientific publication author identity co-reference problem when integrating distributed bibliographic datasets. In our system, we aggregated 6 million bibliographic data from various publication repositories. Users can sign up to the system to audit and align their own publications, thus solving the co-reference problem in a distributed manner. The aggregated results are dereferenceable in the Open Linked Data Cloud.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Yang Yang
    • 1
  • Priyanka Singh
    • 1
  • Jiadi Yao
    • 1
  • Ching-man Au Yeung
    • 2
  • Amir Zareian
    • 1
  • Xiaowei Wang
    • 1
  • Zhonglun Cai
    • 1
  • Manuel Salvadores
    • 1
  • Nicholas Gibbins
    • 1
  • Wendy Hall
    • 1
  • Nigel Shadbolt
    • 1
  1. 1.Intelligence, Agents, Multimedia (IAM) Group, School of Electronics and Computer ScienceUniversity of SouthamptonUK
  2. 2.NTT Communication Science LaboratoriesKyotoJapan

Personalised recommendations