Utilization of DBpedia Mapping in Cross Lingual Wikipedia Infobox Completion

Conference paper

DOI: 10.1007/978-3-319-50127-7_25

Part of the Lecture Notes in Computer Science book series (LNCS, volume 9992)
Cite this paper as:
Megawati, Jang S., Yi M.Y. (2016) Utilization of DBpedia Mapping in Cross Lingual Wikipedia Infobox Completion. In: Kang B., Bai Q. (eds) AI 2016: Advances in Artificial Intelligence. AI 2016. Lecture Notes in Computer Science, vol 9992. Springer, Cham

Abstract

Wikipedia plays a central role in the web as one of the biggest knowledge source due to its large coverage of information that comes from various domains. However, due to the enormous number of pages and limited number of contributors to maintain all of the pages, the problem of missing information among Wikipedia articles has emerged, especially articles in multiple language versions. Several approaches have been studied to fix information gap in between cross- language Wikipedia articles. However, they can only be applied for languages that came from the same root. In this paper, we propose an approach to generate new information for Wikipedia infoboxes written in different languages with different roots by utilizing the existing DBpedia mappings. We combined mapping information from DBpedia with an instance-based method to align the existing Korean-English infobox attribute-value pairs as well as to generate new pairs from the Korean version to fill missing information in the English version. The results showed that we could expand up to 38% of the existing English Wikipedia attribute-value pairs from our datasets with 61% of accuracy.

Keywords

Infobox alignment Infobox completion DBpedia Cross language Wikipedia 

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.Department of Industrial and Systems Engineering, Graduate School of Knowledge Service EngineeringKAISTDaejeonSouth Korea

Personalised recommendations