Advertisement

A Model for Enriching Multilingual Wikipedias Using Infobox and Wikidata Property Alignment

  • Thang Hoang Ta
  • Chutiporn Anutariya
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8943)

Abstract

Wikipedia supports a large converged data with millions of contributions in more than 287 languages currently. Its content changes rapidly and continuously every hour with thousands of edits which trigger many challenges for Wikipedia in controlling, associating and balancing article content among language editions. This paper provides some processes to enrich Wikipedia content, which will retrieve semantic relations based on alignment between infobox properties and Wikidata properties in various languages. Then, the outcomes mainly contribute these semantic relations back to Wikidata and Wikipedias, especially ones are based on the Latin alphabet. The case study will offer a specific case about aligning biological infoboxes and detecting missing interwiki links of biological species at Vietnamese Wikipedia and English Wikipedia.

Keywords

Multilingual wikipedias Wikidata Semantic relations 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Mendes, P.N., Jakob, M., Bizer, C.: In: DBPedia: A Multilingual Cross-Domain Knowledge Base. LREC, pp. 1813–1817. European Language Resources Association ELRA (2012)Google Scholar
  2. 2.
    Cabrio, E., Cojan, J., Gandon, F., Hallili, A.: Querying multilingual DBpedia with QAKiS. In: Cimiano, P., Fernández, M., Lopez, V., Schlobach, S., Völker, J. (eds.) ESWC 2013. LNCS, vol. 7955, pp. 194–198. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  3. 3.
    Morsey, M., Lehmann, J., Auer, S., Stadler, C., Hellman, S.: DBPedia and the Live Extraction of Structured Data from Wikipedia. Electronic Library and Information Systems 46(2), 157–181 (2012)CrossRefGoogle Scholar
  4. 4.
    Kim, E.-K., Weild, M., Choi, K.-S.: Metadata synchronization between bilingual resources: case study in wikipedia. In: International Workshop on the Multilingual Semantic Web (2010)Google Scholar
  5. 5.
    Vrandečić, D., Krötzsch, M.: Wikidata: A Free Collaborative Knowledge Base. Communications of the ACM (2014, to appear)Google Scholar
  6. 6.
    Nastase, V., Strube, M.: Decoding Wikipedia Categories for Knowledge Acquisition. AAAI (2008)Google Scholar
  7. 7.
    Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. IOS Press (2012)Google Scholar
  8. 8.
    Nguyen, T., Moreira, V., Nguyen, H., Nguyen, H., Freire, J.: Multilingual schema matching for Wikipedia infoboxes. Proceedings of the VLDB Endowment 5(2), 133–144 (2011)CrossRefGoogle Scholar
  9. 9.
    Palmero Aprosio, A., Giuliano, C., Lavelli, A.: Towards an automatic creation of localized versions of DBpedia. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 494–509. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  10. 10.
    Ponzetto, S.P., Navigli, R.: Large scale taxonomy mapping for restructuring and integrating wikipedia. In: IJCAT 2009 (2009)Google Scholar
  11. 11.
    Chernov, S., Iofciu, T., Nejdl, W., Zhou, X.: Extracting semantic relationships between wikipedia categories. In: ESWC, June 2006Google Scholar
  12. 12.
    Adar, E., Skinner, M., Weld, D.S.: Information arbitrage across multi-lingual wikipedia. In: WSDM 2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 94–103 (2009)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Shinawatra UniversityBangkokThailand
  2. 2.Asian UniversityChonburiThailand

Personalised recommendations