Skip to main content

Open Knowledge Extraction Challenge

  • Conference paper
  • First Online:
Semantic Web Evaluation Challenges (SemWebEval 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 548))

Included in the following conference series:

Abstract

The Open Knowledge Extraction (OKE) challenge is aimed at promoting research in the automatic extraction of structured content from textual data and its representation and publication as Linked Data. We designed two extraction tasks: (1) Entity Recognition, Linking and Typing and (2) Class Induction and entity typing. The challenge saw the participations of four systems: CETUS-FOX and FRED participating to both tasks, Adel participating to Task 1 and OAK@Sheffield participating to Task 2. In this paper we describe the OKE challenge, the tasks, the datasets used for training and evaluating the systems, the evaluation method, and obtained results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.itl.nist.gov/iad/mig/publications/proceedings/darpa99/html/ie5/ie5.htm.

  2. 2.

    http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings/muc_7_proceedings/overview.html.

  3. 3.

    https://www.ldc.upenn.edu/collaborations/past-projects/ace/annotation-tasks-and-specifications.

  4. 4.

    http://www.nist.gov/tac/tracks/index.html.

  5. 5.

    http://www.nist.gov/tac/2015/KBP.

  6. 6.

    http://trec-kba.org/.

  7. 7.

    http://stlab.istc.cnr.it/stlab/WikipediaOntology/.

  8. 8.

    The prefix dul: stands for the namespace http://www.ontologydesignpatterns.org/ont/dul/DUL.owl.

  9. 9.

    http://persistence.uni-leipzig.org/nlp2rdf/.

  10. 10.

    The prefixes nif:, itsrdf:, dul:, and dbpedia: identify the namespaces http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core, http://www.w3.org/2005/11/its/rdf, http://www.ontologydesignpatterns.org/ont/dul/DUL.owl, and http://dbpedia.org/resource/ respectively.

  11. 11.

    Prefixes d0: and dul: stand for namespaces http://ontologydesignpatterns.org/ont/wikipedia/d0.owl and http://www.ontologydesignpatterns.org/ont/dul/DUL.owl respectively.

  12. 12.

    The training dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/GoldStandard_sampleData/task1/dataset_task_1.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/evaluation-data/task1/evaluation-dataset-task1.ttl.

  13. 13.

    The training dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/GoldStandard_sampleData/task2/dataset_task_2.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/evaluation-data/task2/evaluation-dataset-task2.ttl.

  14. 14.

    https://github.com/anuzzolese/oke-challenge.

References

  1. Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)

    Article  Google Scholar 

  2. Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - a crystallization point for the web of data. J. Web Sem. 7(3), 154–165 (2009)

    Article  Google Scholar 

  3. Consoli, S., Reforgiato, D.: Using fred for named entity resolution, linking and typing for knowledge base population. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 40–50. Springer International Publishing, Switzerland (2015)

    Google Scholar 

  4. Doddington, G.R., Mitchell, A., Przybocki, M.A., Ramshaw, L.A., Strassel, S., Weischedel, R.M.: The automatic content extraction (ace) program-tasks, data, and evaluation. In: LREC (2004)

    Google Scholar 

  5. Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., Schneider, L.: Sweetening ontologies with DOLCE. In: GĂ³mez-PĂ©rez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 166–181. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  6. Gao, J., Mazumdar, S.: Exploiting linked open data to uncover entity types. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 51–62. Springer International Publishing, Switzerland (2015)

    Google Scholar 

  7. Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: Proceedings of the 16th Conference on Computational Linguistics, COLING 1996, vol. 1, pp. 466–471. Association for Computational Linguistics, Stroudsburg, PA, USA (1996)

    Google Scholar 

  8. Hellmann, S., Lehmann, J., Auer, S., BrĂ¼mmer, M.: Integrating NLP using linked data. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  9. Petasis, G., Karkaletsis, V., Paliouras, G., Krithara, A., Zavitsanos, E.: Ontology population and enrichment: state of the art. In: Paliouras, G., Spyropoulos, C.D., Tsatsaronis, G. (eds.) Multimedia Information Extraction. LNCS, vol. 6050, pp. 134–166. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  10. Plu, J., Rizzo, G., Troncy, R.: A hybrid approach for entity recognition and linking. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 28–39. Springer International Publishing, Switzerland (2015)

    Google Scholar 

  11. Röder, M., Usbeck, R., Ngonga Ngomo, A.-C.: Cetus - a baseline approach to type extraction. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 16–27. Springer International Publishing, Switzerland (2015)

    Google Scholar 

  12. Tjong Kim Sang, E.F.: Introduction to the conll-2002 shared task: language-independent named entity recognition. In: Proceedings of the 6th Conference on Natural Language Learning, COLING-02, vol. 20, pp. 1–4. Association for Computational Linguistics, Stroudsburg, PA, USA (2002)

    Google Scholar 

  13. Iordache, O.: Introduction. In: Iordache, O. (ed.) Polystochastic Models for Complexity. UCS, vol. 4, pp. 1–16. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  14. Usbeck, R., Röder, M., Ngomo, A.N., Baron, C., Both, A., BrĂ¼mmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In Gangemi, A., Leonardi, S., Panconesi, A. (eds.) Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pp. 1133–1143. ACM (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrea Giovanni Nuzzolese .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Nuzzolese, A.G., Gentile, A.L., Presutti, V., Gangemi, A., Garigliotti, D., Navigli, R. (2015). Open Knowledge Extraction Challenge. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25518-7_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25517-0

  • Online ISBN: 978-3-319-25518-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics