Skip to main content

SAPOP: Semiautomatic Framework for Practical Ontology Population from Structured Knowledge Bases

  • Conference paper
  • First Online:
Semantic Web and Web Science

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

  • 1778 Accesses

Abstract

The Semantic Web is evolving very quickly. There are already many theories and tools to model various kinds of semantics using ontologies. However after organizations completed modeling the ontology structure, the ontologies must also be filled with instances and relationships to make them practical. This process of ontology population could be hard because we are facing a cold start problem. On the other hand, the potential instances could already exist in LOD, online encyclopedia, or corporate databases in the form of structured data. We think these instances along with their related features could remedy the cold start problem a lot. We present a practical framework to verify this hypothesis. In this framework, first a semiautomated seed discovery method is used to bootstrap the population. Then, we use semi-supervised learning methods to refine and expand the seed instances. Finally the population quality is verified using an effective evaluation process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://linkeddata.org/.

References

  1. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data the semantic web. In: Proceedings of 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference (ISWC+ASWC 2007), pp. 722–735, 2007

    Google Scholar 

  2. Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5, 1–22 (2009)

    Google Scholar 

  3. Liu, B., Lee, W.S., Yu, P.S., Li, X.: Partially supervised classification of text documents. In: ICML, p. 387–394

    Google Scholar 

  4. Pasca, M.: Acquisition of categorized named entities for web search. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 137–145, CIKM ’04

    Google Scholar 

  5. Petasis, G., Karkaletsis, V., Paliouras, G., Krithara, A., Zavitsanos, E.: Ontology population and enrichment: state of the art. In: Paliouras, G., Spyropoulos, C., Tsatsaronis, G. (eds.) Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, Lecture Notes in Computer Science, vol. 6050, pp. 134–166. Springer, Berlin/Heidelberg (2011)

    Chapter  Google Scholar 

  6. Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering, pp. 841–842. SIGIR ’10

    Google Scholar 

  7. Stevenson, M., Gaizauskas, R.: Using corpus-derived name lists for named entity recognition. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 290–295. ANLC ’00

    Google Scholar 

  8. Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706. WWW ’07. ACM, New York, NY

    Google Scholar 

  9. Toral, A., Munoz, R.: A proposal to automatically build and maintain gazetteers for Named Entity Recognition by using Wikipedia. EACL (2006)

    Google Scholar 

  10. Wang, R.C., Schlaefer, N., Cohen, W.W., Nyberg, E.: Automatic set expansion for list question answering. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 947–954. EMNLP ’08

    Google Scholar 

  11. Zhang, B., Zuo, W.: Learning from positive and unlabeled examples: A survey, pp. 650–654. IEEE (May)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xinruo Sun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this paper

Cite this paper

Sun, X., Wang, H., Yu, Y. (2013). SAPOP: Semiautomatic Framework for Practical Ontology Population from Structured Knowledge Bases. In: Li, J., Qi, G., Zhao, D., Nejdl, W., Zheng, HT. (eds) Semantic Web and Web Science. Springer Proceedings in Complexity. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6880-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-6880-6_16

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-6879-0

  • Online ISBN: 978-1-4614-6880-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics