SAPOP: Semiautomatic Framework for Practical Ontology Population from Structured Knowledge Bases

Sun, Xinruo; Wang, Haofen; Yu, Yong

doi:10.1007/978-1-4614-6880-6_16

Xinruo Sun⁶,
Haofen Wang⁶ &
Yong Yu⁶

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

1778 Accesses

Abstract

The Semantic Web is evolving very quickly. There are already many theories and tools to model various kinds of semantics using ontologies. However after organizations completed modeling the ontology structure, the ontologies must also be filled with instances and relationships to make them practical. This process of ontology population could be hard because we are facing a cold start problem. On the other hand, the potential instances could already exist in LOD, online encyclopedia, or corporate databases in the form of structured data. We think these instances along with their related features could remedy the cold start problem a lot. We present a practical framework to verify this hypothesis. In this framework, first a semiautomated seed discovery method is used to bootstrap the population. Then, we use semi-supervised learning methods to refine and expand the seed instances. Finally the population quality is verified using an effective evaluation process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://linkeddata.org/.

References

Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data the semantic web. In: Proceedings of 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference (ISWC+ASWC 2007), pp. 722–735, 2007
Google Scholar
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5, 1–22 (2009)
Google Scholar
Liu, B., Lee, W.S., Yu, P.S., Li, X.: Partially supervised classification of text documents. In: ICML, p. 387–394
Google Scholar
Pasca, M.: Acquisition of categorized named entities for web search. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 137–145, CIKM ’04
Google Scholar
Petasis, G., Karkaletsis, V., Paliouras, G., Krithara, A., Zavitsanos, E.: Ontology population and enrichment: state of the art. In: Paliouras, G., Spyropoulos, C., Tsatsaronis, G. (eds.) Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, Lecture Notes in Computer Science, vol. 6050, pp. 134–166. Springer, Berlin/Heidelberg (2011)
Chapter Google Scholar
Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering, pp. 841–842. SIGIR ’10
Google Scholar
Stevenson, M., Gaizauskas, R.: Using corpus-derived name lists for named entity recognition. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 290–295. ANLC ’00
Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706. WWW ’07. ACM, New York, NY
Google Scholar
Toral, A., Munoz, R.: A proposal to automatically build and maintain gazetteers for Named Entity Recognition by using Wikipedia. EACL (2006)
Google Scholar
Wang, R.C., Schlaefer, N., Cohen, W.W., Nyberg, E.: Automatic set expansion for list question answering. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 947–954. EMNLP ’08
Google Scholar
Zhang, B., Zuo, W.: Learning from positive and unlabeled examples: A survey, pp. 650–654. IEEE (May)
Google Scholar

Download references

Author information

Authors and Affiliations

APEX Data & Knowledge Management Lab, Shanghai Jiao Tong University, Shanghai, China
Xinruo Sun, Haofen Wang & Yong Yu

Authors

Xinruo Sun
View author publications
You can also search for this author in PubMed Google Scholar
Haofen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinruo Sun .

Editor information

Editors and Affiliations

, Dept. of Computer Science and Technology, Tsinghua University, Room 10-206, East main building, Beijing, 100084, China, People's Republic
Juanzi Li
, School of Comp. Sci. & Eng., Southeast University, Dongda Road 2, Nanjing, 211189, Jiangsu, China, People's Republic
Guilin Qi
Peking University, Inst. of Computer Science & Tech., North Zhongguancun Street 128, Beijing, 100871, China, People's Republic
Dongyan Zhao
L3S Research Center, Leibniz University Hannover, Appelstr. 4, Hannover, 30167, Germany
Wolfgang Nejdl
Tsinghua Campus H202B, Shenzhen City, 518055, China, People's Republic
Hai-Tao Zheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, X., Wang, H., Yu, Y. (2013). SAPOP: Semiautomatic Framework for Practical Ontology Population from Structured Knowledge Bases. In: Li, J., Qi, G., Zhao, D., Nejdl, W., Zheng, HT. (eds) Semantic Web and Web Science. Springer Proceedings in Complexity. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6880-6_16

Download citation

DOI: https://doi.org/10.1007/978-1-4614-6880-6_16
Published: 02 May 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6879-0
Online ISBN: 978-1-4614-6880-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics