Constructing Domain Ontology Using Structural and Semantic Characteristics of Web-Table Head

Jung, Sung-won; Kang, Mi-young; Kwon, Hyuk-chul

doi:10.1007/978-3-540-73325-6_66

Sung-won Jung^1,2,
Mi-young Kang¹ &
Hyuk-chul Kwon¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4570))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1342 Accesses
2 Citations

Abstract

This study concerns the constructing of domain ontology from web tables in a specific domain. Ontology defines the common terms and their meaning (concepts) within a context. Thus only meaningful tables are our concern. The meaningful table is composed of a head and a body, which are formatted in rows and columns. The head abstracts the meaning expressed in the body. Thus, in order to obtain a table-information-extraction framework, this study extracts, as prerequisite work, the structural semantic, that is, the domain ontology that frames web-table information, from the head. We suggest a method for automatically extracting domain ontology using the structural and semantic characteristics of the web-table head. The construction of domain ontology proceeds through two steps: (a) extracting table schema as pseudo-ontology from each table from the same domain and (b) constructing domain ontology combining those extracted table schemata. The combination of schemata proceeds through splitting and clustering using (a) statistical information and (b) heuristics based on the structural and semantic characteristics of the web-table head.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Antoniou, G., Harmelen, F.: A Semantic Web Primer, pp. 10–11. MIT Press, Cambridge (2004)
Google Scholar
Kushmerick, N., Weld, D.S., Doorenbos, R.: Wrapper Induction for Information Extraction. In: 15th International Joint Conference on Artificial Intelligence (IJCAI-97), Nagoya (August 1997)
Google Scholar
Chen, H.H., Tsai, S.C., Tsai, J.H.: Mining Tables from Large Scale HTML Texts. In: Proceedings of 18th International Conference on Computational Linguistics, Saabrucken, Germany (July 2000)
Google Scholar
Hurst, M.: Layout and Language: Beyond Simple Text for Information Interaction - Modeling the Table. In: Proceedings of the 2nd International Conference on Multimodal Interfaces, Hong Kong (1999)
Google Scholar
Yang, Y.: Web Table Mining and Database Discovery. M.Sc. thesis, Simon Fraser University (August 2002)
Google Scholar
Yoshida, M., Torisawa, K., Tsujii, J.: Extracting ontologies from World Wide Web via HTML tables. In: Proceedings of the Pacific Association for Computational Linguistics (2001)
Google Scholar
Tijerino, Y., Embley, D., Longsdale, D., Ding, Y., Nagy, G.: Towards Ontology Generation from Tables. Springer, Heidelberg (2005)
Google Scholar
Jung, S.W., Kwon, H.C.: A Scalable Hybrid Approach for Extracting Head Components from Web Tables. IEEE transaction on knowledge and data engineering 18(2), 174–187 (2006)
Article Google Scholar
Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Google Scholar
Chakrabarti, S.: Mining the Web. Morgan Kaufmann Publishers, San Francisco (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Pusan National University, Korean Language Processing Laboratory, Department of Computer Science Engineering,
Sung-won Jung, Mi-young Kang & Hyuk-chul Kwon
Pusan National University, Center for U-Port IT Research and Education, Jangjeon-dong, Geumjeong-gu, 609-735, Busan, Korea
Sung-won Jung

Authors

Sung-won Jung
View author publications
You can also search for this author in PubMed Google Scholar
Mi-young Kang
View author publications
You can also search for this author in PubMed Google Scholar
Hyuk-chul Kwon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hiroshi G. Okuno Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jung, Sw., Kang, My., Kwon, Hc. (2007). Constructing Domain Ontology Using Structural and Semantic Characteristics of Web-Table Head. In: Okuno, H.G., Ali, M. (eds) New Trends in Applied Artificial Intelligence. IEA/AIE 2007. Lecture Notes in Computer Science(), vol 4570. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73325-6_66

Download citation

DOI: https://doi.org/10.1007/978-3-540-73325-6_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73322-5
Online ISBN: 978-3-540-73325-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics