Semantic Annotation Using Horizontal and Vertical Contexts
This paper addresses the issue of semantic annotation using horizontal and vertical contexts. Semantic annotation is a task of annotating web pages with ontological information. As information on a web page is usually two-dimensionally laid out, previous semantic annotation methods that view a web page as an ‘object’ sequence have limitations. In this paper, to better incorporate the two-dimensional contexts, semantic annotation is formalized as a problem of block detection and text annotation. Block detection is aimed at detecting the text block by making use of context in one dimension and text annotation is aimed at detecting the ‘targeted instance’ in the identified blocks using the other dimensional context. A two-stage method for semantic annotation using machine learning has been proposed. Experimental results indicate that the proposed method can significantly outperform the baseline method as well as the sequence-based method for semantic annotation.
Unable to display preview. Download preview PDF.
- 1.Benjamins, R., Contreras, J.: Six Challenges for the Semantic Web. Intelligent Software Components. In: Intelligent software for the networked economy (isoco) (April 2002)Google Scholar
- 2.Kushmerick, N., Weld, D.S., Doorenbos, R.B.: Wrapper Induction for Information Extraction. In: Proc. of IJCAI, Nagoya, Japan, pp. 729–737 (1997)Google Scholar
- 3.Ciravegna, F. (LP)2, an Adaptive Algorithm for Information Extraction from Web-related Texts. In: Proc. of the IJCAI 2001 Workshop on Adaptive Text Extraction and Mining, Seattle, USA (August 2001)Google Scholar
- 5.Tang, J., Li, J., Lu, H., Liang, B., Wang, K.: iASA: Learning to Annotate the Semantic Web. Journal on Data Semantic 4, 110–145 (2005)Google Scholar
- 7.Reeve, L.: Integrating Hidden Markov Models into Semantic Web Annotation Platforms. Technique Report (2004)Google Scholar