Abstract
Semantic annotation for text is a well-studied topic. However, little contribution has been engaged in the application of short text annotation. In this article, an automatic annotation approach is proposed for such purpose, which annotates short text with semantic labels for question answering systems. In the first step, keywords are extracted from a question and then a semantic label selection module is used to select semantic labels to tag keywords. If there is no appropriate label, WordNet is employed to obtain candidate labels to annotate those keywords by calculating the similarity between each keyword in the question and the concept list in our predefined Tagger Ontology. To improve the accuracy of annotation, we also design a naïve Bayesian based method to distinguish multi-senses and assign best semantic labels by referring to historically annotated questions. Preliminary experiments on 6 categories show our approach achieves the precision of 76% in average.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Cheng, P.J., Chiao, H.C., Pan, Y.C., Chien, L.F.: Annotating text segments in documents for search. In: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 317–320 (2005)
Hao, T.Y., Hu, D.W., Liu, W.Y., Zeng, Q.T.: Semantic patterns for user-interactive question answering. Journal of Concurrency and Computation: Practice and Experience 20(1) (2007)
Lin, D.: Dependency-based evaluation of MINIPAR. Treebanks: Building and Using Parsed Corpora (2003)
Prager, J., Brown, E., Coden, A.: Question-answering by predictive annotation. In: Proceedings of the 23rd Annual International ACM SIGIR Conference, Athens (2000)
Sfihari, R., Li, W.: Question answering supported by information extraction. In: Proceedings of the Eighth Text REtrieval Conference (TREC8), Gaithersburg, Md (1999)
Carr, L., Bechhofer, S., Goble, C., Hall, W.: Conceptual linking: ontology-based open hypermedia. In: Proceedings of the 10th International World Wide Web Conference, Hong Kong, pp. 334–342 (2001)
Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM – semi-automatic cREAtion of metadata. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 358. Springer, Heidelberg (2002)
Vargas-Vera, M., Motta, E., Domingue, J., Lanzoni, M., Stutt, A., Ciravegna, F.: MnM: Ontology driven semi-automatic and automatic support for semantic markup. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 379. Springer, Heidelberg (2002)
Kiryakov, A., Popov, B., Ognyanoff, D., Manov, D., Goranov, K.M.: Semantic annotation, indexing, and retrieval. Journal of Web Semantics, 49–79 (2004)
Reeve, L., Han, H.: Survey of semantic annotation platforms. In: Proceedings of the 2005 ACM Symposium on Applied Computing, Santa Fe, New, Mexico, March 13 -17 (2005)
Veale, T.: Meta-knowledge annotation for efficient natural-language question-answering. In: O’Neill, M., Sutcliffe, R.F.E., Ryan, C., Eaton, M., Griffith, N.J.L. (eds.) AICS 2002. LNCS (LNAI), vol. 2464, pp. 127–128. Springer, Heidelberg (2002)
Prager, J., Radev, D., Czuba, K.: Answering what-is questions by virtual annotation. In: Proceedings of the first International Conference on Human Language Technology Research 2001, San Diego, March 18 - 21 (2001)
Hays, D.: Dependency theory: a formalism and some observations. Language, Linguistic Society of America 40(4), 511–525 (1964)
Miller, G.A.: WordNet: a lexical database for English. Communications of the ACM 38(11) (1995)
Li, Y.H., Bandar, Z.A., McLean, D.: An approach for measuring semantic similarity between words using multiple information sources. IEEE Transactions on Knowledge and Data Engineering 15(4) (July/August 2003)
Cowie, J., Ludovik, E., Molina-Salgado, H., Nirenburg, S., Sheremetyeva, S.: Automatic question answering. In: Proceedings of the Rubin Institute for Advanced Orthopedics Conference, Paris (2000)
Álvez, J., Atserias, J., Carrera, J., Climent, S., Laparra, E., Oliver, A., Rigau, G.: Complete and consistent annotation of wordNet using the top concept ontology. In: Proceedings of Sixth International Language Resources and Evaluation (LREC 2008), European Language Resources Association, ELRA (2008)
Hao, T.Y., Ni, X.L., Quan, X.J., Liu, W.Y.: Automatic Construction of Semantic Dictionary for Question Categorization. In: Proceedings of The 13th World Multi-Conference on Systemics, Cybernetics and Informatics: WMSCI 2009, Orlando, July 10-13, pp. 220–225 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, G., Lu, Z., Hao, T., Liu, W. (2011). Automatic Short Text Annotation for Question Answering System. In: Filipe, J., Cordeiro, J. (eds) Web Information Systems and Technologies. WEBIST 2010. Lecture Notes in Business Information Processing, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22810-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-22810-0_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22809-4
Online ISBN: 978-3-642-22810-0
eBook Packages: Computer ScienceComputer Science (R0)