Abstract
Traditional information retrieval systems represent documents and queries by keyword sets. However, the content of a document or a query is mainly defined by both keywords and named entities occurring in it. Named entities have ontological features, namely, their aliases, classes, and identifiers, which are hidden from their textual appearance. Besides, the meaning of a query may imply latent named entities that are related to the apparent ones in the query. We propose an ontology-based generalized vector space model to semantic text search. It exploits ontological features of named entities and their latently related ones to reveal the semantics of documents and queries. We also propose a framework to combine different ontologies to take their complementary advantages for semantic annotation and searching. Experiments on a benchmark dataset show better search quality of our model to other ones.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, New York (1999)
Bast, H., Chitea, A., Suchanek, F., Weber, I.: ESTER: Efficient Search on Text, Entities, and Relations. In: Proceedings 30th Annual International ACM SIGIR Conference (SIGIR-2007), pp. 671–678. ACM, New York (2007)
Cao, T.H., Cao, T.D., Tran, T.L.: A Robust Ontology-Based Method for Translating Natural Language Queries to Conceptual Graphs. In: Domingue, J., Anutariya, C. (eds.) ASWC 2008. LNCS, vol. 5367, pp. 479–492. Springer, Heidelberg (2008)
Cao, T.H., Le, K.C., Ngo, V.M.: Exploring Combinations of Ontological Features and Keywords for Text Retrieval. In: Ho, T.-B., Zhou, Z.-H. (eds.) PRICAI 2008. LNCS (LNAI), vol. 5351, pp. 603–613. Springer, Heidelberg (2008)
Castells, P., Vallet, D., Fernández, M.: An Adaptation of the Vector Space Model for Ontology-Based Information Retrieval. IEEE Transactions of Knowledge and Data Engineering 19(2), 261–272 (2007)
Cheng, T., Yan, X., Chen, K., Chang, C.: EntityRank: Searching Entities Directly and Holistically. In: Proceedings of the 33rd Very Large Data Bases Conference (VLDB-2007), pp. 387–398 (2007)
Choi, N., Song, I.Y., Han, H.: A Survey on Ontology Mapping. ACM SIGMOD Record 35(3), 34–41 (2006)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: Developing Language Processing Components with GATE Version 4, User Guide (2006), http://gate.ac.uk/sale/tao
Elsas, J.L., Arguello, J., Callan, J., Carbonell, J.G.: Retrieval and Feedback Models for Blog Feed Search. In: Proceedings of the 31st annual international ACM SIGIR conference (SIGIR-2008), pp. 347–354. ACM, New York (2008)
Fensel, D., Harmelen, V.F., Horrocks, I.: OIL: An Ontology Infrastructure for the Semantic Web. IEEE Intelligent System 16(2), 38–45 (2001)
Fernández, M., et al.: Semantic Search Meets the Web. In: Proceedings of the 2nd IEEE International Conference on Semantic Computing (ICSC-2008), pp. 253–260 (2008)
Goncalves, A., Zhu, J., Song, D., Uren, V., Pacheco, R.: Latent Relation Dicovery for Vector Space Expansion and Information Retrieval. In: Yu, J.X., Kitsuregawa, M., Leong, H.-V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 122–133. Springer, Heidelberg (2006)
Gruber, T.R.: Toward Principles for the Design of Ontologies Used for Knowledge Sharing. International Journal Human-Computer Studies 43(4), 907–928 (1995)
Guha, R., McCool, R., Miller, E.: Semantic Search. In: Proceedings of the 12th International Conference on World Wide Web, pp. 700–709 (2003)
Kasneci, G., Ramanath, M., Suchanek, F., Weikum, G.: The YAGO-NAGA Approach to Knowledge Discovery. In: Proceedings of 28th ACM SIGMOD International Conference on Management of Data (ACM SIGMOD-2008), pp. 41–47. ACM, New York (2008)
Khalid, M.A., Jijkoun, A., Rijke, M.: The Impact of Named Entity Normalization on Information Retrieval for Question Answering. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 705–710. Springer, Heidelberg (2008)
Lopez, V., Sabou, M., Motta, E.: PowerMap: Mapping the Real Semantic Web on the Fly. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 414–427. Springer, Heidelberg (2006)
Kiryakov, A., Popov, B., Terziev, I., Manov, D., Ognyanoff, D.: Semantic Annotation, Indexing, and Retrieval. Elsevier’s Journal of Web Semantics 2(1) (2005)
Marsh, E., Perzanowski, D.: MUC-7 Evaluation of IE Technology: Overview of Results. In: Proceedings of the Seventh Message Understanding Conference, MUC-7 (1998)
Mihalcea, R., Moldovan, D.: Document Indexing using Named Entities. Studies in Informatics and Control 10(1) (2001)
Ngo, V.M., Cao, T.H.: A Generalized Vector Space Model for Ontology-Based Information Retrieval. Vietnamese Journal on Information Technologies and Communications 22 (2009) (To appear)
Sekine, S.: Named Entity: History and Future. Proteus Project Report (2004)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO - A Core of Semantic Knowledge. Unifying WordNet and Wikipedia. In: Proceeding of the 16th international conference on World Wide Web (WWW-2007), pp. 697–706. ACM, New York (2007)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO - A Large Ontology from Wikipedia and Wordnet. Journal of Semantic Web 6(3), 203–217 (2008)
Tran, T., Cimiano, P., Rudolph, S., Studer, R.: Ontology-Based Interpretation of Keywords for Semantic Search. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 523–536. Springer, Heidelberg (2007)
Varelas, G., Voutsakis, E., Paraskevi, R., Petrakis, G.M.E., Evagelos, E.M.: Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web. In: Proceedings of the 7th annual ACM international workshop on web information and data management, pp. 10–16 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ngo, V.M., Cao, T.H. (2010). Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search. In: Nguyen, N.T., Katarzyniak, R., Chen, SM. (eds) Advances in Intelligent Information and Database Systems. Studies in Computational Intelligence, vol 283. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12090-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-12090-9_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12089-3
Online ISBN: 978-3-642-12090-9
eBook Packages: EngineeringEngineering (R0)