Skip to main content

Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search

  • Chapter
Advances in Intelligent Information and Database Systems

Part of the book series: Studies in Computational Intelligence ((SCI,volume 283))

Abstract

Traditional information retrieval systems represent documents and queries by keyword sets. However, the content of a document or a query is mainly defined by both keywords and named entities occurring in it. Named entities have ontological features, namely, their aliases, classes, and identifiers, which are hidden from their textual appearance. Besides, the meaning of a query may imply latent named entities that are related to the apparent ones in the query. We propose an ontology-based generalized vector space model to semantic text search. It exploits ontological features of named entities and their latently related ones to reveal the semantics of documents and queries. We also propose a framework to combine different ontologies to take their complementary advantages for semantic annotation and searching. Experiments on a benchmark dataset show better search quality of our model to other ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, New York (1999)

    Google Scholar 

  2. Bast, H., Chitea, A., Suchanek, F., Weber, I.: ESTER: Efficient Search on Text, Entities, and Relations. In: Proceedings 30th Annual International ACM SIGIR Conference (SIGIR-2007), pp. 671–678. ACM, New York (2007)

    Chapter  Google Scholar 

  3. Cao, T.H., Cao, T.D., Tran, T.L.: A Robust Ontology-Based Method for Translating Natural Language Queries to Conceptual Graphs. In: Domingue, J., Anutariya, C. (eds.) ASWC 2008. LNCS, vol. 5367, pp. 479–492. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Cao, T.H., Le, K.C., Ngo, V.M.: Exploring Combinations of Ontological Features and Keywords for Text Retrieval. In: Ho, T.-B., Zhou, Z.-H. (eds.) PRICAI 2008. LNCS (LNAI), vol. 5351, pp. 603–613. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Castells, P., Vallet, D., Fernández, M.: An Adaptation of the Vector Space Model for Ontology-Based Information Retrieval. IEEE Transactions of Knowledge and Data Engineering 19(2), 261–272 (2007)

    Article  Google Scholar 

  6. Cheng, T., Yan, X., Chen, K., Chang, C.: EntityRank: Searching Entities Directly and Holistically. In: Proceedings of the 33rd Very Large Data Bases Conference (VLDB-2007), pp. 387–398 (2007)

    Google Scholar 

  7. Choi, N., Song, I.Y., Han, H.: A Survey on Ontology Mapping. ACM SIGMOD Record 35(3), 34–41 (2006)

    Article  Google Scholar 

  8. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: Developing Language Processing Components with GATE Version 4, User Guide (2006), http://gate.ac.uk/sale/tao

  9. Elsas, J.L., Arguello, J., Callan, J., Carbonell, J.G.: Retrieval and Feedback Models for Blog Feed Search. In: Proceedings of the 31st annual international ACM SIGIR conference (SIGIR-2008), pp. 347–354. ACM, New York (2008)

    Chapter  Google Scholar 

  10. Fensel, D., Harmelen, V.F., Horrocks, I.: OIL: An Ontology Infrastructure for the Semantic Web. IEEE Intelligent System 16(2), 38–45 (2001)

    Article  Google Scholar 

  11. Fernández, M., et al.: Semantic Search Meets the Web. In: Proceedings of the 2nd IEEE International Conference on Semantic Computing (ICSC-2008), pp. 253–260 (2008)

    Google Scholar 

  12. Goncalves, A., Zhu, J., Song, D., Uren, V., Pacheco, R.: Latent Relation Dicovery for Vector Space Expansion and Information Retrieval. In: Yu, J.X., Kitsuregawa, M., Leong, H.-V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 122–133. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  13. Gruber, T.R.: Toward Principles for the Design of Ontologies Used for Knowledge Sharing. International Journal Human-Computer Studies 43(4), 907–928 (1995)

    Article  Google Scholar 

  14. Guha, R., McCool, R., Miller, E.: Semantic Search. In: Proceedings of the 12th International Conference on World Wide Web, pp. 700–709 (2003)

    Google Scholar 

  15. Kasneci, G., Ramanath, M., Suchanek, F., Weikum, G.: The YAGO-NAGA Approach to Knowledge Discovery. In: Proceedings of 28th ACM SIGMOD International Conference on Management of Data (ACM SIGMOD-2008), pp. 41–47. ACM, New York (2008)

    Google Scholar 

  16. Khalid, M.A., Jijkoun, A., Rijke, M.: The Impact of Named Entity Normalization on Information Retrieval for Question Answering. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 705–710. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  17. Lopez, V., Sabou, M., Motta, E.: PowerMap: Mapping the Real Semantic Web on the Fly. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 414–427. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  18. Kiryakov, A., Popov, B., Terziev, I., Manov, D., Ognyanoff, D.: Semantic Annotation, Indexing, and Retrieval. Elsevier’s Journal of Web Semantics 2(1) (2005)

    Google Scholar 

  19. Marsh, E., Perzanowski, D.: MUC-7 Evaluation of IE Technology: Overview of Results. In: Proceedings of the Seventh Message Understanding Conference, MUC-7 (1998)

    Google Scholar 

  20. Mihalcea, R., Moldovan, D.: Document Indexing using Named Entities. Studies in Informatics and Control 10(1) (2001)

    Google Scholar 

  21. Ngo, V.M., Cao, T.H.: A Generalized Vector Space Model for Ontology-Based Information Retrieval. Vietnamese Journal on Information Technologies and Communications 22 (2009) (To appear)

    Google Scholar 

  22. Sekine, S.: Named Entity: History and Future. Proteus Project Report (2004)

    Google Scholar 

  23. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO - A Core of Semantic Knowledge. Unifying WordNet and Wikipedia. In: Proceeding of the 16th international conference on World Wide Web (WWW-2007), pp. 697–706. ACM, New York (2007)

    Chapter  Google Scholar 

  24. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO - A Large Ontology from Wikipedia and Wordnet. Journal of Semantic Web 6(3), 203–217 (2008)

    Google Scholar 

  25. Tran, T., Cimiano, P., Rudolph, S., Studer, R.: Ontology-Based Interpretation of Keywords for Semantic Search. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 523–536. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  26. Varelas, G., Voutsakis, E., Paraskevi, R., Petrakis, G.M.E., Evagelos, E.M.: Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web. In: Proceedings of the 7th annual ACM international workshop on web information and data management, pp. 10–16 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Ngo, V.M., Cao, T.H. (2010). Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search. In: Nguyen, N.T., Katarzyniak, R., Chen, SM. (eds) Advances in Intelligent Information and Database Systems. Studies in Computational Intelligence, vol 283. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12090-9_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12090-9_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12089-3

  • Online ISBN: 978-3-642-12090-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics