Thesaurus Based Web Searching

Conference paper
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 166)

Abstract

Search engine technology has become quite popular to help users seek information available on the web. The success of a searching system is determined by the quality and efficiency of the search results. There may be very good items on the search topic in other languages, but, search engine will generally retrieve items of only one language. Most of these search engines use pattern search which is not efficient. In this paper we present a tool that addresses this problem. Here we discuss the work carried out in developing an efficient tool that retrieves all the items of the database relevant to search term, not just the term matching. This tool retrieves all the synonym matches from both Telugu and English languages.

Keywords

Telugu dictionary Foreign key EngTelMap context resolution reverse mapping 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chau, M., et al.: SpidersRUs: Creating specialized search engines in multiple languages. Science Direct, Decision Support Systems 45, 621–640 (2008)CrossRefGoogle Scholar
  2. 2.
    Uma Maheshwara Rao, G.: Morphological complexity of Telugu. In: ICOSAL-2 (2000)Google Scholar
  3. 3.
    Bhattacharya, P., et al.: A multi Lingual Meaning Based Search EngineGoogle Scholar
  4. 4.
    Brin, S., Page, L.: The anatomy of a Large-Scale Hypertextual Web Search Engine. In: Proceedings of the7th WWW Conference, Brisbane, Australia (April 1998)Google Scholar
  5. 5.
    Chen, H., Chau, M., Zeng, D.: CI Spider: a tool for competitive intelligence on the web. Decision Support Systems 34(1), 1–17 (2002)CrossRefGoogle Scholar
  6. 6.
    Lovins, J.: Development of stemming algorithm. Journal of Mechanical Translation and Computational Linguistics 11, 22–31 (1968)Google Scholar
  7. 7.
    Sunitha, K.V.N., Sharada, A.: Building an Efficient Language Model based on Morphology for Telugu ASR. In: KSE-1, CIIL, Mysore (March 2010)Google Scholar
  8. 8.
    Sunitha, K.V.N., Sharada, A.: Telugu Text Corpora Analysis for Creating Speech Database. IJEIT 1(2) (December 2009) ISSN 0975-5292Google Scholar
  9. 9.
    Paice, C., Husk, G.: Another Stemmer. ACM SIGIR Forum 24(3), 566 (1990)CrossRefGoogle Scholar
  10. 10.
    Porter, M.F.: An algorithm for suffix stripping. In: Readings in Information Retrieval, pp. 313–316. Morgan Kaufmann Publishers Inc., San Francisco (1997)Google Scholar
  11. 11.
    Xu, J., Bruce Croft, W.: Corpus based stemming using co-occurrence of word variants. ACM Trans. Inf. Syst. 16(1), 61–81 (1998)CrossRefGoogle Scholar
  12. 12.
    Dawson, J.L.: Suffix removal for word conflation. Bulletin of the Association for Literary and Linguistic Computing 2(3), 33–46 (1974)Google Scholar

Copyright information

© Springer-Verlag GmbH Berlin Heidelberg 2012

Authors and Affiliations

  1. 1.CSE DeptG. Narayanamma Institute of Technology and ScienceHyderabadIndia

Personalised recommendations