Skip to main content

Ensemble Approach for Cross Language Information Retrieval

  • Conference paper
Book cover Computational Linguistics and Intelligent Text Processing (CICLing 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7182))

Abstract

Cross language information retrieval (CLIR) is a sub field of information retrieval (IR) which deals with retrieval of content from one language (source language) for a search query expressed in another language (target language) in the Web. Cross Language Information Retrieval evolved as a field due to the fact that majority of the content in the web is in English. Hence there is a need for dynamic translation of web content for a query expressed in the native language. The biggest problem is that of ambiguity of the query expressed in the native language. The ambiguity of languages is typically not a problem for human beings who can infer the appropriate word sense or meaning based on context, but search engines cannot usually overcome these limitations. Hence, methods and mechanisms to provide native languages access to information from the web are needed. There is a need, to not only retrieve the relevant results but also, present the content behind the results in a user understandable manner. The research in the domain has so far focused in terms of techniques that make use support vector machines, suffix tree approach, Boolean models, and iterative results clustering. This research work focuses on a methodology of personalized context based cross language information retrieval using ensemble-learning approach. The source language for this research is taken, as English and the target language is Telugu. The methodology has tested for various queries and the results are shown in this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Makin., R., Pandey., N., Pingali, P., Varma, V.: Experiments in Cross lingual IR among Indian Languages. In: International Workshop on Cross Language Information Processing (CLIP 2007), Genoa, July 9-10 (2007)

    Google Scholar 

  2. Saraswathi, S., Siddhiqaa, M., Kalaimagal, K.: Bilingual Information Retrieval System for English and Tamil. Journal of Computing 2(4) (April 2010)

    Google Scholar 

  3. Lazarinis, F., Jesus, S., John, V.: Current research issues and trends in non-English Web searching. Springer Science (2009)

    Google Scholar 

  4. Vijayanand, K., Seenivasan, R.P.: Named Entity Recognition and Transliteration for Telugu Language. Language in India, Special Volume: Problems of Parsing in Indian Languages (May 2011), www.languageinindia.com

  5. Sieg, A., Mobasher, B., Burke, R.: Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search. IEEE Intelligent Informatics Bulletin 8(1) (November 2007)

    Google Scholar 

  6. Carpineto, C., Romano, G., Snidero, M.: Mobile information Retrieval with Search Results Clustering: Prototypes and Evaluation. Journal of the American Society for Information Science and Technology 60(5), 877–895 (2009)

    Article  Google Scholar 

  7. Huo, Z., Zhao, J., Hu, X.: Web Data Management for Mobile Users, Network and Parallel Computing Workshops. In: IFIP International Conference on NPC Workshops, September 18-21 (2007)

    Google Scholar 

  8. Banu, W.A., Kader, P.S.A.: A Hybrid Context Based Approach for Web Information Retrieval. International Journal of Computer Applications, article 5 (2010)

    Google Scholar 

  9. Nasharuddin, N.A., Abdullah, M.: Cross-lingual Information Retrieval: State-of-the-Art. Electronic Journal of Computer Science and Information Technology 2 (2010)

    Google Scholar 

  10. Petrelli, D., Levin, S., Beaulieu, M., Sanderson, M.: Which user interaction for cross-language information retrieval? Design issues and reflections. Journal of the American Society for Information Science and Technology 57(5), 709–722

    Google Scholar 

  11. Damjanovic, V., Gasevic, D., Devedzic, V.: Semiotics for Ontologies and Knowledge Representation. In: Proc. of Wissens Management, pp. 571–574 (2005)

    Google Scholar 

  12. Zhou, D., Truran, M., Brailsford, T., Ashman, H.: A Hybrid Technique for English-Chinese Cross Language Information Retrieval. ACM Transactions on Asian Language Information Processing (2008)

    Google Scholar 

  13. Wang, X., Broder, A., Gabrilovich, E., Josifovski, V., Pang, B.: Cross-language query classification using web search for exogenous knowledge. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining (February 2009)

    Google Scholar 

  14. Maeda, A., Kimura, F.: An Approach to Cross-Age and Cross-Cultural Information Access for Digital Humanities. In: Digital Resources for the Humanities and Arts 2008 Conference (DRHA 2008), Cambridge, U.K (September 2008)

    Google Scholar 

  15. Khan, A., Naveed, A.M.: Corpus Based Mapping of Urdu Characters for Cell Phones. In: Proceedings of the Conference on Language & Technology (2009)

    Google Scholar 

  16. Prasad, P., Varma, V.: Hindi and Telugu to English Cross Language Information Retrieval at CLEF 2006. In: Working Notes for the CLEF 2006 Workshop (Cross Language Adhoc Task), Alicante, Spain, September 20-22 (2006)

    Google Scholar 

  17. Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. The MIT Press (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mavaluru, D., Shriram, R., Banu, W.A. (2012). Ensemble Approach for Cross Language Information Retrieval. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28601-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28601-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28600-1

  • Online ISBN: 978-3-642-28601-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics