Infrastructure for Bangla Information retrieval in the context of ICT for Development

  • Nafid Haque
  • M. Hammad Ali
  • Matin Saad Abdullah
  • Mumit Khan

Abstract

In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities for the user. We wanted to come up with an implementation that did not require any special features or optimizations in the user end, and would perform just as well in all situations. For this purpose, we picked two case studies to work on in our effort to finding a suitable solution to the problem. While working on these cases, we encountered several problems and had to find our way around these problems. We had to pick and choose from a set of software packages for the one that would best serve our needs. We also had to take into consideration user convenience in using our system, for which we had to keep in mind the diverse demographics of people that might have need for such a system. Finally, we came up with the system, with all the desired features. Some possible future developments also came into mind in the course of our work, which are also mentioned in this paper.

Keywords

Expense Arena Extractor Harness Hone 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    Google Desktop, available online at http://desktop.google.com/about.htmlGoogle Scholar
  2. [2]
    Yahoo! Search, available online at http://search.yahoo.comGoogle Scholar
  3. [3]
    MSN Search, available online at http://search.msn.comGoogle Scholar
  4. [4]
    Erik Hatcher and Otis Gospodnetic, ‘Lucene in Action’, April 2006.Google Scholar
  5. [5]
    The Official Nutch Website – http://www.lucene.apache.org/nutchGoogle Scholar
  6. [6]
    Vicaya, available online at http://vicaya.sourceforge.netGoogle Scholar
  7. [7]
    Prothom Alo, the largest online daily newspaper in Bangla, available online at www.prothom-alo.netGoogle Scholar
  8. [8]
    D.Net – Development Research Network, www.dnet-bangladesh.orgGoogle Scholar
  9. [9]
    Pallitathya, a research program of D.Net on understanding information needs from a village perceptive, http://www.pallitathya.org/Google Scholar
  10. [10]
    Abolombon, a program of D.Net designed to improve access to legal information on governance and human rights issues, http://www.abolombon.org/Google Scholar
  11. [11]
    The Nutch wiki, available online at http://wiki.apache.org/nutch/Google Scholar
  12. [12]
    The Nutch tutorial for Version 0.7.x, available online at http://www.lucene.apache.org/nutch/tutorial.htmlGoogle Scholar
  13. [13]
    A step by step guideline on how to configure and use Tomcat, available online at http://www.coreservlets.com/Apache-Tomcat-Tutorial/Google Scholar
  14. [14]
    Weblog on enabling Tomcat to support UTF-8 Encoding - http://rollerweblogger.org/page/roller/20040415Google Scholar
  15. [15]
    FAQ on the World Summit Information Society at http://www.itu.int/wsis/Google Scholar

Copyright information

© Springer 2007

Authors and Affiliations

  • Nafid Haque
    • 1
  • M. Hammad Ali
    • 1
  • Matin Saad Abdullah
    • 1
  • Mumit Khan
    • 1
  1. 1.BRAC University66 MohakhaliBangladesh

Personalised recommendations