Infrastructure for Bangla Information retrieval in the context of ICT for Development

  • Nafid Haque
  • M. Hammad Ali
  • Matin Saad Abdullah
  • Mumit Khan


In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities for the user. We wanted to come up with an implementation that did not require any special features or optimizations in the user end, and would perform just as well in all situations. For this purpose, we picked two case studies to work on in our effort to finding a suitable solution to the problem. While working on these cases, we encountered several problems and had to find our way around these problems. We had to pick and choose from a set of software packages for the one that would best serve our needs. We also had to take into consideration user convenience in using our system, for which we had to keep in mind the diverse demographics of people that might have need for such a system. Finally, we came up with the system, with all the desired features. Some possible future developments also came into mind in the course of our work, which are also mentioned in this paper.


Search Engine Digital Divide Information Retrieval System Legal Content Local Machine 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    Google Desktop, available online at Scholar
  2. [2]
    Yahoo! Search, available online at Scholar
  3. [3]
    MSN Search, available online at http://search.msn.comGoogle Scholar
  4. [4]
    Erik Hatcher and Otis Gospodnetic, ‘Lucene in Action’, April 2006.Google Scholar
  5. [5]
    The Official Nutch Website – Scholar
  6. [6]
    Vicaya, available online at http://vicaya.sourceforge.netGoogle Scholar
  7. [7]
    Prothom Alo, the largest online daily newspaper in Bangla, available online at www.prothom-alo.netGoogle Scholar
  8. [8]
    D.Net – Development Research Network, www.dnet-bangladesh.orgGoogle Scholar
  9. [9]
    Pallitathya, a research program of D.Net on understanding information needs from a village perceptive, Scholar
  10. [10]
    Abolombon, a program of D.Net designed to improve access to legal information on governance and human rights issues, Scholar
  11. [11]
    The Nutch wiki, available online at Scholar
  12. [12]
    The Nutch tutorial for Version 0.7.x, available online at Scholar
  13. [13]
    A step by step guideline on how to configure and use Tomcat, available online at Scholar
  14. [14]
    Weblog on enabling Tomcat to support UTF-8 Encoding - Scholar
  15. [15]
    FAQ on the World Summit Information Society at Scholar

Copyright information

© Springer 2007

Authors and Affiliations

  • Nafid Haque
    • 1
  • M. Hammad Ali
    • 1
  • Matin Saad Abdullah
    • 1
  • Mumit Khan
    • 1
  1. 1.BRAC University66 MohakhaliBangladesh

Personalised recommendations