Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7536))

  • 671 Accesses

Abstract

RISOT was a pilot task in FIRE 2011 which focused on the retrieval of automatically recognized text from machine printed sources. The collection used for search was a subset of the FIRE 2008 and 2010 Bengali test collections that contained 92 topics and 62,825 documents. Two teams participated, submitting a total of 12 monolingual runs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Harman, D.: Overview of the Fourth Text Retrieval Conference. In: The Fourth Text Retrieval Conference, Gaithersburg, MD, USA, pp. 1–24 (1995)

    Google Scholar 

  2. Kantor, P., Voorhees, E.: Report on the TREC-5 Confusion Track. In: The Fifth Text Retrieval Conference, Gaithersburg, MD, USA, pp. 65–74 (1996)

    Google Scholar 

  3. Baron, J., Lewis, D., Oard, D.: The TREC-2006 Legal Track. In: The Fifteenth Text Retrieval Conference, Gaithersburg, MD, USA (2006)

    Google Scholar 

  4. Tomlinson, S., Oard, D., Baron, J., Thompson, P.: Overview of the TREC 2007 Legal Track. In: The Sixteenth Text Retrieval Conference, Gaithersburg, MD, USA (2007)

    Google Scholar 

  5. Oard, D., Hedin, B., Tomlinson, S., Baron, J.: Overview of the TREC 2008 Legal Track. In: The Seventeenth Text Retrieval Conference, Gaithersburg, MD, USA (2008)

    Google Scholar 

  6. Hedin, B., Tomlinson, S., Baron, J., Oard, D.: Overview of the TREC 2009 Legal Track. In: The Eighteenth Text Retrieval Conference, Gaithersburg, MD, USA (2009)

    Google Scholar 

  7. Taghva, K., Borsack, J., Condit, A.: Results of Applying probabilistic IR to OCR Text. In: The Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, pp. 202–211 (1994)

    Google Scholar 

  8. Singhal, A., Salton, G., Buckley, C.: Length Normalization in Degraded Text Collections. In: Proceedings of the Fifth Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, USA, pp. 149–162 (1996)

    Google Scholar 

  9. Vincent, L.: Google Book Search: Document Understanding on a Massive Scale. In: Ninth International Conference on Document Analysis and Recognition, Curitiba, Brazil, pp. 819–823 (2007)

    Google Scholar 

  10. Garain, U., Chaudhuri, B.: Compound character recognition by run number based metric distance. In: Proceedings of the IS&T/SPIE 10th International Symposium on Electronic Imaging: Science & Technology, SPIE, San Jose, CA, USA, vol. 3305, pp. 90–97 (1998)

    Google Scholar 

  11. Sauvola, J., Kauniskangas, H., Doermann, D., Pietikainen, M.: Techniques for automated testing of document analysis algorithms. In: Brazilian Symposium on Document Image Analysis, Curitaba, Brazil, pp. 201–212 (1997)

    Google Scholar 

  12. Majumder, P., Mitra, M., Pal, D., Bandyopadhyay, A., Maiti, S., Pal, S., Modak, D., Sanyal, S.: The FIRE 2008 Evaluation Exercise. ACM Transactions on Asian Language Information Processing 9(3), 10:1–10:24 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Garain, U., Paik, J.H., Pal, T., Majumder, P., Doermann, D.S., Oard, D.W. (2013). Overview of the FIRE 2011 RISOT Task. In: Majumder, P., Mitra, M., Bhattacharyya, P., Subramaniam, L.V., Contractor, D., Rosso, P. (eds) Multilingual Information Access in South Asian Languages. Lecture Notes in Computer Science, vol 7536. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40087-2_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40087-2_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40086-5

  • Online ISBN: 978-3-642-40087-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics