Abstract
RISOT was a pilot task in FIRE 2011 which focused on the retrieval of automatically recognized text from machine printed sources. The collection used for search was a subset of the FIRE 2008 and 2010 Bengali test collections that contained 92 topics and 62,825 documents. Two teams participated, submitting a total of 12 monolingual runs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Harman, D.: Overview of the Fourth Text Retrieval Conference. In: The Fourth Text Retrieval Conference, Gaithersburg, MD, USA, pp. 1–24 (1995)
Kantor, P., Voorhees, E.: Report on the TREC-5 Confusion Track. In: The Fifth Text Retrieval Conference, Gaithersburg, MD, USA, pp. 65–74 (1996)
Baron, J., Lewis, D., Oard, D.: The TREC-2006 Legal Track. In: The Fifteenth Text Retrieval Conference, Gaithersburg, MD, USA (2006)
Tomlinson, S., Oard, D., Baron, J., Thompson, P.: Overview of the TREC 2007 Legal Track. In: The Sixteenth Text Retrieval Conference, Gaithersburg, MD, USA (2007)
Oard, D., Hedin, B., Tomlinson, S., Baron, J.: Overview of the TREC 2008 Legal Track. In: The Seventeenth Text Retrieval Conference, Gaithersburg, MD, USA (2008)
Hedin, B., Tomlinson, S., Baron, J., Oard, D.: Overview of the TREC 2009 Legal Track. In: The Eighteenth Text Retrieval Conference, Gaithersburg, MD, USA (2009)
Taghva, K., Borsack, J., Condit, A.: Results of Applying probabilistic IR to OCR Text. In: The Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, pp. 202–211 (1994)
Singhal, A., Salton, G., Buckley, C.: Length Normalization in Degraded Text Collections. In: Proceedings of the Fifth Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, USA, pp. 149–162 (1996)
Vincent, L.: Google Book Search: Document Understanding on a Massive Scale. In: Ninth International Conference on Document Analysis and Recognition, Curitiba, Brazil, pp. 819–823 (2007)
Garain, U., Chaudhuri, B.: Compound character recognition by run number based metric distance. In: Proceedings of the IS&T/SPIE 10th International Symposium on Electronic Imaging: Science & Technology, SPIE, San Jose, CA, USA, vol. 3305, pp. 90–97 (1998)
Sauvola, J., Kauniskangas, H., Doermann, D., Pietikainen, M.: Techniques for automated testing of document analysis algorithms. In: Brazilian Symposium on Document Image Analysis, Curitaba, Brazil, pp. 201–212 (1997)
Majumder, P., Mitra, M., Pal, D., Bandyopadhyay, A., Maiti, S., Pal, S., Modak, D., Sanyal, S.: The FIRE 2008 Evaluation Exercise. ACM Transactions on Asian Language Information Processing 9(3), 10:1–10:24 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garain, U., Paik, J.H., Pal, T., Majumder, P., Doermann, D.S., Oard, D.W. (2013). Overview of the FIRE 2011 RISOT Task. In: Majumder, P., Mitra, M., Bhattacharyya, P., Subramaniam, L.V., Contractor, D., Rosso, P. (eds) Multilingual Information Access in South Asian Languages. Lecture Notes in Computer Science, vol 7536. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40087-2_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-40087-2_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40086-5
Online ISBN: 978-3-642-40087-2
eBook Packages: Computer ScienceComputer Science (R0)