Indian Statistical Institute at INEX 2008 Adhoc Track

Pal, Sukomal; Mitra, Mandar; Ganguly, Debasis; Maiti, Samaresh; Bandyopadhyay, Ayan; Sen, Aparajita; Mitra, Sukanya

doi:10.1007/978-3-642-03761-0_9

Sukomal Pal¹⁹,
Mandar Mitra¹⁹,
Debasis Ganguly¹⁹,
Samaresh Maiti¹⁹,
Ayan Bandyopadhyay¹⁹,
Aparajita Sen¹⁹ &
…
Sukanya Mitra¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5631))

Included in the following conference series:

International Workshop of the Initiative for the Evaluation of XML Retrieval

396 Accesses

Abstract

This paper describes the work that we did at Indian Statistical Institute towards XML retrieval for INEX 2008. Besides the Vector Space Model (VSM) that we have been using since INEX 2006, this year we implemented the Language Modeling (LM) approach in our text retrieval system (SMART) to retrieve XML elements against the INEX Adhoc queries. Like last year, we considered Content-Only (CO) queries and submitted three runs for the FOCUSED sub-task. Two runs are based on the Vector Space Model and one uses the Language Model. One of the VSM-based runs (VSMfbElts0.4) retrieves sub-document-level elements. Both the other runs (VSMfb and LM-nofb-0.20) retrieve elements only at the whole-document level. We applied blind feedback for both the VSM-based runs; no query expansion was used in the LM-based run. In general, the relative performance of our document-level runs is respectable (ranked 15/61 and 22/61 according to the official metric). Though our element retrieval run does reasonably (ranked 16/61 by iP[0.01]) according to the early-precision metrics, we think there is plenty of scope to improve our element retrieval strategy. Our immediate next task is therefore to focus on how to improve true element-level retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

INEX: Initiative for the Evaluation of XML Retrieval (2008), http://www.inex.otago.ac.nz
W3C: XPath-XML Path Language(XPath) Version 1.0, http://www.w3.org/TR/xpath
Salton, G.: A Blueprint for Automatic Indexing. ACM SIGIR Forum 16(2), 22–38 (1981)
Article Google Scholar
Buckley, C., Singhal, A., Mitra, M.: Using Query Zoning and Correlation within SMART: TREC5. In: Voorhees, E., Harman, D. (eds.) Proc. Fifth Text Retrieval Conference (TREC-5), NIST Special Publication 500-238 (1997)
Google Scholar
Hiemstra, D.: Using language models for information retrieval. PhD thesis, University of Twente (2001)
Google Scholar
Ganguly, D.: Implementing a language modeling framework for information retrieval. Master’s thesis, Indian Statistical Institute (2008)
Google Scholar
Mitra, M., Singhal, A., Buckley, C.: Improving automatic query expansion. In: SIGIR 1998, Melbourne, Australia, pp. 206–214. ACM, New York (1998)
Google Scholar
Pal, S., Mitra, M., Chakraborty, A.: Stability of inex 2007 evaluation measures. In: Proceedings of the Second International Workshop on Evaluating Information Access (EVIA), pp. 23–29 (2008), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings7/pdf/EVIA2008/06-EVIA2008-PalS.pdf
Fuhr, N., Kamps, J., Lalmas, M., Malik, S., Trotman, A.: Overview of the INEX 2007 Ad Hoc Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 1–23. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Information Retrieval Lab, CVPR Unit, Indian Statistical Institute, Kolkata, India
Sukomal Pal, Mandar Mitra, Debasis Ganguly, Samaresh Maiti, Ayan Bandyopadhyay, Aparajita Sen & Sukanya Mitra

Authors

Sukomal Pal
View author publications
You can also search for this author in PubMed Google Scholar
Mandar Mitra
View author publications
You can also search for this author in PubMed Google Scholar
Debasis Ganguly
View author publications
You can also search for this author in PubMed Google Scholar
Samaresh Maiti
View author publications
You can also search for this author in PubMed Google Scholar
Ayan Bandyopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Aparajita Sen
View author publications
You can also search for this author in PubMed Google Scholar
Sukanya Mitra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, 4001, Brisband, Qld, Australia
Shlomo Geva
Archives and Information Studies/Humanities, University of Amsterdam, Turfdraagsterpad 9, 1012 XT, Amsterdam, The Netherlands
Jaap Kamps
Department of Computer Science, University of Otago, P.O. Box 56, 9054, Dunedin, New Zealand
Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pal, S. et al. (2009). Indian Statistical Institute at INEX 2008 Adhoc Track. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-03761-0_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03760-3
Online ISBN: 978-3-642-03761-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics