Skip to main content

Parallel Information Retrieval with Query Expansion

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2367))

Abstract

An information retrieval (IR) system with query expansion on a low-cost high-performance PC cluster environment is implemented. The IR system stores document sets, it is indexed by the inverted-index-file (IIF), and the vector space model is used as ranking strategy. The query expansion is adding terms into the original query for raising retrieval effectiveness. In this work, the query expansion with the collocation-based similarity measure is used. In our parallel IR system, the inverted-index file (IIF) is partitioned into pieces using the lexical and the greedy declustering methods. For each incoming user’s query with multiple terms after query expansion, terms are sent to the corresponding nodes that contain the relevant pieces of the IIF to be evaluated in parallel. We study how query performance is affected by query expansion and two declustering methods using two standard Korean test collections. According to the experiments, the greedy method shows about 20% enhancement overall when compared with the lexical method.

This work was supported by Hankuk University of Foreign Studies Research Fund of 2002.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Park, S.H., Kwon, H.C.: An Improved Relevance Feedback for Korean Information Retrieval System. Proceedings of the 16th IASTED International Conference on Applied Informatics, IASTED/ACTA Press, Garmisch-Partenkirchen, Germany (1998) 65–68

    Google Scholar 

  2. Frakes, W., Baeza-Yates, R.: Information retrieval-data structures & algorithms. Prentice-Hall (1992)

    Google Scholar 

  3. Cormack, G.V., Clarke, C.L.A., Palmer, C.R., Kisman, D.I.E.: Fast Automatic Passage Ranking (MultiText Experiment for TREC-8). The proceedings of the Eighth Text Retrieval Conference (TREC-8), Gaithersburg, Maryland (1999) 735–741

    Google Scholar 

  4. Chung, Y.J., Kwon H.C., Chung, S.H., Ryu, K.R.: Declustering Web Content Indices for Parallel Information Retrieval, Lecture Notes in Artificial Intelligence 2109 (2001) 346–350

    Google Scholar 

  5. Xu, J., Croft, W.B.: Query Expansion Using Local and Global Document Analysis, The proceedings of the 19th ACM SIGIR International Conference on Research and Development in Information Retrieval, Zurich (1996) 4–11

    Google Scholar 

  6. Salton, G.: Automatic Text Processing. Addison-Wesley Publishing Company (1989) 313–319

    Google Scholar 

  7. Rijsbergen, C.J.V.: A Theoretical Basis for the Use of Cooccurrence Data in Information Retrieval, Journal of Documentation 33, 106–119

    Google Scholar 

  8. Kim, M.C., Choi, K.S.: A Comparison of Collocation-based Similarity Measures in Query Expansion, Information Processing and Management 35 (1999) 19–30

    Article  Google Scholar 

  9. Qiu, Y, Frei, H.P.: Concept Based Query Expansion, The Proceedings of the 16th ACM SIGIR International Conference on Research and Development in Information Retrieval, Pittsburgh (1993) 160–169

    Google Scholar 

  10. Kim, S.H., Seo, E.K., Lee, W.K., Kim, M.C., Kim, Y.H., Kim, J.K.: The Development of Test Collection for Automatic Indexer, Journal of the Korean Society for Information Management, 11(1), 81–102

    Google Scholar 

  11. Kang, H.K. Choi, K.S.: Two-level Document Ranking Using Mutual Information in Natural Language Information Retrieval, Information Processing and Management, 33(3). 289–306

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chung, Y. (2002). Parallel Information Retrieval with Query Expansion. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-48051-X_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43786-4

  • Online ISBN: 978-3-540-48051-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics