Skip to main content

Evaluation of Result Merging Strategies for Metasearch Engines

  • Conference paper
Web Information Systems Engineering – WISE 2005 (WISE 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3806))

Included in the following conference series:

Abstract

Result merging is a key component in a metasearch engine. Once the results from various search engines are collected, the metasearch system merges them into a single ranked list. The effectiveness of a metasearch engine is closely related to the result merging algorithm it employs. In this paper, we investigate a variety of resulting merging algorithms based on a wide range of available information about the retrieved results, from their local ranks, their titles and snippets, to the full documents of these results. The effectiveness of these algorithms is then compared experimentally based on 50 queries from the TREC Web track and 10 most popular general-purpose search engines. Our experiments yield two important results. First, simple result merging strategies can outperform Google. Second, merging based on the titles and snippets of retrieved results can outperform that based on the full documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aslam, J., Montague, M.: Models for Metasearch. In: ACM SIGIR Conference, pp. 276–284 (2001)

    Google Scholar 

  2. Bergman, M.: The Deep Web: Surfing Hidden Values. White Paper of CompletePlanet (2001), Available at http://brightplanet.com/pdf/deepwebwhitepaper.pdf

  3. Callan, J., Lu, Z., Croft, W.: Searching Distributed Collections with Inference Networks. In: ACM SIGIR Conference, pp. 21–28 (1995)

    Google Scholar 

  4. Dreilinger, D., Howe, A.: Experiences with Selecting Search Engines Using Metasearch. ACM TOIS 15(3), 195–222 (1997)

    Article  Google Scholar 

  5. Dwork, C., Kumar, R., Naor, M., Sivakumar, D.: Rank Aggregation Methods for the Web. In: Tenth International World Wide Web Conference, pp. 613–622 (2001)

    Google Scholar 

  6. Fox, E., Shaw, J.: Combination of Multiple Searches. In: Second Text Retrieval Conference, Gaithersburg, Maryland, August 1994, pp. 243–252 (1994)

    Google Scholar 

  7. Gauch, S., Wang, G., Gomez, M.: ProFusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science 2(9), 637–649 (1996)

    Google Scholar 

  8. Glover, E., Lawrence, S.: Selective Retrieval Metasearch engine. US Patent Application Publication (US 2002/0165860 A1) (November 2002)

    Google Scholar 

  9. Hawking, D., Craswell, N., Bailey, P., Griffiths, K.: Measuring Search Engine Quality. Information Retrieval Journal 4(1), 33–59 (2001)

    Article  MATH  Google Scholar 

  10. Lawrence, S., Lee Giles, C.: Inquirus, the NECi Meta Search Engine. In: Seventh International World Wide Web Conference (1998)

    Google Scholar 

  11. Lee, J.: Analyses of Multiple Evidence Combination. In: ACM SIGIR Conference (1997)

    Google Scholar 

  12. Meng, W., Yu, C., Liu, K.: Building Efficient and Effective Metasearch Engines. ACM Computing Surveys 34(1), 48–84 (2002)

    Article  Google Scholar 

  13. Rasolofo, Y., Hawking, D., Savoy, J.: Result Merging Strategies for a Current News Metasearcher. Inf. Process Manage 39(4), 581–609 (2003)

    Article  MATH  Google Scholar 

  14. Robertson, S., Walker, S., Beaulieu, M.: Okapi at trec-7: automatic ad hoc, filtering, vlc, and interactive track. In: 7th Text REtrieval Conference, pp. 253–264 (1999)

    Google Scholar 

  15. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw Hill, New York (1983)

    MATH  Google Scholar 

  16. Selberg, E., Etzioni, O.: The MetaCrawler Architecture for Resource Aggregation on the Web. In: IEEE Expert (1997)

    Google Scholar 

  17. Si, L., Callan, J.: Using Sampled Data and Regression to Merge Search Engine Results. In: ACM SIGIR Conference, pp. 19–26 (2002)

    Google Scholar 

  18. Vogt, C., Cottrell, G.: Fusion via a Linear Combination of Scores. Information Retrieval 1(3), 151–173 (1999)

    Article  Google Scholar 

  19. Wu, Z., Raghavan, V., Du, C., Sai Charan, M., Meng, W., He, H., Yu, C.: SE-LEGO: Creating Metasearch Engine on Demand. In: ACM SIGIR Conference, Demo paper, p. 464 (2003)

    Google Scholar 

  20. Yu, C., Meng, W., Liu, K., Wu, W., Rishe, N.: Efficient and Effective Metasearch for a Large Number of Text Databases. In: Eighth ACM CIKM Conference, pp. 217–224 (1999)

    Google Scholar 

  21. Yuwono, B., Lee, D.: Server Ranking for Distributed Text Resource Systems on the Internet. In: International Conference on Database System For Advanced Applications, pp. 391–400 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lu, Y., Meng, W., Shu, L., Yu, C., Liu, KL. (2005). Evaluation of Result Merging Strategies for Metasearch Engines. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, JY., Sheng, Q.Z. (eds) Web Information Systems Engineering – WISE 2005. WISE 2005. Lecture Notes in Computer Science, vol 3806. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581062_5

Download citation

  • DOI: https://doi.org/10.1007/11581062_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30017-5

  • Online ISBN: 978-3-540-32286-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics