Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Metasearch Engines

  • Weiyi Meng
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_217

Synonyms

Federated search engine

Definition

Metasearch is to utilize multiple other search systems (called component search systems) to perform simultaneous search. A metasearch engine is a search system that enables metasearch. To perform a basic metasearch, a user query is sent to multiple existing search engines by the metasearch engine; when the search results returned from the search engines are received by the metasearch engine, they are merged into a single ranked list and the merged list is presented to the user. Key issues include how to pass user queries to component search engines, how to extract correct search results from the result pages returned from component search engines, and how to merge the results returned from different component search sources. More sophisticated metasearch engines, especially those that have a large number of component search engines, also perform search engine selection (also referred to as database selection), i.e., identify the component...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Aslam J, Montague M. Models for metasearch. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2001. p. 276–84.Google Scholar
  2. 2.
    Callan J, Lu Z, Croft, WB. Searching distributed collections with inference networks. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1995.Google Scholar
  3. 3.
    Callan J, Connell M, Du A. Automatic discovery of language models for text databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1999.Google Scholar
  4. 4.
    Chang CH, Kayed M, Girgis MR, Shaalan KF. A survey of web information extraction systems. IEEE Trans Knowl Data Eng. 2006;18(10):1411–28.CrossRefGoogle Scholar
  5. 5.
    Dreilinger D, Howe A. Experiences with selecting search engines using metasearch. ACM Trans Inf Sys. 1997;15(3):195–222.CrossRefGoogle Scholar
  6. 6.
    Lee JH. Combining multiple evidence from different properties of weighting schemes. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1995. p. 180–8.Google Scholar
  7. 7.
    Liu KL, Meng W, Qiu J, Yu C, Raghavan V, Wu Z, Lu Y, He H, Zhao H. AllInOneNews: development and evaluation of a large-scale news metasearch engine. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2007. p. 1017–28.Google Scholar
  8. 8.
    Liu W, Meng X, Meng W. ViDE: a vision-based approach for deep web data extraction. IEEE Trans Knowl Data Eng. 2010;22(3):447–60.CrossRefGoogle Scholar
  9. 9.
    Lu Y, Meng W, Shu L, Yu C, Liu K. Evaluation of result merging strategies for metasearch engines. In: Proceedings of the 6th International Conference on Web Information Systems Engineering; 2005. p. 53–66.Google Scholar
  10. 10.
    Meng W, Yu C. Advanced metasearch engine technology. Morgan & Claypool Publishers; 2010.Google Scholar
  11. 11.
    Meng W, Wu Z, Yu C, Li Z. A highly scalable and effective method for metasearch. ACM Trans on Inf Sys. 2001;19(3):310–35.CrossRefGoogle Scholar
  12. 12.
    Selberg E, Etzioni O. The MetaCrawler architecture for resource aggregation on the web. IEEE Expert. 1997;12(1):11–4.CrossRefGoogle Scholar
  13. 13.
    Shokouhi M, Si L. Federated search. Found Trends Info Retrieval. 2011;5(1):1–102.CrossRefGoogle Scholar
  14. 14.
    Sleiman HA, Corchuelo R. A survey on region extractors from web documents. IEEE Trans Knowl Data Eng. 2013;25(9):1960–81.CrossRefGoogle Scholar
  15. 15.
    Yu C, Liu K, Meng W, Wu Z, Rishe N. A methodology to retrieve text documents from multiple databases. IEEE Trans Knowl Data Eng. 2002;14(6):1347–61.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Computer ScienceState University of New York at BinghamtonBinghamtonUSA

Section editors and affiliations

  • Cong Yu
    • 1
  1. 1.Google ResearchNew YorkUSA