Abstract
In this chapter we provide our personal vision of what could be the next generation of Web search engines, outlining the main research challenges that derive from it. This vision is based on a single premise: people do not really want to search, they want to get tasks done. We motivate our work by the current trends in the Web and, in particular, Web search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alonso, O., Gertz, M., Baeza-Yates, R.: On the Value of Temporal Information in Information Retrieval. ACM SIGIR Forum 41(2), 35–41 (2007)
Anderson, C.: The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion (2006)
Atserias, J., Zaragoza, H., Ciaramita, M., Attardi, G.: Semantically Annotated Snapshot of the English Wikipedia. In: Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC (2008)
Baeza-Yates, R.: Applications of Web Query Mining. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 7–22. Springer, Heidelberg (2005)
Baeza-Yates, R., Calderón-Benavides, L., González-Caro, C.: The intention behind Web queries. In: Crestani, F., Ferragina, P., Sanderson, M. (eds.) SPIRE 2006. LNCS, vol. 4209, pp. 98–109. Springer, Heidelberg (2006)
Baeza-Yates, R.: Graphs from Search Engine Queries. In: van Leeuwen, J., Italiano, G.F., van der Hoek, W., Meinel, C., Sack, H., Plášil, F. (eds.) SOFSEM 2007. LNCS, vol. 4362, pp. 1–8. Springer, Heidelberg (2007)
Baeza-Yates, R., Tiberi, A.: Extracting Semantic Relations from Query Logs. In: ACM KDD 2007, San Jose, California, USA, August 2007, pp. 76–85 (2007)
Baeza-Yates, R., Mika, P., Zaragoza, H.: Search, Web 2.0, and the Semantic Web. In: Benjamins, R. (ed.) Trends and Controversies: Near-Term Prospects for Semantic Technologies, January-February 2008. IEEE Intelligent Systems, vol. 23 (1), pp. 80–82 (2008)
Baeza-Yates, R., Pereira, A., Ziviani, N.: Genealogical trees on the Web: A search engine user perspective. In: WWW 2008: Proceedings of the 17th international conference on World Wide Web, Beijing, China, pp. 367–376 (2008)
Baeza-Yates, R., Gionis, A., Junqueira, F., Plachouras, V., Telloli, L.: On the feasibility of multi-site Web search engines. In: ACM CIKM 2009, Hong Kong, China, November 2009, pp. 425–434 (2009)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval, 2nd edn. Addison-Wesley, Reading (2010)
Barroso, L.A., Hölzle, U.: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Synthesis Lectures on Computer Architecture, vol. 6. Morgan Claypool, San Francisco (2009)
Bohannon, P., Merugu, S., Yu, C., Agarwal, V., DeRose, P., Iyer, A., Jain, A., Kakade, V., Muralidharan, M., Ramakrishnan, R., Shen, W.: Purple SOX extraction management system. SIGMOD Record 37(4), 21–27 (2008)
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the Web: Experiments and models. In: Proceedings of the Ninth Conference on World Wide Web, Amsterdam, Netherlands, pp. 309–320. ACM Press, New York (2000)
Broder, A.: A taxonomy of Web search. SIGIR Forum 36(2) (2002)
Broder, A.: The Future of Web Search: From Information Retrieval Information Supply. In: Etzion, O., Kuflik, T., Motro, A. (eds.) NGITS 2006. LNCS, vol. 4032, pp. 362–362. Springer, Heidelberg (2006)
Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)
Chen, F., Doan, A., Yang, J., Ramakrishnan, R.: Efficient Information Extraction over Evolving Text Data. In: ICDE, pp. 943–952 (2008)
Cooper, A.: A survey of query log privacy-enhancing techniques from a policy perspective. ACM Transactions on the Web (TWeb)Â 2(4) (2008)
Cooper, B., Baldeschwieler, E., Fonseca, R., Kistler, J., Narayan, P., Neerdaels, C., Negrin, T., Ramakrishnan, R., Silberstein, A., Srivastava, U., Stata, R.: Building a Cloud for Yahoo! IEEE Data Eng. Bull. 32(1), 36–43 (2009)
Dalvi, N., Kumar, R., Pang, B., Ramakrishnan, R., Tomkins, A., Bohannon, P., Keerthi, S., Merugu, S.: A Web of concepts. In: PODS, pp. 1–12 (2009)
Doan, A., Naughton, J., Ramakrishnan, R., Baid, A., Chai, X., Chen, F., Chen, T., Chu, E., DeRose, P., Gao, B., Gokhale, C., Huang, J., Shen, W., Vuong, B.-Q.: Information extraction challenges in managing unstructured data. SIGMOD Record 37(4), 14–20 (2008)
Goel, S., Broder, A., Gabrilovich, E., Pang, B.: Anatomy of the Long Tail: Ordinary People with Extraordinary Tastes. In: Third ACM Conference on Web Search and Data Mining (WSDM), New York (2010)
Jansen, B.J., Booth, D.L., Spink, A.: Determining the user intent of Web search engine queries. In: Proc. of the 16th international conference on World Wide Web, pp. 1149–1150. ACM Press, New York (2007)
Kilgarriff, A., Grefenstette, G.: Introduction to the special issue on the Web as corpus. Computational Linguistics 29(3), 333–347 (2003)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Mika, P.: Microsearch: An interface for semantic search. In: Proceedings of the Workshop on Semantic Search at the 5th European Semantic Web Conference, Tenerife, Spain (June 2008)
Mika, P., Ciaramita, M., Zaragoza, H., Atserias, J.: Learning to Tag and Tagging to Learn: A Case Study on Wikipedia. IEEE Intelligent Systems 23(5), 27–33 (2008)
Raghavan, P.: The Future of Search. In: 5th Gilbane Boston Conference: Where Content Management Meets Social Media, Boston (2008)
Ramakrishnan, R., Tomkins, A.: Toward a PeopleWeb. Computer 40(8), 63–72 (2007)
Shen, W., De Rose, P., Vu, L., Doan, A., Ramakrishnan, R.: Source-aware Entity Matching: A Compositional Approach. In: ICDE, pp. 196–205 (2007)
Surdeanu, M., Ciaramita, M., Zaragoza, H.: Learning to Rank Answers on Large Online QA Collections. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT (2008)
Surowiecki, J.: The Wisdom of Crowds, Random House (2004)
Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking Very Many Typed Entities on Wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM international conference on Information and Knowledge Management, Lisbon, Portugal (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Baeza-Yates, R., Raghavan, P. (2010). Chapter 2: Next Generation Web Search. In: Ceri, S., Brambilla, M. (eds) Search Computing. Lecture Notes in Computer Science, vol 5950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12310-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-12310-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12309-2
Online ISBN: 978-3-642-12310-8
eBook Packages: Computer ScienceComputer Science (R0)