Chapter 2: Next Generation Web Search

Baeza-Yates, Ricardo; Raghavan, Prabhakar

doi:10.1007/978-3-642-12310-8_2

Ricardo Baeza-Yates¹⁷ &
Prabhakar Raghavan¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5950))

1143 Accesses
26 Citations

Abstract

In this chapter we provide our personal vision of what could be the next generation of Web search engines, outlining the main research challenges that derive from it. This vision is based on a single premise: people do not really want to search, they want to get tasks done. We motivate our work by the current trends in the Web and, in particular, Web search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alonso, O., Gertz, M., Baeza-Yates, R.: On the Value of Temporal Information in Information Retrieval. ACM SIGIR Forum 41(2), 35–41 (2007)
Article Google Scholar
Anderson, C.: The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion (2006)
Google Scholar
Atserias, J., Zaragoza, H., Ciaramita, M., Attardi, G.: Semantically Annotated Snapshot of the English Wikipedia. In: Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC (2008)
Google Scholar
Baeza-Yates, R.: Applications of Web Query Mining. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 7–22. Springer, Heidelberg (2005)
Chapter Google Scholar
Baeza-Yates, R., Calderón-Benavides, L., González-Caro, C.: The intention behind Web queries. In: Crestani, F., Ferragina, P., Sanderson, M. (eds.) SPIRE 2006. LNCS, vol. 4209, pp. 98–109. Springer, Heidelberg (2006)
Chapter Google Scholar
Baeza-Yates, R.: Graphs from Search Engine Queries. In: van Leeuwen, J., Italiano, G.F., van der Hoek, W., Meinel, C., Sack, H., Plášil, F. (eds.) SOFSEM 2007. LNCS, vol. 4362, pp. 1–8. Springer, Heidelberg (2007)
Chapter Google Scholar
Baeza-Yates, R., Tiberi, A.: Extracting Semantic Relations from Query Logs. In: ACM KDD 2007, San Jose, California, USA, August 2007, pp. 76–85 (2007)
Google Scholar
Baeza-Yates, R., Mika, P., Zaragoza, H.: Search, Web 2.0, and the Semantic Web. In: Benjamins, R. (ed.) Trends and Controversies: Near-Term Prospects for Semantic Technologies, January-February 2008. IEEE Intelligent Systems, vol. 23 (1), pp. 80–82 (2008)
Google Scholar
Baeza-Yates, R., Pereira, A., Ziviani, N.: Genealogical trees on the Web: A search engine user perspective. In: WWW 2008: Proceedings of the 17th international conference on World Wide Web, Beijing, China, pp. 367–376 (2008)
Google Scholar
Baeza-Yates, R., Gionis, A., Junqueira, F., Plachouras, V., Telloli, L.: On the feasibility of multi-site Web search engines. In: ACM CIKM 2009, Hong Kong, China, November 2009, pp. 425–434 (2009)
Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval, 2nd edn. Addison-Wesley, Reading (2010)
Google Scholar
Barroso, L.A., Hölzle, U.: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Synthesis Lectures on Computer Architecture, vol. 6. Morgan Claypool, San Francisco (2009)
Google Scholar
Bohannon, P., Merugu, S., Yu, C., Agarwal, V., DeRose, P., Iyer, A., Jain, A., Kakade, V., Muralidharan, M., Ramakrishnan, R., Shen, W.: Purple SOX extraction management system. SIGMOD Record 37(4), 21–27 (2008)
Article Google Scholar
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the Web: Experiments and models. In: Proceedings of the Ninth Conference on World Wide Web, Amsterdam, Netherlands, pp. 309–320. ACM Press, New York (2000)
Google Scholar
Broder, A.: A taxonomy of Web search. SIGIR Forum 36(2) (2002)
Google Scholar
Broder, A.: The Future of Web Search: From Information Retrieval Information Supply. In: Etzion, O., Kuflik, T., Motro, A. (eds.) NGITS 2006. LNCS, vol. 4032, pp. 362–362. Springer, Heidelberg (2006)
Chapter Google Scholar
Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Chen, F., Doan, A., Yang, J., Ramakrishnan, R.: Efficient Information Extraction over Evolving Text Data. In: ICDE, pp. 943–952 (2008)
Google Scholar
Cooper, A.: A survey of query log privacy-enhancing techniques from a policy perspective. ACM Transactions on the Web (TWeb) 2(4) (2008)
Google Scholar
Cooper, B., Baldeschwieler, E., Fonseca, R., Kistler, J., Narayan, P., Neerdaels, C., Negrin, T., Ramakrishnan, R., Silberstein, A., Srivastava, U., Stata, R.: Building a Cloud for Yahoo! IEEE Data Eng. Bull. 32(1), 36–43 (2009)
Google Scholar
Dalvi, N., Kumar, R., Pang, B., Ramakrishnan, R., Tomkins, A., Bohannon, P., Keerthi, S., Merugu, S.: A Web of concepts. In: PODS, pp. 1–12 (2009)
Google Scholar
Doan, A., Naughton, J., Ramakrishnan, R., Baid, A., Chai, X., Chen, F., Chen, T., Chu, E., DeRose, P., Gao, B., Gokhale, C., Huang, J., Shen, W., Vuong, B.-Q.: Information extraction challenges in managing unstructured data. SIGMOD Record 37(4), 14–20 (2008)
Article Google Scholar
Goel, S., Broder, A., Gabrilovich, E., Pang, B.: Anatomy of the Long Tail: Ordinary People with Extraordinary Tastes. In: Third ACM Conference on Web Search and Data Mining (WSDM), New York (2010)
Google Scholar
Jansen, B.J., Booth, D.L., Spink, A.: Determining the user intent of Web search engine queries. In: Proc. of the 16th international conference on World Wide Web, pp. 1149–1150. ACM Press, New York (2007)
Chapter Google Scholar
Kilgarriff, A., Grefenstette, G.: Introduction to the special issue on the Web as corpus. Computational Linguistics 29(3), 333–347 (2003)
Article MathSciNet Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Book MATH Google Scholar
Mika, P.: Microsearch: An interface for semantic search. In: Proceedings of the Workshop on Semantic Search at the 5th European Semantic Web Conference, Tenerife, Spain (June 2008)
Google Scholar
Mika, P., Ciaramita, M., Zaragoza, H., Atserias, J.: Learning to Tag and Tagging to Learn: A Case Study on Wikipedia. IEEE Intelligent Systems 23(5), 27–33 (2008)
Google Scholar
Raghavan, P.: The Future of Search. In: 5th Gilbane Boston Conference: Where Content Management Meets Social Media, Boston (2008)
Google Scholar
Ramakrishnan, R., Tomkins, A.: Toward a PeopleWeb. Computer 40(8), 63–72 (2007)
Article Google Scholar
Shen, W., De Rose, P., Vu, L., Doan, A., Ramakrishnan, R.: Source-aware Entity Matching: A Compositional Approach. In: ICDE, pp. 196–205 (2007)
Google Scholar
Surdeanu, M., Ciaramita, M., Zaragoza, H.: Learning to Rank Answers on Large Online QA Collections. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT (2008)
Google Scholar
Surowiecki, J.: The Wisdom of Crowds, Random House (2004)
Google Scholar
Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking Very Many Typed Entities on Wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM international conference on Information and Knowledge Management, Lisbon, Portugal (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Yahoo! Research, Barcelona, Spain & Sunnyvale, USA
Ricardo Baeza-Yates & Prabhakar Raghavan

Authors

Ricardo Baeza-Yates
View author publications
You can also search for this author in PubMed Google Scholar
Prabhakar Raghavan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Elettronica e Informazione, Politecnico di Milano, Piazza L. Da Vinci, 32, I20133, Milano, Italy
Stefano Ceri & Marco Brambilla &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Baeza-Yates, R., Raghavan, P. (2010). Chapter 2: Next Generation Web Search. In: Ceri, S., Brambilla, M. (eds) Search Computing. Lecture Notes in Computer Science, vol 5950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12310-8_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-12310-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12309-2
Online ISBN: 978-3-642-12310-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics