Skip to main content

Chapter 2: Next Generation Web Search

  • Chapter
Search Computing

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5950))

Abstract

In this chapter we provide our personal vision of what could be the next generation of Web search engines, outlining the main research challenges that derive from it. This vision is based on a single premise: people do not really want to search, they want to get tasks done. We motivate our work by the current trends in the Web and, in particular, Web search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alonso, O., Gertz, M., Baeza-Yates, R.: On the Value of Temporal Information in Information Retrieval. ACM SIGIR Forum 41(2), 35–41 (2007)

    Article  Google Scholar 

  2. Anderson, C.: The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion (2006)

    Google Scholar 

  3. Atserias, J., Zaragoza, H., Ciaramita, M., Attardi, G.: Semantically Annotated Snapshot of the English Wikipedia. In: Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC (2008)

    Google Scholar 

  4. Baeza-Yates, R.: Applications of Web Query Mining. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 7–22. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  5. Baeza-Yates, R., Calderón-Benavides, L., González-Caro, C.: The intention behind Web queries. In: Crestani, F., Ferragina, P., Sanderson, M. (eds.) SPIRE 2006. LNCS, vol. 4209, pp. 98–109. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  6. Baeza-Yates, R.: Graphs from Search Engine Queries. In: van Leeuwen, J., Italiano, G.F., van der Hoek, W., Meinel, C., Sack, H., Plášil, F. (eds.) SOFSEM 2007. LNCS, vol. 4362, pp. 1–8. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  7. Baeza-Yates, R., Tiberi, A.: Extracting Semantic Relations from Query Logs. In: ACM KDD 2007, San Jose, California, USA, August 2007, pp. 76–85 (2007)

    Google Scholar 

  8. Baeza-Yates, R., Mika, P., Zaragoza, H.: Search, Web 2.0, and the Semantic Web. In: Benjamins, R. (ed.) Trends and Controversies: Near-Term Prospects for Semantic Technologies, January-February 2008. IEEE Intelligent Systems, vol. 23 (1), pp. 80–82 (2008)

    Google Scholar 

  9. Baeza-Yates, R., Pereira, A., Ziviani, N.: Genealogical trees on the Web: A search engine user perspective. In: WWW 2008: Proceedings of the 17th international conference on World Wide Web, Beijing, China, pp. 367–376 (2008)

    Google Scholar 

  10. Baeza-Yates, R., Gionis, A., Junqueira, F., Plachouras, V., Telloli, L.: On the feasibility of multi-site Web search engines. In: ACM CIKM 2009, Hong Kong, China, November 2009, pp. 425–434 (2009)

    Google Scholar 

  11. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval, 2nd edn. Addison-Wesley, Reading (2010)

    Google Scholar 

  12. Barroso, L.A., Hölzle, U.: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Synthesis Lectures on Computer Architecture, vol. 6. Morgan Claypool, San Francisco (2009)

    Google Scholar 

  13. Bohannon, P., Merugu, S., Yu, C., Agarwal, V., DeRose, P., Iyer, A., Jain, A., Kakade, V., Muralidharan, M., Ramakrishnan, R., Shen, W.: Purple SOX extraction management system. SIGMOD Record 37(4), 21–27 (2008)

    Article  Google Scholar 

  14. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the Web: Experiments and models. In: Proceedings of the Ninth Conference on World Wide Web, Amsterdam, Netherlands, pp. 309–320. ACM Press, New York (2000)

    Google Scholar 

  15. Broder, A.: A taxonomy of Web search. SIGIR Forum 36(2) (2002)

    Google Scholar 

  16. Broder, A.: The Future of Web Search: From Information Retrieval Information Supply. In: Etzion, O., Kuflik, T., Motro, A. (eds.) NGITS 2006. LNCS, vol. 4032, pp. 362–362. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  17. Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)

    Google Scholar 

  18. Chen, F., Doan, A., Yang, J., Ramakrishnan, R.: Efficient Information Extraction over Evolving Text Data. In: ICDE, pp. 943–952 (2008)

    Google Scholar 

  19. Cooper, A.: A survey of query log privacy-enhancing techniques from a policy perspective. ACM Transactions on the Web (TWeb) 2(4) (2008)

    Google Scholar 

  20. Cooper, B., Baldeschwieler, E., Fonseca, R., Kistler, J., Narayan, P., Neerdaels, C., Negrin, T., Ramakrishnan, R., Silberstein, A., Srivastava, U., Stata, R.: Building a Cloud for Yahoo! IEEE Data Eng. Bull. 32(1), 36–43 (2009)

    Google Scholar 

  21. Dalvi, N., Kumar, R., Pang, B., Ramakrishnan, R., Tomkins, A., Bohannon, P., Keerthi, S., Merugu, S.: A Web of concepts. In: PODS, pp. 1–12 (2009)

    Google Scholar 

  22. Doan, A., Naughton, J., Ramakrishnan, R., Baid, A., Chai, X., Chen, F., Chen, T., Chu, E., DeRose, P., Gao, B., Gokhale, C., Huang, J., Shen, W., Vuong, B.-Q.: Information extraction challenges in managing unstructured data. SIGMOD Record 37(4), 14–20 (2008)

    Article  Google Scholar 

  23. Goel, S., Broder, A., Gabrilovich, E., Pang, B.: Anatomy of the Long Tail: Ordinary People with Extraordinary Tastes. In: Third ACM Conference on Web Search and Data Mining (WSDM), New York (2010)

    Google Scholar 

  24. Jansen, B.J., Booth, D.L., Spink, A.: Determining the user intent of Web search engine queries. In: Proc. of the 16th international conference on World Wide Web, pp. 1149–1150. ACM Press, New York (2007)

    Chapter  Google Scholar 

  25. Kilgarriff, A., Grefenstette, G.: Introduction to the special issue on the Web as corpus. Computational Linguistics 29(3), 333–347 (2003)

    Article  MathSciNet  Google Scholar 

  26. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)

    Book  MATH  Google Scholar 

  27. Mika, P.: Microsearch: An interface for semantic search. In: Proceedings of the Workshop on Semantic Search at the 5th European Semantic Web Conference, Tenerife, Spain (June 2008)

    Google Scholar 

  28. Mika, P., Ciaramita, M., Zaragoza, H., Atserias, J.: Learning to Tag and Tagging to Learn: A Case Study on Wikipedia. IEEE Intelligent Systems 23(5), 27–33 (2008)

    Google Scholar 

  29. Raghavan, P.: The Future of Search. In: 5th Gilbane Boston Conference: Where Content Management Meets Social Media, Boston (2008)

    Google Scholar 

  30. Ramakrishnan, R., Tomkins, A.: Toward a PeopleWeb. Computer 40(8), 63–72 (2007)

    Article  Google Scholar 

  31. Shen, W., De Rose, P., Vu, L., Doan, A., Ramakrishnan, R.: Source-aware Entity Matching: A Compositional Approach. In: ICDE, pp. 196–205 (2007)

    Google Scholar 

  32. Surdeanu, M., Ciaramita, M., Zaragoza, H.: Learning to Rank Answers on Large Online QA Collections. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT (2008)

    Google Scholar 

  33. Surowiecki, J.: The Wisdom of Crowds, Random House (2004)

    Google Scholar 

  34. Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking Very Many Typed Entities on Wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM international conference on Information and Knowledge Management, Lisbon, Portugal (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Baeza-Yates, R., Raghavan, P. (2010). Chapter 2: Next Generation Web Search. In: Ceri, S., Brambilla, M. (eds) Search Computing. Lecture Notes in Computer Science, vol 5950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12310-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12310-8_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12309-2

  • Online ISBN: 978-3-642-12310-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics