Abstract
Search applications have become very popular over the last two decades, one of the main drivers being the advent of the Web. Nevertheless, searching on the Web is very different to searching on smaller, often more structured collections such as digital libraries, local Web sites, and intranets. One way of helping the searcher locating the right information for a specific information need in such a collection is by providing well-structured domain knowledge to assist query modification and navigation. There are two main challenges which we will both address in this chapter: acquiring the domain knowledge and adapting it automatically to the specific interests of the user community. We will outline how in digital libraries a domain model can automatically be acquired using search engine query logs and how it can be continuously updated using methods resembling ant colony behaviour.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agosti, M., Cisco, D., Di Nunzio, G.M., Masiero, I., Melucci, M.: i-TEL-u: A Query Suggestion Tool for Integrating Heterogeneous Contexts in a Digital Library. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 397–400. Springer, Heidelberg (2010)
Agosti, M., Crivellari, F., Di Nunzio, G.M., Ioannidis, Y., Stamatogiannakis, L., Triantafillidi, M.-L., Vayanou, M.: Report on Search Engines and HTTP Log Analysis. Technical report TELplus D5.2, TELplus Project (2009)
Albakour, M.-D., Kruschwitz, U., Lucas, S.: Sentence-level attachment prediction. In: Cunningham, H., Hanbury, A., Rüger, S. (eds.) IRFC 2010. LNCS, vol. 6107, pp. 6–19. Springer, Heidelberg (2010)
Albakour, M.-D., Kruschwitz, U., Nanas, N., Kim, Y., Song, D., Fasli, M., De Roeck, A.: AutoEval: An evaluation methodology for evaluating query suggestions using query logs. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 605–610. Springer, Heidelberg (2011)
Anick, P.: Using Terminological Feedback for Web Search Refinement - A Log-based Study. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, pp. 88–95 (2003)
Baeza-Yates, R., Saint-Jean, F.: A Three Level Search Engine Index Based in Query Log Distribution. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 56–65. Springer, Heidelberg (2003)
Baeza-Yates, R., Tiberi, A.: Extracting semantic relations from query logs. In: Proceeding of the 13th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, San Jose, California, pp. 76–85 (2007)
Baraglia, R., Castillo, C., Donato, D., Nardini, F.M., Perego, R., Silvestri, F.: The Effects of Time on Query Flow Graph-based Models for Query Suggestion. In: Proceedings of RIAO 2010, Paris (2010)
Belkin, N.J.: Some(what) grand challenges for information retrieval. SIGIR Forum 42(1), 47–54 (2008)
Berghaus, B., Mandl, T., Womser-Hacker, C., Kluck, M.: An entry vocabulary module for a political science test collection. In: Business Information Systems. Lecture Notes in Business Information Processing, pp. 1–11 (2008)
Boldi, P., Bonchi, F., Castillo, C., Donato, D., Vigna, S.: Query suggestions using query-flow graphs. In: Proceedings of the 2009 Workshop on Web Search Click Data (WSCD 2009), pp. 56–63 (2009)
Brusilovsky, P., Cassel, L., Delcambre, L., Fox, E., Furuta, R., Garcia, D., Shipman, F., Bogen, P., Yudelson, M.: Enhancing digital libraries with social navigation: The case of ensemble. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 116–123. Springer, Heidelberg (2010)
Callison-Burch, C.: Fast, cheap, and creative: Evaluating translation quality using Amazon’s Mechanical Turk. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 286–295. Association for Computational Linguistics (2009)
Chau, M., Fang, X., Sheng, O.R.L.: Analysis of the Query Logs of a Web Site Search Engine. Journal of the American Society for Information Science and Technology (JASIST) 56(13), 1363–1376 (2005)
Dignum, S., Kruschwitz, U., Fasli, M., Kim, Y., Song, D., Cervino, U., De Roeck, A.: Incorporating Seasonality into Search Suggestions Derived from Intranet Query Logs. In: Proceedings of the IEEE/WIC/ACM International Conferences on Web Intelligence (WI 2010), Toronto, pp. 425–430 (2010)
Fonseca, B.M., Golgher, P.B., de Moura, E.S., Pôssas, B., Ziviani, N.: Discovering search engine related queries using association rules. Journal of Web Engineering 2(4), 215–227 (2004)
Fonseca, B.M., Golgher, P.B., de Moura, E.S., Ziviani, N.: Using association rules to discover search engines related queries. In: Proceedings of the First Latin American Web Congress, pp. 66–71 (2003)
Gey, F.C., Buckland, M., Chen, A., Larson, R.R.: Entry vocabulary – a technology to enhance digital search. In: Proceedings of the First International Conference on Human Language Technology (2001)
Ghorab, M.R., Leveling, J., Zhou, D., Jones, G.J.F., Wade, V.: Identifying common user behaviour in multilingual search logs. In: Peters, C., Di Nunzio, G.M., Kurimo, M., Mostefa, D., Penas, A., Roda, G. (eds.) CLEF 2009. LNCS, vol. 6241, pp. 518–525. Springer, Heidelberg (2010)
Göker, A., He, D.: Analysing web search logs to determine session boundaries for user-oriented learning. In: Brusilovsky, P., Stock, O., Strapparava, C. (eds.) AH 2000. LNCS, vol. 1892, pp. 319–322. Springer, Heidelberg (2000)
Hawking, D.: Enterprise Search. In: Baeza-Yates, R., Ribeiro-Neto, B. (eds.) Modern Information Retrieval, 2nd edn., pp. 641–683. Addison-Wesley, Harlow (2011)
Jansen, B.J., Spink, A., Blakely, C., Koshman, S.: Defining a session on Web search engines. Journal of the American Society for Information Science and Technology (JASIST) 58(6), 862–871 (2007)
Jansen, B.J., Spink, A., Koshman, S.: Web Server Interaction with the Dogpile.com Metasearch Engine. Journal of the American Society for Information Science and Technology (JASIST) 58(5), 744–755 (2007)
Jansen, J., Spink, A., Taksa, I. (eds.): Handbook of Research on Web Log Analysis. IGI (2008)
Joachims, T., Granka, L., Pan, B., Hembrooke, H., Gay, G.: Accurately interpreting clickthrough data as implicit feedback. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, pp. 154–161 (2005)
Joachims, T., Radlinski, F.: Search engines that learn from implicit feedback. IEEE Computer 40(8), 34–40 (2007)
Jones, R., Klinkner, K.L.: Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In: Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pp. 699–708 (2008)
Jones, R., Rey, B., Madani, O., Greiner, W.: Generating Query Substitutions. In: Proceedings of the 15th International World Wide Web Conference (WWW 2006), Edinburgh, pp. 387–396 (2006)
Kelly, D., Gyllstrom, K., Bailey, E.W.: A comparison of query and term suggestion features for interactive searching. In: Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, pp. 371–378 (2009)
Kruschwitz, U.: An Adaptable Search System for Collections of Partially Structured Documents. IEEE Intelligent Systems 18(4), 44–52 (2003)
Kruschwitz, U.: Intelligent Document Retrieval: Exploiting Markup Structure. The Information Retrieval Series, vol. 17. Springer, Heidelberg (2005)
Lungley, D., Kruschwitz, U.: Automatically maintained domain knowledge: Initial findings. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 739–743. Springer, Heidelberg (2009)
Manning, C., Prabhakar, R., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Markey, K.: Twenty-five years of end-user searching, Part 1: Research findings. Journal of the American Society for Information Science and Technology (JASIST) 58(8), 1071–1081 (2007)
Mat-Hassan, M., Levene, M.: Associating Search and Navigation Behavior Through Log Analysis. Journal of the American Society for Information Science and Technology (JASIST) 56(9), 913–934 (2005)
Nanas, N., Roeck, A.: Autopoiesis, the immune system, and adaptive information filtering. Natural Computing: an International Journal 8(2), 387–427 (2009)
Poblete, B., Baeza-Yates, R.: Query-Sets: Using Implicit Feedback and Query Patterns to Organize Web Documents. In: Proceedings of the 17th International World Wide Web Conference (WWW 2008), Beijing, pp. 41–50 (2008)
Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, pp. 206–213 (1999)
Sherman, C.: Why Enterprise Search will never be Google-y. In: Enterprise Search Sourcebook, pp. 12–13 (2008)
Silvestri, F.: Mining Query Logs: Turning Search Usage Data into Knowledge. Foundations and Trends in Information Retrieval, vol. 4. Now Publisher (2010)
Smyth, B., Briggs, P., Coyle, M., O’Mahony, M.: Google shared. A case-study in social search. In: Houben, G.-J., McCalla, G., Pianesi, F., Zancanaro, M. (eds.) UMAP 2009. LNCS, vol. 5535, pp. 283–294. Springer, Heidelberg (2009)
Snow, R., O’Connor, B., Jurafsky, D., Ng, A.Y.: Cheap and Fast - But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 254–263. Association for Computational Linguistics (2008)
Sowa, J.F.: Semantic networks. In: Shapiro, S.C. (ed.) Encyclopedia of Artificial Intelligence, pp. 1493–1511. John Wiley & Sons, Chichester (1992)
Spink, A., Jansen, B.J.: Web Search: Public Searching of the Web. The Information Science and Knowledge Management Series, vol. 6. Kluwer, Dordrecht (2004)
Surowiecki, J.: The Wisdom of Crowds. Anchor, New York (2005)
Teevan, J., Adar, E., Jones, R., Potts, M.A.S.: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 151–158 (2007)
Wang, P., Berry, M.W., Yang, Y.: Mining Longitudinal Web Queries: Trends and Patterns. Journal of the American Society for Information Science and Technology (JASIST) 54(8), 743–758 (2003)
White, M.: Making Search Work: Implementing Web, Intranet and Enterprise Search. Facet Publishing (2007)
White, R.W., Bilenko, M., Cucerzan, S.: Studying the Use of Popular Destinations to Enhance Web Search Interaction. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 159–166 (2007)
White, R.W., Ruthven, I.: A Study of Interface Support Mechanisms for Interactive Information Retrieval. Journal of the American Society for Information Science and Technology (JASIST) 57(7), 933–948 (2006)
Widdows, D., Dorow, B.: A Graph Model for Unsupervised Lexical Acquisition and Automatic Word-Sense Disambiguation. In: Proceedings of the 19th Conference on Computational Linguistics (COLING), Taipei, Taiwan, pp. 1093–1099 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kruschwitz, U. et al. (2011). Moving towards Adaptive Search in Digital Libraries. In: Bernardi, R., Chambers, S., Gottfried, B., Segond, F., Zaihrayeu, I. (eds) Advanced Language Technologies for Digital Libraries. NLP4DL AT4DL 2009 2009. Lecture Notes in Computer Science, vol 6699. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23160-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-23160-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23159-9
Online ISBN: 978-3-642-23160-5
eBook Packages: Computer ScienceComputer Science (R0)