BIS 2011: Business Information Systems Workshops pp 154-159 | Cite as
Advanced Resource Selection for Federated Enterprise Search
Conference paper
Abstract
Distributed information retrieval is a well-known approach for accessing heterogeneous, highly autonomous sources of unstructured information. Selecting and querying only a number of relevant sources can help improve its performance, but most resource selection algorithms are limited to syntactic comparisons.
We present a framework for applying resource selection in the context of a semantic federated product information system, and evaluate the performance of the well-known CORI resource selection algorithm in this context.
Keywords
Resource selection distributed information retrieval federated search enterprise search enterprise information systemsPreview
Unable to display preview. Download preview PDF.
References
- 1.Wauer, M., Schuster, D., Meinecke, J.: Aletheia: an architecture for semantic federation of product information from structured and unstructured sources. In: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services, iiWAS 2010, pp. 325–332. ACM, New York (2010)Google Scholar
- 2.Callan, J.: Distributed information retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers (2000)Google Scholar
- 3.Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1995, pp. 21–28. ACM, New York (1995)Google Scholar
- 4.Si, L., Lu, J., Callan, J.: Distributed information retrieval with skewed database size distributions. In: Proceedings of the 2003 Annual National Conference on Digital Government Research. dg.o 2003, pp. 1–6. Digital Government Society of North America (2003)Google Scholar
- 5.Si, L., Callan, J.: Relevant document distribution estimation method for resource selection. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2003, pp. 298–305. ACM, New York (2003)Google Scholar
- 6.Shokouhi, M.: Central-rank-based collection selection in uncooperative distributed information retrieval. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 160–172. Springer, Heidelberg (2007)CrossRefGoogle Scholar
- 7.Thomas, P., Shokouhi, M.: SUSHI: scoring scaled samples for server selection. In: SIGIR 2009: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 419–426. ACM, New York (2009)Google Scholar
- 8.Hong, D., Si, L., Bracke, P., Witt, M., Juchcinski, T.: A joint probabilistic classification model for resource selection. In: Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 98–105. ACM, New York (2010)Google Scholar
- 9.Arguello, J., Callan, J., Diaz, F.: Classification-based resource selection. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, pp. 1277–1286. ACM, New York (2009)Google Scholar
- 10.Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat. Lang. Eng. 10(3-4), 327–348 (2004)CrossRefGoogle Scholar
- 11.Clark, J., DeRose, S.: XML Path Language (XPath) version 1.0. Recommendation, World Wide Web Consortium (November 1999), http://www.w3.org/TR/xpath.html
- 12.Broder, A.: A taxonomy of web search. SIGIR Forum 36, 3–10 (2002)CrossRefGoogle Scholar
Copyright information
© Springer-Verlag Berlin Heidelberg 2011