Advertisement

Faceted Wikipedia Search

  • Rasmus Hahn
  • Christian Bizer
  • Christopher Sahnwaldt
  • Christian Herta
  • Scott Robinson
  • Michaela Bürgle
  • Holger Düwiger
  • Ulrich Scheel
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 47)

Abstract

Wikipedia articles contain, besides free text, various types of structured information in the form of wiki markup. The type of wiki content that is most valuable for search are Wikipedia infoboxes, which display an article’s most relevant facts as a table of attribute-value pairs on the top right-hand side of the Wikipedia page. Infobox data is not used by Wikipedia’s own search engine. Standard Web search engines like Google or Yahoo also do not take advantage of the data. In this paper, we present Faceted Wikipedia Search, an alternative search interface for Wikipedia, which facilitates infobox data in order to enable users to ask complex questions against Wikipedia knowledge. By allowing users to query Wikipedia like a structured database, Faceted Wikipedia Search helps them to truly exploit Wikipedia’s collective intelligence.

Keywords

Faceted search faceted classification Wikipedia DBpedia knowledge representation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bizer, C.: The emerging web of linked data. IEEE Intelligent Systems 24, 87–92 (2009)CrossRefGoogle Scholar
  2. 2.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)Google Scholar
  3. 3.
    Chen, K.: Computing query previews in the flamenco system. Technical report, University of Berkeley (2004)Google Scholar
  4. 4.
    Bizer, C., et al.: Dbpedia - a crystallization point for the web of data. Journal of Web Semantics 7(3), 154–165 (2009)Google Scholar
  5. 5.
    English, J., Hearst, M., Sinha, R., Swearingen, K., Yee, K.-P.: Flexible search and navigation using faceted metadata. Technical report, University of Berkeley (2002)Google Scholar
  6. 6.
    Hearst, M., Elliott, A., English, J., Sinha, R., Swearingen, K., Yee, K.-P.: Finding the flow in web site search. Commun. ACM 45(9), 42–49 (2002)CrossRefGoogle Scholar
  7. 7.
    Hearst, M.A.: Uis for faceted navigation: Recent advances and remaining open problems. In: HCIR 2008 Second Workshop on Human-Computer Interaction and Information Retrieval. Microsoft (October 2008)Google Scholar
  8. 8.
    Kazama, J., Torisawa, K.: Exploiting wikipedia as external knowledge for named entity recognition. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (2007)Google Scholar
  9. 9.
    Klyne, G., Carroll, J.: Resource description framework (rdf): Concepts and abstract syntax - w3c recommendation (2004), http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/
  10. 10.
    Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)zbMATHGoogle Scholar
  11. 11.
    Metaweb Technologies. Freebase wikipedia extraction (wex) (2009), http://download.freebase.com/wex/
  12. 12.
    Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: A large ontology from wikipedia and wordnet. Journal of Web Semantics 6(3), 203–217 (2008)Google Scholar
  13. 13.
    Wu, F., Weld, D.: Automatically Refining the Wikipedia Infobox Ontology. In: Proceedings of the 17th World Wide Web Conference (2008)Google Scholar
  14. 14.
    Yitzhak, O.B., Golbandi, N., Har’el, N., Lempel, R., Neumann, A., Koifman, S.O., Sheinwald, D., Shekita, E., Sznajder, B., Yogev, S.: Beyond basic faceted search. In: WSDM 2008: Proceedings of the international conference on Web search and web data mining, pp. 33–44. ACM, New York (2008)CrossRefGoogle Scholar
  15. 15.
    Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking very many typed entities on wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pp. 1015–1018. ACM, New York (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Rasmus Hahn
    • 1
  • Christian Bizer
    • 2
  • Christopher Sahnwaldt
    • 1
  • Christian Herta
    • 1
  • Scott Robinson
    • 1
  • Michaela Bürgle
    • 1
  • Holger Düwiger
    • 1
  • Ulrich Scheel
    • 1
  1. 1.neofonie GmbHBerlinGermany
  2. 2.Freie Universität BerlinBerlinGermany

Personalised recommendations