Advertisement

A Technique for Information Retrieval from Microformatted Websites

  • J. Guadalupe Ramos
  • Josep Silva
  • Gustavo Arroyo
  • Juan C. Solorio
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5947)

Abstract

In this work, we introduce a new method for information extraction from the semantic web. The fundamental idea is to model the semantic information contained in the microformats of a set of web pages, by using a data structure called semantic network. Then, we introduce a novel technique for information extraction from semantic networks. In particular, the technique allows us to extract a portion—a slice—of the semantic network with respect to some criterion of interest. The slice obtained represents relevant information retrieved from the semantic network and thus from the semantic web. Our approach can be used to design novel tools for information retrieval and presentation, and for information filtering that was distributed along the semantic web.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Microformats.org. The Official Microformats Site (2009), http://microformats.org/
  2. 2.
    Khare, R., Çelik, T.: Microformats: a Pragmatic Path to the Semantic Web. In: WWW 2006: Proceedings of the 15th International Conference on World Wide Web, pp. 865–866. ACM, New York (2006)CrossRefGoogle Scholar
  3. 3.
    hCard. Simple, Open, Distributed Format for Representing People, Companies, Organizations, and Places (2009), http://microformats.org/wiki/hcard
  4. 4.
    Sowa, J.F. (ed.): Principles of Semantic Networks: Explorations in the Representation of Knowledge. Morgan Kaufmann, San Francisco (1991)zbMATHGoogle Scholar
  5. 5.
    Sowa, J.F.: Semantic Networks. In: Shapiro, S.C. (ed.) Encyclopedia of Artificial Intelligence. John Wiley & Sons, Chichester (1992)Google Scholar
  6. 6.
    Wang, W., Rada, R.: Structured Hypertext with Domain Semantics. ACM Transactions on Information Systems (TOIS) 16(4), 372–412 (1998)CrossRefGoogle Scholar
  7. 7.
    Mollá, D.: Learning of Graph-based Question Answering Rules. In: Proc. HLT/NAACL 2006 Workshop on Graph Algorithms for Natural Language Processing, pp. 37–44 (2006)Google Scholar
  8. 8.
    Silva, J.: A Program Slicing Based Method to Filter XML/DTD Documents. In: van Leeuwen, J., Italiano, G.F., van der Hoek, W., Meinel, C., Sack, H., Plášil, F. (eds.) SOFSEM 2007. LNCS, vol. 4362, pp. 771–782. Springer, Heidelberg (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • J. Guadalupe Ramos
    • 1
  • Josep Silva
    • 2
  • Gustavo Arroyo
    • 2
  • Juan C. Solorio
    • 1
  1. 1.Instituto Tecnológico de La PiedadMéxico
  2. 2.DSICUniversidad Politécnica de ValenciaValenciaSpain

Personalised recommendations