Abstract
The Semantic Web seems to be evolving into a property-linked web of RDF data, conceptually divorced from (but physically housed in) the hyperlinked web of HTML documents. We discuss the Unified Web model that integrates the two webs and formalizes the structure and the semantics of interconnections between them. We also discuss the Hybrid Query Language which combines the Data and Information Retrieval techniques to provide a convenient and uniform way to retrieve data and documents from the Unified Web. We present the retrieval system SITAR and some preliminary results.
Keywords
- Semantic Web
- Information Retrieval
- Data Retrieval
- Hybrid Retrieval
- Unified Web
- Hybrid Query Language
Chapter PDF
References
Semantic Web Activity page, [Webpage], http://www.w3.org/2001/sw/
Prud’hommeaux, E., Seaborne, A. (eds.): SPARQL Query Language for RDF. W3C Working Draft (October 2006), http://www.w3.org/TR/rdf-sparql-query/
Adida, B., Birbeck, M. (eds.): RDFa. W3C Working Draft (2006), http://www.w3.org/TR/xhtml-rdfa-primer/
Immaneni, T., Thirunarayan, K.: Hybrid Retrieval from the Unified Web. In: Proceedings of the 22nd ACM Symposium on Applied Computing, Semantic Web and Applications Track (ACM SAC 2007), Seoul, Korea, (March 2007)
Thirunarayan, K.: On Embedding Machine-Processable Semantics into Documents. IEEE Transactions on Knowledge and Data Engineering 17(7), 1014–1018 (2005)
Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms (1998)
Guha, R., McCool, R., Miller, E.: Semantic search. In: Proceedings of the Twelfth International Conference on World Wide Web, Budapest, Hungary, May 2003, ACM Press, New York (2003)
Apache Lucene, [Webpage], http://lucene.apache.org/
Hartmann, J., Sure, Y.: An Infrastructure for Scalable, Reliable Semantic Portals. IEEE Intelligent Systems 19(3), 58–65 (2004)
CyberNeko HTML Parser, [Webpage], http://people.apache.org/~andyc/neko/doc/html/
Jena ARP, [Webpage], http://www.hpl.hp.com/personal/jjc/arp/
Beckett, D.: SWAD-E Deliverable 10.2: Mapping Semantic Web Data with RDBMSes. Online Document (2003), http://www.w3.org/2001/sw/Europe/reports/scalable_rdbms_mapping_report/
Beckett, D.: SWAD-Europe Deliverable 10.1: Scalability and Storage: Survey of Free Software / Open Source RDF storage systems. Online Document (2002), http://www.w3.org/2001/sw/Europe/reports/rdf_scalable_storage_report/
Bailey, J., Bry, F., Furche, T., Schaffert, S.: Web and Semantic Web Query Languages: A Survey. In: Eisinger, N., Małuszyński, J. (eds.) Reasoning Web. LNCS, vol. 3564, pp. 35–133. Springer, Heidelberg (2005)
Haase, P., Broekstra, J., Eberhart, A., Volz, R.: A comparison of RDF query languages. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 502–517. Springer, Heidelberg (2004)
Davies, J., Weeks, R., Krohn, U.: QuizRDF: Search technology for the semantic web. In: Workshop on Real World RDF and Semantic Web Applications, 11th International World Wide Web Conference, Hawaii, USA (2002)
Mayfield, J., Finin, T.: Information retrieval on the semantic web: Integrating inference and retrieval. In: Proceedings of the SIGIR 2003 Semantic Web Workshop (2003)
Ding, L., Pan, R., Finin, T.W., Joshi, A., Peng, Y., Kolari, P.: Finding and ranking knowledge on the semantic web. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 156–170. Springer, Heidelberg (2005)
Rocha, C., Schwabe, D., Aragao, M.P.: A Hybrid Approach for Searching in the Semantic Web. In: Proceedings of the 13th International World Wide Web Conference, New York, May 2004, pp. 374–383 (2004)
Zhang, L., Yu, Y., Zhou, J., Lin, C., Yang, Y.: An enhanced model for searching in semantic portals. In: Proceedings of the 14th International World Wide Web Conference, Chiba, Japan, May 2005, ACM Press, New York (2005)
Vallet, D., Fernández, M., Castells, P.: An Ontology-Based Information Retrieval Model. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 455–470. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Immaneni, T., Thirunarayan, K. (2007). A Unified Approach to Retrieving Web Documents and Semantic Web Data. In: Franconi, E., Kifer, M., May, W. (eds) The Semantic Web: Research and Applications. ESWC 2007. Lecture Notes in Computer Science, vol 4519. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72667-8_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-72667-8_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72666-1
Online ISBN: 978-3-540-72667-8
eBook Packages: Computer ScienceComputer Science (R0)