Abstract
This article addresses how to improve the automated accessibility and visibility of information from Web Information Portals and in particular virtual library systems. Information from web information portals could provide great value to satisfy information needs. But most of this information stays hidden in data silos which are part of that section of the web that is not indexable by common search engines and is therefore called Deep Web. Shared vocabularies like Schema.org helped to increase machine readability of structured information on the web in general, but markup vocabularies didn’t increase the accessibility and visibility of information from data silos. This article addresses the limitations regarding the accessibility of information from data silos on the Deep Web and proposes an extension to Schema.org to fill the identified gaps. The extension improves the automated accessibility and visibility of information provided in web information portals by providing Dynamic Deep Linking capabilities to Deep Web data silos by lifting web forms of web information portals to the level of machine understandable semantic Web Query Interfaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Schema.org Actions: http://blog.schema.org/2014/04/announcing-schemaorg-actions.html
- 2.
Sitelinks Search: https://developers.google.com/structured-data/slsb-overview
- 3.
Semantic Deep Search Extension website: http://semdeepsearch.vocab-ext.appspot.com/
- 4.
EconBiz, subject portal for economics and business studies: https://www.econbiz.de/
References
Bergman, M.K.: White paper: the deep web: surfacing hidden value. J. Electron. Publish. 7(1), 1–17 (2001)
Blandford, A.: Google, public libraries, and the deep web. Dalhousie J. Interdiscip. Manage. 11 (2015)
Bizer, C., Heath, T., Berners-Lee, T.: Linked data-the story so far. In: Semantic Services, Interoperability and Web Applications: Emerging Concepts, pp. 205–227 (2009)
Ferrara, E., De Meo, P., Fiumara, G., Baumgartner, R.: Web data extraction, applications and techniques: a survey. Knowl.-Based Syst. 70, 301–323 (2014)
Furche, T., Gottlob, G., Grasso, G., Guo, X., Orsi, G., Schallhart, C.: The ontological key: automatically understanding and integrating forms to access the deep Web. VLDB J. 22(5), 615–640 (2013)
Henzinger, M.R.: Hyperlink analysis for the web. IEEE Internet Comp. 5(1), 45–50 (2001)
Klemenz, A.M., Tochtermann, K.: Semantification of Query Interfaces to Improve Access to Deep Web Content. SDA, pp. 104–111 (2013)
Lanthaler, M., Gütl C.: Hydra: a vocabulary for hypermedia-driven web apis. In: LDOW, vol. 996 (2013)
Purcell, K., Brenner, J., Rainie L.: Search engine use 2012 (2012)
Van de Sompel, H., Beit-Arie, O.: Open linking in the scholarly information environment using the OpenURL framework. New Rev. Inf. Netw. 7(1), 59–76 (2001)
Steiner, T., Troncy, R., Hausenblas, M.: How Google is using linked data today and vision for tomorrow. In: Proceedings of Linked Data in the Future Internet, vol. 700 (2010)
Wang, L., Hawbani, A., Wang, X.: Focused deep web entrance crawling by form feature classification. In: Wang, Yu., Xiong, H., Argamon, S., Li, X., Li, J. (eds.) BigCom 2015. LNCS, vol. 9196, pp. 79–87. Springer, Cham (2015). doi:10.1007/978-3-319-22047-5_7
Zhang, Z., He, B., Chen-Chuan Chang, K.: Understanding web query interfaces: best-effort parsing with hidden syntax. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 107–118. ACM (2004)
Zhao, F., Zhou, J., Nie, C., Huang, H., Jin, H.: SmartCrawler: a two-stage crawler for efficiently harvesting deep-web interfaces. IEEE Trans. Serv. Comput. 9(4), 608–620 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Klemenz, A.M., Tochtermann, K. (2017). Semantic Enrichment of Web Query Interfaces to Enable Dynamic Deep Linking to Web Information Portals. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds) Research and Advanced Technology for Digital Libraries. TPDL 2017. Lecture Notes in Computer Science(), vol 10450. Springer, Cham. https://doi.org/10.1007/978-3-319-67008-9_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-67008-9_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67007-2
Online ISBN: 978-3-319-67008-9
eBook Packages: Computer ScienceComputer Science (R0)