Skip to main content

Semantic Enrichment of Web Query Interfaces to Enable Dynamic Deep Linking to Web Information Portals

  • Conference paper
  • First Online:
Research and Advanced Technology for Digital Libraries (TPDL 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10450))

Included in the following conference series:

  • 2386 Accesses

Abstract

This article addresses how to improve the automated accessibility and visibility of information from Web Information Portals and in particular virtual library systems. Information from web information portals could provide great value to satisfy information needs. But most of this information stays hidden in data silos which are part of that section of the web that is not indexable by common search engines and is therefore called Deep Web. Shared vocabularies like Schema.org helped to increase machine readability of structured information on the web in general, but markup vocabularies didn’t increase the accessibility and visibility of information from data silos. This article addresses the limitations regarding the accessibility of information from data silos on the Deep Web and proposes an extension to Schema.org to fill the identified gaps. The extension improves the automated accessibility and visibility of information provided in web information portals by providing Dynamic Deep Linking capabilities to Deep Web data silos by lifting web forms of web information portals to the level of machine understandable semantic Web Query Interfaces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Schema.org Actions: http://blog.schema.org/2014/04/announcing-schemaorg-actions.html

  2. 2.

    Sitelinks Search: https://developers.google.com/structured-data/slsb-overview

  3. 3.

    Semantic Deep Search Extension website: http://semdeepsearch.vocab-ext.appspot.com/

  4. 4.

    EconBiz, subject portal for economics and business studies: https://www.econbiz.de/

References

  1. Bergman, M.K.: White paper: the deep web: surfacing hidden value. J. Electron. Publish. 7(1), 1–17 (2001)

    Article  MathSciNet  Google Scholar 

  2. Blandford, A.: Google, public libraries, and the deep web. Dalhousie J. Interdiscip. Manage. 11 (2015)

    Google Scholar 

  3. Bizer, C., Heath, T., Berners-Lee, T.: Linked data-the story so far. In: Semantic Services, Interoperability and Web Applications: Emerging Concepts, pp. 205–227 (2009)

    Google Scholar 

  4. Ferrara, E., De Meo, P., Fiumara, G., Baumgartner, R.: Web data extraction, applications and techniques: a survey. Knowl.-Based Syst. 70, 301–323 (2014)

    Article  Google Scholar 

  5. Furche, T., Gottlob, G., Grasso, G., Guo, X., Orsi, G., Schallhart, C.: The ontological key: automatically understanding and integrating forms to access the deep Web. VLDB J. 22(5), 615–640 (2013)

    Article  Google Scholar 

  6. Henzinger, M.R.: Hyperlink analysis for the web. IEEE Internet Comp. 5(1), 45–50 (2001)

    Article  Google Scholar 

  7. Klemenz, A.M., Tochtermann, K.: Semantification of Query Interfaces to Improve Access to Deep Web Content. SDA, pp. 104–111 (2013)

    Google Scholar 

  8. Lanthaler, M., Gütl C.: Hydra: a vocabulary for hypermedia-driven web apis. In: LDOW, vol. 996 (2013)

    Google Scholar 

  9. Purcell, K., Brenner, J., Rainie L.: Search engine use 2012 (2012)

    Google Scholar 

  10. Van de Sompel, H., Beit-Arie, O.: Open linking in the scholarly information environment using the OpenURL framework. New Rev. Inf. Netw. 7(1), 59–76 (2001)

    Article  Google Scholar 

  11. Steiner, T., Troncy, R., Hausenblas, M.: How Google is using linked data today and vision for tomorrow. In: Proceedings of Linked Data in the Future Internet, vol. 700 (2010)

    Google Scholar 

  12. Wang, L., Hawbani, A., Wang, X.: Focused deep web entrance crawling by form feature classification. In: Wang, Yu., Xiong, H., Argamon, S., Li, X., Li, J. (eds.) BigCom 2015. LNCS, vol. 9196, pp. 79–87. Springer, Cham (2015). doi:10.1007/978-3-319-22047-5_7

    Chapter  Google Scholar 

  13. Zhang, Z., He, B., Chen-Chuan Chang, K.: Understanding web query interfaces: best-effort parsing with hidden syntax. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 107–118. ACM (2004)

    Google Scholar 

  14. Zhao, F., Zhou, J., Nie, C., Huang, H., Jin, H.: SmartCrawler: a two-stage crawler for efficiently harvesting deep-web interfaces. IEEE Trans. Serv. Comput. 9(4), 608–620 (2016)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arne Martin Klemenz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Klemenz, A.M., Tochtermann, K. (2017). Semantic Enrichment of Web Query Interfaces to Enable Dynamic Deep Linking to Web Information Portals. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds) Research and Advanced Technology for Digital Libraries. TPDL 2017. Lecture Notes in Computer Science(), vol 10450. Springer, Cham. https://doi.org/10.1007/978-3-319-67008-9_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67008-9_45

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67007-2

  • Online ISBN: 978-3-319-67008-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics