Skip to main content

The Design of Deep Web Search Engine Based on Domain Knowledge

  • Conference paper
Future Control and Automation

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 173))

  • 2232 Accesses

Abstract

A new method of search data on Deep Web is proposed in this paper based on the concept of domain knowledge and the feature of Deep Web data. First they obtain the search interfaces on the Deep Web; and then do feature analysis and domain judge on them; at last, they classify and integrate various interfaces in line with diverse domains. This method has showed up higher degree of correlation of domains from the seek result, via testing several different domains. Compared with other methods, its recall and precision are also satisfactory.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bergman, M.K.: The deep Web: Surfacing hidden value. White Paper on the Deep Web (2001), http://www.brightplanet.com/pdf/deepwebwhitepaper.pdf

  2. Chang, K.C.-C., He, B., Li, C., et al.: Structured database on the web: Observations and Implications. SIGMOD Record 33(3), 61–70 (2004)

    Article  Google Scholar 

  3. Raghavan, S., Careia–Molina, H.: Crawling the hidden web. In: Proc. of the lnternational Conference on Vary Large Data Bases (VLDB), Rome, Italy (2001-2009)

    Google Scholar 

  4. Ipeirotis, P.G., Gravano, L., Sahami, M.: Probe, Count, and Classify: Categorizing Hidden Web Databases. In: ACM SIGMODZ 2001, Santa Bathara, Califomia, USA, May21-24 (2001)

    Google Scholar 

  5. Walny, J., Barbosa, D.: SemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources. In: 2009 International Conference on Web Information Systems and Mining (2009)

    Google Scholar 

  6. Wang, H., Liu, X., Zuo, W.: Using classifiers to find domain specific online databases automatically. Journal of Software 19(2), 246–256 (2008)

    Article  Google Scholar 

  7. Barbosa, L., Freire, J.: Combining classifiers to identify online databases. In: Proceedings of the 16th International Conference on World Wide Web, pp. 431–440. ACM Press, New York (2007)

    Chapter  Google Scholar 

  8. Lu, Y.Y., He, H., Zhao, H.K., Meng, W.Y., Yu, C.: Annotating structured data of the deep Web. In: Proc. of the IEEE 23rd Int’l Conf. on Data Engineering, pp. 376–385. IEEE Computer Society Press, Istanbul (2007)

    Chapter  Google Scholar 

  9. He, H., Meng, W.Y., Lu, Y.Y., Yu, C., Wu, Z.: Towards deeper understanding of the search interfaces of the deep Web. World Wide Web 10(2), 133–155 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deng Rong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rong, D., Hao, W., Xin, Z. (2012). The Design of Deep Web Search Engine Based on Domain Knowledge. In: Deng, W. (eds) Future Control and Automation. Lecture Notes in Electrical Engineering, vol 173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31003-4_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31003-4_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31002-7

  • Online ISBN: 978-3-642-31003-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics