Skip to main content

Using Document Features to Optimize Web Cache

  • Conference paper
  • First Online:
Artificial Neural Networks — ICANN 2001 (ICANN 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2130))

Included in the following conference series:

Abstract

In this paper Web cache optimization using document features is proposed. The problem in Web cache optimization is to decide which strategy to use in replacement of cache objects. While commonly used policies use heuristic rules, proposed model predicts the value of each Web object by using features collected from the HTTP responses and from the HTML structure of the document. In a case study, generalized linear model and multilayer perceptron committee model are used to classify about 50000 Web documents according to their popularity. Results show that linear model does not find any correlation between the features and document popularity. MLP model gives better results, yielding mean classification percentages of 64 and 74 for the documents to be left or to be removed from the Web cache, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 189.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Douglis, F., Feldmann, A., Krishnamurthy, B., Mogul, J.: Rate of Change and Other Metrics: a Live Study of the World Wide Web. USENIX Symposium on Internet Technologies and Systems (1997)

    Google Scholar 

  2. Williams, S., Abrams, M., Standridge, R.G., Abdulla, G., and Fox, E.A.: Removal Policies in Network Caches for World-Wide Web Documents. ACM SIGCOMM’96 Conference (1996)

    Google Scholar 

  3. Davison, B.D.: A Survey of Proxy Cache Evaluation Techniques. 4th International Web Caching Workshop (1999)

    Google Scholar 

  4. Cao, P., Irani, S.: Cost-Aware WWW Proxy Caching Algorithms. USENIX Symposium on Internet Technologies and Systems (1997)

    Google Scholar 

  5. Rizzo, L., Vicisano, L.. Replacement Policies for a Proxy Cache. IEEE/ACM Transactions on Networking, 8(2) (2000) 158–170

    Article  Google Scholar 

  6. Foong, A.P., Hu, Y-H., Heisey, D.M.: Logistic Regression in an Adaptive Web Cache. IEEE Internet Computing 3(5) (1999) 27–36

    Article  Google Scholar 

  7. Koskela, T., Heikkonen, J., Kaski, K.: Modeling the Cacheability of HTML Documents. 9th International World Wide Web Conference (2000)

    Google Scholar 

  8. Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press (1995)

    Google Scholar 

  9. Riedmiller, M., Braun, H.: A Direct Adaptive Method for Faster Backpropagation Learning: The RPROP Algorithm. IEEE International Conference on Neural Networks (1993) 586–591

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koskela, T., Heikkonen, J., Kaski, K. (2001). Using Document Features to Optimize Web Cache. In: Dorffner, G., Bischof, H., Hornik, K. (eds) Artificial Neural Networks — ICANN 2001. ICANN 2001. Lecture Notes in Computer Science, vol 2130. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44668-0_169

Download citation

  • DOI: https://doi.org/10.1007/3-540-44668-0_169

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42486-4

  • Online ISBN: 978-3-540-44668-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics