Skip to main content

A Solution to Resource Underutilization for Web Services Hosted in the Cloud

  • Conference paper
Book cover On the Move to Meaningful Internet Systems: OTM 2009 (OTM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5870))

Abstract

At the moment the service market is experiencing a continuous growth, as services allow easy and quick enhancing of new and existing applications. However, hosting services according to a common on-premise model is not sufficient for dealing with erratic, spike-prone service loads. A new more promising approach is hosting services in the cloud (utility computing), which enables dynamic resource allocation. The last provides an opportunity to meet average response time requirements even in case of long-term fluctuating loads. Unfortunately, in the presence of short term fluctuations the resources utilization has to stay under 50% in order to achieve response time of the same order as job sizes.

In this work we suggest to compensate the problem of underutilization caused by hosting low-latency services by means of allocating the remaining resources to time insensitive service requests. This solution uses load balancing combined with admission control and scheduling application server threads. The proposed approach is evaluated by means of experiments with the prototype conducted with Amazon’s EC2. The experimental results show that the servers utilization can be increased without penalizing low-latency requests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amazon Elastic Compute Cloud (Amazon EC2), http://www.amazon.com/gp/browse.html?node=201590011

  2. Apache Axis, http://ws.apache.org/axis/

  3. Apache Tomcat, http://tomcat.apache.org/

  4. Google App Engine, http://code.google.com/appengine

  5. HAProxy. The Reliable, High Performance TCP/HTTP Load Balancer, http://haproxy.1wt.eu/

  6. Java SE Runtime Environment 6.0, http://www.java.com/

  7. Sun Java Real-Time System, http://java.sun.com/javase/technologies/realtime/rts/

  8. Tsung, http://tsung.erlang-projects.org/

  9. Abdelzaher, T., Shin, K., Bhatti, N.: User-level qos-adaptive resource management in server end-systems (2003)

    Google Scholar 

  10. Abdelzaher, T.F., Shin, K.G., Bhatti, N.: Performance guarantees for web server end-systems: A control-theoretical approach. IEEE Trans. Parallel Distrib. Syst. 13(1), 80–96 (2002)

    Article  Google Scholar 

  11. Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications, 2nd edn., May 2006. Wiley/Blackwell (2006)

    Google Scholar 

  12. Booth, D., Haas, H., McCabe, F., Newcomer, E., Champion, M., Ferris, C., Orchard, D.: Web services architecture. Technical report, W3C (2004), http://www.w3.org/TR/ws-arch/

  13. Christensen, E., Curbera, F., Meredith, G., Weerawarana, S.: Web services description language (WSDL) 1.1 (March 2001)

    Google Scholar 

  14. Cooper, G., DiPippo, L., Esibov, L., Ginis, R., Johnston, R., Kortman, P., Krupp, P., Mauer, J., Squadrito, M., Thurasignham, B., Wohlever, S., Wolfe, V.: Real-time corba development at mitre, nrad, tripacific and uri. In: Proceedings of the Workshop on Middleware for Real-Time Systems and Services, San Francisco, CA (1997)

    Google Scholar 

  15. Denning, P.J.: Thrashing: Its causes and prevention. In: Proceedings AFIPS, Fall Joint Computer Conference (1968)

    Google Scholar 

  16. Heiss, H.-U., Wagner, R.: Adaptive load control in transaction processing systems. In: VLDB 1991: Proceedings of the 17th International Conference on Very Large Data Bases, pp. 47–54. Morgan Kaufmann Publishers Inc., San Francisco (1991)

    Google Scholar 

  17. Leung, J.Y.-T. (ed.): Handbook of Scheduling: Alogorithms, Models, and Performance Analysis. Chapman & Hall/CRC, Boca Raton (2004)

    Google Scholar 

  18. Levy, R., Nagarajarao, J., Pacifici, G., Spreitzer, A., Tantawi, A., Youssef, A.: Performance management for cluster based web services. IEEE Journal on Selected Areas in Communications, 247–261 (2005)

    Google Scholar 

  19. Lin, B., Dinda, P.A.: Vsched: Mixing batch and interactive virtual machines using periodic real-time scheduling. In: SC 2005: Proceedings of the 2005 ACM/IEEE conference on Supercomputing. IEEE Computer Society, Washington (2005)

    Google Scholar 

  20. Liu, Z., Niclausse, N., Jalpa-Villanueva, C.: Traffic model and performance evaluation of web servers. Perform. Eval. 46(2-3), 77–100 (2001)

    Article  MATH  Google Scholar 

  21. Ludwig, H.: Web services qos: External slas and internal policies - or: How do we deliver what we promise? In: Proc. 4th IEEE Int’l Conf Web Information System Eng. Workshops, pp. 115–120. IEEE CS Press, Los Alamitos (2003)

    Chapter  Google Scholar 

  22. Mitra, N.: Soap version 1.2 part 0.

    Google Scholar 

  23. Sha, L., Rajkumar, R., Lehoczky, J.P.: Priority inheritance protocols: An approach to real-time synchronization. IEEE Transactions on Computers 39(9), 1175–1185 (1990)

    Article  MathSciNet  Google Scholar 

  24. Siddhartha, P., Ganesan, R., Sengupta, S.: Smartware - a management infrastructure for web services. In: Proc. of the 1st Workshop on Web Services: Modeling, Architecture and Infrastructure (WSMAI 2003), Angers, France, April 2003, pp. 42–49. ICEIS Press (2003), In conjunction with ICEIS 2003

    Google Scholar 

  25. Song, L., Marin, G.A.: Generating realistic network traffic for security experiments. In: SoutheastCon, 2004. Proceedings, pp. 200–207. IEEE, Los Alamitos (2004)

    Chapter  Google Scholar 

  26. Uhlig, S., Bonaventure, O.: Understanding the long-term self-similarity of internet traffic. In: Smirnov, M., Crowcroft, J., Roberts, J., Boavida, F. (eds.) QofIS 2001. LNCS, vol. 2156, p. 286. Springer, Heidelberg (2001)

    Google Scholar 

  27. Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: SIGMETRICS 2005: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, vol. 33, pp. 291–302. ACM Press, New York (2005)

    Chapter  Google Scholar 

  28. Waldspurger, C.A., Weihl, W.E.: Lottery scheduling: Flexible proportional-share resource management. In: Operating Systems Design and Implementation, pp. 1–11 (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dyachuk, D., Deters, R. (2009). A Solution to Resource Underutilization for Web Services Hosted in the Cloud. In: Meersman, R., Dillon, T., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2009. OTM 2009. Lecture Notes in Computer Science, vol 5870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05148-7_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-05148-7_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-05147-0

  • Online ISBN: 978-3-642-05148-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics