Abstract
At the moment the service market is experiencing a continuous growth, as services allow easy and quick enhancing of new and existing applications. However, hosting services according to a common on-premise model is not sufficient for dealing with erratic, spike-prone service loads. A new more promising approach is hosting services in the cloud (utility computing), which enables dynamic resource allocation. The last provides an opportunity to meet average response time requirements even in case of long-term fluctuating loads. Unfortunately, in the presence of short term fluctuations the resources utilization has to stay under 50% in order to achieve response time of the same order as job sizes.
In this work we suggest to compensate the problem of underutilization caused by hosting low-latency services by means of allocating the remaining resources to time insensitive service requests. This solution uses load balancing combined with admission control and scheduling application server threads. The proposed approach is evaluated by means of experiments with the prototype conducted with Amazon’s EC2. The experimental results show that the servers utilization can be increased without penalizing low-latency requests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amazon Elastic Compute Cloud (Amazon EC2), http://www.amazon.com/gp/browse.html?node=201590011
Apache Axis, http://ws.apache.org/axis/
Apache Tomcat, http://tomcat.apache.org/
Google App Engine, http://code.google.com/appengine
HAProxy. The Reliable, High Performance TCP/HTTP Load Balancer, http://haproxy.1wt.eu/
Java SE Runtime Environment 6.0, http://www.java.com/
Sun Java Real-Time System, http://java.sun.com/javase/technologies/realtime/rts/
Abdelzaher, T., Shin, K., Bhatti, N.: User-level qos-adaptive resource management in server end-systems (2003)
Abdelzaher, T.F., Shin, K.G., Bhatti, N.: Performance guarantees for web server end-systems: A control-theoretical approach. IEEE Trans. Parallel Distrib. Syst. 13(1), 80–96 (2002)
Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications, 2nd edn., May 2006. Wiley/Blackwell (2006)
Booth, D., Haas, H., McCabe, F., Newcomer, E., Champion, M., Ferris, C., Orchard, D.: Web services architecture. Technical report, W3C (2004), http://www.w3.org/TR/ws-arch/
Christensen, E., Curbera, F., Meredith, G., Weerawarana, S.: Web services description language (WSDL) 1.1 (March 2001)
Cooper, G., DiPippo, L., Esibov, L., Ginis, R., Johnston, R., Kortman, P., Krupp, P., Mauer, J., Squadrito, M., Thurasignham, B., Wohlever, S., Wolfe, V.: Real-time corba development at mitre, nrad, tripacific and uri. In: Proceedings of the Workshop on Middleware for Real-Time Systems and Services, San Francisco, CA (1997)
Denning, P.J.: Thrashing: Its causes and prevention. In: Proceedings AFIPS, Fall Joint Computer Conference (1968)
Heiss, H.-U., Wagner, R.: Adaptive load control in transaction processing systems. In: VLDB 1991: Proceedings of the 17th International Conference on Very Large Data Bases, pp. 47–54. Morgan Kaufmann Publishers Inc., San Francisco (1991)
Leung, J.Y.-T. (ed.): Handbook of Scheduling: Alogorithms, Models, and Performance Analysis. Chapman & Hall/CRC, Boca Raton (2004)
Levy, R., Nagarajarao, J., Pacifici, G., Spreitzer, A., Tantawi, A., Youssef, A.: Performance management for cluster based web services. IEEE Journal on Selected Areas in Communications, 247–261 (2005)
Lin, B., Dinda, P.A.: Vsched: Mixing batch and interactive virtual machines using periodic real-time scheduling. In: SC 2005: Proceedings of the 2005 ACM/IEEE conference on Supercomputing. IEEE Computer Society, Washington (2005)
Liu, Z., Niclausse, N., Jalpa-Villanueva, C.: Traffic model and performance evaluation of web servers. Perform. Eval. 46(2-3), 77–100 (2001)
Ludwig, H.: Web services qos: External slas and internal policies - or: How do we deliver what we promise? In: Proc. 4th IEEE Int’l Conf Web Information System Eng. Workshops, pp. 115–120. IEEE CS Press, Los Alamitos (2003)
Mitra, N.: Soap version 1.2 part 0.
Sha, L., Rajkumar, R., Lehoczky, J.P.: Priority inheritance protocols: An approach to real-time synchronization. IEEE Transactions on Computers 39(9), 1175–1185 (1990)
Siddhartha, P., Ganesan, R., Sengupta, S.: Smartware - a management infrastructure for web services. In: Proc. of the 1st Workshop on Web Services: Modeling, Architecture and Infrastructure (WSMAI 2003), Angers, France, April 2003, pp. 42–49. ICEIS Press (2003), In conjunction with ICEIS 2003
Song, L., Marin, G.A.: Generating realistic network traffic for security experiments. In: SoutheastCon, 2004. Proceedings, pp. 200–207. IEEE, Los Alamitos (2004)
Uhlig, S., Bonaventure, O.: Understanding the long-term self-similarity of internet traffic. In: Smirnov, M., Crowcroft, J., Roberts, J., Boavida, F. (eds.) QofIS 2001. LNCS, vol. 2156, p. 286. Springer, Heidelberg (2001)
Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: SIGMETRICS 2005: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, vol. 33, pp. 291–302. ACM Press, New York (2005)
Waldspurger, C.A., Weihl, W.E.: Lottery scheduling: Flexible proportional-share resource management. In: Operating Systems Design and Implementation, pp. 1–11 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dyachuk, D., Deters, R. (2009). A Solution to Resource Underutilization for Web Services Hosted in the Cloud. In: Meersman, R., Dillon, T., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2009. OTM 2009. Lecture Notes in Computer Science, vol 5870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05148-7_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-05148-7_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05147-0
Online ISBN: 978-3-642-05148-7
eBook Packages: Computer ScienceComputer Science (R0)