A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets

Nguyen, Tuan Anh; Min, Dugki; Choi, Eunmi

doi:10.1007/s11227-017-2127-2

A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets

Published: 28 August 2017

Volume 74, pages 222–276, (2018)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

991 Accesses
14 Citations
Explore all metrics

A Correction to this article was published on 18 May 2018

This article has been updated

Abstract

Virtualized server systems, as a major underlying element in high-performance computing systems, require further studies on many aspects of dependability. Among the significant factors, the availability measures are crucial to deliver high-quality services. Previous studies presented various modeling and analysis results on system availability of a virtualized system with two servers using a continuous-time Markov chain. In this study, we propose a cluster model of m virtualized servers using stochastic reward nets (SRNs). We focused on the overall configuration of the entire system, and in the modeling, we considered the detailed interactions between the servers. The model incorporates specific techniques for high availability of the system: standby techniques, virtual machine (VM) live migration and VM failover techniques. Simplified failures and recovery behaviors of physical servers and VMs are taken into consideration. Various SRN models are developed based on different case studies in which the techniques to improve the system’s overall availability are incorporated one after another. We conducted comprehensive analyses on the models with significant metrics of interest including: steady-state availability (SSA), sensitivity analysis of the SSA, downtime cost and operational cost analyses. We propose to use reward functions featured in SRN as a solution to help ease the computation of operational costs. The study provides an analytical basis for system adjustment and configuration of virtualized systems in data centers, cloud computing in practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Multi-objective Virtual Machine Scheduling Algorithm in Fault Tolerance Aware Cloud Environments

A stochastic reward net-based assessment of reliability, availability and operational cost for a software-defined network infrastructure

Article 03 November 2018

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Change history

18 May 2018
The section “Acknowledgement” was incorrect in the original article. The correct section “Acknowledgement” is given below.

References

Nabi M, Toeroe M, Khendek F (2016) Availability in the cloud: state of the art. J Netw Comput Appl 60:54–67. doi:10.1016/j.jnca.2015.11.014
Article Google Scholar
Stansberry M (2013) 2013 data center industry survey. Uptime Institute, LLC, Washington
Google Scholar
Thein T, Chi SD, Park JS (2008) Availability modeling and analysis on virtualized clustering with rejuvenation. IJCSNS Int J Comput Sci Netw Secur 8(9):72–80
Google Scholar
Myint MTH, Thein T (2010) Availability improvement in virtualized multiple servers with software rejuvenation and virtualization. In: 2010 Fourth International Conference on Secure Software Integration and Reliability Improvement. IEEE, pp 156–162. doi:10.1109/SSIRI.2010.19. ISBN: 978-1-4244-7435-6. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5502845
Kim DS, Machida F, Trivedi KS (2009) Availability modeling and analysis of a virtualized system. In: 2009 15th IEEE Pacific Rim International Symposium on Dependable Computing, vol 1. IEEE, pp 365–371. doi:10.1109/PRDC.2009.64. ISBN: 978-0-7695-3849-5. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5368189
Machida F, Kim DS, Trivedi KS (2013) Modeling and analysis of software rejuvenation in a server virtualized system with live VM migration. Perform Eval 70(3):212–230. doi:10.1016/j.peva.2012.09.003
Article Google Scholar
Machida F, Kim DS, Trivedi KS (2010) Modeling and analysis of software rejuvenation in a server virtualized system. In: 2010 IEEE Second International Workshop on Software Aging and Rejuvenation. IEEE, pp 1–6. doi:10.1109/WOSAR.2010.5722098. ISBN: 978-1-61284-344-5. http://www.mendeley.com/research/modeling-analysis-software-rejuvenation-server-virtualized-system/http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5722098
Matos RDS, Maciel PRM, Machida F, Kim DS, Trivedi KS (2012) Sensitivity analysis of server virtualized system availability. IEEE Trans Reliab 61(4):994–1006. doi:10.1109/TR.2012.2220711
Article Google Scholar
Nguyen TA, Kim DS, Park JS (2014) A comprehensive availability modeling and analysis of a virtualized servers system using stochastic reward nets. Sci World J 2014:1–18. doi:10.1155/2014/165316
Google Scholar
Nanda S, Chiueh T (2005) A survey of virtualization technologies. Technical report, SUNY. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.74.371
Barham P, Dragovic B, Fraser K, Hand S, Harris T, Ho A, Neugebauer R, Pratt I, Warfield A (2003) Xen and the art of virtualization. In: Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles—SOSP’03, vol 37. ACM Press, New York, NY, USA, p 164. doi:10.1145/945445.945462. ISBN: 1581137575. http://dl.acm.org/citation.cfm?id=945445.945462. http://portal.acm.org/citation.cfm?doid=945445.945462
Sahoo J, Mohapatra S, Lath R (2010) Virtualization: a survey on concepts, taxonomy and associated security issues. In: 2010 Second International Conference on Computer and Network Technology (ICCNT). doi:10.1109/ICCNT.2010.49
Morrill H, Beard M, Clitherow D (2008) Achieving continuous availability of IBM systems infrastructures. IBM Syst J 47(4):493–503. doi:10.1147/SJ.2008.5386519
Article Google Scholar
Loveland S, Dow EM, LeFevre F, Beyer D, Chan PF (2008) Leveraging virtualization to optimize high-availability system configurations. IBM Syst J 47(4):591–604. doi:10.1147/SJ.2008.5386515
Article Google Scholar
Radhakrishnan R, Mark K, Powell B (2008) IT service management for high availability. IBM Syst J 47(4):549–561. doi:10.1147/SJ.2008.5386521
Article Google Scholar
Ameen RY, Hamo AY (2013) Survey of server virtualization. Int J Comput Sci Inf Secur 11(3): 65–74. arXiv: 1304.3557. http://arxiv.org/abs/1304.3557
Adeshiyan T, Attanasio CR, Farr EM, Harper RE, Pelleg D, Schulz C, Spainhower LF, Ta-Shma P, Tomek LA (2009) Using virtualization for high availability and disaster recovery. IBM J Res Dev 53(4):8:1–8:11. doi:10.1147/JRD.2009.5429062
Article Google Scholar
Mikkilineni R, Kankanhalli G (2010) Using virtualization to prepare your data center for “real-time assurance of business continuity”. In: 2010 19th IEEE International Workshops on Enabling Technologies: Infrastructures for Collaborative Enterprises. IEEE, pp 76–81. doi:10.1109/WETICE.2010.18. ISBN: 978-1-4244-7216-1. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5541978
Thein T, Chi SD, Park JS (2008) Improving fault tolerance by virtualization and software rejuvenation. In: 2008 Second Asia International Conference on Modelling & Simulation (AMS). IEEE, pp 855–860. doi:10.1109/AMS.2008.75. ISBN: 978-0-7695-3136-6. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4530587
Xu J, Li X, Zhong Y, Zhang H (2014) Availability modeling and analysis of a single-server virtualized system with rejuvenation. J Softw 9(1):129–139. doi:10.4304/jsw.9.1.129-139
Article Google Scholar
Han L, Xu J (2013) Availability models for virtualized systems with rejuvenation. J Comput Inf Syst 20:8389–8396. doi:10.12733/jcis8586
Google Scholar
Sahner R, Trivedi K, Puliafito A (1997) Performance and reliability analysis of computer systems (an example-based approach using the Sharpe software). IEEE Trans Reliab 46(3):441–441. doi:10.1109/TR.1997.664017
Article MATH Google Scholar
Nguyen TA, Park JS (2014) Availability Modeling and Analysis in a Virtualized Servers Network, Seoul, Korea
Gray J, Siewiorek D (1991) High-availability computer systems. Computer 24(9):39–48. doi:10.1109/2.84898
Article Google Scholar
Pham C, Kalbarczyk Z, Iyer RK (2012) Toward a high availability cloud: techniques and challenges. In: IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN 2012). IEEE, pp 1–6. doi:10.1109/DSNW.2012.6264687. ISBN: 978-1-4673-2266-9. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6264687
Lumpp T, Schneider J, Holtz J, Mueller M, Lenz N, Biazetti a, Petersen D (2008) From high availability and disaster recovery to business continuity solutions. IBM Syst J 47(4):605–619. doi:10.1147/SJ.2008.5386516
Article Google Scholar
Mondal SK, Muppala JK, Machida F, Trivedi KS (2014) Computing defects per million in cloud caused by virtual machine failures with replication. In: 2014 IEEE 20th Pacific Rim International Symposium on Dependable Computing. IEEE, pp 161–168. doi:10.1109/PRDC.2014.29. ISBN: 978-1-4799-6474-1. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6974785
Mondal SK, Muppala JK (2014) Energy modeling of virtual machine replication schemes with checkpointing in data centers. In: 2014 IEEE Fourth International Conference on Big Data and Cloud Computing. IEEE, pp 633–640. doi:10.1109/BDCloud.2014.37. ISBN: 978-1-4799-6719-3. http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=7034853. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=7034853
Mondal S, Muppala J, Machida F (2016) Virtual machine replication on achieving energy-efficiency in a cloud. Electronics 5(3):37. doi:10.3390/electronics5030037
Article Google Scholar
Cully B, Lefebvre G, Meyer D, Feeley M, Hutchinson N, Warfield A (2008) Remus: high availability via asynchronous virtual machine replication. In: NSDI’08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation. USENIX Association, pp 161–174. ISBN: 111- 999-5555-22-1. http://dl.acm.org/citation.cfm?id=1387589.1387601
Polze A, Troger P, Salfner F (2011) Timely virtual machine migration for pro-active fault tolerance. In: 2011 14th IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops. IEEE, pp 234–243. doi:10.1109/ISORCW.2011.42. ISBN: 978-1-4577-0303-4. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5753533
Chang F, Ji M, Leung ST, MacCormick J, Perl S, Zhang L (2002) Myriad: cost-effective disaster tolerance. In: FAST ’02 Proceedings of the 1st USENIX Conference on File and Storage Technologies. USENIX Association, p 8. http://dl.acm.org/citation.cfm?id=1083323.1083334
Trivedi KS, Kim DS, Roy A, Medhi D (2009) Dependability and security models. In: Proceedings of the 2009 7th International Workshop on the Design of Reliable Communication Networks. DRCN, pp 11–20. doi:10.1109/DRCN.2009.5340029. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5340029
Trivedi KS, Kim DS, Ghosh R (2013) System availability assessment using stochastic models. Appl Stoch Models Bus Ind 29(2):94–109. doi:10.1002/asmb.951
Article MathSciNet Google Scholar
Bailey D, Frank-Schultz E, Lindeque P, Temple JL III (2008) Three reliability engineering techniques and their application to evaluating the availability of IT systems: an introduction. IBM Syst J 47(4):577–589. doi:10.1147/SJ.2008.5386507
Article Google Scholar
Smith WE, Trivedi KS, Tomek LA, Ackaret J (2008) Availability analysis of blade server systems. IBM Syst J 47(4):621–640. doi:10.1147/SJ.2008.5386524
Article Google Scholar
Ever E (2016) Performability analysis of cloud computing centers with large numbers of servers. J Supercomput 1–27. doi:10.1007/s11227-016-1906-5. ISSN: 0920-8542. http://link.springer.com/10.1007/s11227-016-1906-5
Muppala JK, Ciardo G, Trivedi KS (1994) Stochastic reward nets for reliability prediction. Commun Reliab Maintainab Serviceability 9–20. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.1102
Seshadri S, Muench PH, Chiu L, Koltsidas I, Ioannou N, Haas R, Liu Y, Mei M, Blinick S (2014) Software defined just-in-time caching in an enterprise storage system. IBM J Res Dev 58(2/3):7:1–7:13. doi:10.1147/JRD.2014.2303595
Article Google Scholar
Maciel P, Matos R, Callou G, Silva B, Barreto D, Araujo J, Araujo J, Alves V, Worth S (2014) Performance evaluation of sheepdog distributed storage system. In: 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, pp 3370–3375. doi:10.1109/SMC.2014.6974448. ISBN: 978-1-4799-3840-7. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6974448
Clitherow D, Brookbanks M, Clayton N, Spear G (2008) Combining high availability and disaster recovery solutions for critical IT environments. IBM Syst J 47(4):563–575. doi:10.1147/SJ.2008.5386509
Article Google Scholar
Cocchiara R, Davis H, Kinnaird D (2008) Data center topologies for mission-critical business systems. IBM Syst J 47(4):695–706. doi:10.1147/SJ.2008.5386510
Article Google Scholar
Andrzejak A, Silva L (2008) Using machine learning for non-intrusive modeling and prediction of software aging. In: NOMS 2008—2008 IEEE Network Operations and Management Symposium. IEEE, pp 25–32. doi:10.1109/NOMS.2008.4575113. ISBN: 978-1-4244-2065-0. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4575113
Alonso J, Torres J, Berral JL, Gavalda R (2010) Adaptive on-line software aging prediction based on machine learning. In: Proceedings of 2010 IEEE/IFIP International Conference on Dependable Systems & Networks (DSN). IEEE, pp 507–516. doi:10.1109/DSN.2010.5544275. ISBN: 978-1-4244-7500-1. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5544275
Ciardo G, Muppala J, Trivedi K (1989) SPNP: stochastic Petri net package. In: Proceedings of the Third International Workshop on Petri Nets and Performance Models, PNPM89, pp 142–151. doi:10.1109/PNPM.1989.68548. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=68548. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.87.9688
Melo M, Maciel P, Araujo J, Matos R, Araujo C (2013) Availability study on cloud computing environments: live migration as a rejuvenation mechanism. In: 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE, pp 1–6. doi:10.1109/DSN.2013.6575322. ISBN: 978-1-4673-6472-0. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6575322
Sousa E, Lins F, Tavares E, Cunha P, Maciel P (2014) A modeling approach for cloud infrastructure planning considering dependability and cost requirements. IEEE Trans Syst Man Cybern Syst (99): 1–1 (2014). doi:10.1109/TSMC.2014.2358642. ISSN: 2168-2216. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6913553
Sousa E, Silva E, Lins F, Tavares E, Maciel P (2014) Dependability evaluation of cloud infrastructures. In: 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, pp 1282–1287. doi:10.1109/SMC.2014.6974091. ISBN: 978-1-4799-3840-7. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6974091
Clark C, Fraser K, Hand S, Hansen JG, Jul E, Limpach C, Pratt I, Warfield A (2005) Live migration of virtual machines. In: Proceedings of the 2nd Conference on Symposium on Networked Systems Design & Implementation. vol 2(Vmm), pp 273–286. http://dl.acm.org/citation.cfm?id=1251203.1251223. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.120.374
Kourai K, Chiba S (2011) Fast software rejuvenation of virtual machine monitors. IEEE Trans Dependable Secure Comput 8(6):839–851. doi:10.1109/TDSC.2010.20
Article Google Scholar
Melo C, Matos R, Dantas J, Maciel P (2017) Capacity-oriented availability model for resources estimation on private cloud infrastructure. In: 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC). IEEE, pp 255–260. doi:10.1109/PRDC.2017.49. ISBN: 978-1-5090-5652-1. http://ieeexplore.ieee.org/document/7920629/
Patel P, Ranabahu A, Sheth A (2009) Service level agreement in cloud computing. Kno.e.sis Publications. http://corescholar.libraries.wright.edu/knoesis/78
Anala M, Kashyap M, Shobha G (2013) Application performance analysis during live migration of virtual machines. doi:10.1109/IAdCC.2013.6514252
Ma F, Liu F, Liu Z (2010) Live virtual machine migration based on improved pre-copy approach. doi:10.1109/ICSESS.2010.5552416
Strunk A, Dargie W (2013) Does live migration of virtual machines cost energy? doi:10.1109/AINA.2013.137
Strunk A (2012) Costs of virtual machine live migration: a survey. In: 2012 IEEE Eighth World Congress on Services. IEEE, pp 323–329. doi:10.1109/SERVICES.2012.23. ISBN: 978-1-4673-3053-4. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6274069
Trivedi KS, Sigmon TM (1979) A performance comparison of optimally designed computer systems with and without virtual memory. In: Proceedings of the 6th Annual Symposium on Computer Architecture—ISCA’79. ACM Press, New York, NY, USA, pp 117–121. doi:10.1145/800090.802900. http://dl.acm.org/citation.cfm?id=800090.802900
Trivedi KS (1980) Designing linear storage hierarchies so as to maximize reliability subject to cost and performance constraints. In: Proceedings of the 7th Annual Symposium on Computer Architecture—ISCA’80. ACM Press, New York, NY, USA, pp 211–217. doi:10.1145/800053.801928. http://dl.acm.org/citation.cfm?id=800053.801928
Longo F, Distefano S, Bruneo D, Scarpa M (2015) Dependability modeling of software defined networking. Comput Netw 83:280–296. doi:10.1016/j.comnet.2015.03.018
Article Google Scholar
Distefano S, Longo F, Trivedi KS (2012) Investigating dynamic reliability and availability through statespace models. Comput Math Appl 64(12):3701–3716. doi:10.1016/j.camwa.2012.02.038
Article MathSciNet MATH Google Scholar
Distefano S, Trivedi KS (2013) Non-markovian state-space models in dependability evaluation. Q Reliab Eng Int 29(2):225–239. doi:10.1002/qre.1305
Article Google Scholar
Distefano S, Longo F, Scarpa M, Trivedi KS (2014) Non-markovian modeling of a bladecenter chassis midplane. Springer, pp 255–269. doi:10.1007/978-3-319-10885-8_18. http://link.springer.com/10.1007/978-3-319-10885-8_18
Machida F, Kim DS, Trivedi KS (2013) Modeling and analysis of software rejuvenation in a server virtualized system with live VM migration. In: Performance Evaluation, vol 70. IEEE, pp 212–230. doi:10.1016/j.peva.2012.09.003. ISBN: 9781612843469. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5722098
Trivedi KS (2001) Probability and statistics with reliability, queuing and computer science applications, 2nd edn. Wiley-Interscience, Chichester, p 830. ISBN: 0-471-33341-7. http://www.amazon.com/Probability-Statistics-Reliability-Queueing-Applications/dp/0471333417. http://dl.acm.org/citation.cfm?id=501883
Trivedi KS, Vasireddy R, Trindale D, Nathan S, Castro R (2006) Modeling high availability systems. In: 2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC’06). IEEE, pp 154–164. doi:10.1109/PRDC.2006.45. ISBN: 0-7695-2724-8. http://www.computer.org/csdl/proceedings/prdc/2006/2724/00/27240154-abs.html
Deshpande U, Kulkarni U, Gopalan K (2012) Inter-rack live migration of multiple virtual machines. In: Proceedings of the 6th International Workshop on Virtualization Technologies in Distributed Computing Date—VTDC’12. ACM Press, New York, NY, USA, p 19. doi:10.1145/2287056.2287062. ISBN: 9781450313445. http://dl.acm.org/citation.cfm?doid=2287056.2287062
Svärd P, Hudzia B, Tordsson J, Elmroth E, Svärd P, Hudzia B, Tordsson J, Elmroth E (2011) Evaluation of delta compression techniques for efficient live migration of large virtual machines. ACM SIGPLAN Not 46(7):111. doi:10.1145/2007477.1952698
Article Google Scholar
Hines MR, Deshpande U, Gopalan K (2009) Post-copy live migration of virtual machines. ACM SIGOPS Oper Syst Rev 43(3):14. doi:10.1145/1618525.1618528
Article Google Scholar
Hirofuchi T, Nakada H, Itoh S, Sekiguchi S (2011) Reactive consolidation of virtual machines enabled by postcopy live migration. In: Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing—VTDC’11. ACM Press, New York, NY, USA, p 11. doi:10.1145/1996121.1996125. ISBN: 9781450307017. http://portal.acm.org/citation.cfm?doid=1996121.1996125
Sahni S, Varma V (2012) A hybrid approach to live migration of virtual machines. In: 2012 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM). IEEE, pp 1–5. doi:10.1109/CCEM.2012.6354587. ISBN: 978-1-4673-4422-7. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6354587. http://ieeexplore.ieee.org/document/6354587/
Yin F, Liu W, Song J (2014) Live virtual machine migration with optimized three-stage memory copy. Springer, Berlin, pp 69–75. doi:10.1007/978-3-642-40861-8_11. http://link.springer.com/10.1007/978-3-642-40861-8_11
Maleszewski J, Sosnowski J (2018) Managing and enhancing performance benchmarks. In: Zamojski W, Mazurkiewicz J, Sugier J, Walkowiak T, Kacprzyk J (eds) Advances in Dependability Engineering of Complex Systems: Proceedings of the Twelfth International Conference on Dependability and Complex Systems DepCoS-RELCOMEX, July 2–6, 2017, Brunów, Poland. Springer, Cham, pp 287–297. doi:10.1007/978-3-319-59415-6_28. ISBN: 978-3-319-59415-6. http://dx.doi.org/10.1007/978-3-319-59415-6_28
Shojafar M, Canali C, Lancellotti R, Abawajy J (2016) Adaptive computing-plus-communication optimization framework for multimedia processing in cloud systems. IEEE Trans Cloud Comput 1–1. doi:10.1109/TCC.2016.2617367. ISSN: 2168-7161. http://ieeexplore.ieee.org/document/7590092/
Ghosh R, Longo F, Naik VK, Trivedi KS (2012) Modeling and performance analysis of large scale IaaS clouds. doi:10.1016/j.future.2012.06.005. http://www.sciencedirect.com/science/article/pii/S0167739X12001410
Ghosh R, Longo F, Frattini F, Russo S, Trivedi K (2014) Scalable analytics for IaaS cloud availability. IEEE Trans Cloud Comput (99) 1–1. doi:10.1109/TCC.2014.2310737. ISSN: 2168-7161. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6762970
Guida M, Longo M, Postiglione F, Trivedi KS, Yin X (2013) Semi-Markov models for performance evaluation of failure-prone IP multimedia subsystem core networks. Proc Inst Mech Eng Part O J Risk Reliab 227(3):290–301. doi:10.1177/1748006X13485191
Google Scholar
Shojafar M, Javanmardi S, Abolfazli S, Cordeschi N (2015) FUGE: a joint meta-heuristic approach to cloud job scheduling algorithm using fuzzy theory and a genetic method. Cluster Comput 18(2):829–844. doi:10.1007/s10586-014-0420-x
Article Google Scholar
Constazltinescu C, Trivedi K (1994) A stochastic reward net model for dependability analysis of real-time computing systems. In: Proceedings of 2nd IEEE Workshop on Real-Time Applications. IEEE Computer Society Press, pp 142–146. doi:10.1109/RTA.1994.316162. ISBN: 0-8186-6375-8. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=316162
Liu Y, Trivedi KS (2006) Survivability quantification: the analytical modeling approach. Int J Perform Eng 2(1):29. http://www.ijpe-online.com/p3-survivability-quantification-the-analytical-modeling-approach.html#axzz3v44nELpI

Download references

Acknowledgements

We appreciate and thank a number of peer reviewers and colleagues who are devoted to give valuable and constructive comments and suggestions. This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2017-2016-0-00465) supervised by the IITP (Institute for Information & communications Technology Promotion).

Author information

Authors and Affiliations

Office of Research, University-Industry Cooperation Foundation, Konkuk University, Seoul, South Korea
Tuan Anh Nguyen
Department of Computer Engineering, Konkuk University, Seoul, South Korea
Tuan Anh Nguyen & Dugki Min
School of Management Information Systems, Kookmin University, Seoul, South Korea
Eunmi Choi

Authors

Tuan Anh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Dugki Min
View author publications
You can also search for this author in PubMed Google Scholar
Eunmi Choi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Tuan Anh Nguyen or Dugki Min.

Appendices

Appendix 1: Stochastic reward nets

Stochastic reward net (SRN) is used in sufficiently modeling many hardware and software structures of real-time computing systems [78, 79]. To build SRN model, we use three main components: places, transitions and arcs. Arcs only connect place(s) to transition(s) and transition(s) to places. There is an integer number of entities named token denoted by dot sign or integer number in the places. Transition can be enabled to transport tokens from and to places called firing. The state or condition of the system is decided by location of tokens [22, 64]. That means, a set of current location of tokens in SRN models reflects the state or condition of the system, called marking. Guard is a Boolean condition attached to each transition to perform marking dependence. To succinctly describe many complex behaviors, marking-dependent firing rates of transitions are applied as a function of the current marking. This dependency is denoted by \( \# \) sign next to the transition. More general dependencies are often needed and hence allowed in the SRN formalism [64]. There are other features such as input arcs, inhibit arcs, multiplicities, so that SRN models can be simplified. In the following sections, we present very detailed description of the SRN models for small-scale VSS.

Appendix 2: Definition of guard functions in SRN models

See Table 7.

Table 7 Definition of guard functions and entities used in the SRN models

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nguyen, T.A., Min, D. & Choi, E. A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets. J Supercomput 74, 222–276 (2018). https://doi.org/10.1007/s11227-017-2127-2

Download citation

Published: 28 August 2017
Issue Date: January 2018
DOI: https://doi.org/10.1007/s11227-017-2127-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets

Abstract

Access this article

Similar content being viewed by others

A Multi-objective Virtual Machine Scheduling Algorithm in Fault Tolerance Aware Cloud Environments

A stochastic reward net-based assessment of reliability, availability and operational cost for a software-defined network infrastructure

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Change history

18 May 2018

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Appendices

Appendix 1: Stochastic reward nets

Appendix 2: Definition of guard functions in SRN models

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets

Abstract

Access this article

Similar content being viewed by others

A Multi-objective Virtual Machine Scheduling Algorithm in Fault Tolerance Aware Cloud Environments

A stochastic reward net-based assessment of reliability, availability and operational cost for a software-defined network infrastructure

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Change history

18 May 2018

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Appendices

Appendix 1: Stochastic reward nets

Appendix 2: Definition of guard functions in SRN models

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation