# Combined performance and availability analysis of distributed resources in grid computing

- 261 Downloads
- 2 Citations

## Abstract

This paper presents a mathematical model to evaluate the performance of grid resources when availability of the resources is taken into account. The proposed model uses continuous time Markov chains (CTMCs) to model the failure-repair behavior of a grid resource. In grid computing environment, a resource not only may fail during task execution, but also it can cancel its membership at any time. Hence, the proposed CTMC considers the availability of a grid resource to a grid user in both failure and membership refusal situations. After modeling the availability of the resource, the mean sojourn time of grid tasks in each of the availability states is estimated. Assigning the mean sojourn times of the tasks as performance levels to each of the CTMC’s states, a Markov reward model (MRM) representing the combined performance and availability measures is obtained. Computing the cumulative state probability of the CTMC and multiplying reward rates of the MRM’s states to each of the corresponding state probabilities, the expected accumulated sojourn time of grid tasks in each of the grid resources is achieved. An illustrative example is presented and the results obtained from the proposed model are reported in cases where various scheduling disciplines are considered inside the grid resource to simultaneously service grid and local tasks.

## Keywords

Grid computing Performance Availability Markov reward model Mean sojourn time## Abbreviations

- FTCS
Fault tolerant computer system

- CTMC
Continuous time Markov chain

- MRM
Markov reward model

- RMS
Resource management system

- GSPN
Generalized stochastic Petri net

- SAN
Stochastic activity network

## List of symbols

- \(t\)
Time

- \(i\)
Index of the system

- \(\alpha \)
Failure rate of the resource

- \(\beta \)
Repair rate of the resource

- \(\lambda _\mathrm{l}\)
Local tasks arrival rate

- \(\mu _\mathrm{l}\)
Resource service rate for local tasks

- \(\lambda _\mathrm{g}\)
Grid tasks arrival rate

- \(\mu _\mathrm{g}\)
Resource service rate for grid tasks

- \(\mu \)
Total service rate of the resource

- \(Z(t)\)
\(\hbox {t}\ge 0\), a random process representing the related CTMC

- \(\Omega \)
State space of CTMC

- \(N\)
Number of states in CTMC

- \(Q\)
Generator matrix

- \(P(t)\)
Transient probability vector

- \(p_{0}\)
Initial probability vector

- \(\pi \)
Steady state probability vector

- \(L(t)\)
Cumulative state probability vector during time period \(\left[ {0,t} \right) \)

- \(X(t)\)
Instantaneous reward rate of the related MRM

- \(r\)
Reward rate vector over \(Z( t)\)

- \(\Phi (t)\)
Accumulative reward over the period \(\left[ {0,t} \right) \)

- \({E\left[ {X(t)}\right] }\)
Expected instantaneous reward rate

- \({E\left[ X \right] }\)
Expected steady state reward

- \({E\left[ {\Phi (t)} \right] }\)
Expected accumulated reward rate

## Notes

### Acknowledgments

The authors would like to thank Iran Telecommunication Research Center (ITRC) for their support. This research was also supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the CPRC (Communications Policy Research Center) support program supervised by the KCA (Korea Communications Agency)(KCA-1194100004).

## References

- 1.Foster I, Kesselman C (2004) The Grid 2: blueprint for a new computing infrastructure, 2nd edn. Morgan Kaufmann, San FranciscoGoogle Scholar
- 2.Krauter K, Buyya R, Maheswaran M (2002) A taxonomy and survey of grid resource management systems for distributed computing. Softw Pract experience 32(2):135–164CrossRefzbMATHGoogle Scholar
- 3.Levitin G, Dai Y-S (2007) Service reliability and performance in grid system with star topology. Reliab Eng System Saf 92(1):40–46Google Scholar
- 4.Dai Y-S, Levitin G (2006) Reliability and performance of tree-structured grid services. IEEE Trans Reliab 55(2):337–349CrossRefGoogle Scholar
- 5.Azgomi MA, Entezari-Maleki R (2010) Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets. Futur Gener Comput Systems 26(8):1141–1150CrossRefGoogle Scholar
- 6.Entezari-Maleki R, Movaghar A (2011) Availability modeling of grid computing environments using SANs. In: The 19th International Conference on Software, Telecommunications and Computer Networks (SoftCOM 2011), Split, Croatia, September 15–17, 2011, pp 1–6Google Scholar
- 7.Caron E, Garonne V, Tsaregorodtsev A (2007) Definition, modeling and simulation of a grid computing scheduling system for high throughput computing. Futur Gener Comput Systems 23(8):968–976CrossRefGoogle Scholar
- 8.Parsa S, Entezari-Maleki R (2009) Modeling and throughput analysis of grid task scheduling using stochastic Petri nets. In: The 2009 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA’09), Las Vegas, July 13–16, 2009, pp 458–464Google Scholar
- 9.Berten V, Devillers R, Louchard G (2007) FIFO queuing of constant length fully synchronous jobs. In: The 4th International Conference on Grid Service Engineering and Management (GSEM ’07), Leipzig, September 25–26, 2007, pp 81–95Google Scholar
- 10.Jayasinghe M, Tari Z, Zeephongsekul P, Zomaya AY (2011) Task assignment in multiple server farms using preemptive migration and flow control. J Parallel Distributed Comput 71(12):1608–1621CrossRefzbMATHGoogle Scholar
- 11.Zhang L, Zhang B, Zhang Y, Zeng L (2011) Efficient scheduling strategies in high performance computing service platform for Shanghai Colleges. In: The 3rd International Conference on Computer Research and Development (ICCRD’11), Shanghai, March 11–13, 2011, pp 176–180Google Scholar
- 12.Entezari-Maleki R, Movaghar A (2012) A probabilistic task scheduling method for grid environments. Futur Gener Comput Systems 28(3):513–524CrossRefGoogle Scholar
- 13.Adabi S, Movaghar A, Rahmani AM, Beigy H (2013) Market\_based grid resource allocation using new negotiation model. J Netw Comput Appl 36(1):543–565CrossRefGoogle Scholar
- 14.Parsa S, Entezari-Maleki R (2012) Task dispatching approach to reduce the number of waiting tasks in grid environments. J Supercomput 59(1):469–485CrossRefGoogle Scholar
- 15.Parsa S, Entezari-Maleki R (2012) A queuing network model for minimizing the total makespan of computational grids. Comput Electr Eng 38(4):827–839CrossRefGoogle Scholar
- 16.Trivedi KS, Muppala JK, Woolet SP, Haverkort BR (1992) Composite performance and dependability analysis. Perform Eval 14(2–3):197–215CrossRefzbMATHGoogle Scholar
- 17.Reibman AL (1990) Modeling the effect of reliability on performance. IEEE Trans Reliab 39(3):314–320CrossRefGoogle Scholar
- 18.Dai Y-S, Xie M, Poh K-L (2008) Availability modeling and cost optimization for the grid resource management system. IEEE Trans Systems Man Cybern Part A Systems Hum 38(1):170–179CrossRefGoogle Scholar
- 19.Parsa S, Parand F-A (2012) Estimation of service reliability and performance in grid environment. J King Saud Univ Eng Sci 24(2):151–157Google Scholar
- 20.Trivedi KS, Ciardo G, Malhotra M, Sahner RA (1993) Dependability and performability analysis. NASA Contractor Report 191565, Report no. 93–85, November 1993Google Scholar
- 21.Trivedi KS, Malhotra M (1993) Reliability and performability techniques and tools: a survey. In: The 7th ITG/GI Conference on Measurement, Modelling and Evaluation of Computer and Communication Systems (MMB), Germany, September 1993, pp 27–48Google Scholar
- 22.Meyer JF (1980) On evaluating the performability of degradable computing systems. IEEE Trans Comput c–29(8):720–731CrossRefGoogle Scholar
- 23.Ma Y, Han JJ, Trivedi KS (2001) Composite performance and availability analysis of wireless communication networks. IEEE Trans Vehicular Technol 50(5):1216–1223CrossRefGoogle Scholar
- 24.Beaudry MD (1978) Performance-related reliability measures for computing systems. IEEE Transact Comput C–27(6):540–547CrossRefGoogle Scholar
- 25.Hussain H et al (2013) A survey on resource allocation in high performance distributed computing systems. Parallel Comput 39(11):709–736CrossRefMathSciNetGoogle Scholar
- 26.Meyer JF (1982) Closed-form solutions of performability. IEEE Transact Comput C–31(7):648–657CrossRefGoogle Scholar
- 27.Balasangameshwaraa J, Rajub N (2012) A hybrid policy for fault tolerant load balancing in grid computing environments. J Netw Comput Appl 35(1):412–422CrossRefGoogle Scholar
- 28.Sun X-H, Wu M (2007) Quality of service of grid computing: resource sharing. In: The 6th International Conference on Grid and Cooperative Computing (GCC’07), Los Alamitos, August 16–18, 2007, pp 395–402Google Scholar
- 29.Bolch G, Greiner S, Meer HD, Trivedi KS (2006) Queueing networks and Markov chains: modeling and performance evaluation with computer science applications, 2nd edn., Wiley, New YorkGoogle Scholar
- 30.Allen AO (1990) Probability, statistics, and queueing theory with computer science applications, 2nd edn., Harcourt Brace Jovanovich, New YorkGoogle Scholar