Risk Assessment Modeling in Grids at Component Level: Considering Grid Resources as Repairable

Conference paper
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 151)


Service level agreements (SLAs), as formal contractual agreements, increase the confidence level between the End user and the Grid Resource provider, as compared to the best effort approach. On the other end, SLAs fall short of assessing the risk in acceptance of the SLA, risk assessment in Grid computing fills that gap. The current approaches to risk assessment are based on node level risk assessment. This work is differentiated by that it provides risk assessment information at the granularity level of components. A risk assessment model at the component level based on Non-Homogeneous Poisson Process (NHPP) model is proposed. Grid failure data is used for the experimentation at the component level. The Grid risk model selection is validated by using a goodness of fit test along with graphical approaches. The experimental results provide detailed risk assessment information at the component level which can be used by Grid Resource provider to manage and use the Grid resources efficiently.


Risk Assessment Repairable Grid resources Probability of Failure 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the grid: Enabling scalable virtual organizations. Int. J. High Perform. Comput. Appl. 15(3), 200–222 (2001)CrossRefGoogle Scholar
  2. 2.
    Krivtsov, V.: Practical extensions to NHPP application in repairable system reliability analysis. Reliability Engineering & System Safety 92(5), 560–562 (2007)CrossRefGoogle Scholar
  3. 3.
    (Los Alamos National Laboratories), http://institute.lanl.gov/data/lanldata.shtml
  4. 4.
    Crow, L.H.: Reliability analysis for complex, repairable systems. Tech. Rep. (December 1975)Google Scholar
  5. 5.
    Burchard, L.O., et al.: A Quality-of-Service architecture for future grid computing applications. In: Proceedings of the 19th IEEE IPDPS 2005, p. 132a (2005)Google Scholar
  6. 6.
    Leff, A., Rayfield, J.T., Dias, D.M.: Service-level agreements and commercial grids. IEEE Internet Computing 7(4), 44–50 (2003)CrossRefGoogle Scholar
  7. 7.
    Ludwig, H., et al.: A service level agreement language for dynamic electronic services. In: Advanced Issues of E-Commerce and Web-Based Information Systems, pp. 25–32 (2002)Google Scholar
  8. 8.
    Padgett, J., Djemame, K., Dew, P.: Grid-Based SLA Management. In: Sloot, P.M.A., Hoekstra, A.G., Priol, T., Reinefeld, A., Bubak, M. (eds.) EGC 2005. LNCS, vol. 3470, pp. 1076–1085. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  9. 9.
    Foster, I.: What is the grid? - a three point checklist. GRIDtoday 1(6) (July 2002)Google Scholar
  10. 10.
    Schopf, J.M., Nitzberg, B.: Grids: The top ten questions. Sci. Program. 10, 103–111 (2002)Google Scholar
  11. 11.
    Schwiegelshohn, U., et al.: Perspectives on grid computing. Future Gener. Comput. Syst. 26(8), 1104–1115 (2010)CrossRefGoogle Scholar
  12. 12.
    Talia, D., et al.: Using SLA for resource management and scheduling - a survey grid middleware and services. In: Grid Middleware and Services, ch. 22, pp. 335–347. Springer, Boston (2008)CrossRefGoogle Scholar
  13. 13.
    Djemame, K., et al.: Brokering of risk-aware service level agreements in grids. Concurrency Computat.: Pract. Exper. 23(13), 1558–1582 (2011)CrossRefGoogle Scholar
  14. 14.
    Sahai, A., et al.: Specifying and monitoring guarantees in commercial grids through SLA. In: Proceedings of the CCGRID 2003. IEEE Computer Society, Washington, DC (2003)Google Scholar
  15. 15.
    Rigdon, S.E., Basu, A.P.: Statistical Methods for the Reliability of Repairable Systems. Wiley-Interscience (2000)Google Scholar
  16. 16.
    Zadeh, L.A.: Fuzzy Set as a Basis for a Theory of Possibility. Fuzzy Sets and Systems 1, 3–28 (1978)MathSciNetMATHCrossRefGoogle Scholar
  17. 17.
    Haimes, Yacov, Y.: Risk Modeling, Assessment, and Management, 3rd edn. Wiley series in system engineering (1998)Google Scholar
  18. 18.
    Djemame, K., et al.: Introducing risk management into the grid. In: Second IEEE International Conference on e-Science and Grid Computing, p. 28. The Netherlands (December 2006)Google Scholar
  19. 19.
    Hovestadt, M., et al.: The first step of introducing risk management for prepossessing slas. In: IEEE International Conference on Services Computing, SCC 2006, pp. 36–43 (2006)Google Scholar
  20. 20.
    Hamada, M.S., et al.: Bayesian Reliability. Springer series in statistics. Springer (2008)Google Scholar
  21. 21.
    Alsoghayer, R., Djemame, K.: Probabilistic risk assessment for resource provision in grids. In: Djemame, K. (ed.) Proceedings of the 25th UK Performance Engineering Workshop, Leeds, UK, pp. 99–110 (July 2009)Google Scholar
  22. 22.
    Sangrasi, A., Djemame, K.: Component level risk assessment in grids: A probablistic risk model and experimentation. In: Proceedings of the 5th IEEE International Conference on Digital Ecosystems and Technologies Conference (DEST), pp. 68–75 (May 2011)Google Scholar
  23. 23.
    Carlsson, C., Fullér, R.: Risk Assessment of SLAs in Grid Computing with Predictive Probabilistic and Possibilistic Models. In: Greco, S., Pereira, R.A.M., Squillante, M., Yager, R.R., Kacprzyk, J. (eds.) Preferences and Decisions. SFSC, vol. 257, pp. 11–29. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  24. 24.
    Gourlay, I., et al.: Reliability and risk in grid resource brokering. In: 2nd IEEE DEST (2008)Google Scholar
  25. 25.
    Bunday, B.D.: Statistical methods in reliability theory and practice, ser. Maths and its applications. Statistics, Operational Research, and Computational Maths. Ellis Horwood (1991)Google Scholar
  26. 26.
    Krautsevich, L., et al.: Risk-Aware Usage Decision Making in Highly Dynamic Systems. In: International Conference on Internet Monitoring and Protection, vol. 0, pp. 29–34 (2010)Google Scholar
  27. 27.
    Iosup, A., et al.: On the dynamic resource availability in Grids. In: Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, USA, September 19-21, pp. 26–33 (2007)Google Scholar
  28. 28.
    Hovestadt, M., et al.: The first step of introducing risk management for prepossessing slas. In: IEEE International Conference on Services Computing, SCC 2006, pp. 36–43 (2006)Google Scholar
  29. 29.
    Pinheiro, E., et al.: Failure trends in a large disk drive population. In: Proceedings of the 5th USENIX Conference on File and Storage Technologies (February 2007)Google Scholar
  30. 30.
    Lingrand, D., et al.: Optimization of Jobs Submission on the EGEE Production Grid: Modeling Faults Using Workload. Journal of Grid Computing 8, 305–321 (2010)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  1. 1.School of ComputingUniversity of LeedsLeedsUK

Personalised recommendations