Skip to main content
Log in

A new framework of complex system reliability with imperfect maintenance policy

  • S.I.: Statistical Reliability Modeling and Optimization
  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

The interactions and dependencies between software and hardware are often neglected in modeling system reliability in the past few decades due to the mathematical complexity. However, many system failures occurred from the interactions or simultaneous occurrences of software and hardware. This paper first proposes a new diagram of categorizing system-level failures and further incorporates such a diagram into the development of complex system reliability framework. System-level failures result from software subsystem, hardware subsystem, and the interactions of software and hardware subsystems. The focus of this study is on the investigation of the interactions failures generated from the interactions of software and hardware subsystems. In addition to the considerations of total hardware failures, software-induced hardware failures, and hardware-induced software failures introduced by Zhu and Pham (Mathematics 7(11):1049, 2019), we further introduce the partial hardware failures that can be respectively induced by hardware and software to explicitly demonstrate the dependencies and interactions between software and hardware. Hence, a new complex system reliability framework is developed based on such system-level failure categorization with the Markov process. Furthermore, the numerical examples are studied to illustrate the impacts on system reliability with the changes of state transition parameters that modeling the interactions of software and hardware subsystems. Finally, we have studied two maintenance policies of the proposed complex system reliability model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  • Avizienis, A., Laprie, J. C., Randell, B., & Landwehr, C. (2004). Basic concepts and taxonomy of dependable and secure computing. IEEE Transactions on Dependable and Secure Computing, 1(1), 11–33.

    Article  Google Scholar 

  • Bar-Yam, Y. (2019). Dynamics of complex systems. Cambridge: CRC Press.

    Book  Google Scholar 

  • Dessouky, G., Gens, D., Haney, P., Persyn, G., Kanuparthi, A., & Khattri, H., et al. (2019). Hardfails: insights into software-exploitable hardware bugs. In: Proceedings of 28th USENIX Security Symposium, Santa Clara, California, USA (pp. 213–230).

  • Evtyushkin, D., Riley, R., Abu-Ghazaleh, N. C. A. E., & Ponomarev, D. (2018). Branchscope: A new side-channel attack on directional branch predictor. ACM SIGPLAN Notices, 53(2), 693–707.

    Article  Google Scholar 

  • Ferraiuolo, A., Xu, R., Zhang, D., Myers, A. C., & Suh, G. E. (2017). Verification of a practical hardware security architecture through static information flow analysis. In: Proceedings of the twenty-second international conference on architectural support for programming languages and operating systems, Xi’an, China (pp. 555–568).

  • Friedman, M. A., Tran, P. Y., & Goddard, P. L. (1992). Reliability techniques for combined hardware and software systems. In: Proceedings of the annual reliability and maintainability symposium, Las Vegas, NV, USA.

  • Goel, A. L., & Okumoto, K. (1979). Time-dependent error-detection rate model for software reliability and other performance measures. IEEE Transactions on Reliability, 28(3), 206–211.

    Article  Google Scholar 

  • Gras, B., Razavi, K., Bos, H., & Giuffrida, C. (2018). Translation leak-aside buffer: Defeating cache side-channel protections with {TLB} attacks. In: Proceedings of 27th {USENIX} security symposium, Baltimore, Maryland, USA (pp. 955–972).

  • Gu, Y., Fan, C., Liang, L., & Zhang, J. (2019). Reliability calculation method based on the Copula function for mechanical systems with dependent failure. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03202-5.

    Article  Google Scholar 

  • Hecht, H., & Hecht, M. (1986). Software reliability in the system context. IEEE Transactions on Software Engineering, 1, 51–58.

    Article  Google Scholar 

  • Jerraya, A., & Wolf, W. (2005). Hardware/software interface codesign for embedded systems. Computer, 38(2), 63–69.

    Article  Google Scholar 

  • Jing, Q., Vasilakos, A. V., Wan, J., Lu, J., & Qiu, D. (2014). Security of the internet of things: Perspectives and challenges. Wireless Networks, 20(8), 2481–2501.

    Article  Google Scholar 

  • Kanoun, K., & Ortalo-Borrel, M. (2000). Fault-tolerant system dependability-explicit modeling of hardware and software component-interactions. IEEE Transactions on Reliability, 49(4), 363–376.

    Article  Google Scholar 

  • Kapur, P. K., Pham, H., Aggarwal, A. G., & Kaur, G. (2012). Two dimensional multi-release software reliability modeling and optimal release planning. IEEE Transactions on Reliability, 61(3), 758–768.

    Article  Google Scholar 

  • Laprie, J. C., & Kanoun, K. (1992). X-ware reliability and availability modeling. IEEE Transactions on Software Engineering, 2, 130–147.

    Article  Google Scholar 

  • Laprie, J. C., Kanoun, K., Béounes, C., & Kaâniche, M. (1991). The KAT (Knowledge-Action-Transformation) approach to the modeling and evaluation of reliability and availability growth. IEEE Transactions on Software Engineering, 4, 370–382.

    Article  Google Scholar 

  • Lorén, S., & de Maré, J. (2015). Maintenance for reliability—A case study. Annals of Operations Research, 224(1), 111–119.

    Article  Google Scholar 

  • Lung, C. H., Zhang, X., & Rajeswaran, P. (2016). Improving software performance and reliability in a distributed and concurrent environment with an architecture-based self-adaptive framework. Journal of Systems and Software, 121, 311–328.

    Article  Google Scholar 

  • Minamino, Y., Inoue, S., & Yamada, S. (2016). NHPP-based change-point modeling for software reliability assessment and its application to software development management. Annals of Operations Research, 244(1), 85–101.

    Article  Google Scholar 

  • Moghimi, A., Wichelmann, J., Eisenbarth, T., & Sunar, B. (2019). Memjam: A false dependency attack against constant-time crypto implementations. International Journal of Parallel Programming, 47(4), 538–570.

    Article  Google Scholar 

  • Park, J., & Baik, J. (2015). Improving software reliability prediction through multi-criteria based dynamic model selection and combination. Journal of Systems and Software, 101, 236–244.

    Article  Google Scholar 

  • Park, J., Kim, H. J., Shin, J. H., Baik, J. (2012). An embedded software reliability model with consideration of hardware related software failures. In: Proceedings of 2012 IEEE sixth international conference on software security and reliability, Gaithersburg, MD, USA (pp. 207–214).

  • Pham, H., & Wang, H. (2000). Optimal (τ, T) opportunistic maintenance of a k-out-of-n: G system with imperfect PM and partial failure. Naval Research Logistics (NRL), 47(3), 223–239.

    Article  Google Scholar 

  • Pignal, P. I. (1988). An analysis of hardware and software availability exemplified on the IBM 3725 communication controller. IBM Journal of Research and Development, 32(2), 268–278.

    Article  Google Scholar 

  • Ross, S. M. (2014). Introduction to probability models. Cambridge: Academic Press.

    Google Scholar 

  • Sinuany-Stern, Z. (1999). Reliability and maintenance in production control-an introduction. Annals of Operations Research, 91, 1–10.

    Article  Google Scholar 

  • Stark, G. E. (1987). Dependability evaluation of integrated hardware/software systems. IEEE Transactions on Reliability, 36(4), 440–444.

    Article  Google Scholar 

  • Tao, F., Zuo, Y., Da Xu, L., & Zhang, L. (2014). IoT-based intelligent perception and access of manufacturing resource toward cloud manufacturing. IEEE Transactions on Industrial Informatics, 10(2), 1547–1557.

    Article  Google Scholar 

  • Teng, X., Pham, H., & Jeske, D. R. (2006). Reliability modeling of hardware and software interactions, and its applications. IEEE Transactions on Reliability, 55(4), 571–577.

    Article  Google Scholar 

  • Van Driel, W. D., Schuld, M., Wijgers, R., & Van Kooten, W. E. J. (2014). Software reliability and its interaction with hardware reliability. In: Proceedings of 2014 15th international conference on thermal, mechanical and mulit-physics simulation and experiments in microelectronics and microsystems, Ghent, Belgium (pp. 1–8).

  • Wan, J., Tang, S., Shu, Z., Li, D., Wang, S., Imran, M., et al. (2016). Software-defined industrial internet of things in the context of industry 4.0. IEEE Sensors Journal, 16(20), 7373–7380.

    Article  Google Scholar 

  • Wang, J., Bai, G., Li, Z., & Zuo, M. J. (2020). A general discrete degradation model with fatal shocks and age-and state-dependent nonfatal shocks. Reliability Engineering and System Safety, 193, 106648.

    Article  Google Scholar 

  • Wang, F., & Li, H. (2018). Distribution modeling for reliability analysis: Impact of multiple dependences and probability model selection. Applied Mathematical Modelling, 59, 483–499.

    Article  Google Scholar 

  • Wang, H., & Pham, H. (1996). Optimal age-dependent preventive maintenance policies with imperfect maintenance. International Journal of Reliability, Quality and Safety Engineering, 3(02), 119–135.

    Article  Google Scholar 

  • Wang, Y., & Pham, H. (2011). Modeling the dependent competing risks with multiple degradation processes and random shock using time-varying copulas. IEEE Transactions on Reliability, 61(1), 13–22.

    Article  Google Scholar 

  • Wang, S., Wan, J., Li, D., & Zhang, C. (2016a). Implementing smart factory of industrie 4.0: An outlook. International Journal of Distributed Sensor Networks, 12(1), 3159805.

    Article  Google Scholar 

  • Wang, S., Wan, J., Zhang, D., Li, D., & Zhang, C. (2016b). Towards smart factory for industry 4.0: A self-organized multi-agent system with big data based feedback and coordination. Computer Networks, 101, 158–168.

    Article  Google Scholar 

  • Welke, S. R., Johnson, B. W., & Aylor, J. H. (1995). Reliability modeling of hardware/software systems. IEEE Transactions on Reliability, 44(3), 413–418.

    Article  Google Scholar 

  • Xu, D., He, J., & Yang, Z. (2019). Reliability prediction based on Birnbaum–Saunders model and its application to smart meter. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03429-2.

    Article  Google Scholar 

  • Yang, G. (2007). Life cycle reliability engineering. Hoboken: Wiley.

    Book  Google Scholar 

  • Yarom, Y., Genkin, D., & Heninger, N. (2017). CacheBleed: A timing attack on OpenSSL constant-time RSA. Journal of Cryptographic Engineering, 7(2), 99–112.

    Article  Google Scholar 

  • Zeng, Y., Xing, L., Zhang, Q., & Jia, X. (2019). An analytical method for reliability analysis of hardware-software co-design system. Quality and Reliability Engineering International, 35(1), 165–178.

    Article  Google Scholar 

  • Zhao, X., Cai, J., Mizutani, S., & Nakagawa, T. (2020a). Preventive replacement policies with time of operations, mission durations, minimal repairs and maintenance triggering approaches. Journal of Manufacturing Systems. https://doi.org/10.1016/j.jmsy.2020.04.003.

    Article  Google Scholar 

  • Zhao, X., Chen, M., & Nakagawa, T. (2020b). Periodic replacement policies with shortage and excess costs. Annals of Operations Research. https://doi.org/10.1007/s10479-020-03566-z.

    Article  Google Scholar 

  • Zhao, X., Wu, C., Wang, X., & Sun, J. (2020c). Reliability analysis of k-out-of-n: F balanced systems with multiple functional sectors. Applied Mathematical Modelling, 82, 108–124.

    Article  Google Scholar 

  • Zhou, H., Wang, S., Qi, F., & Gao, S. (2019). Maintenance modeling and operation parameters optimization for complex production line under reliability constraints. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03228-9.

    Article  Google Scholar 

  • Zhu, M., & Pham, H. (2018a). A two-phase software reliability modeling involving with software fault dependency and imperfect fault removal. Computer Languages, Systems and Structures, 53, 27–42.

    Article  Google Scholar 

  • Zhu, M., & Pham, H. (2018b). A software reliability model incorporating martingale process with gamma-distributed environmental factors. Annals of Operations Research. https://doi.org/10.1007/s10479-018-2951-7.

    Article  Google Scholar 

  • Zhu, M., & Pham, H. (2018c). A multi-release software reliability modeling for open source software incorporating dependent fault detection process. Annals of Operations Research, 269(1–2), 773–790.

    Article  Google Scholar 

  • Zhu, M., & Pham, H. (2019). A novel system reliability modeling of hardware, software, and interactions of hardware and software. Mathematics, 7(11), 1049.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mengmeng Zhu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhu, M. A new framework of complex system reliability with imperfect maintenance policy. Ann Oper Res 312, 553–579 (2022). https://doi.org/10.1007/s10479-020-03852-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-020-03852-w

Keywords

Navigation