Skip to main content
Log in

Improving the fault-tolerance of software-defined networks with dynamic overlay agreement

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Software-Defined Networking (SDN) emerges as one of the leading technologies to address the pressing networking problems such as network virtualization and data center complexity issues using programmable switches and controllers. To well exchange information and have coordinated control for the SDN environment, the multi-controller SDN network has been proposed. Unfortunately, this network architecture still faces the problem of faulty controllers that may forge the command arbitrarily, and then the whole network will be broken down easily. Thus, it is important to propose the fault-tolerance scheme under the SDN network system. In this paper, a new SDN environment called the Multi-Controller Overlay Groups (MCOG) where each device is managed by the controllers is proposed. Besides, the degree of the faulty influence will be revised to define the different bounds of fault tolerance in the MCOG environment. Furthermore, the new protocol named General Dynamic Multi-Controller Agreement (GDMCA) protocol is proposed to solve the Byzantine Agreement (BA) problem in the MCOG environment. Based on the proof of complexity, it can show that the proposed GDMCA protocol is optimal under the MCOG environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Aslan, M., Matrawy, A.: Adaptive consistency for distributed SDN controllers, 17th International Telecommunications Network Strategy and Planning Symposium (Networks), November 2015, 150–157

  2. Dridi, L., Zhani, M.F.: SDN-Guard: DoS attacks mitigation in SDN networks, 5th IEEE International Conference on Cloud Networking, December 2016, 212–217

  3. Misra, N. Saha and R. Bhakta: Traffic-aware rule-cache assignment in SDN: Security Implications, 2020 IEEE International Conference on Communications Workshops (ICC Workshops), Dublin, Ireland, 2020, 1–6

  4. Chattopadhyay, S., Chatterjee, S., Nandi, S., Chakraborty, S.: Aloe: Fault-Tolerant network management and Orchestration framework for IoT applications, in IEEE Transactions on Network and Service Management. https://doi.org/10.1109/TNSM.2020.3008426

  5. Jia, Z., Sun, Y., Liu, Q., Dai, S., Liu, C.: cRetor: an SDN-based routing scheme for data centers with regular topologies. IEEE Access 8, 116866–116880 (2020)

    Article  Google Scholar 

  6. Q. Qin; K. Poularakis; G. Iosifidis; L. Tassiulas: SDN controller placement at the edge: optimizing delay and overheads, IEEE International Conference on Computer Communications (Infocom), 2018

  7. Huang, H., Guo, S., Liang, W., Li, K., Ye, B., Zhuang, W.: Near-optimal routing protection for in-band software-defined heterogeneous networks. IEEE J. Sel. Areas Commun. 34(11), 2918–2934 (2016)

  8. Wu, D., Nie, X., Asmare, E., Arkhipov, D. I., Qin, Z., Li, R., McCann, J. A., Li, K.: Mobility management and flow scheduling in software defined urban IoT. In: IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 6, pp. 1400–1418, 1 June 2020

  9. Blial, O., Mamoun, B. M., Benaini, R.: An overview on SDN architectures with multiple controllers. J. Comput. Netw. Commun. 2, 1–8 (2016)

  10. Ho, C.C., Wang, K., Hsu, Y. H.: A fast consensus algorithm for multiple controllers in software-defined networks, 18th International Conference on Advanced Communication Technology (ICACT), March 2016, 112–116

  11. Qiu, C., Song, Q., Zhao, C., Xu, F.: Multiple Controllers Sleeping Management in Green Software-Defined Wireless Networking, 16th International Symposium on Communications and Information Technologies (ISCIT), November 2016, 525–530

  12. Yongkai, Z., Hang, Y., Lijun, Z., Guobao, L., Ge, L.: Multiple SDN Controller Orchestration for Financial Cloud, International Conference on Security of Smart Cities, Industrial Control System and Communications (SSIC), July 2016, 1–6

  13. Chourishi, D., Miri, A., Milic, M., Ismaeel, S.: Role-Based Multiple Controllers for Load Balancing and Security in SDN, IEEE Canada International Humanitarian Technology Conference (IHTC), September 2015, 1–4

  14. Yu, J., Wang, Y., Pei, K., Zhang, S., Li, J.: A Load Balancing Mechanism for multiple SDN Controllers based on Load Informing Strategy, IEEE 18th Asia-Pacific Network Operations and Management Symposium (APNOMS), October 2016, 1–4.

  15. Corici, M., Reichel, B., Bochow, B., Magedanz, T.: An SDN-based Solution for Increasing Flexibility and Reliability of Dedicated Network Environments, IEEE 21st International Conference Emerging Technologies and Factory Automation (ETFA), September 2016, 1–6

  16. Li, H., Jiang, G., Chai, R.: Energy consumption optimization based joint routing and flow allocation algorithm for Software-Defined networking, IEEE 19th International Symposium on Wireless Personal Multimedia Communications (WPMC), November 2016, 311–316

  17. Lamport, L., Shostak, R., Pease, M.: The Byzantine generals problem. ACM Trans. Progr. Lang. Syst. 4(3), 382–401 (1982)

    Article  Google Scholar 

  18. Lynch, N. A., Fischer, M. J., Fowler, R. J.: A simple and efficient Byzantine generals algorithm. In: IEEE Symposium on Reliability in Distributed Software and Database Systems, February 1982, 46–52

  19. Lim, J.B., Jung, D., Suh, T., Yu, H.: Unstructured Membership Management for Byzantine Fault Tolerance in Clouds, 27th International Conference on Architecture of Computing Systems (ARCS), March 2014, 1–8

  20. Pease, M., Shostak, R., Lamport, L.: Reaching Agreement in Presence of Faults. Journal of ACM 27(2), 228–234 (1980)

    Article  MathSciNet  Google Scholar 

  21. Barborak, M., Malek, M., Dahbura, A.: The consensus problem in fault-tolerant computing. ACM Comput. Surv. 25(2), 171–220 (1993)

    Article  Google Scholar 

  22. Baracha, G., Toeug, S.: Asynchronous consensus and broadcast protocols. J. ACM 32(4), 824–840 (1985)

    Article  MathSciNet  Google Scholar 

  23. Grossman, R.L.: The Case for Cloud Computing. IT Professional 11(2), 23–27 (2009)

    Article  Google Scholar 

  24. Chiang, M. L., Cheng, T. Y.: Restudy the Byzantine Agreement Problem under Multi-Controller Groups Software-Defined Networks (Master’s thesis), Available from https://hdl.handle.net/11296/3z5sk4

  25. ElDefrawy, K., Kaczmarek, T.: Byzantine Fault Tolerant Software-Defined Networking Controllers, IEEE 40th Annual Computer Software and Applications Conference, August 2016, 2, 208–213

  26. Li, H., Li, P., Guo, S., Nayak, A.: Byzantine-Resilient Secure Software-Defined Networks with Multiple Controllers in Cloud. IEEE Transactions on Cloud Computing 2(4), 436–447 (2014)

    Article  Google Scholar 

  27. Karmakar, K. K., Varadharajan, V., Tupakula, U.: Mitigating attacks in software defined networks. Clust. Comput. 22, 1143–1157 (2019)

    Article  Google Scholar 

  28. Görkemli, B., Parlakışık, A.M., Civanlar, S., Ulaş, A., Murat, A.: Tekalp: Dynamic Management of Control Plane Performance in Software-Defined Networks, IEEE NetSoft Conference and Workshops (NetSoft), July 2016, 68–72

  29. Jafarian, T.; Masdari, M.; Ghaffari, A. et al.: A survey and classification of the security anomaly detection mechanisms in software defined networks, Cluster Computing, 2020

  30. Reitblatt, M., Foster, N., Rexford, J., Walker, D.: Consistent updates for software-defined networks: Change you can believe in, in Proceedings of the 10th ACM Workshop on Hot Topics in Networks, November 2011, 1–6

  31. Khairi, S., Raouyane, B., Bellafkih, M.: Novel QoE monitoring and management architecture with eTOM for SDN-based 5G networks. Cluster Computing 23, 1–12 (2020)

    Article  Google Scholar 

  32. Liao, D., Wu, Y., Wu, Z., Zhu, Z., Zhang, W., Sun, G., Chang, V.: AI-based software-defined virtual network function scheduling with delay optimization. Clust. Comput. 22, 13897–13909 (2019)

  33. Darabseh, A., Freris, N. M.: A software-defined architecture for control of IoT cyberphysical systems. Clust. Comput. 22, 1107–1122 (2019)

    Article  Google Scholar 

  34. Heller, B., Sherwood, R., Mckeown, N.: The controller placement problem, in the first workshop on Hot topics in software defined networks (HotSDN '12). 2012, 7–12

  35. Yu, M., Rexford, J., Freedman, M. J., Wang, J.: Scalable Flow-Based Networking with DIFANE. ACM SIGCOMM Comput. Commun. Rev. 40(4), 351–362 (2010)

    Article  Google Scholar 

  36. Koponen, T., Casado, M., Gude, N., Stribling, J., Poutievskiy, L., Zhuy, M., Ramanathany, R., Iwataz, Y., Inouez, H., Hamaz, T., Shenker, S.: Onix: A Distributed Control Platform for Large-scale Production Networks, in the 9th USENIX conference on Operating systems design and implementation (OSDI'10). 2010

  37. Yeganeh, S. H., Ganjali, Y.: Kandoo: A framework for efficient and scalable, in the first workshop on Hot topics in software defined networks(HotSDN '12). 2012. p. 19–24

  38. Jain, S., Kumar, A., Mandal, S., Ong, J., Poutievski, L., Singh, A., Venkata, S., Wanderer, J., Zhou, J., Zhu, M., Zolla, J., Hölzle, U., Stuart, S., Vahdat, A.: B4: Experience with a globally-deployed software defined WAN, in ACM SIGCOMM 2013 conference on SIGCOMM, (SIGCOMM’13). August 12–16, 2013: Hong Kong, China. p. 3–14

  39. Chandra, T. D., Griesemer, R., Redstone, J.: Paxos made live - an engineering perspective, in the twenty-sixth annual ACM symposium on principles of distributed computing (PODC '07). 2007: New York, NY, USA. pp. 398–407

  40. Molina, G.: Application of Byzantine Agreement in Database Systems. ACM Transactions on Database Systems 11(1), 24–27 (1986)

    Google Scholar 

  41. Gifford, G. K.: Weighted voting for replicated data, technical report, in Proceedings of the 7th ACM symposium on operating system principles (SOSP), December 1979, 150–162

  42. Wang, S. C.: A study of application of byzantine agreement flight management system, in Proceedings of Aero. & Astro. Conference (AASRC), 1990, 651–660

  43. Yan, K. Q., Chin, Y. H.: Achieving Byzantine agreement in a processor & links fallible network, conference proceedings, 8th annual phoenix conference on computers and communications, August 1989, 407–412

  44. Wang, S. C., Chin, Y. H., Yan, K. Q.: Consensus problem on a generalized network connected by unreliable transmission media, in Proceedings of international conference on computing and information, August 1990, 413–417

  45. Song, S., Park, H., Choi, B. Y., Choi, T., Zhu, H.: Control path management framework for enhancing software-defined network (SDN) reliability. IEEE Trans. Netw. Serv. Manage. 14(2), 302–316 (2017)

    Article  Google Scholar 

  46. Tootoonchian, A., Ganjali Y.: A distributed control plane for OpenFlow, in Proceedings of the Internet Network Management Conference on Research on Enterprise Networking (INM/WREN), April 2010, 3.-3

  47. Koponen, T., Casado, M., Gude, N., Stribling, J., Poutievski, L., Zhu, M., Ramanathan, R., Iwata, Y., Inoue, H., Hama, T., Shenker, S.: Onix: A distributed control platform for large-scale production networks, in Proceedings of the 9th USENIX conference on Operating Systems Design and Implementation (OSDI), October 2010, 351–364

  48. Fonseca, P., Bennesby, R., Mota, E., Passito, A.: A replication component for resilient OpenFlow-based networking, IEEE Network Operations and Management Symposium (NOMS), June 2012, 933–939

  49. Yan, K. Q., Chin, Y. H., Wang, S. C.: Optimal agreement protocol in malicious faulty processors and faulty links. IEEE Trans. Knowl. Data Eng. 4(3), 266–280 (1992)

  50. Wang, S. C., Wang, S.S., Yan, K. Q.: Optimal malicious agreement in a virtual subnet-based cloud computing environment, The eighth international conference on internet and web applications and services, September 2013, 224–229

  51. Fischer, M.: The Consensus Problem in Unreliable Distributed Systems (A Brief Survey), Lecture Notes in Computer Science, in Proceedings of the International Fault Tolerance Computer Conference, August 1983, 127–140

  52. Bar-Noy, D., Dolev, C., Dwork, R.: Strong: Shifting gears: changing algorithms on the fly to expedite Byzantine agreement. In: Proceedings of the 6th Annual ACM Symposium on Principles of Distributed Computing, April 1987, 42–51

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mao-Lun Chiang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hsieh, HC., Chiang, ML. & Chang, TY. Improving the fault-tolerance of software-defined networks with dynamic overlay agreement. Cluster Comput 24, 2597–2614 (2021). https://doi.org/10.1007/s10586-020-03224-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-020-03224-w

Keywords

Navigation