Distributed Computing

, Volume 24, Issue 6, pp 299–321 | Cite as

Consensus in the presence of mortal Byzantine faulty processes

  • Josef Widder
  • Martin Biely
  • Günther Gridling
  • Bettina Weiss
  • Jean-Paul Blanquart
Open Access
Article

Abstract

We consider the problem of reaching agreement in distributed systems in which some processes may deviate from their prescribed behavior before they eventually crash. We call this failure model “mortal Byzantine”. After discussing some application examples where this model is justified, we provide matching upper and lower bounds on the number of faulty processes, and on the required number of rounds in synchronous systems. We then continue our study by varying different system parameters. On the one hand, we consider the failure model under weaker timing assumptions, namely for partially synchronous systems and asynchronous systems with unreliable failure detectors. On the other hand, we vary the failure model in that we limit the occurrences of faulty steps that actually lead to a crash in synchronous systems.

Keywords

Consensus Byzantine fault Distributed algorithm Fault tolerance Complexity 

References

  1. 1.
    Charron-Bost B., Schiper A.: Uniform consensus is harder than consensus. J. Algorithms 51(1), 15–37 (2004)CrossRefMATHMathSciNetGoogle Scholar
  2. 2.
    Delporte-Gallet, C., Fauconnier, H., Horn, S.L., Toueg, S.: Fast fault-tolerant agreement algorithms. In: Proceedings of the 24th ACM Symposium on Principles of Distributed Computing (PODC’05), pp. 169–178. ACM Press, New York, USA (2005)Google Scholar
  3. 3.
    Lynch N.: Distributed Algorithms. Morgan Kaufman Publishers, San Francisco (1996)MATHGoogle Scholar
  4. 4.
    Lamport L., Shostak R., Pease M.: The Byzantine generals problem. ACM Trans. Program. Lang. Syst. 4(3), 382–401 (1982)CrossRefMATHGoogle Scholar
  5. 5.
    Nesterenko, M., Arora, A.: Dining philosophers that tolerate malicious crashes. In: Proceedings of the 22nd International Conference on Distributed Computing Systems (ICDCS’02), pp. 191–198. Vienna, Austria (2002)Google Scholar
  6. 6.
    Fischer, M.J., Lynch, N.A., Merritt, M.: Easy impossibility proofs for distributed consensus problems. In: Proceedings of the Fourth Annual ACM Symposium on Principles of Distributed Computing, PODC ’85, pp. 59–70. ACM, New York, USA (1985)Google Scholar
  7. 7.
    Fischer M.J., Lynch N.: A lower bound for the time to assure interactive consistancy. Inf. Process. Lett. 14(4), 198–202 (1982)CrossRefMathSciNetGoogle Scholar
  8. 8.
    Dwork C., Lynch N., Stockmeyer L.: Consensus in the presence of partial synchrony. J. ACM 35(2), 288–323 (1988)CrossRefMathSciNetGoogle Scholar
  9. 9.
    Chandra T.D., Toueg S.: Unreliable failure detectors for reliable distributed systems. J. ACM 43(2), 225–267 (1996)CrossRefMATHMathSciNetGoogle Scholar
  10. 10.
    Doudou, A., Garbinato, B., Guerraoui, R., Schiper, A.: Muteness failure detectors: specification and implementation. In: Proceedings 3rd European Dependable Computing Conference (EDCC-3). Lecture Notes in Computer Science 1667, vol. 1667, pp. 71–87. Springer, Prague, Czech Republic (1999)Google Scholar
  11. 11.
    Doudou, A., Schiper, A.: Muteness detectors for consensus with Byzantine processes. In: Proceedings of the 17th ACM Symposium on Principles of Distributed Computing (PODC-17). Puerto Vallarta, Mexico (1998)Google Scholar
  12. 12.
    Bazzi, R.A., Herlihy, M.: Enhanced fault-tolerance through Byzantine failure detection. In: 13th International Conference on Principles of Distributed Systems (OPODIS), Lecture Notes in Computer Sciences, vol. 5923, pp. 129–143. Springer (2009)Google Scholar
  13. 13.
    Dijkstra, E.W.: On the role of scientific thought. In: Selected Writings on Computing: A Personal Perspective, pp. 60–66. Springer, New York (1982). (EWD 447)Google Scholar
  14. 14.
    Dolev D., Reischuk R., Strong H.R.: Early stopping in Byzantine agreement. J. ACM 37(4), 720–741 (1990)CrossRefMATHMathSciNetGoogle Scholar
  15. 15.
    Elrad T., Francez N.: Decomposition of distributed programs into communication-closed layers. Sci. Comput. Programm. 2(3), 155–173 (1982)CrossRefMATHGoogle Scholar
  16. 16.
    Aguilera, M.K., Delporte-Gallet, C., Fauconnier, H., Toueg, S.: Consensus with Byzantine failures and little system synchrony. In: DSN ’06: Proceedings of the International Conference on Dependable Systems and Networks, pp. 147–155. IEEE Computer Society, Washington, DC, USA (2006). doi:10.1109/DSN.2006.22
  17. 17.
    Bracha G., Toueg S.: Asynchronous consensus and broadcast protocols. J. ACM 32(4), 824–840 (1985)CrossRefMathSciNetGoogle Scholar
  18. 18.
    Srikanth T., Toueg S.: Simulating authenticated broadcasts to derive simple fault-tolerant algorithms. Distrib. Comput. 2, 80–94 (1987)CrossRefGoogle Scholar
  19. 19.
    Fischer M.J., Lynch N.A., Paterson M.S.: Impossibility of distributed consensus with one faulty process. J. ACM 32(2), 374–382 (1985)CrossRefMATHMathSciNetGoogle Scholar
  20. 20.
    Perry K.J., Toueg S.: Distributed agreement in the presence of processor and communication faults. IEEE Trans. Softw. Eng. SE-12(3), 477–482 (1986)Google Scholar
  21. 21.
    Dolev D.: The Byzantine generals strike again. J. Algorithms 3(1), 14–30 (1982)CrossRefMATHMathSciNetGoogle Scholar
  22. 22.
    Fitzi, M., Maurer, U.M.: From partial consistency to global broadcast. In: Proceedings of the 32nd Annual ACM Symposium on Theory of Computing (STOC), pp. 494–503 (2000)Google Scholar
  23. 23.
    Pease M., Shostak R., Lamport L.: Reaching agreement in the presence of faults. J. ACM 27(2), 228–234 (1980)CrossRefMATHMathSciNetGoogle Scholar
  24. 24.
    Castro, M., Liskov, B.: Practical Byzantine fault tolerance. In: 3rd Symposium on Operating Systems Design and Implementation (1999)Google Scholar
  25. 25.
    Correia M., Neves N.F., Lung L.C., Veríssimo P.: Low complexity Byzantine-resilient consensus. Distrib. Comput. 17, 237–249 (2005)CrossRefGoogle Scholar
  26. 26.
    Doudou, A., Garbinato, B., Guerraoui, R.: Encapsulating failure detection: From crash to Byzantine failures. In: Reliable Software Technologies—Ada-Europe 2002. Lecture Notes in Computer Science 2361, pp. 24–50. Springer, Vienna, Austria (2002)Google Scholar
  27. 27.
    Malkhi, D., Reiter, M.: Unreliable intrusion detection in distributed computations. In: Proceedings of the 10th Computer Security Foundations Workshop (CSFW97), pp. 116–124. Rockport, MA, USA (1997)Google Scholar
  28. 28.
    Abd-El-Malek, M., Granger, G.R., Goodson, G.R., Reiter, M.K., Wylie, J.J.: Fault-scalable Byzantine fault-tolerant services. In: 20th ACM Symposium on Operating Systems Principles (SOSP’05), pp. 59–74 (2005)Google Scholar
  29. 29.
    Correia M., Neves N.F., Veríssimo P.: From consensus to atomic broadcast: Time-free Byzantine-resistant protocols without signatures. Comput. J. 49(1), 82–96 (2006)CrossRefGoogle Scholar
  30. 30.
    Martin J.P., Alvisi L.: Fast Byzantine consensus. IEEE Trans. Dependable Secur. Comput. 3(3), 202–215 (2006)CrossRefGoogle Scholar
  31. 31.
    Anceaume, E., Delporte-Gallet, C., Fauconnier, H., Hurfin, M., Le Lann, G.: Designing modular services in the scattered Byzantine failure model. In: 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), pp. 262–269. IEEE Computer Society (2004)Google Scholar
  32. 32.
    Anceaume, E., Delporte-Gallet, C., Fauconnier, H., Hurfin, M., Widder, J.: Clock synchronization in the Byzantine-recovery failure model. In: International Conference On Principles Of Distributed Systems OPODIS 2007. Lecture Notes in Computer Science, pp. 90–104. Springer, Guadeloupe, French West Indies (2007)Google Scholar
  33. 33.
    Azadmanesh M.H., Kieckhafer R.M.: New hybrid fault models for asynchronous approximate agreement. IEEE Trans. Comput. 45(4), 439–449 (1996)CrossRefMATHGoogle Scholar
  34. 34.
    Biely, M.: An optimal Byzantine agreement algorithm with arbitrary node and link failures. In: Proceedings of 15th Annual IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS’03), pp. 146–151. Marina Del Rey, USA (2003)Google Scholar
  35. 35.
    Thambidurai, P.M., Park, Y.K.: Interactive consistency with multiple failure modes. In: Proceedings of 7th Symposium on Reliable Distributed Systems, pp. 93–100 (1988)Google Scholar
  36. 36.
    Fischer, M., Lamport, L.: Byzantine generals and transaction commit protocols. Technical Report 62, SRI International (1982)Google Scholar
  37. 37.
    Hermant J.F., Lann G.: Fast asynchronous uniform consensus in real-time distributed systems. IEEE Trans. Comput. 51(8), 931–944 (2002)CrossRefGoogle Scholar
  38. 38.
    Chandra T.D., Hadzilacos V., Toueg S.: The weakest failure detector for solving consensus. J. ACM 43(4), 685–722 (1996)CrossRefMATHMathSciNetGoogle Scholar
  39. 39.
    Charron-Bost B., Hutle M., Widder J.: In search of lost time. Inf. Process. Lett. 110(21), 928–933 (2010)CrossRefMathSciNetGoogle Scholar
  40. 40.
    Baldoni R., Hélary J.M., Raynal M., Tangui L.: Consensus in Byzantine asynchronous systems. J. Discret. Algorithms 1(2), 185–210 (2003)CrossRefMATHGoogle Scholar
  41. 41.
    Friedman R., Mostéfaoui A., Raynal M.: Simple and efficient oracle-based consensus protocols for asynchronous Byzantine systems. IEEE Trans. Dependable Secur. Comput. 2(1), 46–56 (2005)CrossRefGoogle Scholar
  42. 42.
    Kihlstrom, K.P., Moser, L.E., Melliar-Smith, P.M.: Solving consensus in a Byzantine environment using an unreliable fault detector. In: Proceedings of the International Conference on Principles of Distributed Systems (OPODIS), pp. 61–75. Chantilly, France (1997)Google Scholar
  43. 43.
    Kihlstrom K.P., Moser L.E., Melliar-Smith P.M.: Byzantine fault detectors for solving consensus. Comput. J. 46(1), 16–35 (2003)CrossRefMATHGoogle Scholar
  44. 44.
    Aguilera M.K., Chen W., Toueg S.: Failure detection and consensus in the crash-recovery model. Distrib. Comput. 13(2), 99–125 (2000)CrossRefGoogle Scholar
  45. 45.
    Delporte-Gallet, C., Fauconnier, H., Freiling, F.C., Penso, L.D., Tielmann, A.: From crash-stop to permanent omission: automatic transformation and weakest failure detectors. In: 21st International Symposium on Distributed Computing (DISC). Lecture Notes in Computer Science, vol. 4731, pp. 165–178. Springer (2007)Google Scholar
  46. 46.
    Widder, J., Gridling, G., Weiss, B., Blanquart, J.P.: Synchronous consensus with mortal Byzantines. In: Proceedings of the International Conference on Dependable Systems and Networks (DSN’07), pp. 102–111. Edinburgh, UK (2007)Google Scholar
  47. 47.
    Choy, M., Singh, A.K.: Efficient fault tolerant algorithms for resource allocation in distributed systems. In: Proceedings of the Twenty-fourth Annual ACM Symposium on Theory of Computing, STOC ’92, pp. 593–602. ACM, New York, USA (1992)Google Scholar
  48. 48.
    Yamauchi, Y., Masuzawa, T., Bein, D.: Adaptive containment of time-bounded Byzantine faults. In: 12th International Symposium Stabilization, Safety, and Security of Distributed Systems (SSS 2010). Lecture Notes in Computer Science, vol. 6366, pp. 126–140. Springer (2010)Google Scholar
  49. 49.
    Turpin R., Coan A.B.: Extending binary Byzantine agreement to multivalued Byzantine agreement. Inf. Process. Lett. 18(2), 73–76 (1984)CrossRefGoogle Scholar
  50. 50.
    Mostefaoui A., Raynal M., Tronel F.: From binary consensus to multivalued consensus in asynchronous message-passing systems. Inf. Process. Lett. 73(5–6), 207–212 (2000)CrossRefMathSciNetGoogle Scholar
  51. 51.
    Zhang J., Chen W.: Bounded cost algorithms for multivalued consensus using binary consensus instances. Inf. Process. Lett. 109(17), 1005–1009 (2009)CrossRefMATHGoogle Scholar

Copyright information

© The Author(s) 2011

Authors and Affiliations

  • Josef Widder
    • 1
  • Martin Biely
    • 2
  • Günther Gridling
    • 3
  • Bettina Weiss
    • 3
  • Jean-Paul Blanquart
    • 4
  1. 1.Formal Methods in Systems Engineering Group E184/4Technische Universität wienViennaAustria
  2. 2.Ecole polytechnique Fédérale de Lausanne (EPFL)LausanneSwitzerland
  3. 3.Embedded Computing Systems Group E182/2Technische Universität wienViennaAustria
  4. 4.Astrium SatellitesToulouseFrance

Personalised recommendations