Abstract
In this paper, we consider lightweight decentralised algorithms for achieving consensus in distributed systems. Each member of a distributed group has a private value from a fixed set consisting of, say, two elements, and the goal is for all members to reach consensus on the majority value. We explore variants of the voter model applied to this problem. In the voter model, each node polls a randomly chosen group member and adopts its value. The process is repeated until consensus is reached. We generalise this so that each member polls a (deterministic or random) number of other group members and changes opinion only if a suitably defined super-majority has a different opinion. We show that this modification greatly speeds up the convergence of the algorithm, as well as substantially reducing the probability of it reaching consensus on the incorrect value.
Similar content being viewed by others
References
Abdullah, M.A., Draief, M.: Consensus on the initial global majority by local majority polling for a class of sparse graphs, (2012). arXiv:1209.5025
Benezit, F., Thiran, P., Vetterli, M.: Interval consensus: from quantized gossip to voting. In: Proceeding of IEEE ICASSP, pp. 3661–3664 (2009)
Cooper, C., Elsasser, R., Ono, H., Radzik, T.: Coalescing random walks and voting on connected graphs, (2013). arXiv:1204.4106
Donnelly, P., Welsh, D.: Finite particle systems and infection models. Math. Proc. Camb. Phil. Soc. 94, 167–182 (1983)
Doyle, P.G., Snell, J.L.: Random Walks and Electrical Networks. The Mathematical Association of America, Washington D.C (1984)
Draief, M., Vojnovic, M.: Convergence speed of binary interval consensus. SIAM J. Control Optim. 50, 1087–1109 (2012)
Hassin, Y., Peleg, D.: Distributed probabilistic polling and applications to proportionate agreement. In: Proceedings of 26th International Colloquium on Automata, Languages & Programming, pp. 402–411 (1999)
Hendrickx, J.M., Olshevsky, A., Tsitsiklis, J.N.: Distributed anonymous discrete function computation and averaging. IEEE Trans. Automat. Control 56(10), 2276–2289 (2011)
Kanoria, Y., Montanari, A.: Majority dynamics on trees and the dynamic cavity method. Ann. Appl. Probab. 21(5), 1694–1748 (2011)
Kashyap, A., Basar, T., Srikant, R.: Quantized consensus. Automatica 43(7), 1192–1203 (2007)
Mosk-Aoyama, D., Shah, D.: Fast distributed algorithms for computing separable functions. IEEE Trans. Inf. Theory 54(7), 2997–3007 (2008)
Mossel, E., Neeman, J., Tamuz, O.: Majority dynamics and aggregation of information in social networks. Auton. Agent Multi-Agent Syst. 28(3), 408–429 (2014)
Olshevsky, A., Tsitsiklis, J.: Convergence speed in distributed consensus and averaging. SIAM J. Control Optim. 48, 33–55 (2009)
Perron, E., Vasudevan D., Vojnovic, M.: Using three states for binary consensus on complete graphs. In: Proceedings of IEEE Infocom, IEEE Communications Society, New York (2009)
Shang, S., Cuff, P., Hui, P., Kulkarni, S.: An upper bound on the convergence time for quantized consensus. In: Proceedings of IEEE Infocom, IEEE Communications Society, New York (2013)
Surowiecki, J.: The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How Collective Wisdom Shapes Business,Economies, Societies and Nations. Doubleday, New York (2004)
Acknowledgments
The authors would like to thank the reviewers and the associated editor for their helpful and insightful comments which have improved this paper.
Author information
Authors and Affiliations
Corresponding author
Appendix: Proof of Theorems
Appendix: Proof of Theorems
Throughout the proofs we adopt the convention that any summation with a lower index larger than it’s upper index is taken to be zero.
Proof (Proof of Theorem 1)
Let \(X(t)\) denote the number of nodes in state 1 at time \(t\). As we are considering the \((m,m)\) algorithm on a complete graph with \(N\) nodes, the dynamics of \(X\) are governed by a continuous time Markov chain on the state space \(\{ 0,1,\ldots , N \}\), with absorbing states at \(0\) and \(N\), and with transition rates given by
We are interested in the probability of \(X\) hitting state \(N\) before state \(0\), and hence being absorbed in state \(N\).
If the initial system state is \(X(0)=i<N/2\), then hitting state \(N\) is equivalent to the system reaching consensus on the incorrect (minority) value. Recall from (1) that we defined
as the hitting probability of state \(N\) from an initial state with \(i\) nodes in state 1. As discussed in Sect. 3.1, we obtain by conditioning on the first jump that the hitting probabilities \(h_N(\frac{i}{N})\), \(i=0,1,\ldots ,N\), satisfy the recursion (4).
In order to solve this recursion, we make use of an analogy with electrical resistor networks (see Snell and Doyle [5]), as outlined in Sect. 3.1. Let \(R_i\) be the resistance between nodes \(i\) and \(i-1\) in a series network of \(N\) resistors, and let \(V_i\) denote the voltage at node \(i\), i.e. the node between resistors \(R_i\) and \(R_{i\,+\,1}\). Set \(V_0=0\) and \(V_N=1\). If we normalise \(R_1=1\), and take
then the hitting probabilities \(h_N(\frac{i}{N})\), \(i=1,\ldots ,N-1\) satisfy the same Eq. (3) as the voltages \(V_i\), \(i=1,\ldots ,N-1\) in the resistor network described above. We now solve for these voltages.
We obtain from the above recursion for the resistor values that
where we use the convention that an empty product is equal to 1. Now, by Ohm’s law, the current through the series resistor network is given by \(1/(R_1+R_2+\ldots +R_N)\). Consequently, the voltage at node \(i\) is \(V_i=\sum _{k=1}^i R_k/\sum _{k=1}^N R_k\). Since this is the same as the hitting probability \(h_N(i/N)\), we obtain that
For \(\alpha \in [0,1]\) such that \(\alpha N\) is not an integer, we defined \(h_N(\alpha )\) in (2) by linear interpolation. Since \(h_N(i/N)\) is increasing in \(i\), it follows that
We shall obtain an upper bound on the numerator and a lower bound on the denominator in the expression above. First observe that, if \(i\le \alpha N\), then
and so
for all \(i\ge 1\). It follows that
Now, if \(\alpha \in (0,1/2)\) as in the statement of the theorem, then
for a constant \(c\) that does not depend on \(N\). Combining this with Stirling’s formula, we obtain that
where we recall that \(H(\cdot )\) denotes the binary entropy function. Likewise, we obtain using Stirling’s formula that
Substituting (6) and (7) in (5), we obtain that
where \(c>0\) is a constant that may depend on \(\alpha \) but does not depend on \(N\). This establishes the first claim of the theorem.
In order to establish the second claim about logarithmic equivalence, we need a lower bound on \(h_N(\alpha )\) that matches the above upper bound to within a term that is subexponential in \(N\). We obtain such a bound from
by obtaining a lower bound on the numerator and an upper bound on the denominator. First, an upper bound on the denominator is as follows:
since \(H(1/2)=\log 2\). Next, we can get a lower bound on the numerator by simply replacing the sum with its last term, corresponding to \(i=\lfloor \alpha N \rfloor -1\). By Stirling’s formula, this gives a lower bound which is identical to the upper bound in (6), up to a constant. Hence, we get
Since this agrees with the lower bound on \(h_N\) up to a polynomial in \(N\), the claimed logarithmic equivalence follows. This completes the proof of the theorem. \(\square \)
Proof (Proof of Theorem 2)
We consider the \((m,d)\) algorithm. Let \(X(t)\) denote the number nodes in state 1 at time \(t\), and let \(Z_x\) denote a Binomial (\(m,x\)) random variable. Then \(X(t)\) evolves as continuous time Markov chain with transition rates
As before, we denote the hitting probability of state \(N\) given that we start in state \(i\) as
We extend the definition of \(h_N\) to \([0,1]\) by linear interpolation, as in (2).
By conditioning on the first transition, we observe that the hitting probabilities satisfy the recurrence relation (4), with the transition probabilities \(p_{i,\,i+1}\) and \(p_{i,\,i-1}\) being proportional to the rates specified above. Hence, as in the proof of Theorem 1, we can solve the recurrence using an electrical network analogy. The hitting probabilities \(h_N(i/N)\) satisfy the same equations as the voltages \(V_i\) in a series resistor network with \(V_0=0\) and \(V_N=1\), where the resistances are related by
Using the definition of the function \(g\) in the statement of Theorem 2, we can rewrite this is \(R_{i+1}= g(i/N) R_i\). Taking \(R_1=1\) without loss of generality, and solving for the voltages in this resistor network, we obtain
As usual, we take empty sums to be 0 and empty products to be 1. In order to obtain an upper bound on \(h_N(i/N)\), we seek an upper bound on the numerator and a lower bound on the denominator. We shall bound the numerator from above by \(N\) times the largest summand, and the denominator from below by a single summand, corresponding to \(i=\lfloor N/2 \rfloor \).
We assumed in the statement of Theorem 2 that \(m\ge 2\) and \(d>m/2\). Now,
Comparing the summands term by term, their ratio is \(\bigl (\frac{x}{1-x}\bigr )^{m-2k+1}\) for index \(k\). This is bigger than 1 for all \(x<1/2\) because \(m-2k+1\) is negative or zero for all \(k\ge d>m/2\). Therefore, we conclude that \(g(x)\ge 1\) for all \(x\le 1/2\). Using this fact, we see from (9) that the summands in the numerator are non-decreasing in \(j\), and hence the largest summand corresponds to \(j=i\). Noting, also, that the function \(h_N\) is non-decreasing, we obtain that
Taking logarithms and letting \(N\) tend to infinity, we get
In order to show that the above sum converges to the Riemann integral of \(\int _{\alpha }^{1/2} \log g(x)\), as claimed in the theorem, it suffices to show that the function \(\log g\) is continuous over this compact interval. Now, from (10), the function \(g\) is a ratio of polynomials that are bounded away from zero on the interval \([\alpha ,1/2]\), and the claim follows. \(\square \)
Proof (Proof of Theorem 3)
We consider the \((m,d)\) algorithm. Let \(X(t)\) denote the number nodes in state 1 at time \(t\), and let \(Z_x\) denote a Binomial (\(m,x\)) random variable. Then \(X(t)\) evolves as continuous time Markov chain with transition rates given by (8). We shall bound the time to consensus by introducing a simpler Markov chain whose associated hitting times stochastically dominate those of the consensus process. This new chain can be viewed as a simple random walk with a negative drift and a reflecting upper boundary.
Fix \(\epsilon \in (0,1)\). We shall define a birth–death Markov chain \(Y\) on the integers \(\{ 0,1, \ldots , N\}\) by specifying the transition probabilities of the jump chain and the holding times in each state. The definitions involve parameters \(\beta \in (0,1)\) and \(c_1,c_2 >0\) that will be specified later. The transition probabilities of the jump chain are given by
while \(p_{0,\,0}=p_{N,\,N}=1\). The holding times are exponentially distributed, with rates \(c_1 i\) in state \(i<(1-\epsilon )N/2\), \(c_1 (N-i)\) for \(i>(1+\epsilon )N/2\), and \(c_2 N\) for \((1-\epsilon )N/2 \le i \le (1+\epsilon )N/2\). Note that the jump probabilities and holding times are symmetric about \(N/2\), and that \(0\) and \(N\) are absorbing states. The jump chain behaves as a symmetric random walk in the central section, and as a random walk with drift towards the boundaries in the outer sections.
We shall show that, for a suitable choice of \(\beta \), \(c_1\) and \(c_2\), the time to absorption of the consensus process \(X(t)\) is stochastically dominated by that for \(Y_{\epsilon }(t)\). By explicitly bounding the latter, we shall obtain a bound on the time to reach consensus. Recall that \(Z_x\) denotes a Binomial (\(m,x\)) random variable. We take
Define \(\tilde{X}(t) = \min \{ X(t), N-X(t) \}\), and \(\tilde{Y}(t) = \min \{ Y(t), N-Y(t) \}\). Then \(\tilde{X}\) and \(\tilde{Y}\) are Markov processes on \(\{ 0,1,\ldots ,\lfloor N/2 \rfloor \}\), with the same jump rates and holding times as \(X\) and \(Y\), respectively, except at state \(\lfloor N/2 \rfloor \) where there is no upward jump, and where the holding time is suitably modified to account for the censored jumps. Moreover, the modified processes \(\tilde{X}\) and \(\tilde{Y}\) have a unique absorbing state at 0, and the same time to absorption as \(X\) and \(Y\), respectively.
The claim of the theorem will be immediate from the following two lemmas. In fact, they establish that \(\tau ^0_N(x)\) is no bigger than \(\tilde{\tau }^0_N(x)\) (defined below), and that the latter is \(O(\log N)\). To show that \(\tau ^0_N(x)\) is in fact \(\Theta (\log N)\), observe that for consensus to be reached, either every node in the minority state has to have updated its opinion at least once, or every node in the majority state needs to have done so. As each node updates its opinion after independent Exp(1) times, this involved the maximum of \(O(N)\) independent Exp(1) random variables, which has mean of order \(\log N\). \(\square \)
Lemma 1
For given initial conditions \(\tilde{X}(0) \le \tilde{Y}(0)\), the stochastic processes \(\tilde{X}(t)\) and \(\tilde{Y}(t)\) can be coupled in such a way that \(\tilde{X}(t) \le \tilde{Y}(t)\) for all \(t\ge 0\). In particular, the process \(\tilde{X}\) first falls below level \(\lfloor \alpha N \rfloor \), and likewise hits the absorbing state 0, no later than \(\tilde{Y}\) does the same.
Lemma 2
Let \(x\in (0,1/2)\) and \(\alpha \in (0,x)\) be given. Fix \(\epsilon \in (0,1-2x)\), and let the processes \(Y\) and \(\tilde{Y}\) be defined as above. Let
where the subscript on the expectation denotes the initial state \(\tilde{Y}(0)\). Then, \(\tilde{\tau }^{\alpha }_N(x)=O(1)\) and \(\tilde{\tau }^0_N(x)=O(\log N)\).
Proof (Proof of Lemma 1)
Let \(i< N/2\). The jump probability from \(i\) to \(i-1\) for the Markov process \(\tilde{X}(t)\) is the same as for \(X(t)\) and is given by
If \(i<(1-\epsilon )N/2\), then this quantity is no smaller than the corresponding jump probability \(\beta \) for \(\tilde{Y}(t)\), defined in (12); this is because \(\mathbb {P}(Z_{i/N} \le m-d)\) is decreasing in \(i\), while \(\mathbb {P}(Z_{i/N} \ge d)\) is increasing. Also, if \(d\) is bigger than \(m/2\), as we assume, then, for all \(i\) between \((1-\epsilon )N/2\) and \(N/2\), the jump probability of \(X\) from \(i\) to \(i-1\) is no smaller than a half, since
Next, we compare the holding times in different states for the two processes. The rate of moving out of state \(i\) for the \(\tilde{X}\) process is
whereas for the \(\tilde{Y}\) process it is
As \(\mathbb {P}(Z_{i/N}\le m-d)\) is decreasing in \(i\), it is clear that this rate is greater for the \(\tilde{X}\) process if \(i<(1-\epsilon )N/2\). It can be seen using (13) that it is also greater for \((1-\epsilon )N/2 \le i < \lfloor N/2 \rfloor \).
Thus, we have shown that the jump probability from \(i\) to \(i-1\) for \(\tilde{X}\) is no smaller than that for \(\tilde{Y}\), and that the holding time in each state is no greater (in the standard stochastic order). The existence of the claimed coupling follows from these facts. Indeed, such a coupling can be constructed by letting the processes involve independently when \(\tilde{X}(t) \ne \tilde{Y}(t)\), but, when they are equal, sampling the residual holding times and the subsequent jumps jointly to respect the desired ordering. \(\square \)
Proof (Proof of Lemma 2)
Define \(\hat{Y}(t) = \min \{ \tilde{Y}(t), \lceil \frac{(1-\epsilon )N}{2} \rceil \}\). Then, \(\hat{Y}(t)\) is a semi-Markov process. The associated jump chain is Markovian with the same transition probabilities as the \(\tilde{Y}(t)\) process, except that the only possible transition from \(\lceil \frac{(1-\epsilon )N}{2} \rceil \) is to \(\lceil \frac{(1-\epsilon )N}{2} \rceil -1\). The holding times in all states other \(\lceil \frac{(1-\epsilon )N}{2} \rceil \) are exponential with the same rates as in the process \(Y(t)\), but the holding times in \(\lceil \frac{(1-\epsilon )N}{2} \rceil \) have the distribution of the exit time of the process \(Y(t)\) from the interval \(\{ \lceil \frac{(1-\epsilon )N}{2} \rceil , \ldots , \lceil \frac{(1+\epsilon )N}{2} \rceil \}\) in each visit.
Let \(t_j\) denote the mean time spent by the process \(\hat{Y}(t)\) in state \(j\) during each visit. Then \(t_j = 1/(jc_1)\) for \(j<\lceil \frac{(1-\epsilon )N}{2} \rceil \). In order to compute \(t_{\lceil (1-\epsilon )N/2 \rceil }\), we first note that, by well-known results for the symmetric random walk, the mean number of steps for \(Y(t)\) to exit the interval \(\{ \lceil \frac{(1-\epsilon )N}{2} \rceil \}, \ldots , \lceil \frac{(1+\epsilon )N}{2} \rceil \}\) after entering it at the boundary is \(\lceil \epsilon N \rceil \). Moreover, the mean holding time in each state in this interval is \(1/(Nc_2)\) by definition. Hence, the mean exit time for \(Y(t)\) from this interval, which is also the mean holding time for \(\hat{Y}(t)\) in \(\lceil \frac{(1-\epsilon )N}{2} \rceil \), is \(\epsilon /c_2\). This is a constant that does not depend on \(N\).
Now, \(\tilde{\tau }_N^0(x)\) is the mean time for \(\tilde{Y}\), and hence \(\hat{Y}\), to hit 0. By decomposing this into the number of visits to each intermediate state, and the expected time in each state during each visit, we can write
where \(f_{ij}\) denotes the probability that \(\hat{Y}\), started in state \(i\), hits state \(j\) before 0, and \(n_j\) denotes the mean number of visits to state \(j\) conditional on ever visiting it. If we let \(f_{jj}\) denote the return probability to state \(j\) before hitting 0, then the number of visits to \(j\) is geometrically distributed with parameter \(f_{jj}\) by the Markov property, and so \(n_j = 1/(1-f_{jj})\). Using well-known results for the gambler’s ruin problem, we have
where we recall that \(\beta >1/2\) is the transition probability from \(i\) to \(i-1\) for the jump chain associated with \(\hat{Y}(t)\), which is a biased random walk with reflection at \(\lceil (1-\epsilon ) N/2 \rceil \). By conditioning on the first step, we also have
Solving for \(f_{jj}\) and substituting in \(n_j=1/(1-f_{jj})\), we obtain after some tedious calculations that
But \(\beta \) is a constant that does not depend on \(N\), and is strictly bigger than a half under the assumption that \(d>m/2\) made in the statement of the theorem. Hence, we see that the expected number of returns to any state is bounded uniformly by a finite constant.
Now, substituting the results obtained above for \(n_j\) and \(t_j\) in (14), and noting that \(f_{\lfloor xN \rfloor ,j} \le 1\) for all \(j\) as it is a probability, we obtain that
In fact, a slightly more careful analysis shows that the time to absorption is \(O(\log \tilde{Y}(0))\), which is a tighter bound if the initial condition grows slower than a fraction of \(N\), i.e. if the population is already close to consensus to start with.
The derivation of the upper bound on \(\tilde{\tau }^{\alpha }_N(x)\) is very similar, except that we get an analogue of (14) where the sum only runs over \(j\ge \alpha N\). Moreover, the terms in the sum have the interpretation of hitting probabilities and number of returns before crossing the level \(\alpha N\), which is bounded by the hitting probabilities and number of returns before hitting 0. The details are omitted. \(\square \)
Proof (Proof of Theorem 4)
Let \(X(t)\) be the number nodes in state 1 at time \(t\). The dynamics of \(X\) are governed by the continuous time Markov chain. Before stating the transition rates we define the two functions \(p_1\) and \(p_2\) by
and
where \(Z_{m,\,x}\) denotes a random variable with the Bin \((m,x)\) distribution as before, but we now make the dependence on \(m\) explicit in the notation. Then the transition rates of the Markov Chain are
where \(\mathbb {E}_{(M,\,D)}\) is expectation taken with respect to the random choice of \((M,D)\), the algorithm to be used. Finding the hitting probability and the times to consensus follows exactly the same process as for the deterministic case with the minor changes. Therefore, we do not include the proofs here. \(\square \)
Rights and permissions
About this article
Cite this article
Cruise, J., Ganesh, A. Probabilistic consensus via polling and majority rules. Queueing Syst 78, 99–120 (2014). https://doi.org/10.1007/s11134-014-9397-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-014-9397-7