Refined Mean Field Analysis: The Gossip Shuffle Protocol Revisited

Gast, Nicolas; Latella, Diego; Massink, Mieke

doi:10.1007/978-3-030-50029-0_15

Nicolas Gast¹⁰,
Diego Latella¹¹ &
Mieke Massink¹¹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 12134))

Included in the following conference series:

International Conference on Coordination Languages and Models

1077 Accesses
1 Citations

Abstract

Gossip protocols form the basis of many smart collective adaptive systems. They are a class of fully decentralised, simple but robust protocols for the distribution of information throughout large scale networks with hundreds or thousands of nodes. Mean field analysis methods have made it possible to approximate and analyse performance aspects of such large scale protocols in an efficient way that is independent of the number of nodes in the network. Taking the gossip shuffle protocol as a benchmark, we evaluate a recently developed refined mean field approach. We illustrate the gain in accuracy this can provide for the analysis of medium size models analysing two key performance measures: replication and coverage. We also show that refined mean field analysis requires special attention to correctly capture the coordination aspects of the gossip shuffle protocol.

This research has been partially supported by the Italian MIUR project PRIN 2017FTXR7S “IT-MaTTerS” (Methods and Tools for Trustworthy Smart Systems).

You have full access to this open access chapter, Download conference paper PDF

FlyFast: A Mean Field Model Checker

Community-Based Gossip Algorithm for Distributed Averaging

Topological network features determine convergence rate of distributed average algorithms

Article Open access 17 December 2022

Keywords

1 Introduction and Related Work

Many collective adaptive systems rely on the decentralised distribution of information. Gossip protocols have been proposed as a paradigm that can provide a stable, scalable and reliable method for such decentralised spreading of information [2,3,4, 6, 8, 9, 16, 21, 22]. The basic mechanism of information spreading followed by a gossip shuffle protocol is that nodes exchange part of the data they keep in their cache with randomly selected peers in pairwise synchronous communications on a regular basis.

Interesting performance aspects of such gossip protocols are the replication of a newly inserted fresh data element in a network and the dynamics of network coverage. Replication of a data element occurs when nodes exchange the data element in pairwise communication. Network coverage concerns the fraction of the population of network nodes that have “seen” the data element since its introduction into the network, even if they may no longer have it in their cache due to further exchanges with other peers.

Traditionally, these performance measures have been studied based on simulation models. However, when large populations of nodes are involved, such simulations may be very resource consuming. Recently these protocols have been studied using classic mean field approximation techniques [1, 2]. In that classic approach the full stochastic model of a gossip network, i.e. one in which each node is modelled individually, is replaced by a much simpler model in which the pairwise synchronous interactions between individual nodes are replaced by the average effect that all those interactions have on a single node and then the model of this single node is studied in the context of the overall average network behaviour. Of course, the average effects may change over time as nodes change their local states. This is taken into account in a mean field model by letting the probabilities of interactions depend on the fraction of nodes that are in a particular local state. Compared to traditional simulation methods, mean field approximation techniques scale very well to large populations because these techniques are independent of the exact population size^{Footnote 1} allowing analysis that is orders of magnitudes faster than discrete event simulation. This method of derivation of a mean field model from a large population of interacting objects relies on what is known as the assumption of “propagation of chaos” (also called “statistical independence” or “decoupling of joint probabilities”) [7, 10, 17, 19]. The assumption is based on the fact that when the number of interacting nodes becomes very large, their interactions tend to behave as if they were statistically independent.

In this paper we revisit an analysis of the gossip shuffle protocol by Bahkshi et al. in [1, 2, 4] by using a refined mean field approximation for discrete time population models that we developed in [12, 13], and which was in turn inspired by an earlier result for continuous time population models presented in [11].

Contributions. The main contribution of this short paper (full version in [14]) is a novel benchmark (clock-synchronous) DTMC population model of the gossip shuffle protocol analysed using refined mean field analysis [12, 13]. In particular:

We show that, by using the refined mean field, a more accurate approximation can be obtained, compared to classical mean field approximation, for medium size populations for this gossip protocol, but that this requires a novel model that reflects the synchronisation effects of the pairwise interaction of the original protocol.
The refined mean field results we obtained are very close both to those of independent Java based simulation from the literature in [2] (taken as “ground truth” for comparison with our results) and to those of the event simulation of the model itself, but several orders of magnitude faster and independent of the system size.

Like classic mean field approaches, the refined approach is also highly scalable and computationally non-intensive. Therefore it is an interesting candidate for being integrated with other analysis approaches such as (on-the-fly) mean field model checking [18], which is planned in future work. The current study aims at providing further insight in the feasibility of applying the refined mean field approach, that implies the use of symbolic differentiation, on larger benchmark examples and in the possible complications of such an analysis that need to be taken into consideration.

2 Benchmark Gossip Shuffle Protocol

We consider the gossip shuffle protocol described in [1, 2, 15]. This particular version has been extensively studied by Bahkshi et al., leading to an analytical model of the gossip protocol [3], a classical mean field model [2] and a Java implementation^{Footnote 2} of a simulator for the protocol [1, 2], which makes it a very suitable candidate of a real-world application that allows for the comparison of our results with those available in the literature. Figure 1 recalls the pseudo code of a generic shuffle protocol (adapted from [1]). Further details can be found in [1, 2].

Two main key measures that are of interest for this protocol are the transient aspects of the replication of a newly introduced element in the network and that of the coverage of the network, i.e. the fraction of network nodes that have seen the new data element when time is passing. These measures depend on a number of characteristics of the network. In the following we use N to denote the size of the network, i.e. the number of gossiping nodes, n to denote the number of different data items in the network, c to denote the size of the cache and s to denote the size of the selected items from the cache to be exchanged with a neighbour. In the context of this work, and for comparison with the results presented in [1], the network is assumed to be fully connected. We consider a discrete time variant of the protocol with a maximal delay between two subsequent active data-exchanges of a node denoted by $G_{ max }$.

3 Background

In the sequel we use theoretical results on discrete time mean field approximation [7, 12, 19]. We briefly recall the notation and main results in the following. We consider a population model of a system composed of $0 < N \in \mathrm {I\!N}$ identical interacting objects, i.e. a (model of a) system of size N. We assume that the set $\{0,\ldots , n-1\}$ of local states of each object is finite; we refer to [12] for a discussion on how to deal with infinite dimensional models. Time is discrete and the behaviour of the system is characterised by a (time homogeneous) discrete time Markov chain (DTMC) $X^{(N)}(t)=(X_1^{(N)}(t), \ldots , X_N^{(N)}(t))$, where $X_i^{(N)}(t)$ is the state of object i at time t, for $i=1,\ldots , N$.

The occupancy measure vector at time t of the model is the row-vector DTMC $M^{(N)}(t)=(M_0^{(N)}(t), \ldots , M_{n-1}^{(N)}(t))$ where, for $j=0,\ldots , n-1$, the stochastic variable $M_j^{(N)}(t)$ denotes the fraction of objects in state j at time t, over the total population of N objects:

$$\begin{aligned} M_j^{(N)}(t) = \frac{1}{N}\sum _{i=1}^N 1_{\{X_i^{(N)}(t)=j\}} \end{aligned}$$

and $1_{\{x=j\}}$ is equal to 1 if $x=j$ and 0 otherwise. At each time step $t \in \mathrm {I\!N}$ each object performs a local transition, possibly changing its state. The transitions of any two objects are assumed to be independent from each other, while the transition probabilities of an object may depend also on M(t), thus, for large N, the probabilistic behaviour of an object is characterised by the one-step transition probability $n \times n$ matrix $\mathbf {K}(m)$, where $\mathbf {K}_{ij}(m)$ is the probability for the object to jump from state i to state j when the occupancy measure vector is $m \in \mathcal{U}^n$, the unit simplex of $\mathrm {I\!R}_{\ge 0}^n$, that is, $\mathcal{U}^n=\{m \in [0,1]^n \;|\; \sum _{i=1}^{n} m_i =1\}$. In this paper, for simplicity, we assume $\mathbf {K}(m)$ to be a continuous function of $m$ that does not depend on N. In the sequel, for reasons of presentation, we provide a graphical specification of the relevant models. The computation of matrix $\mathbf {K}(m)$ from such a model specification is straightforward.

3.1 Discrete Time Classical Mean Field Approximation

Below we recall Theorem 4.1 of [19] on classic mean field approximation, under the simplifying assumptions mentioned above:

Theorem

4.1 of [19] (Convergence to Mean Field). Assume that the initial occupancy measure $M^{(N)}(0)$ converges almost surely to the deterministic limit $\mu (0)$. Define $\mu (t)$ iteratively by (for $t \ge 0$):

$$\begin{aligned} \mu (t+1) = \mu (t) \, \mathbf {K}(\mu (t)). \end{aligned}$$

(1)

Then for any fixed time t, almost surely, $ \lim _{N \rightarrow \infty } M^{(N)}(t) = \mu (t). $

The above result thus allows one to use, for large N, a deterministic approximation $\mu $ of the average behaviour of a discrete population model.

3.2 Discrete Time Refined Mean Field Approximation

The following corollary illustrates the relationship between the refined mean field result and the classic convergence theorem:

Corollary

1(i) of [12] Under the assumptions of Theorem 1 of [12], it holds that for any coordinate i and any time-step $t \in \mathrm {I\!N}$

$$\begin{aligned} {\mathchoice{\mathbb {E}\left[ M^{(N)}_i(t)\right] }{\mathbb {E}[M^{(N)}_i(t)]}{\mathbb {E}[M^{(N)}_i(t)]}{\mathbb {E}[M^{(N)}_i(t)]}} = \mu _i(t) + \frac{V_i(t)}{N} + o\left( \frac{1}{N}\right) . \end{aligned}$$

In other words, the expected value of the fraction of the objects in local state i of the full stochastic model with population size N at time t, is equal to the classic limit mean field value $\mu _i(t)$ plus a factor $V_i(t)$, divided by the population size N plus a residual amount of order $o\left( \frac{1}{N}\right) $. $V_i(t)$ satisfies a linear recurrence relation that uses differentiation of functions and the covariance of $\mu (t)$, as shown in Theorem 1 of [12] (see also [14]), and can be implemented efficiently using symbolic differentiation software packages. It is easy to see that the larger is N the smaller this additional factor gets. Essentially, the refined mean field takes not only the first moment (the mean) but also the second moment (variance) into consideration in the approximation. In [12] we have applied this discrete time refined mean field approximation on a number of examples ranging from the well-known epidemic model SEIR to wireless networks. Here we investigate its application to a novel model of the more complex gossip shuffle protocol.

A proof-of-concept implementation of both the classical and the refined mean field techniques and a discrete event simulator has been developed by one of the authors of the present paper in F${\#}$ using the DiffSharp package [5] for symbolic differentiation. The results in this paper have been obtained using this implementation which can be found at [20].

4 Refined Mean Field Approximation of the Gossip Shuffle Protocol

The classical mean field model of the gossip protocol in [1], and aggregated versions thereof in [14], are based on the principle of decoupling of joint probabilities [7, 19] and on a careful study of the pairwise probabilities of the various possible outcomes of a shuffle between two gossip nodes. This model provides reasonable accuracy for systems with tens of thousands of nodes or more. However, discrete event simulation of this model for medium size systems shows that it does not respect important properties of the original gossip shuffle protocol, in particular the property that the new data element never gets lost from the system. We have found that this is caused by an inaccurate modelling of the effects of coordination between interacting nodes (see [14] for details).

We present a novel model in which (1) the system can never completely loose the inserted data element and (2) the model reflects the effects of the pairwise interaction between nodes satisfying basic properties of the original gossip shuffle protocol while still adhering to the principle of decoupling of joint probabilities. We distinguish the effects of a node getting a data element through exchanging it with another node–in which case the total number of replicas of the data element in the system remains the same–or through replication, i.e. the other node retains its copy of the data element and the global number of the data element in the system increases by one. With reference to Fig. 2, for what concerns point (1) above, we introduce the state PD to the model representing that there always is a gossip node in the network that possesses the data element.

To address point (2), we introduce states FD and LD to distinguish between the effect of interactions between gossip nodes. State FD represents the fact that the gossip node received the data element for the first time via an exchange of the data element with another node. State LD also represents the fact that the node received the data element via an exchange, but that it had already seen the data element in the past. Note that we can retrieve the total number of gossip nodes in the system that do not possess the data element as the sum of the nodes that are in states FD, LD, I and O because for each node in state FD (LD, resp.) there is a node in the network that just lost its data element in the synchronous shuffle with our current node. A gossip node can also get involved in an interaction in which the data element is replicated, i.e. a node gives it to another one but also retains a copy itself, and one in which two nodes, both possessing the data element, interact and one of them looses its copy. Note that in this model it is not possible that both nodes loose their copy in a single interaction. The conditional probabilities of pairs of interacting nodes obtaining or loosing the data element can be expressed in terms of n (number of different data elements), c (size of the cache) and s (number of selected elements for exchange), as follows^{Footnote 3}:

The probability functions of the state transitions in the model below depend on m, i.e. the occupancy measure vector, the conditional probabilities, the ‘no collision’ probability $\mathsf{noc}$, and $G_{ max }$ (see [14] for further details).

Figure 3 shows the replication as sum of the number of nodes in states D and PD and the coverage as the sum of the number of nodes in D, PD, FD, LD and O for a network with $N=100$, $n=500$, $c=100$ and $s=50$ with initially one node in state PD and all the others in state I. Besides the classic and refined mean field approximations for the model in Fig. 2 and the Java simulation results of the actual shuffle protocol, Fig. 3 also shows the average of the model simulation. Note the good approximation of the simulation results by the refined mean field even in this very small network. Similarly good results have been found for a system with N = 2,500 shown in Fig. 4. A first comparison of the (non-optimised) performance of the implementation in F$\#$ of the analysis for N = 2,500, producing the results in Fig. 4, is: 0.5 s (classic mean field); 25.5 s (refined mean field^{Footnote 4}); 7 m 1.4 s (fast model simulation [19], 500 runs); 3 h 42 m 41.5 s (Java simulation, 500 runs) on a MacBook Pro, Intel i7, 16 GB.

5 Conclusions

We have developed a novel mean field model for the shuffle gossip protocol with which more accurate approximations for medium size gossip protocols can be obtained via refined mean field approximation techniques. This model respects key aspects of the protocol such as the effects of different kinds of interactions and the fact that a new data element cannot be lost by the system as a whole. Accurate approximation of medium size systems is important because many practical systems consist of many, but not a huge number of, components and simulation of such systems is still a resource consuming effort. A refined mean field approximation can provide very fast and accurate approximations.

Notes

1.
As long as this size is large enough to obtain a sufficiently accurate approximation. The computational complexity of these techniques do depend on the number of local states of an object in a population.
2.
We thank Rena Bahkshi for sharing her Java simulator source code with us.
3.
$P(A'B'|AB)$ is the conditional probability of the state of an active-passive pair AB to have state $A'B'$ after their interaction, where $A,B,A',B' \in \{O,D\}$, see [1, 2].
4.
Recall that the mean field analyses times are independent of the size of the system.

References

Bakhshi, R.: Gossiping models - formal analysis of epidemic protocols. Ph.D. thesis, Vrije Universiteit Amsterdam, January 2011. http://www.cs.vu.nl/en/Images/Gossiping_Models_van_Rena_Bakhshi_tcm210-256906.pdf
Bakhshi, R., Cloth, L., Fokkink, W., Haverkort, B.R.: Mean-field framework for performance evaluation of push-pull gossip protocols. Perform. Eval. 68(2), 157–179 (2011). https://doi.org/10.1016/j.peva.2010.08.025
Article Google Scholar
Bakhshi, R., Gavidia, D., Fokkink, W., van Steen, M.: An analytical model of information dissemination for a gossip-based protocol. Comput. Netw. 53(13), 2288–2303 (2009). https://doi.org/10.1016/j.comnet.2009.03.017
Article MATH Google Scholar
Bakhshi, R., Gavidia, D., Fokkink, W., van Steen, M.: A modeling framework for gossip-based information spread. In: Eighth International Conference on Quantitative Evaluation of Systems, QEST 2011, Aachen, Germany, 5–8 September 2011, pp. 245–254. IEEE Computer Society (2011). https://doi.org/10.1109/QEST.2011.39
Baydin, A.G., Pearlmutter, B.A., Radul, A.A., Siskind, J.M.: Automatic differentiation in machine learning: a survey. J. Mach. Learn. Res. 18, 153:1–153:43 (2018). http://jmlr.org/papers/v18/17-468.html
MathSciNet MATH Google Scholar
Birman, K.: The promise, and limitations, of gossip protocols. Oper. Syst. Rev. 41(5), 8–13 (2007). https://doi.org/10.1145/1317379.1317382
Article Google Scholar
Bortolussi, L., Hillston, J., Latella, D., Massink, M.: Continuous approximation of collective system behaviour: a tutorial. Perform. Eval. 70(5), 317–349 (2013)
Article Google Scholar
Frei, R., Serugendo, G.D.M.: Advances in complexity engineering. Int. J. Bio-Inspired Comput. 3(4), 199–212 (2011). https://doi.org/10.1504/IJBIC.2011.041144
Article Google Scholar
Frei, R., Serugendo, G.D.M.: Concepts in complexity engineering. Int. J. Bio-Inspired Comput. 3(2), 123–139 (2011). https://doi.org/10.1504/IJBIC.2011.039911
Article Google Scholar
Gast, N., Gaujal, B.: A mean field approach for optimization in discrete time. Discrete Event Dyn. Syst. 21(1), 63–101 (2011). https://doi.org/10.1007/s10626-010-0094-3
Article MathSciNet MATH Google Scholar
Gast, N., Houdt, B.V.: A refined mean field approximation. Proc. ACM Meas. Anal. Comput. Syst. 1(2), 33:1–33:28 (2017). https://doi.org/10.1145/3154491
Article Google Scholar
Gast, N., Latella, D., Massink, M.: A refined mean field approximation of synchronous discrete-time population models. Perform. Eval. 126, 1–21 (2018). https://doi.org/10.1016/j.peva.2018.05.002
Article Google Scholar
Gast, N., Latella, D., Massink, M.: A refined mean field approximation for synchronous population processes. In: Workshop on Mathematical Performance Modeling and Analysis (MAMA 2018), pp. 30–32. ACM (2019). ACM SIGMETRICS Perform. Eval. Rev
Google Scholar
Gast, N., Latella, D., Massink, M.: Refined mean field analysis of the gossip shuffle protocol - extended version - (2020). arXiv:2004.07519v1
Gavidia, D., Voulgaris, S., van Steen, M.: A gossip-based distributed news service for wireless mesh networks. In: Conference on Wireless On demand Network Systems and Services (WONS), pp. 59–67. IEEE Computer Society (2006)
Google Scholar
Jelasity, M.: Gossip. In: Serugendo, G.D.M., Gleizes, M.P., Karageorgos, A. (eds.) Self-organising Software - From Natural to Artificial Adaptation. NCS, pp. 139–162. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17348-6_7
Chapter Google Scholar
Latella, D., Loreti, M., Massink, M.: On-the-fly PCTL fast mean-field approximated model-checking for self-organising coordination. Sci. Comput. Program. 110, 23–50 (2015). https://doi.org/10.1016/j.scico.2015.06.009
Article Google Scholar
Latella, D., Loreti, M., Massink, M.: FlyFast: a mean field model checker. In: Legay, A., Margaria, T. (eds.) TACAS 2017. LNCS, vol. 10206, pp. 303–309. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-662-54580-5_18
Chapter Google Scholar
Le Boudec, J., McDonald, D.D., Mundinger, J.: A generic mean field convergence result for systems of interacting objects. In: Fourth International Conference on the Quantitative Evaluation of Systems (QEST 2007), Edinburgh, Scotland, UK, 17–19 September 2007, pp. 3–18. IEEE Computer Society (2007)
Google Scholar
Massink, M.: Refined mean field F$\#$ implementation and gossip shuffle model. https://github.com/mimass/RefinedMF
Pianini, D., Beal, J., Viroli, M.: Improving gossip dynamics through overlapping replicates. In: Lluch Lafuente, A., Proença, J. (eds.) COORDINATION 2016. LNCS, vol. 9686, pp. 192–207. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-39519-7_12
Chapter Google Scholar
Voulgaris, S., Jelasity, M., van Steen, M.: A robust and scalable peer-to-peer gossiping protocol. In: Moro, G., Sartori, C., Singh, M.P. (eds.) AP2PC 2003. LNCS (LNAI), vol. 2872, pp. 47–58. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25840-7_6
Chapter MATH Google Scholar

Download references

Author information

Authors and Affiliations

INRIA, University Grenoble Alpes, Grenoble, France
Nicolas Gast
Consiglio Nazionale delle Ricerche, Istituto di Scienza e Tecnologie dell’Informazione ‘A. Faedo’, CNR, Pisa, Italy
Diego Latella & Mieke Massink

Authors

Nicolas Gast
View author publications
You can also search for this author in PubMed Google Scholar
Diego Latella
View author publications
You can also search for this author in PubMed Google Scholar
Mieke Massink
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mieke Massink .

Editor information

Editors and Affiliations

Project-team SPIRALS, Inria Lille – Nord Europe, Villeneuve d’Ascq, France
Simon Bliudze
School of Computing, University of Kent, Canterbury, UK
Laura Bocchi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gast, N., Latella, D., Massink, M. (2020). Refined Mean Field Analysis: The Gossip Shuffle Protocol Revisited. In: Bliudze, S., Bocchi, L. (eds) Coordination Models and Languages. COORDINATION 2020. Lecture Notes in Computer Science(), vol 12134. Springer, Cham. https://doi.org/10.1007/978-3-030-50029-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-50029-0_15
Published: 10 June 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-50028-3
Online ISBN: 978-3-030-50029-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)

Refined Mean Field Analysis: The Gossip Shuffle Protocol Revisited

Abstract