Multiple change points detection and clustering in dynamic networks

Corneli, Marco; Latouche, Pierre; Rossi, Fabrice

doi:10.1007/s11222-017-9775-1

Multiple change points detection and clustering in dynamic networks

Published: 26 September 2017

Volume 28, pages 989–1007, (2018)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

1010 Accesses
17 Citations
Explore all metrics

Abstract

The increasing amount of data stored in the form of dynamic interactions between actors necessitates the use of methodologies to automatically extract relevant information. The interactions can be represented by dynamic networks in which most existing methods look for clusters of vertices to summarize the data. In this paper, a new framework is proposed in order to cluster the vertices while detecting change points in the intensities of the interactions. These change points are key in the understanding of the temporal interactions. The model used involves non-homogeneous Poisson point processes with cluster-dependent piecewise constant intensity functions and common discontinuity points. A variational expectation maximization algorithm is derived for inference. We show that the pruned exact linear time method, originally developed for change points detection in univariate time series, can be considered for the maximization step. This allows the detection of both the number of change points and their location. Experiments on artificial and real datasets are carried out, and the proposed approach is compared with related methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data

Article 01 July 2021

Detecting network backbones against time variations in node properties

Article 24 July 2019

Detecting Stable Communities in Link Streams at Multiple Temporal Scales

Notes

Interaction times are assumed to be strictly positive and $\nu _{M+1}$ is assumed to fall outside the time interval [0, T]: $\mathbb {P}(\nu _{M+1} \ge T) = 1$.
Since the whole time interval [0, T] is considered, the intensity functions can be assumed (exceptionally) left continuous in $t=T$. In formulas: $\lambda _{kg}(T)=\lambda _{kg}(\eta _{D-1}$).
We use the same notation to denote the set of $K(K+1)/2$ intensity functions and the tensor because under our assumptions they correspond to two different views of the same object. Notice that the frontal slices of ${\varvec{\lambda }}$ are symmetric $K\times K$ matrices since we are dealing with undirected graphs.
In the framework of change point detection for time series, a cost function to be minimized is associated with each time segment. In contrast, we introduce a gain function to be consistent with the problem formulation used so far. However, the two definitions are equivalent since the cost function can be thought as a gain function multiplied by $-\,1$.
The same holds for the selected number of groups K (see Sect. 3.5).
Notice that in case the minimal partition is used, $X_{ij}^{(u)}$ is trivially equal to one for the pair (i, j) interacting at $t_{u-1}$ and zero for all the other pairs.
Map data are available from http://www.openstreetmap.org and copyrighted OpenStreetMap contributors.

References

Achab, M., Bacry, E., Gaïffas, S., Mastromatteo, I., Muzy, J.F.: Uncovering causality from multivariate Hawkes integrated cumulants. ArXiv preprint arXiv:1607.06333 (2016)
Airoldi, E., Blei, D., Fienberg, S., Xing, E.: Mixed membership stochastic blockmodels. J. Mach. Learn. Res. 9, 1981–2014 (2008)
MATH Google Scholar
Boullé, M.: Optimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach. Adv. Data Anal. Classif. 3(1), 39–61 (2009)
Casteigts, A., Flocchini, P., Quattrociocchi, W., Santoro, N.: Time-varying graphs and dynamic networks. Int. J. Parallel Emerg. Distrib. Syst. 27(5), 387–408 (2012). doi:10.1080/17445760.2012.668546
Article Google Scholar
Corneli, M., Latouche, P., Rossi, F.: Block modelling in dynamic networks with non-homogeneous poisson processes and exact ICL. Soc. Netw. Anal. Min. 6(1), 1–14 (2016a). doi:10.1007/s13278-016-0368-3
Article Google Scholar
Corneli, M., Latouche, P., Rossi, F.: Exact ICL maximization in a non-stationary temporal extension of the stochastic block model for dynamic networks. Neurocomputing 192, 81–91 (2016). doi:10.1016/j.neucom.2016.02.031
Article Google Scholar
Daley, D.J., Vere-Jones, D.: An Introduction to the Theory of Point Processes. Volume I: Elementary Theory and Methods. Springer, Berlin (2003)
MATH Google Scholar
Daudin, J.J., Picard, F., Robin, S.: A mixture model for random graphs. Stat. Comput. 18(2), 173–183 (2008)
Article MathSciNet Google Scholar
Dempster, A.P., Rubin, D.B., Laird, N.M.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 39(1), 1–38. http://www.jstor.org/stable/2984875 (1977)
Dubois, C., Butts, C., Smyth, P.: Stochastic block modelling of relational event dynamics. In: International Conference on Artificial Intelligence and Statistics, Volume 31 of the Journal of Machine Learning Research Proceedings, pp. 238–246 (2013)
Durante, D., Dunson, D.B., et al.: Locally adaptive dynamic networks. Ann. Appl. Stat. 10(4), 2203–2232 (2016)
Article MathSciNet MATH Google Scholar
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3–5), 75–174 (2010)
Article MathSciNet Google Scholar
Friel, N., Rastelli, R., Wyse, J., Raftery, A.E.: Interlocking directorates in Irish companies using a latent space model for bipartite networks. In: Proceedings of the National Academy of Sciences, vol. 113, no. 24, pp. 6629–6634. doi:10.1073/pnas.1606295113. http://www.pnas.org/content/113/24/6629.full.pdf (2016)
Guigourès, R., Boullé, M., Rossi, F.: A triclustering approach for time evolving graphs. In: Co-clustering and Applications, IEEE 12th International Conference on Data Mining Workshops (ICDMW 2012), Brussels, Belgium, pp. 115–122. doi:10.1109/ICDMW.2012.61 (2012)
Guigourès, R., Boullé, M., Rossi, F.: Discovering patterns in time-varying graphs: a triclustering approach. In: Advances in Data Analysis and Classification, pp. 1–28. doi:10.1007/s11634-015-0218-6 (2015)
Hanneke, S., Fu, W., Xing, E.P., et al.: Discrete temporal models of social networks. Electron. J. Stat. 4, 585–605 (2010)
Article MathSciNet MATH Google Scholar
Hawkes, A.G.: Point spectra of some mutually exciting point processes. J. R. Stat. Soc. Ser. B (Methodol.) 33(3), 438–443 (1971)
Ho, Q., Song, L., Xing, E.P.: Evolving cluster mixed-membership blockmodel for time-evolving networks. In: International Conference on Artificial Intelligence and Statistics, pp. 342–350 (2011)
Hoff, P., Raftery, A., Handcock, M.: Latent space approaches to social network analysis. J. Am. Stat. Assoc. 97(460), 1090–1098 (2002)
Article MathSciNet MATH Google Scholar
Jackson, B., Sargle, J., Barnes, D., Arabhi, S., Alt, A., Giomousis, P., Gwin, E., Sangtrakulcharoen, P., Tan, L., Tsai, T.: An algorithm for optimal partitioning of data on an interval. In: Signal Processing Letters, pp. 105–108 (2005)
Jernite, Y., Latouche, P., Bouveyron, C., Rivera, P., Jegou, L., Lamassé, S.: The random subgraph model for the analysis of an ecclesiastical network in Merovingian Gaul. Ann. Appl. Stat. 8(1), 55–74 (2014)
Article MathSciNet MATH Google Scholar
Killick, R., Fearnhead, P., Eckley, I.A.: Optimal detection of changepoints with a linear computational cost. J. Am. Stat. Assoc. 107(500), 1590–1598 (2012). doi:10.1080/01621459.2012.737745
Article MathSciNet MATH Google Scholar
Kim, M., Leskovec, J.: Nonparametric multi-group membership model for dynamic networks. Adv. Neural Inf. Process. Syst. 25, 1385–1393 (2013)
Google Scholar
Kolda, T.G., Bader, B.W.: Tensor decompositions and applications. SIAM Rev. 51(3), 455–500 (2009)
Article MathSciNet MATH Google Scholar
Krivitsky, P.N., Handcock, M.S.: A separable model for dynamic networks. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 76(1), 29–46 (2014)
Article MathSciNet Google Scholar
Latouche, P., Birmelé, E., Ambroise, C.: Overlapping stochastic block models with application to the French political blogosphere. Ann. Appl. Stat., 5(1) 309–336 (2011)
Lewis, P., Shedler, G.: Simulation of nonhomogeneous poison processes by thinning. Naval Res. Logist. Q. 26(3), 403–413 (1979)
Article MATH Google Scholar
Matias, C., Miele, V.: Statistical clustering of temporal networks through a dynamic stochastic block model. J. R. Stat. Soc. Ser. B 79(4), 1119–1141 (2017)
Matias, C., Rebafka, T., Villers, F.: Estimation and clustering in a semiparametric Poisson process stochastic block model for longitudinal networks. arXiv:1512.07075 e-prints (2015)
Nouedoui, L., Latouche, P.: Bayesian non parametric inference of discrete valued networks. In: 21-th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2013), Bruges, Belgium, pp. 291–296 (2013)
Nowicki, K., Snijders, T.: Estimation and prediction for stochastic blockstructures. J. Am. Stat. Assoc. 96(455), 1077–1087 (2001)
Article MathSciNet MATH Google Scholar
Rand, W.M.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66(336), 846–850 (1971)
Article Google Scholar
Robins, G., Pattison, P., Kalish, Y., Lusher, D.: An introduction to exponential random graph (p*) models for social networks. Soc. Netw. 29(2), 173–191 (2007)
Article Google Scholar
Sarkar, P., Moore, A.W.: Dynamic social network analysis using latent space models. ACM SIGKDD Explor. Newsl. 7(2), 31–40 (2005)
Article Google Scholar
Sewell, D.K., Chen, Y.: Latent space models for dynamic networks. J. Am. Stat. Assoc. 110(512), 1646–1657 (2015)
Article MathSciNet MATH Google Scholar
Sewell, D.K., Chen, Y.: Latent space models for dynamic networks with weighted edges. Soc. Netw. 44, 105–116 (2016)
Article Google Scholar
Snijders, T.A.: Stochastic actor-oriented models for network change. J. Math. Sociol. 21(1–2), 149–172 (1996)
Article MATH Google Scholar
von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007). doi:10.1007/s11222-007-9033-z
Article MathSciNet Google Scholar
Wang, Y., Wong, G.: Stochastic blockmodels for directed graphs. J. Am. Stat. Assoc. 82, 8–19 (1987)
Article MathSciNet MATH Google Scholar
Xing, E.P., Fu, W., Song, L.: A state-space mixed membership blockmodel for dynamic network tomography. Ann. Appl. Stat. 4(2), 535–566 (2010). doi:10.1214/09-AOAS311
Article MathSciNet MATH Google Scholar
Xu, H., Farajtabar, M., Zha, H.: Learning granger causality for Hawkes processes. In: Proceedings of the 33rd International Conference on Machine Learning, pp. 1717–1726 (2016)
Xu, K.S., Hero III, A.O.: Dynamic stochastic blockmodels for time-evolving social networks. IEEE J Sel Top Signal Process. 8(4), 552–562 (2014)
Yang, T., Chi, Y., Zhu, S., Gong, Y., Jin, R.: Detecting communities and their evolutions in dynamic social networks a Bayesian approach. Mach. Learn. 82(2), 157–189 (2011)
Article MathSciNet MATH Google Scholar
Zreik, R., Latouche, P., Bouveyron, C.: The dynamic random subgraph model for the clustering of evolving networks. Comput. Stat. 32(2), 501–533 (2016). doi:10.1007/s00180-016-0655-5
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire SAMM, Université Paris 1 Panthéon-Sorbonne, 90 rue de Tolbiac, 75634, Paris Cedex 13, France
Marco Corneli, Pierre Latouche & Fabrice Rossi

Authors

Marco Corneli
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Latouche
View author publications
You can also search for this author in PubMed Google Scholar
Fabrice Rossi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Corneli.

Appendix: Proofs

1.1 A.1 Proof of Proposition 1

Proof

Notice, first, that the central factor on the r.h.s. of the equality in Eq. (3) can be written as

$$\begin{aligned} \prod _{m=1}^M \lambda _{Z_{i_m}Z_{j_m}}(\nu _m)=\prod _{m=1}^M\prod _{j>i}^N\left( \lambda _{Z_i Z_j}(\nu _m)\right) ^{{\mathbf {1}}_{\mathcal {A}^{(i,j)}}(\nu _m)}, \end{aligned}$$

where ${\mathbf {1}}_{{\mathcal {G}}}(\cdot )$ is the indicator function on a set ${\mathcal {G}}$ and $\mathcal {A}^{(ij)}$ has been defined in (1). By inverting the product on the right-hand side, because of the indicator function we get

$$\begin{aligned}&\prod _{j>i}^N \prod _{m=1}^M \left( \lambda _{Z_i Z_j}(\nu _m)\right) ^{{\mathbf {1}}_{\mathcal {A}^{(i,j)}}(\nu _m)}=\prod _{j>i}^N \prod _{m=1}^{M^{(i,j)}}\lambda _{Z_iZ_j}(\nu _m^{(i,j)})\\&\quad =\prod _{j>i}^N \prod _{m=1}^{M^{(i,j)}}\left[ \prod _{k,g}^K (\lambda _{kg}(\nu _m^{(i,j)}))^{Z_{ik}Z_{jg}}\right] , \end{aligned}$$

where $\nu _m^{(i,j)}$ are the interaction times in the set $\mathcal {A}^{(i,j)}$, whose cardinality is $M^{(i,j)}$. Thanks (5), the following holds

$$\begin{aligned}&\prod _{j>i}^N \prod _{m=1}^{M^{(i,j)}}\left[ \prod _{k,g}^K (\lambda _{kg}(\nu _m^{(i,j)}))^{Z_{ik}Z_{jg}}\right] \nonumber \\&\quad =\prod _{j>i}^N \prod _{m=1}^{M^{(i,j)}}\left[ \prod _{k,g}^K \prod _{d=1}^{D} \lambda _{kgd}^{Z_{ik}Z_{jg} {\mathbf {1}}_{[\eta _{d-1}, \eta _d[}(\nu _m^{(i,j)})}\right] \nonumber \\&\quad =\prod _{k,g}^K \prod _{d=1}^{D}\lambda _{kgd}^{\sum _{j>i}^N Z_{ik}Z_{jg}(M^{(i,j)}(\eta _d) - M^{(i,j)}(\eta _{d-1}))}. \end{aligned}$$

(19)

Note that the last equality employs the definition of counting process

$$\begin{aligned} M^{(i,j)}(t)=\sum _{m=1}^{M^{(i,j)}}{\mathbf {1}}_{]0,t]}(\nu _m^{(i,j)}). \end{aligned}$$

By replacing (19) into (3) and using that

$$\begin{aligned} \Lambda _{Z_iZ_j}(T)=\sum _{k,g}^K \Lambda _{kg}(T)Z_{ik}Z_{jg}=\sum _{d=1}^D \sum _{k,g}^K \lambda _{kgd}\Delta _d Z_{ik}Z_{jg}, \end{aligned}$$

it suffices to take the logarithm of the likelihood and the proposition is proven. $\square $

1.2 A.2 Proof of Proposition 3

Proof

The following objective function is taken into account

$$\begin{aligned} \mathcal {L} \left( q(Z);K,\eta , D, {\varvec{\lambda }}, \pi \right) + \sum _{i=1}^N l_i\left( \sum _{k=1}^K\tau _{ik} -1\right) . \end{aligned}$$

This function has to be maximized with respect to both $\tau $ and the N Lagrange multipliers $l_1, \ldots , l_N$, introduced to take into account the normality of the lines of $\tau $. The most difficult step consists in taking the partial derivative of the objective function with respect to $\tau _{i_0k_0}$. We first focus on those terms of $\mathcal {L}(\cdot )$ depending on d

$$\begin{aligned}&Q(\tau ,\theta ):=-\sum _{d=1}^{D}\sum _{k,g}^K \left( \lambda _{kgd}\Delta _d \left( \sum _{i=1}^N\sum _{j>i}^N \tau _{ik} \tau _{jg}\right) \right. \\&\left. \quad - \log (\lambda _{kgd})\left( \sum _{i=1}^N\sum _{j>i}^N \tau _{ik}\tau _{jg}X_{ij}^{(d)}\right) \right) . \end{aligned}$$

Hence

$$\begin{aligned}&\frac{\partial Q(\tau ,\theta )}{\partial \tau _{i_0k_0}}=\sum _{d=1}^{D} \frac{\partial }{\partial \tau _{i_0k_0}}\left( -\sum _{i=1}^N\sum _{j>i}^N \sum _{k,g}^K\tau _{ik} \tau _{jg} \lambda _{kgd}\Delta _d \right) \\&\qquad + \sum _{d=1}^{D} \frac{\partial }{\partial \tau _{i_0 k_0}}\left( \sum _{i=1}^N\sum _{j>i}^N \sum _{k,g}^K\tau _{ik}\tau _{jg}X_{ij}^{(d)}\log (\lambda _{kgd})\right) \\&\quad =-\sum _{d=1}^{D}\left[ \sum _{j>i_0}^N \sum _{g=1}^K \tau _{jg}\Delta _d \lambda _{k_0 g d} + \sum _{j<i_0}^N \sum _{g=1}^K\tau _{jg}\Delta _d \lambda _{g k_0 d}\right] \\&\qquad +\sum _{d=1}^{D}\left[ \sum _{j>i_0}^N \sum _{g=1}^K \tau _{jg} X_{i_0 j}^{(d)}\log (\lambda _{k_0 g d}) + \sum _{j<i_0}^N \sum _{g=1}^K\tau _{jg} X_{j i_0}^{(d)}\log (\lambda _{g k_0 d})\right] \\&\quad =-\sum _{d=1}^{D}\left[ \sum _{j\ne i_0}^N \sum _{g=1}^K \tau _{jg} \Delta _d\lambda _{k_0 g d} - \sum _{j \ne i_0}^N \sum _{g=1}^K\tau _{jg} X_{i_0 j}^{(d)}\log (\lambda _{k_0 g d})\right] , \end{aligned}$$

where the last equality comes from the symmetry (i.e., interactions are undirected) of the frontal slices of tensors X and ${\varvec{\lambda }}$. Notice that the last term in the above equation is the function inside the exponential in Proposition 3. The remaining terms of $\mathcal {L}(\cdot )$, not involving d, can be differentiated straightforward. Imposing the partial derivatives of $\mathcal {L}(\cdot )$ equal to zero and using the above equation lead to the following system

$$\begin{aligned} {\left\{ \begin{array}{ll} \log (\tau _{i_0 k_0})= \log (\pi _{k_0})- \frac{\partial Q(\tau , \theta )}{\partial \tau _{i_0k_0}} + l_{i_0} -1 &{} \\ \sum _{k_0=1}^K \tau _{i_0 k_0}=1 &{}\qquad \forall (i_0,k_0). \end{array}\right. } \end{aligned}$$

The solution is obtained straightforward after some manipulations, and this concludes the proof. $\square $

1.3 A.3 Proof of Proposition 4

Proof

The following definitions are introduced to keep the notation uncluttered

$$\begin{aligned} S_{kg}:&=\sum _{j>i}^N \tau _{ik}\tau _{jg}\\ Y_{kg}^{[s,t[}:&=\sum _{j>i}^N\tau _{ik}\tau _{jg}(M^{(i,j)}(t)-M^{(i,j)}(s)) \qquad \forall s<t. \end{aligned}$$

Moreover, for every $t_{u_e}<t_{u_f}<t_{u_g}$, the following shorthand notation is used

$$\begin{aligned} \Delta ^{e,f}:=t_{u_f}-t_{u_e} \end{aligned}$$

and similarly for $\Delta ^{f,g}$ and $\Delta ^{e,g}$. Hence, we get

$$\begin{aligned}&{\mathcal {G}}([t_{u_e}, t_{u_g}[)=\sum _{k,g}\left[ \max _{\lambda _{kg}^{e,g} \in ]0, +\infty [}\left( -\lambda _{kg}^{e,g}\Delta ^{e,g}S_{kg} + \log (\lambda _{kg}^{e,g})Y_{kg}^{[t_{u_e}, t_{u_g}[}\right) \right] \\&\quad =\sum _{k,g}\left[ \max _{\lambda _{kg}^{e,g} \in ]0, +\infty [}\left( -\lambda _{kg}^{e,g}\Delta ^{e,f}S_{kg} + \log (\lambda _{kg}^{e,g})Y_{kg}^{[t_{u_e}, t_{u_f}[}\right) \right] \\&\qquad +\sum _{k,g}\left[ \max _{\lambda _{kg}^{e,g} \in ]0, +\infty [}\left( -\lambda _{kg}^{e,g}\Delta ^{f,g}S_{kg} + \log (\lambda _{kg}^{e,g})Y_{kg}^{[t_{u_f}, t_{u_g}[}\right) \right] \\&\quad \le \sum _{k,g}\left[ \max _{\lambda _{kg}^{e,f} \in ]0, +\infty [}\left( -\lambda _{kg}^{e,f}\Delta ^{e,f}S_{kg} + \log (\lambda _{kg}^{e,f})Y_{kg}^{[t_{u_e}, t_{u_f}[}\right) \right] \\&\qquad +\sum _{k,g}\left[ \max _{\lambda _{kg}^{f,g} \in ]0, +\infty [}\left( -\lambda _{kg}^{f,g}\Delta ^{f,g}S_{kg} + \log (\lambda _{k,g}^{f,g})Y_{kg}^{[t_{u_f}, t_{u_g}[}\right) \right] \\&\quad = {\mathcal {G}}([t_{u_e}, t_{u_f}[) + {\mathcal {G}}([t_{u_f}, t_{u_g}[), \end{aligned}$$

where the first and the last equalities come from the definition of ${\mathcal {G}}(\cdot )$. This concludes the proof. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Corneli, M., Latouche, P. & Rossi, F. Multiple change points detection and clustering in dynamic networks. Stat Comput 28, 989–1007 (2018). https://doi.org/10.1007/s11222-017-9775-1

Download citation

Received: 09 January 2017
Accepted: 12 September 2017
Published: 26 September 2017
Issue Date: September 2018
DOI: https://doi.org/10.1007/s11222-017-9775-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multiple change points detection and clustering in dynamic networks

Abstract

Access this article

Similar content being viewed by others

The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data

Detecting network backbones against time variations in node properties

Detecting Stable Communities in Link Streams at Multiple Temporal Scales

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix: Proofs

1.1 A.1 Proof of Proposition 1

Proof

1.2 A.2 Proof of Proposition 3

Proof

1.3 A.3 Proof of Proposition 4

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multiple change points detection and clustering in dynamic networks

Abstract

Access this article

Similar content being viewed by others

The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data

Detecting network backbones against time variations in node properties

Detecting Stable Communities in Link Streams at Multiple Temporal Scales

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix: Proofs

Appendix: Proofs

1.1 A.1 Proof of Proposition 1

Proof

1.2 A.2 Proof of Proposition 3

Proof

1.3 A.3 Proof of Proposition 4

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation