A Decentralized Technique for Robust Probabilistic Mixture Modelling of a Distributed Data Set

El Attar, Ali; Pigeau, Antoine; Gelgon, Marc

doi:10.1007/978-3-642-24013-3_29

Ali El Attar⁶,
Antoine Pigeau⁶ &
Marc Gelgon⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 382))

741 Accesses

Abstract

This paper deals with a machine learning task, namely probability density estimation, in the case data is composed of subsets hosted on nodes of a distributed system. Focusing on mixture models and assuming a set of local probability distribution estimates, we demonstrate how it is possible to combining local estimates in a dynamic, robust and decentralized fashion, through gossiping a global probabilistic model over the data set. Experiments are reported to illustrate the proposal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
MATH Google Scholar
Kermarrec, A.-M., van Steen, M.: Gossiping in distributed systems. Operating Systems Review 41(5), 2–7 (2007)
Article Google Scholar
Gu, D.: Distributed EM algorithm for Gaussian Mixtures in Sensor Networks. IEEE Transactions on Neural Networks 19(7), 1154–1166 (2008)
Article Google Scholar
Kowalczyk, W., Vlassis, N.A.: Newscast EM. In: NIPS, pp. 713–720. MIT Press, Cambridge (2004)
Google Scholar
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: A survey. ACM Comput. Surv. 41(3), 1–58 (2009)
Article Google Scholar
Rousseeuw, P.J., Driessen, K.V.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41(3), 212–223 (1999)
Article Google Scholar
Jelasity, M., Voulgaris, S., Guerraoui, R., Kermarrec, A.-M., van Steen, M.: Gossip-based peer sampling. ACM Trans. Comput. Syst. 25(3), 8 (2007)
Article Google Scholar
Goldberger, J., Roweis, S.T.: Hierarchical clustering of a mixture model. In: NIPS, pp. 505–512. MIT Press, Cambridge (2004)
Google Scholar
Hershey, J.R., Olsen, P.A.: Approximating the Kullback Leibler divergence between Gaussian mixture models. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, vol. 4, pp. IV-317–IV-320 (2007)
Google Scholar
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: SODA 2007: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035. Society for Industrial and Applied Mathematics, PA (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

LINA (UMR CNRS 6241), Université de Nantes, Nantes, France
Ali El Attar, Antoine Pigeau & Marc Gelgon

Authors

Ali El Attar
View author publications
You can also search for this author in PubMed Google Scholar
Antoine Pigeau
View author publications
You can also search for this author in PubMed Google Scholar
Marc Gelgon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Technology,Policy andManagement, Intelligent Interactive Dynamic Systems, Section Systems Engineering, Delft University of Technology, Jaffalaan 5, 2628BX, Delft, The Netherlands
F. M. T. Brazier
D-CIS lab, P.O. Box 90, 2600 AB, Delft, The Netherlands
Kees Nieuwenhuis & Gregor Pavlin &
Faculty of Technology,Policy andManagement, Section Systems Engineering, Delft University of Technology, Jaffalaan 5, 2628BX, Delft, The Netherlands
Martijn Warnier
Faculty of Automatics, Computers and Electronics, Software Engineering Department, University of Craiova, Bvd. Decebal Nr. 107, 200440, Craiova, Romania
Costin Badica

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

El Attar, A., Pigeau, A., Gelgon, M. (2011). A Decentralized Technique for Robust Probabilistic Mixture Modelling of a Distributed Data Set. In: Brazier, F.M.T., Nieuwenhuis, K., Pavlin, G., Warnier, M., Badica, C. (eds) Intelligent Distributed Computing V. Studies in Computational Intelligence, vol 382. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24013-3_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-24013-3_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24012-6
Online ISBN: 978-3-642-24013-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics