Degenerate Expectation-Maximization Algorithm for Local Dimension Reduction

Lin, Xiaodong; Zhu, Yu

doi:10.1007/978-3-642-17103-1_25

Xiaodong Lin²³ &
Yu Zhu²⁴

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organisation ((STUDIES CLASS))

1527 Accesses
3 Citations

Abstract

Dimension reduction techniques based on principal component analysis (PCA) and factor analysis are commonly used in statistical data analysis. The effectiveness of these methods is limited by their global nature. Recent efforts have focused on relaxing global restrictions in order to identify subsets of data that are concentrated on lower dimensional subspaces. In this paper, we propose an adaptive local dimension reduction method, called the Degenerate Expectation-Maximization Algorithm (DEM). This method is based on the finite mixture model. We demonstrate that the DEM yields significantly better results than the local PCA (LPCA) and other related methods in a variety of synthetic and real datasets. The DEM algorithm can be used in various applications ranging from clustering to information retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Archer, C, and Leen, T. K. (1999). “Optimal Dimension Reduction and Transform Coding with Mixture Principal Components,” International Joint Conference on Neural Networks (IJCNN), IEEE.
Google Scholar
Ciuperca, G., Ridolfi, A., and Idier, J. (2003). “Penalized Maximum Likelihood Estimator for Normal Mixtures,” Scandinavian Journal of Statistics, 30, 45–59.
Article MathSciNet MATH Google Scholar
Hastie, T., and Stuetzle, W. (1989). “Principal Curves,” Journal of the American Statistical Association, 84, 502–516.
Article MathSciNet MATH Google Scholar
Hathaway, R. J. (1985). “A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions,” Annals of Statistics, 13, 795–800.
Article MathSciNet MATH Google Scholar
Hinton, G. E., and Ghahramani, Z. (1997). “Generative Models for Discovering Sparse Distributed Representations,” Philosophical Transactions Royal Society B, 352, 1177–1190.
Article Google Scholar
Kambhatla, N., and Leen, T. K. (1997). “Dimension Reduction by Local Principal Component Analysis,” Neural Computation, 9, 1793–1516.
Article Google Scholar
Kiefer, J., and Wolfowitz, J. (1956). “Consistency of the Maximum Likelihood Estimates in the Presence of Infinitely Many Incidental Parameters,” Annals of Mathematical Statistics, 27, 887–906.
Article MathSciNet MATH Google Scholar
Kohonen, T. (1989). Self-Organization and Associative Memory (3rd ed.), Springer-Verlag, Berlin.
Book Google Scholar
Kohonen, T. (1990). “The Self-Organizing Map,” in Proceedings of the IEEE, 78, pp. 1464–1479.
Article Google Scholar
Lin, X. (2003). Finite Mixture Models for Clustering, Dimension Reduction and Privacy Preserving Data Mining, Ph.D. Thesis, Purdue University.
Google Scholar

Download references

Author information

Authors and Affiliations

Statistical and Applied Mathematical Science Institute, USA
Xiaodong Lin
Purdue University, USA
Yu Zhu

Authors

Xiaodong Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Leanna House Institute of Statistics and Decision Sciences, Duke University, 27708, Durham, NC, USA
David Banks
Department of Mathematics, Illinois Institute of Technology, 10 West 32nd Street, 60616-3793, Chicago, IL, USA
Frederick R. McMorris
Faculty of Management, Rutgers University, 180 University Avenue, 07102-1895, Newark, NJ, USA
Phipps Arabie
Institute of Decision Theory, University of Karlsruhe, Kaiserstr. 12, 76128, Karlsruhe, Germany
Wolfgang Gaul

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, X., Zhu, Y. (2004). Degenerate Expectation-Maximization Algorithm for Local Dimension Reduction. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17103-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-17103-1_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22014-5
Online ISBN: 978-3-642-17103-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics