Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation

Czarnecki, Wojciech Marian; Jozefowicz, Rafal; Tabor, Jacek

doi:10.1007/978-3-319-23528-8_4

Wojciech Marian Czarnecki¹⁰,
Rafal Jozefowicz¹¹ &
Jacek Tabor¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9284))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

4775 Accesses
4 Citations

Abstract

Representation learning is currently a very hot topic in modern machine learning, mostly due to the great success of the deep learning methods. In particular low-dimensional representation which discriminates classes can not only enhance the classification procedure, but also make it faster, while contrary to the high-dimensional embeddings can be efficiently used for visual based exploratory data analysis.

In this paper we propose Maximum Entropy Linear Manifold (MELM), a multidimensional generalization of Multithreshold Entropy Linear Classifier model which is able to find a low-dimensional linear data projection maximizing discriminativeness of projected classes. As a result we obtain a linear embedding which can be used for classification, class aware dimensionality reduction and data visualization. MELM provides highly discriminative 2D projections of the data which can be used as a method for constructing robust classifiers.

We provide both empirical evaluation as well as some interesting theoretical properties of our objective function such us scale and affine transformation invariance, connections with PCA and bounding of the expected balanced accuracy error.

Download to read the full chapter text

Chapter PDF

Feature Reduction Using Locally Linear Embedding and Distance Metric Learning

Interpretable Discriminative Dimensionality Reduction and Feature Selection on the Manifold

Manifold Learning in Data Mining Tasks

Keywords

References

Bache, K., Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Bhatia, R.: Matrix analysis, vol. 169. Springer Science & Business Media (1997)
Google Scholar
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 27 (2011)
Google Scholar
Cover, T.M., Thomas, J.A.: Elements of information theory, 2nd edn. Willey-Interscience, NJ (2006)
MATH Google Scholar
Czarnecki, W.M.: On the consistency of multithreshold entropy linear classifier. Schedae Informaticae (2015)
Google Scholar
Czarnecki, W.M., Tabor, J.: Multithreshold entropy linear classifier: Theory and applications. Expert Systems with Applications (2015)
Google Scholar
Geng, Q., Wright, J.: On the local correctness of 1-minimization for dictionary learning. In: 2014 IEEE International Symposium on Information Theory (ISIT), pp. 3180–3184. IEEE (2014)
Google Scholar
Goodfellow, I.J., et al.: Challenges in representation learning: a report on three machine learning contests. In: Lee, M., Hirose, A., Hou, Z.-G., Kil, R.M. (eds.) ICONIP 2013, Part III. LNCS, vol. 8228, pp. 117–124. Springer, Heidelberg (2013)
Chapter Google Scholar
Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Computation 18(7), 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Jozefowicz, R., Czarnecki, W.M.: Fast optimization of multithreshold entropy linear classifier (2015). arXiv preprint arXiv:1504.04739
Karampatziakis, N., Mineiro, P.: Discriminative features via generalized eigenvectors. In: Proceedings of the 31st International Conference on Machine Learning (ICML 2014), pp. 494–502 (2014)
Google Scholar
Levy, O., Goldberg, Y.: Neural word embedding as implicit matrix factorization. In: Advances in Neural Information Processing Systems (NIPS 2014), pp. 2177–2185 (2014)
Google Scholar
Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Online dictionary learning for sparse coding. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 689–696. ACM (2009)
Google Scholar
Principe, J.C., Xu, D., Fisher, J.: Information theoretic learning. Unsupervised Adaptive Filtering 1, 265–319 (2000)
Google Scholar
Silverman, B.W.: Density estimation for statistics and data analysis, vol. 26. CRC Press (1986)
Google Scholar
Suykens, J.A., Van Gestel, T., De Brabanter, J., De Moor, B., Vandewalle, J.: Least squares support vector machines, vol. 4. World Scientific (2002)
Google Scholar
Tabor, J., Spurek, P.: Cross-entropy clustering. Pattern Recognition 47(9), 3046–3059 (2014)
Article Google Scholar
Wang, L.: Support Vector Machines: theory and applications, vol. 177. Springer Science & Business Media (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Mathematics and Computer Science, Jagiellonian University, Krakow, Poland
Wojciech Marian Czarnecki & Jacek Tabor
Google, New York, USA
Rafal Jozefowicz

Authors

Wojciech Marian Czarnecki
View author publications
You can also search for this author in PubMed Google Scholar
Rafal Jozefowicz
View author publications
You can also search for this author in PubMed Google Scholar
Jacek Tabor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wojciech Marian Czarnecki .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Porto, Porto, Portugal
Pedro Pereira Rodrigues
University of Porto - CRACS/INESC TEC, Porto, Portugal
Vítor Santos Costa
University of Porto - INESC TEC, Porto, Portugal
Carlos Soares
University of Porto - INESC TEC, Porto, Portugal
João Gama
University of Porto - INESC TEC, Porto, Portugal
Alípio Jorge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Czarnecki, W.M., Jozefowicz, R., Tabor, J. (2015). Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation. In: Appice, A., Rodrigues, P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9284. Springer, Cham. https://doi.org/10.1007/978-3-319-23528-8_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-23528-8_4
Published: 29 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23527-1
Online ISBN: 978-3-319-23528-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation

Abstract

Chapter PDF

Similar content being viewed by others

Feature Reduction Using Locally Linear Embedding and Distance Metric Learning

Interpretable Discriminative Dimensionality Reduction and Feature Selection on the Manifold

Manifold Learning in Data Mining Tasks

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Maximum Entropy Linear Manifold for Learning Discriminative Low-Dimensional Representation

Abstract

Chapter PDF

Similar content being viewed by others

Feature Reduction Using Locally Linear Embedding and Distance Metric Learning

Interpretable Discriminative Dimensionality Reduction and Feature Selection on the Manifold

Manifold Learning in Data Mining Tasks

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation