An algorithm of non-negative matrix factorization with the nearest neighbor after per-treatments

Jia, Mengxue; Li, Xiangli; Zhang, Ying

doi:10.1007/s11042-023-14571-2

An algorithm of non-negative matrix factorization with the nearest neighbor after per-treatments

Published: 28 February 2023

Volume 82, pages 30669–30688, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

111 Accesses
Explore all metrics

Abstract

Clustering is a hot topic in machine learning. For high dimension data, nonnegative matrix factorization (NMF) is a crucial technology in clustering. However, NMF has some disadvantages. First, NMF clusters data in original space while outliers and noise will weaken NMF clustering results. Second, NMF does not take local structure which is beneficial for clustering of data into consideration. To address these two disadvantages, a new algorithm is proposed called nonnegative matrix factorization with the nearest neighbor after per-treatments (PNNMF). Per-treatments are used to alleviate effects of outliers and noise. After per-treatments, some credible connected components generated by the nesrest neighbor of data are chosen to capture local structure. Moreover a new initialization for basis matrix is proposed basing these credible connected components. Experiments on real data sets confirm the effectiveness of PNNMF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dual local learning regularized nonnegative matrix factorization and its semi-supervised extension for clustering

Article 04 October 2020

Adaptive local learning regularized nonnegative matrix factorization for data clustering

Article 03 January 2019

Automatic Non-negative Matrix Factorization Clustering with Competitive Sparseness Constraints

Data availability statement

The datasets analysed during the current study are available in the homepage of Deng Cai (http://www.cad.zju.edu.cn/home/dengcai/).

References

Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. IEEE Trans Pattern Anal mach Intell 19(7):711–720
Article Google Scholar
Cai D, He X, Han J (2005) Document clustering using locality preserving indexing. IEEE Trans Knowl Data Eng 17(12):1624–1637
Article Google Scholar
Cai D, He X, Han J, et al (2010) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560
Google Scholar
Chen WS, Liu J, Pan B, et al (2019) Face recognition using nonnegative matrix factorization with fractional power inner product kernel. Neurocomputing 348:40–53
Article Google Scholar
Ding C, Li T, Peng W et al (2006) Orthogonal nonnegative matrix t-factorizations for clustering. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, New York, NY, USA, KDD ’06. p 126–135, DOI https://doi.org/10.1145/1150402.1150420
Gao B, Woo WL, Dlay SS (2012) Variational regularized 2-d nonnegative matrix factorization. IEEE Trans Neural Netw Learn Syst 23(5):703–716
Article Google Scholar
Hedjam R, Abdesselam A, Melgani F (2021) Nmf with feature relationship preservation penalty term for clustering problems. Pattern Recogn 112:107814. https://doi.org/10.1016/j.patcog.2021.107814
Article Google Scholar
Hubert L, Arabie P (1985) Comparing partitions. J Classif 2 (1):193–218
Article MATH Google Scholar
Kong D, Ding C, Huang H (2011) Robust nonnegative matrix factorization using l21-norm. In: Proceedings of the 20th ACM international conference on information and knowledge management. Association for Computing Machinery, New York, NY, USA, CIKM ’11, p 673–682, DOI https://doi.org/10.1145/2063576.2063676
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788
Article MATH Google Scholar
Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems. pp 556–562. MIT Press, Cambridge, MA, USA
Li Z, Tang J (2017) Weakly supervised deep matrix factorization for social image understanding. IEEE Trans Image Process 26(1):276–288. https://doi.org/10.1109/TIP.2016.2624140
Article MathSciNet MATH Google Scholar
Li Z, Tang J, He X (2018) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Networks Learn Syst 29 (5):1947–1960. https://doi.org/10.1109/TNNLS.2017.2691725
Article MathSciNet Google Scholar
Li Z, Tang J, Mei T (2018) Deep collaborative embedding for social image understanding. IEEE Trans Pattern Anal Mach Intell PP:1–1. https://doi.org/10.1109/TPAMI.2018.2852750
Article Google Scholar
Li Z, Tang J, Zhang L, et al (2020) Weakly-supervised semantic guided hashing for social image retrieval. Int J Comput Vis 128. https://doi.org/10.1007/s11263-020-01331-0
Peng S, Ser W, Chen B, et al (2020) Robust orthogonal nonnegative matrix tri-factorization for data representation. Knowl-Based Syst 201-202:106054. https://doi.org/10.1016/j.knosys.2020.106054
Article Google Scholar
Sarfraz S, Sharma V, Stiefelhagen R (2019) Efficient parameter-free clustering using first neighbor relations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, New York, NY, USA., pp 8934–8943
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal mach Intell 22(8):888–905
Article Google Scholar
Sun Y, Wang J, Guo J et al (2022) Globality constrained adaptive graph regularized non-negative matrix factorization for data representation. IET Image Processing 16(10):2577–2592
Article Google Scholar
Tang C, Bian M, Liu X, et al (2019) Unsupervised feature selection via latent representation learning and manifold regularization. Neural Netw 117:163–178
Article Google Scholar
Wang Y, Chen L, Mei JP (2014) Stochastic gradient descent based fuzzy clustering for large data. In: 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). p 2511–2518. https://doi.org/10.1109/FUZZ-IEEE.2014.6891755
Xu W, Gong Y (2004) Document clustering by concept factorization. In: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. pp 202–209. Association for Computing Machinery, New York, NY, USA
Zhang X, Gao H, Li G, et al (2018) Multi-view clustering based on graph-regularized nonnegative matrix factorization for object recognition. Inf Sci 432:463–478
Article MathSciNet MATH Google Scholar
Zhang Z, Jia L, Zhao M, et al (2018) Adaptive non-negative projective semi-supervised learning for inductive classification. Neural Netw 108:128–145
Article MATH Google Scholar
Zhou J (2019) Research of swnmf with new iteration rules for facial feature extraction and recognition. Symmetry 11(3):354
Article MATH Google Scholar
Zurada JM, Ensari T, Asl EH et al (2013) Nonnegative matrix factorization and its application to pattern analysis and text mining. In: 2013 Federated Conference on Computer Science and Information Systems, IEEE, pp 11–16

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China(11961010, 61967004).

Author information

Authors and Affiliations

School of Mathematics and Computing Science, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
Mengxue Jia, Xiangli Li & Ying Zhang
School of Mathematics and Statistics, Xidian University, Xi’an, 710126, Shaanxi, China
Mengxue Jia & Ying Zhang
Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
Xiangli Li
Center for Applied Mathematics of Guangxi (GUET), Guilin, 541004, Guangxi, China
Xiangli Li

Authors

Mengxue Jia
View author publications
You can also search for this author in PubMed Google Scholar
Xiangli Li
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiangli Li.

Ethics declarations

Conflict of interest statement

The authors declare that there are no conflict of interests, we do not have any possible conflicts of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

To proof Theorem 1, we first give a definition and two lemmas.

Definition 1: $G(v, v^{\prime })$ is an auxiliary function of F(v) if $G(v, v^{\prime })$ sastisfies: 1.$G(v, v^{\prime })$ $\geqslant $ F(v); 2.G(v,v) = F(v).

Lemma 1: If G is the auxiliary function of F, F is unincreased under the condition

$$ \begin{array}{@{}rcl@{}} v^{t+1} = \mathop{\arg\min}_{v} G(v, v^{t}). \end{array} $$

Proof: F(v^t+ 1) = G(v^t+ 1,v^t+ 1) $\leqslant $ G(v^t+ 1,v^t) $\leqslant $ G(v^t,v^t) = F(v^t). $_{\square }$

We can rewrite the cost function (16) as

$$ \begin{array}{@{}rcl@{}} O = tr({X^{P}}^{T}X^{P}) - 2\sum\limits^{n}_{i=1}\sum\limits^{k}_{j=1} V_{ij}(U^{T}X)_{ji} + \sum\limits^{n}_{i=1}\sum\limits^{k}_{j=1} V_{ij}(U^{T}UV^{T})_{ji} \end{array} $$

$$ \begin{array}{@{}rcl@{}} + \lambda \sum\limits^{k}_{i=1}\sum\limits^{n}_{j=1} V^{T}_{ij}((H^{T}-B^{T})(H-B)V)_{ji}. \end{array} $$

Now we consider the element v_ab in V. To update the v_ab, we denote the related part of v_ab in O as:

$$ \begin{array}{@{}rcl@{}} F_{ab} = 2v_{ab}(U^{T}X^{P})_{ab}+v_{ab}(U^{T}UV^{T})_{ab}+(VU^{T}U)_{ab}v^{T}_{ab}+\lambda v^{T}_{ba}((H^{T}-B^{T})(H-B)V)_{ab} \end{array} $$

$$ \begin{array}{@{}rcl@{}} +\lambda (V^{T}(H^{T}-B^{T})(H-B))_{ba}v_{ab}. \end{array} $$

Basing F_ab, we can calculate its first-order derivative and second-order derivative as follow:

$$ \begin{array}{@{}rcl@{}} F_{ab}^{\prime} = -2({X^{P}}^{T}U)_{ab}+2(VU^{T}U)_{ab}+2\lambda ((H^{T}-B^{T})(H-T)V)_{ab}. \end{array} $$

$$ \begin{array}{@{}rcl@{}} F_{ab}^{\prime\prime} = 2\sum\limits_{i=1}^{m}U_{ib}^{2}+2\lambda \sum\limits_{i=1}^{n}(H_{ia}-B_{ia})^{2}. \end{array} $$

Lemma 2: Function

$$ \begin{array}{@{}rcl@{}} G(v, v^{t}_{ab}) = F_{ab}(v^{t}_{ab})+F^{\prime}_{ab}(v^{t}_{ab})(v-v^{t}_{ab})+\frac{(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}}{v^{t}_{ab}}(v-v^{t}_{ab})^{2} \end{array} $$

is the auxiliary function of F_ab.

Proof: It is obvious that $G(v^{t}_{ab}, v^{t}_{ab}) = F_{ab}(v)$. Next we will prove that $G(v, v^{t}_{ab}) \geqslant F_{ab}(v)$.

Because F_ab(v) is a quadratic function, the Taylor expansion of F_ab(v) on $v_{ab}^{t}$ is

$$ \begin{array}{@{}rcl@{}} F_{ab}(v) = F_{ab}(v^{t}_{ab})+F^{\prime}_{ab}(v^{t}_{ab})(v-v^{t}_{ab})+(\sum\limits_{i=1}^{m}U_{ib}^{2}+\lambda \sum\limits_{i=1}^{n}(H_{ia}-B_{ia})^{2})(v-v^{t}_{ab})^{2}. \end{array} $$

If $G(v, v^{t}_{ab}) \geqslant F_{ab}(v)$, we only need

$$ \begin{array}{@{}rcl@{}} \frac{(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}}{v^{t}_{ab}} \geqslant \sum\limits_{i=1}^{m}U_{ib}^{2}+\lambda \sum\limits_{i=1}^{n}(H_{ia}-B_{ia})^{2}. \end{array} $$

(20)

We can rewrite (20) as

$$ \begin{array}{@{}rcl@{}} (VU^{T}U+\lambda H^{T}HV+\lambda E^{T}EV)_{ab} \geqslant v^{t}_{ab}(U^{T}U)_{bb}+\lambda (H^{T}H-H^{T}B-B^{T}H+B^{T}B)_{aa}v^{t}_{ab}. \end{array} $$

(21)

Because V and U are nonnegative, we have

$$ \begin{array}{@{}rcl@{}} (VU^{T}U+\lambda H^{T}HV+\lambda E^{T}EV)_{ab} \geqslant v_{ab}^{t}(U^{T}U)_{bb}+\lambda (H^{T}H+B^{T}B)_{aa}v^{t}_{ab} \end{array} $$

$$ \begin{array}{@{}rcl@{}} \geqslant v^{t}_{ab}(U^{T}U)_{bb}+\lambda (H^{T}H-H^{T}B-B^{T}H+B^{T}B)_{aa}v^{t}_{ab}. \end{array} $$

It means the (20) is hold and the Lemma 2 is proved. $_{\square }$

Now we give the proof of Theorem 1.

Proof: For each element in V, we can find its auxiliary function $G(v, v^{t}_{ab})$. Because $G(v, v^{t}_{ab})$ is a quadratic function, calculate the first-order derivative as follow:

$$ \begin{array}{@{}rcl@{}} G^{\prime}(v, v^{t}_{ab})=F^{\prime}_{ab}(v_{ab}^{t})+\frac{2(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}}{v_{ab}^{t}}(v-v^{t}_{ab}). \end{array} $$

Let the derivative equal 0 and we can get the update rule:

$$ \begin{array}{@{}rcl@{}} v^{t+1}_{ab} = v^{t}_{ab}-\frac{v^{t}_{ab}}{2} \frac{F^{\prime}_{ab}(v_{ab}^{t})}{(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}} \end{array} $$

$$ \begin{array}{@{}rcl@{}} =v^{t}_{ab}-\frac{v^{t}_{ab}}{2} \frac{2((-{X^{P}}^{T}U+VU^{T}U+\lambda (H^{T}-B^{T})(H-B)V))_{ab}}{(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}} \end{array} $$

$$ \begin{array}{@{}rcl@{}} =\frac{({X^{P}}^{T}U+\lambda H^{T}BV+\lambda B^{T}HV)_{ab}}{(VU^{T}U+\lambda H^{T}HV+\lambda B^{T}BV)_{ab}}v^{t}_{ab}. \end{array} $$

It is same as the update rule (19). For U, we can use the similar method to prove.

So, the proof of Theorem 1 is done. $_{\square }$

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Jia, M., Li, X. & Zhang, Y. An algorithm of non-negative matrix factorization with the nearest neighbor after per-treatments. Multimed Tools Appl 82, 30669–30688 (2023). https://doi.org/10.1007/s11042-023-14571-2

Download citation

Received: 26 September 2021
Revised: 26 July 2022
Accepted: 31 January 2023
Published: 28 February 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11042-023-14571-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An algorithm of non-negative matrix factorization with the nearest neighbor after per-treatments

Abstract

Access this article

Similar content being viewed by others

Dual local learning regularized nonnegative matrix factorization and its semi-supervised extension for clustering

Adaptive local learning regularized nonnegative matrix factorization for data clustering

Automatic Non-negative Matrix Factorization Clustering with Competitive Sparseness Constraints

Data availability statement

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest statement

Additional information

Publisher’s note

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An algorithm of non-negative matrix factorization with the nearest neighbor after per-treatments

Abstract

Access this article

Similar content being viewed by others

Dual local learning regularized nonnegative matrix factorization and its semi-supervised extension for clustering

Adaptive local learning regularized nonnegative matrix factorization for data clustering

Automatic Non-negative Matrix Factorization Clustering with Competitive Sparseness Constraints

Data availability statement

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest statement

Additional information

Publisher’s note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation