Unsupervised feature selection via joint local learning and group sparse regression

Wu, Yue; Wang, Can; Zhang, Yue-qing; Bu, Jia-jun

doi:10.1631/FITEE.1700804

Unsupervised feature selection via joint local learning and group sparse regression

Published: 18 May 2019

Volume 20, pages 538–553, (2019)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Yue Wu^1,2,
Can Wang ORCID: orcid.org/0000-0002-5890-4307^1,2,
Yue-qing Zhang¹ &
…
Jia-jun Bu^1,2

122 Accesses
1 Citation
Explore all metrics

Abstract

Feature selection has attracted a great deal of interest over the past decades. By selecting meaningful feature subsets, the performance of learning algorithms can be effectively improved. Because label information is expensive to obtain, unsupervised feature selection methods are more widely used than the supervised ones. The key to unsupervised feature selection is to find features that effectively reflect the underlying data distribution. However, due to the inevitable redundancies and noise in a dataset, the intrinsic data distribution is not best revealed when using all features. To address this issue, we propose a novel unsupervised feature selection algorithm via joint local learning and group sparse regression (JLLGSR). JLLGSR incorporates local learning based clustering with group sparsity regularized regression in a single formulation, and seeks features that respect both the manifold structure and group sparse structure in the data space. An iterative optimization method is developed in which the weights finally converge on the important features and the selected features are able to improve the clustering results. Experiments on multiple real-world datasets (images, voices, and web pages) demonstrate the effectiveness of JLLGSR.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Unsupervised feature selection based on self-representation sparse regression and local similarity preserving

Article 18 December 2017

GOLFS: feature selection via combining both global and local information for high dimensional clustering

Article 03 August 2023

Unsupervised feature selection via local structure learning and sparse learning

Article 28 November 2017

References

Belkin M, Niyogi P, 2001. Laplacian eigenmaps and spectral techniques for embedding and clustering. 14^th Int Conf on Neural Information Processing Systems: Natural and Synthetic, p. 585–591.
Google Scholar
Bellman RE, 1961. Adaptive Control Processes: a Guided Tour. Princeton University Press, Princeton, NJ.
Book MATH Google Scholar
Cai D, Zhang C, He X, 2010. Unsupervised feature selection for multi-cluster data. 16^th Int Conf on Knowledge Discovery and Data Mining, p. 333–342. https://doi.org/10.1145/1835804.1835848
Google Scholar
Chang XJ, Nie FP, Yang Y, et al., 2016. Convex sparse PCA for unsupervised feature learning. ACM Trans Knowl Dis Data, 11(1):3. https://doi.org/10.1145/2910585
Google Scholar
Cheung Y, Zeng H, 2009. Local kernel regression score for selecting features of high-dimensional data. IEEE Trans Knowl Data Eng, 21(12):1798–1802. https://doi.org/10.1109/TKDE.2009.23
Article Google Scholar
Doquire G, Verleysen M, 2013. Mutual information-based feature selection for multilabel classification. Neuro-computing, 122:148–155. https://doi.org/10.1016/j.neucom.2013.06.035
MATH Google Scholar
Du L, Shen YD, 2015. Unsupervised feature selection with adaptive structure learning. 21^st Int Conf on Knowledge Discovery and Data Mining, p. 209–218. https://doi.org/10.1145/2783258.2783345
Google Scholar
Fanty M, Cole R, 1990. Spoken letter recognition. Conf on Advances in Neural Information Processing Systems, p. 220–226. https://doi.org/10.3115/116580.116725
Google Scholar
Georghiades AS, Belhumeur PN, Kriegman DJ, 2001. From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans Patt Anal Mach Intell, 23(6):643–660. https://doi.org/10.1109/34.927464
Article Google Scholar
Guyon I, Elisseeff A, 2003. An introduction to variable and feature selection. J Mach Learn Res, 3:1157–1182. https://doi.org/10.1162/153244303322753616
MATH Google Scholar
Guyon I, Weston J, Barnhill S, et al., 2002. Gene selection for cancer classification using support vector machines. Mach Learn, 46(1–3):389–422. https://doi.org/10.1023/A:1012487302797
Article MATH Google Scholar
Han YH, Wu F, Tian Q, et al., 2012. Image annotation by input-output structural grouping sparsity. IEEE Trans Image Proc, 21(6):3066–3079. https://doi.org/10.1109/TIP.2012.2183880
Article MathSciNet MATH Google Scholar
Han YH, Yang Y, Yan Y, et al., 2015. Semisupervised feature selection via spline regression for video semantic recognition. IEEE Trans Neur Netw Learn Syst, 26(2):252–264. https://doi.org/10.1109/TNNLS.2014.2314123
Article MathSciNet Google Scholar
He X, Niyogi P, 2004. Locality preserving projections. Conf on Advances in Neural Information Processing Systems, p. 153–160.
Google Scholar
He X, Cai D, Niyogi P, 2005. Laplacian score for feature selection. Conf on Advances in Neural Information Processing Systems, p. 507–514.
Google Scholar
Hou CP, Nie FP, Li XL, et al., 2014. Joint embedding learning and sparse regression: a framework for unsupervised feature selection. IEEE Trans Cybern, 44(6):793–804. https://doi.org/10.1109/TCYB.2013.2272642
Article Google Scholar
Hull JJ, 1994. A database for handwritten text recognition research. IEEE Trans Patt Anal Mach Intell, 16(5):550–554. https://doi.org/10.1109/34.291440
Article Google Scholar
Jiang Y, Ren JT, 2011. Eigenvalue sensitive feature selection. 28^th Int Conf on Machine Learning, p. 89–96.
Google Scholar
Jolliffe IT, 2002. Principal Component Analysis (2^nd Ed.). Springer, New York.
MATH Google Scholar
Krizhevsky A, 2009. Learning Multiple Layers of Features from Tiny Images. Science Department, University of Toronto, Tech, Toronto.
Google Scholar
Kuhn HW, 1955. The Hungarian method for the assignment problem. Nav Res Log Q, 2(1–2):83–97. https://doi.org/10.1002/nav.3800020109
Article MathSciNet MATH Google Scholar
Lee KC, Ho J, Kriegman DJ, 2005. Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans Patt Anal Mach Intell, 27(5):684–698. https://doi.org/10.1109/TPAMI.2005.92
Article Google Scholar
Luo MN, Nie FP, Chang XJ, et al., 2018. Adaptive unsupervised feature selection with structure regularization. IEEE Trans Neur Netw Learn Syst, 29(4):944–956. https://doi.org/10.1109/TNNLS.2017.2650978
Article Google Scholar
Munkres J, 1957. Algorithms for the assignment and transportation problems. J Soc Ind Appl Math, 5(1):32–38. https://doi.org/10.1137/0105003
Article MathSciNet MATH Google Scholar
Nie FP, Xiang SM, Jia YQ, et al., 2008. Trace ratio criterion for feature selection. 23^rd Int Conf on Artificial Intelligence, p. 671–676.
Google Scholar
Nie FP, Xiang SM, Song YQ, et al., 2009. Orthogonal locality minimizing globality maximizing projections for feature extraction. Opt Eng, 48(1):017202. https://doi.org/10.1117/1.3067869
Article Google Scholar
Nie FP, Huang H, Cai X, et al., 2010a. Efficient and robust feature selection via joint l _2,1-norms minimization. 23^rd Int Conf on Neural Information Processing Systems, p. 1813–1821.
Google Scholar
Nie FP, Xu D, Tsang IWH, et al., 2010b. Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction. IEEE Trans Image Proc, 19(7):1921–1932. https://doi.org/10.1109/TIP.2010.2044958
Article MathSciNet MATH Google Scholar
Nie FP, Zeng ZN, Tsang IW, et al., 2011. Spectral embedded clustering: a framework for in-sample and out-of-sample spectral clustering. IEEE Trans Neur Netw, 22(11):1796–1808. https://doi.org/10.1109/TNN.2011.2162000
Article Google Scholar
Nie FP, Wang XQ, Jordan MI, et al., 2016a. The constrained Laplacian rank algorithm for graph-based clustering. 30^th AAAI Conf on Artificial Intelligence, p. 1969–1976.
Google Scholar
Nie FP, Zhu W, Li XI, 2016b. Unsupervised feature selection with structured graph optimization. 30^th AAAI Conf on Artificial Intelligence, p. 1302–1308.
Google Scholar
Peng HC, Long FH, Ding C, 2005. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Patt Anal Mach Intell, 27(8):1226–1238. https://doi.org/10.1109/TPAMI.2005.159
Article Google Scholar
Roweis ST, Saul LK, 2000. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323–2326. https://doi.org/10.1126/science.290.5500.2323
Article Google Scholar
Sun YJ, Todorovic S, Goodison S, 2010. Local-learning-based feature selection for high-dimensional data analysis. IEEE Trans Patt Anal Mach Intell, 32(9):1610–1626. https://doi.org/10.1109/TPAMI.2009.190
Article Google Scholar
Tan MK, Wang L, Tsang IW, 2010. Learning sparse SVM for feature selection on very high dimensional datasets. 27^th Int Conf on Machine Learning, p. 1047–1054.
Google Scholar
Tenenbaum JB, de Silva V, Langford JC, 2000. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319–2323. https://doi.org/10.1126/science.290.5500.2319
Article Google Scholar
Tibshirani R, 1996. Regression shrinkage and selection via the Lasso. J R Stat Soc B, 58(1):267–288.
MathSciNet MATH Google Scholar
Verleysen M, 2003. Learning high-dimensional data. In: Ablameyko S, Goras L, Gori M (Eds.), Limitations and Future Trends in Neural Computation. IOS Press, Amsterdam, p. 141–162.
Google Scholar
Wang D, Nie FP, Huang H, 2014. Unsupervised feature selection via unified trace ratio formulation and K-means clustering (TRACK). European Conf on Machine Learning and Knowledge Discovery in Databases, p. 306–321. https://doi.org/10.1007/978-3-662-44845-8_20
Google Scholar
Wu Y, Wang C, Bu JJ, et al., 2016. Group sparse feature selection on local learning based clustering. Neurocom-puting, 171:1118–1130. https://doi.org/10.1016/j.neucom.2015.07.045
Article Google Scholar
Yang Y, Shen HT, Ma ZG, et al., 2011. l _2,1-norm regularized discriminative feature selection for unsupervised learning. 22^nd Int Joint Conf on Artificial Intelligence, p. 1589–1594. https://doi.org/10.5591/978-1-57735-516-8/ijcai11-267
Google Scholar
Zeng H, Cheung YM, 2009. Feature selection for local learning based clustering. 13^th Pacific-Asia Conf on Advances in Knowledge Discovery and Data Mining, p. 414–425. https://doi.org/10.1007/978-3-642-01307-2_38
Google Scholar
Zeng H, Cheung YM, 2011. Feature selection and kernel learning for local learning-based clustering. IEEE Trans Patt Anal Mach Intell, 33(8):1532–1547. https://doi.org/10.1109/TPAMI.2010.215
Article Google Scholar
Zhao Z, Liu H, 2007. Spectral feature selection for supervised and unsupervised learning. 24^th Int Conf on Machine Learning, p. 1151–1157. https://doi.org/10.1145/1273496.1273641
Google Scholar
Zou H, Hastie T, 2005. Regularization and variable selection via the elastic net. J R Stat Soc Ser B, 67(2):301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The experiment is supported by Cheng-wei YAO in the Experiment Center of the College of Computer Science and Technology, Zhejiang University.

Author information

Authors and Affiliations

Zhejiang Provincial Key Laboratory of Service Robot, College of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China
Yue Wu, Can Wang, Yue-qing Zhang & Jia-jun Bu
Joint Institute of Frontier Technologies, Alibaba-Zhejiang University, Hangzhou, 310027, China
Yue Wu, Can Wang & Jia-jun Bu

Authors

Yue Wu
View author publications
You can also search for this author in PubMed Google Scholar
Can Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yue-qing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jia-jun Bu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Can Wang.

Additional information

Project supported by Alibaba-Zhejiang University Joint Institute of Frontier Technologies and Zhejiang Provincial Key Research and Development Plan (No. 2017C01012)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, Y., Wang, C., Zhang, Yq. et al. Unsupervised feature selection via joint local learning and group sparse regression. Frontiers Inf Technol Electronic Eng 20, 538–553 (2019). https://doi.org/10.1631/FITEE.1700804

Download citation

Received: 02 December 2017
Accepted: 09 March 2018
Published: 18 May 2019
Issue Date: April 2019
DOI: https://doi.org/10.1631/FITEE.1700804

Key words

CLC number

TP391.4

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Unsupervised feature selection via joint local learning and group sparse regression

Abstract

Access this article

Similar content being viewed by others

Unsupervised feature selection based on self-representation sparse regression and local similarity preserving

GOLFS: feature selection via combining both global and local information for high dimensional clustering

Unsupervised feature selection via local structure learning and sparse learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

Unsupervised feature selection via joint local learning and group sparse regression

Abstract

Access this article

Similar content being viewed by others

Unsupervised feature selection based on self-representation sparse regression and local similarity preserving

GOLFS: feature selection via combining both global and local information for high dimensional clustering

Unsupervised feature selection via local structure learning and sparse learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation