A Two-Step Penalized Regression Method with Networked Predictors

Luo, Chong; Pan, Wei; Shen, Xiaotong

doi:10.1007/s12561-011-9051-4

A Two-Step Penalized Regression Method with Networked Predictors

Published: 04 January 2012

Volume 4, pages 27–46, (2012)
Cite this article

Statistics in Biosciences Aims and scope Submit manuscript

Chong Luo¹,
Wei Pan¹ &
Xiaotong Shen²

180 Accesses
4 Citations
Explore all metrics

Abstract

Penalized regression incorporating prior dependency structure of predictors can be effective in high-dimensional data analysis (Li and Li in Bioinformatics, 24:1175–1118, 2008). Pan et al. (Biometrics, 66:474–484, 2010) proposed a penalized regression method for better outcome prediction and variable selection by smoothing parameters over a given predictor network, which can be applied to analysis of microarray data with a given gene network. In this paper, we develop two modifications to their method for further performance enhancement. First, we employ convex programming and show its improved performance over an approximate optimization algorithm implemented in their original proposal. Second, we perform bias reduction after initial variable selection through a new penalty, leading to better parameter estimates and outcome prediction. Simulations have demonstrated substantial performance improvement of the proposed modifications over the original method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Binder H, Schumacher M (2008) Comment on “Network-constrained regularization and variable selection for analysis of genomic data”. Bioinformatics 24:2566–2568
Article Google Scholar
Bondell HD, Reich BJ (2008) Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR. Biometrics 64:115–123
Article MathSciNet MATH Google Scholar
Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Stat 32:407–499
Article MathSciNet MATH Google Scholar
Grant M, Boyd S, Ye Y (2010) CVX: Matlab software for disciplined convex programming. Available at http://www.stanford.edu/boyd/cvx
Higgins ME, Claremont M, Major JE, Sander C, Lash AE (2007) CancerGenes: a gene selection resource for cancer genome projects. Nucleic Acids Res 35 (Suppl 1):D721–D726
Article Google Scholar
Horvath S, Zhang B, Carlson M, Lu KV, Zhu S, Felciano RM, Laurance MF, Zhao W, Shu Q, Lee Y, Scheck AC, Liau LM, Wu H, Geschwind DH, Febbo PG, Kornblum HI, Cloughesy TF, Nelson SF, Mischel PS (2006) Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target. In: Proceedings of national academy of sciences, vol 103, pp 17402–17407
Google Scholar
Li C, Li H (2008) Network-constrained regularization and variable selection for analysis of genomic data. Bioinformatics 24:1118–1175
Article Google Scholar
Li C, Li H (2010) Variable selection and regression analysis for graph-structured covariates with an application to genomics. Ann Appl Stat 4:1498–1516
Article MathSciNet MATH Google Scholar
Meinshausen N (2007) Relaxed Lasso. Comput Stat Data Anal 52:374–393
Article MathSciNet MATH Google Scholar
Meinshausen N, Bühlmann P (2010) Stability selection (with discussion). J R Stat Soc B 72:417–473
Article Google Scholar
Pan W, Xie B, Shen X (2010) Incorporating predictor network in penalized regression with application to microarray data. Biometrics 66:474–484
Article MathSciNet MATH Google Scholar
Shen X, Pan W, Zhu Y (2011) Likelihood-based selection and sharp parameter estimation. To appear in JASA. Available on-line at http://www.sph.umn.edu/biostatistics/research/reports.asp
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J R Stat Soc B 58:267–288
MathSciNet MATH Google Scholar
Tibshirani R, Rosset S, Zhu J, Knight K (2005) Sparsity and smoothness via the fused Lasso. J R Stat Soc B 67:91–108
Article MathSciNet MATH Google Scholar
Wei Z, Li H (2007) A Markov random field model for network-based analysis of genomic data. Bioinformatics 23:1537–1544
Article MathSciNet Google Scholar
Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc B 68:49–67
Article MathSciNet MATH Google Scholar
Zhao P, Yu B (2004) Boosted Lasso. Tech rep, Dept of Statistics, UC-Berkeley
Zhu Y, Shen X, Pan W (2009) Network-based support vector machine for classification of microarray samples. BMC Bioinform Suppl 10(1):S21
Article Google Scholar
Zhou S (2010) Thresholded Lasso for high dimensional variable selection and statistical estimation. Manuscript
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B 67:301–320
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, 55455–0392, USA
Chong Luo & Wei Pan
School of Statistics, University of Minnesota, Minneapolis, MN, 55455, USA
Xiaotong Shen

Authors

Chong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Wei Pan
View author publications
You can also search for this author in PubMed Google Scholar
Xiaotong Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Pan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luo, C., Pan, W. & Shen, X. A Two-Step Penalized Regression Method with Networked Predictors. Stat Biosci 4, 27–46 (2012). https://doi.org/10.1007/s12561-011-9051-4

Download citation

Received: 11 April 2011
Accepted: 11 December 2011
Published: 04 January 2012
Issue Date: May 2012
DOI: https://doi.org/10.1007/s12561-011-9051-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Two-Step Penalized Regression Method with Networked Predictors

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Introduction to Machine Learning

Feature selection techniques for machine learning: a survey of more than two decades of research

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Two-Step Penalized Regression Method with Networked Predictors

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Introduction to Machine Learning

Feature selection techniques for machine learning: a survey of more than two decades of research

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation