Skip to main content
Log in

A distributed algorithm for fitting generalized additive models

  • Published:
Optimization and Engineering Aims and scope Submit manuscript

Abstract

Generalized additive models are an effective regression tool, popular in the statistics literature, that provides an automatic extension of traditional linear models to nonlinear systems. We present a distributed algorithm for fitting generalized additive models, based on the alternating direction method of multipliers (ADMM). In our algorithm the component functions of the model are fit independently, in parallel; a simple iteration yields convergence to the optimal generalized additive model. This is in contrast to the traditional approach of backfitting, where the component functions are fit sequentially. We illustrate the method on different classes of problems such as generalized additive, logistic, and piecewise constant models, with various types of regularization, including those that promote smoothness and sparsity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Becker S, Bobin J, Candès EJ (2011a) NESTA: a fast and accurate first-order method for sparse recovery. SIAM J Imaging Sci 4(1):1–39

    Article  MathSciNet  MATH  Google Scholar 

  • Becker S, Candès EJ, Grant M (2011b) Templates for convex cone problems with applications to sparse signal recovery. Math Program Comput 3(3)

  • Bottomley C, Van Belle V, Pexsters A, Papageorghiou A, Mukri F, Kirk E, Van Huffel S, Timmerman D, Bourne T (2011) A model and scoring system to predict outcome of intrauterine pregnancies of uncertain viability. Ultrasound Obstet Gynecol 37(5):588–595

    Article  Google Scholar 

  • Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  • Boyd S, Parikh N, Chu E, Peleato B, Eckstein J (2010) Distributed optimization and statistical learning via the alternating direction method of multipliers. Found Trends Mach Learn 3(1):1–122

    Article  MATH  Google Scholar 

  • Douglas J, Rachford HH (1956) On the numerical solution of heat conduction problems in two and three space variables. Trans Am Math Soc 82:421–439

    Article  MathSciNet  MATH  Google Scholar 

  • Everett H (1963) Generalized Lagrange multiplier method for solving problems of optimum allocation of resources. Oper Res 11(3):399–417

    Article  MathSciNet  MATH  Google Scholar 

  • Friedman JH, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22

    Google Scholar 

  • Guisan A, Edwards T, Hastie T (2002) Generalized linear and generalized additive models in studies of species distributions: setting the scene. Ecol Model 157:89–100

    Article  Google Scholar 

  • Hastie T, Tibshirani R (1986) Generalized additive models. Stat Sci 1(3):297–318

    Article  MathSciNet  Google Scholar 

  • Hastie T, Tibshirani R (1995) Generalized additive models for medical research. In: Encyclopedia for biostatistics, vol 4, pp 187–196

    Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, Berlin

    Book  Google Scholar 

  • Hegland M, McIntosh I, Turlach B (1999) A parallel solver for generalised additive models. Comput Stat Data Anal 31:377–396

    Article  MATH  Google Scholar 

  • Kim S-J, Koh K, Lustig M, Boyd S, Gorinevsky D (2007) An interior-point method for large-scale 1-regularized least squares. IEEE J Sel Top Signal Process 1(4):606–617

    Article  Google Scholar 

  • Reinsch CH (1967) Smoothing by spline functions. Numer Math 10:177–183

    Article  MathSciNet  MATH  Google Scholar 

  • Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc, Ser B, Stat Methodol 58(1):267–288

    MathSciNet  MATH  Google Scholar 

  • Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc, Ser B, Stat Methodol 68:49–67

    Article  MathSciNet  MATH  Google Scholar 

  • Zhao P, Rocha G, Yu B (2009) The composite absolute penalties family for grouped and hierarchical variable selection. Ann Stat 37(6A):3468–3497

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arezou Keshavarz.

Additional information

This material is based upon work supported by AFOSR grant FA9550-09-1-0704, by DARPA XDATA FA8750-12-2-0306, and by NASA grant NNX07AEIIA.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chu, E., Keshavarz, A. & Boyd, S. A distributed algorithm for fitting generalized additive models. Optim Eng 14, 213–224 (2013). https://doi.org/10.1007/s11081-013-9215-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11081-013-9215-9

Keywords

Navigation