Confidence Intervals for Maximin Effects in Inhomogeneous Large-Scale Data

  • Dominik Rothenhäusler
  • Nicolai Meinshausen
  • Peter Bühlmann
Conference paper
Part of the Abel Symposia book series (ABEL, volume 11)

Abstract

One challenge of large-scale data analysis is that the assumption of an identical distribution for all samples is often not realistic. An optimal linear regression might, for example, be markedly different for distinct groups of the data. Maximin effects have been proposed as a computationally attractive way to estimate effects that are common across all data without fitting a mixture distribution explicitly. So far just point estimators of the common maximin effects have been proposed in Meinshausen and Bühlmann (Ann Stat 43(4):1801–1830, 2015). Here we propose asymptotically valid confidence regions for these effects.

Keywords

Confidence Region Actual Coverage Affine Projection Asymptotic Confidence Interval Ridge Penalty 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Aitkin, M., Rubin, D.: Estimation and hypothesis testing in finite mixture models. J. R. Stat. Soc. Ser. B 47, 67–75 (1985)MATHGoogle Scholar
  2. 2.
    Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996)MathSciNetMATHGoogle Scholar
  3. 3.
    Bühlmann, P., Meinshausen, N.: Magging: maximin aggregation for inhomogeneous large-scale data. IEEE Proc. (2014, preprint). arxiv:1409.2638Google Scholar
  4. 4.
    DeSarbo, W., Cron, W.: A maximum likelihood methodology for clusterwise linear regression. J. Classif. 5, 249–282 (1988)CrossRefMathSciNetMATHGoogle Scholar
  5. 5.
    Fan, J., Zhang, W.: Statistical estimation in varying coefficient models. Ann. Stat. 27, 1491–1518 (1999)CrossRefMathSciNetMATHGoogle Scholar
  6. 6.
    Figueiredo, M., Jain, A.: Unsupervised learning of finite mixture models. IEEE Trans. Pattern Anal. Mach. Intell. 24, 381–396 (2002)CrossRefGoogle Scholar
  7. 7.
    Hastie, T., Tibshirani, R.: Varying-coefficient models. J. R. Stat. Soc. Ser. B 55, 757–796 (1993)MathSciNetMATHGoogle Scholar
  8. 8.
    McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2004)Google Scholar
  9. 9.
    Meinshausen, N., Bühlmann, P.: Maximin effects in inhomogeneous large-scale data. Ann. Stat. 43(4), 1801–1830 (2015)CrossRefGoogle Scholar
  10. 10.
    Pinheiro, J., Bates, D.: Mixed-Effects Models in S and S-PLUS. Springer, New York (2000)CrossRefMATHGoogle Scholar
  11. 11.
    Van de Geer, S., Bühlmann, P., Ritov, Y., Dezeure, R.: On asymptotically optimal confidence regions and tests for high-dimensional models. Ann. Stat. 42(3), 1166–1202 (2014)CrossRefMATHGoogle Scholar
  12. 12.
    Zhang, C.H., Zhang, S.S.: Confidence intervals for low dimensional parameters in high dimensional linear models. J. R. Stat. Soc. Ser. B (Stat Methodol.) 76(1), 217–242 (2014)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Dominik Rothenhäusler
    • 1
  • Nicolai Meinshausen
    • 1
  • Peter Bühlmann
    • 1
  1. 1.Seminar für StatistikETH ZürichZürichSwitzerland

Personalised recommendations