Abstract
Although there are many papers on variable selection methods based on mean model in the finite mixture of regression models, little work has been done on how to select significant explanatory variables in the modeling of the variance parameter. In this paper, we propose and study a novel class of models: a skew-normal mixture of joint location and scale models to analyze the heteroscedastic skew-normal data coming from a heterogeneous population. The problem of variable selection for the proposed models is considered. In particular, a modified Expectation-Maximization(EM) algorithm for estimating the model parameters is developed. The consistency and the oracle property of the penalized estimators is established. Simulation studies are conducted to investigate the finite sample performance of the proposed methodologies. An example is illustrated by the proposed methodologies.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
J Engel, A F Huele. A generalized linear modeling approach to robust design, Technometrics, 1996, 38(4): 365–373.
R E Park. Estimation with heteroscedastic error terms, Econometrica, 1966, 34: 888.
A C Harvey. Estimating regression models with multiplicative heteroscedasticity, Econometrica, 1976, 44(3): 461–465.
M Aitkin. Modelling variance heterogeneity in normal regression using GLIM, J R Statist Soc Ser C, 1987, 36(3): 332–339.
L C Wu, Z Z Zhang, D K Xu. Variable selection in joint mean and variance models of Box-Cox transformation, J Appl Statist, 2012, 39(12): 2543–2555.
L C Wu, Z Z Zhang, D K Xu. Variable selection in joint location and scale models of the skew-normal distribution, J Stat Comput Simul, 2013, 83(7): 1266–1278.
L C Wu, G L Tian, Y Q Zhang, T Ma. Variable selection in joint location, scale and skewness models with a skew-t-normal distribution, Stat Interface, 2017, 10(2): 217–277.
S M Goldfeld, R E Quandt. A Markov model for switching regression, J Econometrics, 1973, 1(1): 3–15.
W S DeSarbo, W L Cron. A maximum likelihood methodology for cluster wise linear regressions, J Classification, 1988, 5(2): 249–282.
P N Jones, G J McLachlan. Fitting finite mixture models in a regression context, Aust N Z J Stat, 1992, 34(2): 233–240.
K Jedidi, V Ramaswamy, W S DeSarbo. On estimating finite mixtures of multivariate regression and simultaneous equation models, Struct Equ Model, 1996, 3(3): 266–289.
G McLachlan, D Peel. Finite Mixture Models, Wiley, New York, 2000.
A Khalili, J Chen. Variable selection in finite mixture of regression models, J Am Statist Assoc, 2007, 102(479): 1025–1038.
A Khalili. New estimation and feature selection methods in mixture-of-experts models, Canad J Statist, 2010, 38(4): 519–539.
A Khalili, S L Lin. Regularization in finite mixture of regression models with diverging number of parameters, Biometrics, 2011, 69(2): 436–446.
A Khalili. An overview of the new feature selection methods in finite mixture of regression models, J Ital Stat Assoc, 2011, 10(2): 201–235.
Y T Du, A Khalili, G N Johanna, J S Russell. Simultaneous fixed and random effects selection in finite mixture of linear mixed-effects models, Canad J Statist, 2013, 41(4): 596–616.
E Ormoz, F Eskandari. Variable selection in finite mixture of semiparametric regression models, Comm Statist Theory Methods, 2016, 45(3): 695–711.
K J Lee, R B Chen, Y N Wu. Bayesian variable selection for finite mixture model of linear regressions, Comput Statist Data Anal, 2016, 95: 1–16.
A Khalili, J Chen, D A Stephens. Regularization and selection in Gaussian mixture of autoregressive models, Canad J Statist, 2017, 45(4): 356–374.
Q G Tang, J Karunamun. Robust variable selection for finite mixture regression models, Ann Inst Statist Math, 2018, 70(3): 489–521.
M Liu, T I Lin. A skew-normal mixture regression model, Educ Psychol Meas, 2014, 74(1): 139–162.
E Cepeda, D Gamerman. Bayesian modeling of variance heterogeneity in normal regression models, Braz J Probab Stat, 2001, 14: 207–221.
J T Taylor, A P Verbyla. Joint modelling of location and scale parameters of the t distribution, Statist Model, 2004, 4(2): 91–112.
Z Z Zhang, D R Wang. Simultaneous variable selection for heteroscedastic regression models, Sci China Math, 2011, 54(3): 515–530.
L C Wu, H Q Li. Variable selection for joint mean and dispersion models of the inverse Gaussian distribution, Metrika, 2012, 75(6): 795–808.
W Zhao, R Zhang. Variable selection of varying dispersion Student-t regression models, J Syst Sci Complex, 2015, 28(4): 961–977.
A Azzalini. A class of distributions which includes the normal ones, Scand J Stat, 1985, 12: 171–178.
H Teicher. Identifiability of finite mixtures, Ann Math Stat, 1963, 34: 1265–1269.
N Atienza, J Garcia-Heras, J M Munoz-Pichardo. A new condition for identifiability of finite mixture distributions, Metrika, 2006, 63(2): 215–221.
C E G Otiniano, P N Rathie, L C S M Ozelim. On the identifiability of finite mixture of Skew-Normal and Skew-t distributions, Stat Probabil Lett, 2015, 106: 103–108.
C Hennig. Identifiability of models for clusterwise linear regression, J Classification, 2000, 17(2): 273–296.
P Wang, M L Puterman, I Cockburn, N Le. Mixed Poisson regression models with covariate dependent rates, Biometrics, 1996, 52: 381–400.
J Fan, R Li. Variable selection via nonconvave penalized likelihood and its oracle properties, J Am Statist Assoc, 2001, 96(456): 1348–1360.
R Tibshirani. Regression shrinkage and selection via the lasso, J R Statist Soc Ser B, 1996, 58(1): 267–288.
A Antoniadis. Wavelets in statistics: A Review, J Ital Stat Assoc, 1997, 6(2): 97–130.
A P Dempster, N M Laird and D B Rubin. Maximum likelihood from incomplete data via the EM algorithm, J R Statist Soc Ser B, 1977, 39(1): 1–22.
R A Jacobs, M I Jordan, S J Nowlan and G E Hinton. Adaptive mixtures of local experts, Neural Comput, 1991, 3(1): 79–87.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Supported by the National Natural Science Foundation of China(11861041).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articles Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articles Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wu, Lc., Yang, Sq. & Tao, Y. Variable selection for skew-normal mixture of joint location and scale models. Appl. Math. J. Chin. Univ. 36, 475–491 (2021). https://doi.org/10.1007/s11766-021-3774-x
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11766-021-3774-x