Uncovering Cluster Structure and Group-Specific Associations: Variable Selection in Multivariate Mixture Regression Models

Tadesse, Mahlet G.; Mortier, Frédéric; Monni, Stefano

doi:10.1007/978-3-319-31323-8_21

Mahlet G. Tadesse²,
Frédéric Mortier³ &
Stefano Monni⁴

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 157))

955 Accesses

Abstract

Variable selection for mixture of regression models has been the focus of much research in recent years. These models combine the ideas of mixture models, regression models, and variable selection to uncover group structures and key relationships between data sets. The objective is to identify homogeneous groups of objects and determine the cluster-specific subsets of covariates modulating the outcomes. In this chapter we review frequentist and Bayesian methods we have proposed to address in a unified manner the problems of cluster identification and cluster-specific variable selection in the context of mixture of regression models. These methods have a wide range of applications, in particular in the context of high-dimensional data analysis. We illustrate their performance in two diverse areas: one in ecology for modeling species-rich ecosystems and the other in genomics for integrating data from different genomic sources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Biernacki, C., Celeux, G., Govaert, G.: Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans. Pattern Anal. Mach. Intell. 22, 719–725 (2000)
Article Google Scholar
George, E., McCulloch, R.: Approaches for Bayesian variable selection. Stat. Sin. 7, 339–373 (1997)
MATH Google Scholar
Geyer, C.: Markov chain Monte Carlo maximum likelihood. In: Keramigas, E. (ed.) Computing Science and Statistics, pp. 156–163. Interface Foundation, Fairfax (1991)
Google Scholar
Gupta, M., Ibrahim, J.G.: Variable selection in mixture modeling for the discovery of gene regulatory networks. J. Am. Stat. Assoc. 102, 867–880 (2007)
Article MathSciNet MATH Google Scholar
Khalili, A., Chen, J.: Variable selection in finite mixture of regression models. J. Am. Stat. Assoc. 102, 1025–1038 (2007)
Article MathSciNet MATH Google Scholar
Metropolis, N., Rosenbluth, A., Rosenbluth, M., Teller, A., Teller, E.: Equations of state calculations by fast computing machines. J. Chem. Phys. 21, 1087–1091 (1953)
Article Google Scholar
Monni, S., Tadesse, M.G.: A stochastic partitioning method to associate high-dimensional responses and covariates (with discussion). Bayesian Anal. 4, 413–464 (2009)
Article MathSciNet MATH Google Scholar
Morley, M., Molony, C.M., Weber, T.M., Devlin, J.L., Ewens, K.G., Spielman, R.S., Cheung, V.G.: Genetic analysis of genome-wide variation in human gene expression. Nature 430, 743–747 (2004)
Article Google Scholar
Mortier, F., Ouédraogo, D.-Y., Claeys, F., Tadesse, M.G., Cornu, G., Baya, F., Benedet, F., Freycon, V., Gourlet-Fleury, S., Picard, N.: Mixture of inhomogeneous matrix models for species-rich ecosystems. Environmetrics 26, 39–51 (2015)
Article MathSciNet Google Scholar
Städler, N., Bühlmann, P., van de Geer, S.: ℓ1-penalization for mixture regression models. Test 19, 209–256 (2010)
Article MathSciNet MATH Google Scholar
Zou, H.: The adaptive lasso and its oracle properties. J. Am. Stat. Assoc. 101, 1418–1429 (2006)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Georgetown University, 37th & O Streets NW, Washington, DC, 20057, USA
Mahlet G. Tadesse
UPR Biens et Services des Ecosystémes Forestiers Tropicaux (B&SEF), CIRAD, TA C-105/D Campus International de Baillarguet, 34398, Montpellier Cedex 5, France
Frédéric Mortier
Department of Mathematics, American University of Beirut, 11-0236, Riad El Solh, Beirut, 1107 2020, Lebanon
Stefano Monni

Authors

Mahlet G. Tadesse
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Mortier
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Monni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mahlet G. Tadesse .

Editor information

Editors and Affiliations

Dept of Mathematics and Economics, Virginia State University, Petersburg, Virginia, USA
Bourama Toni

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tadesse, M.G., Mortier, F., Monni, S. (2016). Uncovering Cluster Structure and Group-Specific Associations: Variable Selection in Multivariate Mixture Regression Models. In: Toni, B. (eds) Mathematical Sciences with Multidisciplinary Applications. Springer Proceedings in Mathematics & Statistics, vol 157. Springer, Cham. https://doi.org/10.1007/978-3-319-31323-8_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-31323-8_21
Published: 20 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31321-4
Online ISBN: 978-3-319-31323-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics