Abstract
When fitting a Gaussian mixture regression model to observed data, estimating a between-group contrast can be a practical issue. One can use the estimate to compare the effects of a particular covariate or a set of covariates across different subpopulations. By applying fiducial generalized pivotal quantities, a small-sample solution is proposed in this paper to obtain interval estimates of between-group contrasts. Specifically, a Markov chain Monte Carlo sampler, which takes the membership uncertainty of each individual into account, is designed to generate realizations from the target distributions for computing the required interval estimates. A plant virus transmission study is first introduced as a motivating example for the present study. Next, the observed data are analyzed to illustrate the proposed method. Based on the simulation results, it is further shown that the proposed method can maintain the empirical coverage rates sufficiently close to the nominal level.
Supplementary materials accompanying this paper appear on-line.


Similar content being viewed by others
References
Boiteau, G., Singh, M., Singh, R. P. Tai, G. C. C. and Turner, T. R. (1998). Rate of spread of PVY\(^n\) by alate Myzus persicae (Sulzer) from infected to healthy plants under laboratory conditions. Potato Research, 41, 335–344.
Chow, G. C. (1960). Tests of equality between sets of coefficients in two linear regressions. Econometrika, 28, 591–605.
Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977) Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society. Series B, 39, 1–22.
Fisher, R. A. (1935). The fiducial argument in statistical inference. Annals of Eugenics, 6, 391–398.
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A. and Rubin, D. B. (2013). Bayesian data analysis. 3rd edition. Chapman & Hall, London.
Geweke, J. (1992). Evaluating the accuracy of sampling based approaches to the calculation of posterior moments. Bayesian Statistics, 4, 169–193.
Hannig, J. (2009). On generalized fiducial inference. Statistica Sinica, 19, 491–544.
Hannig, J., Iyer, H., Lai, R. C. S. and Lee, T. C. M. (2016). Generalized fiducial inference: a review and new results. Journal of the American Statistical Association, 111, 1346–1361.
Hannig, J., Iyer, H. and Patterson, P. (2006). Fiducial generalized confidence intervals. Journal of the American Statistical Association, 101, 254–269.
Hong, L., Ye, Z. S. and Ling, R. (2018). Environmental risk assessment of emerging contaminants using degradation data. Journal of Agricultural, Biological, and Environmental Statistics, 23, 390–409.
Hurn, M., Justel, A. and Robert, C. (2003). Estimating mixtures of regressions. Journal of Computational and Graphical Statistics, 1, 55–79.
Iyer, H. and Patterson, P. (2002). A recipe for constructing generalized pivotal quantities and generalized confidence intervals. Technical Report, Department of Statistics, Colorado State University.
Jasra, A., Holmes, C. C. and Stephens, D. A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling. Statistical Science, 20, 50–67.
Kang, Q. and Vahl, C. I. (2014). Statistical analysis in the safety evaluation of genetically-modified crops: equivalence tests. Crop Science, 54, 2183–2200.
Khuri, A. I. (1986). Exact tests for comparison of correlated response model with an unknown dispersion matrix. Technometrics, 28, 347–357.
Kim, S. H. and Cohen, A. S. (1998). On the Behrens-Fisher problem: a review. Journal of Educational and Behavioral Statistics, 23, 356–377.
Koschat, M. A. and Weerahandi, S. (1992). Chow-type tests under heteroscedasticity. Journal of Business and Economic Statistic, 10, 221–228.
Lee, H. I., Chen, H., Kishino, H. and Liao, C. T. (2016). A reference population-based conformance proportion. Journal of Agricultural, Biological, and Environmental Statistics, 21, 684–697.
Li, X. (2009). A generalized \(p\)-value approach for comparing the means of several log-normal populations. Statistics and Probability Letters, 79, 1404–1408.
Louis, T. A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of Royal Statistical Society. Series B, 44, 226–233.
McLachlan, G. J. and Peel, D. (2000). Finite mixture models. Wiley, New York.
Smith, P. J. and Choi, S. C. (1982). Simple tests to compare two dependent regression lines. Technometrics, 24, 123–126.
Tian, L. and Wu, J. (2007). Inferences on the mean response in a log-regression model: the generalized variable approach. Statistics in Medicine, 26, 5180–5188.
Tsai, S. F. (2014). A generalized test variable approach for grain yield comparisons of rice. Journal of Applied Statistics, 41, 2627–2638.
Tsui, K. W., and Weerahandi, S. (1989). Generalized \(p\)-values in significance testing of hypotheses in the presence of nuisance parameters. Journal of the American Statistical Association, 84, 602–607.
Turner, T. R. (2000). Estimating the propagation rate of a viral infection of potato plants via mixtures of regressions. Journal of Royal Statistical Society. Series C, 49, 371–384.
van der Voet, H., Goedhart, P. W. and Schmidt, K. (2017). Equivalence testing using existing reference data: an example with genetically modified and conventional crops in animal feeding studies. Food and Chemical Toxicology, 109, 472–485.
Weerahandi, S. (1993). Generalized confidence intervals. Journal of the American Statistical Association, 88, 899–905.
Zellner, A. (1962). An efficient method of estimating seemingly unrelated regressions and tests for aggregated bias. Journal of the American Statistical Association, 57, 348–375.
Acknowledgements
I would like to thank the editor, an associate editor and two anonymous referees for their insightful comments and constructive suggestions. I am also grateful to Computer and Information Networking Center, National Taiwan University for the support of high-performance computing facilities. Shin-Fu Tsai’s research is supported in part by Ministry of Science and Technology, Taiwan (Grant No. MOST 107-2118-M-002-002-MY2).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendix: Simulation Results
Appendix: Simulation Results
See Tables 4, 5, 6, 7, 8 and 9.
Rights and permissions
About this article
Cite this article
Tsai, SF. Comparing Coefficients Across Subpopulations in Gaussian Mixture Regression Models. JABES 24, 610–633 (2019). https://doi.org/10.1007/s13253-019-00364-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13253-019-00364-4


