Skip to main content
Log in

Comparing Coefficients Across Subpopulations in Gaussian Mixture Regression Models

  • Published:
Journal of Agricultural, Biological and Environmental Statistics Aims and scope Submit manuscript

Abstract

When fitting a Gaussian mixture regression model to observed data, estimating a between-group contrast can be a practical issue. One can use the estimate to compare the effects of a particular covariate or a set of covariates across different subpopulations. By applying fiducial generalized pivotal quantities, a small-sample solution is proposed in this paper to obtain interval estimates of between-group contrasts. Specifically, a Markov chain Monte Carlo sampler, which takes the membership uncertainty of each individual into account, is designed to generate realizations from the target distributions for computing the required interval estimates. A plant virus transmission study is first introduced as a motivating example for the present study. Next, the observed data are analyzed to illustrate the proposed method. Based on the simulation results, it is further shown that the proposed method can maintain the empirical coverage rates sufficiently close to the nominal level.

Supplementary materials accompanying this paper appear on-line.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from $39.99 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Boiteau, G., Singh, M., Singh, R. P. Tai, G. C. C. and Turner, T. R. (1998). Rate of spread of PVY\(^n\) by alate Myzus persicae (Sulzer) from infected to healthy plants under laboratory conditions. Potato Research, 41, 335–344.

    Article  Google Scholar 

  • Chow, G. C. (1960). Tests of equality between sets of coefficients in two linear regressions. Econometrika, 28, 591–605.

    Article  MathSciNet  Google Scholar 

  • Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977) Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society. Series B, 39, 1–22.

    MathSciNet  MATH  Google Scholar 

  • Fisher, R. A. (1935). The fiducial argument in statistical inference. Annals of Eugenics, 6, 391–398.

    Article  Google Scholar 

  • Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A. and Rubin, D. B. (2013). Bayesian data analysis. 3rd edition. Chapman & Hall, London.

    MATH  Google Scholar 

  • Geweke, J. (1992). Evaluating the accuracy of sampling based approaches to the calculation of posterior moments. Bayesian Statistics, 4, 169–193.

    MathSciNet  Google Scholar 

  • Hannig, J. (2009). On generalized fiducial inference. Statistica Sinica, 19, 491–544.

    MathSciNet  MATH  Google Scholar 

  • Hannig, J., Iyer, H., Lai, R. C. S. and Lee, T. C. M. (2016). Generalized fiducial inference: a review and new results. Journal of the American Statistical Association, 111, 1346–1361.

    Article  MathSciNet  Google Scholar 

  • Hannig, J., Iyer, H. and Patterson, P. (2006). Fiducial generalized confidence intervals. Journal of the American Statistical Association, 101, 254–269.

    Article  MathSciNet  Google Scholar 

  • Hong, L., Ye, Z. S. and Ling, R. (2018). Environmental risk assessment of emerging contaminants using degradation data. Journal of Agricultural, Biological, and Environmental Statistics, 23, 390–409.

    Article  MathSciNet  Google Scholar 

  • Hurn, M., Justel, A. and Robert, C. (2003). Estimating mixtures of regressions. Journal of Computational and Graphical Statistics, 1, 55–79.

    Article  MathSciNet  Google Scholar 

  • Iyer, H. and Patterson, P. (2002). A recipe for constructing generalized pivotal quantities and generalized confidence intervals. Technical Report, Department of Statistics, Colorado State University.

  • Jasra, A., Holmes, C. C. and Stephens, D. A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling. Statistical Science, 20, 50–67.

    Article  MathSciNet  Google Scholar 

  • Kang, Q. and Vahl, C. I. (2014). Statistical analysis in the safety evaluation of genetically-modified crops: equivalence tests. Crop Science, 54, 2183–2200.

    Article  Google Scholar 

  • Khuri, A. I. (1986). Exact tests for comparison of correlated response model with an unknown dispersion matrix. Technometrics, 28, 347–357.

    Article  MathSciNet  Google Scholar 

  • Kim, S. H. and Cohen, A. S. (1998). On the Behrens-Fisher problem: a review. Journal of Educational and Behavioral Statistics, 23, 356–377.

    Article  Google Scholar 

  • Koschat, M. A. and Weerahandi, S. (1992). Chow-type tests under heteroscedasticity. Journal of Business and Economic Statistic, 10, 221–228.

    Google Scholar 

  • Lee, H. I., Chen, H., Kishino, H. and Liao, C. T. (2016). A reference population-based conformance proportion. Journal of Agricultural, Biological, and Environmental Statistics, 21, 684–697.

    Article  MathSciNet  Google Scholar 

  • Li, X. (2009). A generalized \(p\)-value approach for comparing the means of several log-normal populations. Statistics and Probability Letters, 79, 1404–1408.

    Article  MathSciNet  Google Scholar 

  • Louis, T. A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of Royal Statistical Society. Series B, 44, 226–233.

    MathSciNet  MATH  Google Scholar 

  • McLachlan, G. J. and Peel, D. (2000). Finite mixture models. Wiley, New York.

    Book  Google Scholar 

  • Smith, P. J. and Choi, S. C. (1982). Simple tests to compare two dependent regression lines. Technometrics, 24, 123–126.

    Article  Google Scholar 

  • Tian, L. and Wu, J. (2007). Inferences on the mean response in a log-regression model: the generalized variable approach. Statistics in Medicine, 26, 5180–5188.

    Article  MathSciNet  Google Scholar 

  • Tsai, S. F. (2014). A generalized test variable approach for grain yield comparisons of rice. Journal of Applied Statistics, 41, 2627–2638.

    Article  MathSciNet  Google Scholar 

  • Tsui, K. W., and Weerahandi, S. (1989). Generalized \(p\)-values in significance testing of hypotheses in the presence of nuisance parameters. Journal of the American Statistical Association, 84, 602–607.

    MathSciNet  Google Scholar 

  • Turner, T. R. (2000). Estimating the propagation rate of a viral infection of potato plants via mixtures of regressions. Journal of Royal Statistical Society. Series C, 49, 371–384.

    Article  MathSciNet  Google Scholar 

  • van der Voet, H., Goedhart, P. W. and Schmidt, K. (2017). Equivalence testing using existing reference data: an example with genetically modified and conventional crops in animal feeding studies. Food and Chemical Toxicology, 109, 472–485.

    Article  Google Scholar 

  • Weerahandi, S. (1993). Generalized confidence intervals. Journal of the American Statistical Association, 88, 899–905.

    Article  MathSciNet  Google Scholar 

  • Zellner, A. (1962). An efficient method of estimating seemingly unrelated regressions and tests for aggregated bias. Journal of the American Statistical Association, 57, 348–375.

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

I would like to thank the editor, an associate editor and two anonymous referees for their insightful comments and constructive suggestions. I am also grateful to Computer and Information Networking Center, National Taiwan University for the support of high-performance computing facilities. Shin-Fu Tsai’s research is supported in part by Ministry of Science and Technology, Taiwan (Grant No. MOST 107-2118-M-002-002-MY2).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shin-Fu Tsai.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Appendix: Simulation Results

Appendix: Simulation Results

See Tables 4, 5, 6, 7, 8 and 9.

Table 4 Empirical coverage rates of 90% confidence intervals.
Table 5 Empirical coverage rates of 95% confidence intervals.
Table 6 Empirical coverage rates of 99% confidence intervals.
Table 7 Average width of 90% confidence intervals.
Table 8 Average width of 95% confidence intervals.
Table 9 Average width of 99% confidence intervals.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tsai, SF. Comparing Coefficients Across Subpopulations in Gaussian Mixture Regression Models. JABES 24, 610–633 (2019). https://doi.org/10.1007/s13253-019-00364-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13253-019-00364-4

Keywords