New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test

Wang, Haiyan; Tong, Bo; Zhang, Huaiyu; Li, Xukun

doi:10.1007/s11749-017-0530-x

New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test

Original Paper
Published: 24 March 2017

Volume 26, pages 661–683, (2017)
Cite this article

TEST Aims and scope Submit manuscript

Haiyan Wang ORCID: orcid.org/0000-0002-2523-0465¹,
Bo Tong²,
Huaiyu Zhang¹ &
…
Xukun Li¹

385 Accesses
2 Citations
Explore all metrics

Abstract

Various tests are available to compare the means of two populations. Tests for skewed data, however, are not well studied even though they are often needed in pharmaceutical study and agricultural economics. In particular, there is no available result to give power and sample size calculation for a two-sample Bootstrap-t test in skewed populations. In this paper, we propose easy-to-compute new tests and study their theoretical properties. The proposed work starts with derivation of a second-order Edgeworth expansion for the pooled two-sample t-statistic. Then new test rejection regions are formed based on Cornish–Fisher expansion of quantiles. The new tests account for first-order and second-order population skewnesses that were ignored in two-sample t test. We report the theoretical type I error accuracy and power of the newly proposed tests and the large sample t test. We also provide the detailed conditions under which the proposed tests give better power than the two-sample large sample test. Our new tests, \(\hbox {TCF}_1\) and TCF, are asymptotically equivalent to Bootstrap-t test up to \(O(N^{-1})\) and \(O(N^{-3/2})\), respectively. Compared with commonly used two-sample parametric and nonparametric tests, the new tests are computationally efficient, give better power for skewed data with moderate sample size, and provide sample size calculation to achieve desired power at a given significance level. Empirical studies confirmed our theoretical results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Inference about the arithmetic average of log transformed data

Article 30 April 2022

Bootstrapping sample quantiles of discrete data

Article 20 February 2015

Resampling-Free Bootstrap Inference for Quantiles

References

An L, Ahmed ES (2008) Improving the performance of kurtosis estimator. Comput Stat Data Anal 52:2669–2681
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen O, Hall P (1988) On the level-error after bartlett adjustment of the likelihood ratio statistic. Biometrika 75:374–378
Article MathSciNet MATH Google Scholar
Beran R (1988) Prepivoting test statistics: A bootstrap view of asymptotic refinements. Journal of the American Statistical Association 83:682–697
MathSciNet MATH Google Scholar
Davison AC, Hinkley DV (1997) Bootstrap methods and their application. Cambridge University Press, Cambridge
Book MATH Google Scholar
Efron B (1979) Bootstrap methods: another look at the jackknife. Ann Stat 7:1–26
Article MathSciNet MATH Google Scholar
Efron B, Tibshirani R (1993) An introduction to the bootstrap. Chapman and Hall, New York
Book MATH Google Scholar
Fisher NI, Hall P (1990) On bootstrap hypothesis testing. Aust J Stat 32:177–190
Article MathSciNet Google Scholar
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439):531–537
Article Google Scholar
Hall P (1992) The bootstrap and Edgeworth expansion. Springer, New York
Book MATH Google Scholar
Hinkley D (1988) Bootstrap methods. J R Stat Soc B 50:321–337
MathSciNet MATH Google Scholar
Ott RL, Longnecker MT (2008) An introduction to statistical methods and data analysis. Duxbury Press, Michigan
Google Scholar
Phillip D, Zhou XH (2006) Nonparametric statistical methods for cost-effectiveness analyses. Biometrics 62:576–588
Article MathSciNet MATH Google Scholar
Shao J, Tu D (1995) The jackknife and bootstrap. Springer, New York, NY
Book MATH Google Scholar
Tibshirani R, Hastie T, Narasimhan B, Chu G (2002) Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci USA 99:6567–6572
Article Google Scholar
Xu J (2010) Asymptotic expansion of the non null distribution of the two-sample t-statistic under non normality with application in power comparison. Commun Statist Theory Methods 39:1915–1921
Article MathSciNet MATH Google Scholar
Xu J, Cui X, Gupta AK (2009) Improved statistics for contrasting means of two samples under non-normality. Br J Math Stat Psychol 62:21–40
Article MathSciNet Google Scholar
Zhou XH, Philip D (2005) Nonparametric confidence intervals for the one- and two-sample problems. Biostatistics 6:187–200
Article MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the two anonymous referees whose comments have led to significant improvement of the paper.

Author information

Authors and Affiliations

Department of Statistics, Kansas State University, 101 Dickens Hall, Manhattan, KS, 66506, USA
Haiyan Wang, Huaiyu Zhang & Xukun Li
Biometrics, Data and Statistical Sciences, Research and Development Department R43V, AbbVie Inc., North Chicago, IL, 60064, USA
Bo Tong

Authors

Haiyan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Tong
View author publications
You can also search for this author in PubMed Google Scholar
Huaiyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xukun Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haiyan Wang.

Additional information

Bo Tong and Huaiyu Zhang have contributed equally to this paper.

This work was partially supported by a grant by Simons foundation (#246077) to Haiyan Wang.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material Online Resource 1 - Regularity conditions (PDF 43KB).

Online Resource 2 - Technical Proof (PDF 88KB).

Online Resource 3 - Figures S1, S2, and details of simulation studies (PDF 146KB).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, H., Tong, B., Zhang, H. et al. New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test. TEST 26, 661–683 (2017). https://doi.org/10.1007/s11749-017-0530-x

Download citation

Received: 17 July 2016
Accepted: 16 March 2017
Published: 24 March 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s11749-017-0530-x

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test

Abstract

Access this article

Similar content being viewed by others

Inference about the arithmetic average of log transformed data

Bootstrapping sample quantiles of discrete data

Resampling-Free Bootstrap Inference for Quantiles

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material Online Resource 1 - Regularity conditions (PDF 43KB).

Online Resource 2 - Technical Proof (PDF 88KB).

Online Resource 3 - Figures S1, S2, and details of simulation studies (PDF 146KB).

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test

Abstract

Access this article

Similar content being viewed by others

Inference about the arithmetic average of log transformed data

Bootstrapping sample quantiles of discrete data

Resampling-Free Bootstrap Inference for Quantiles

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material Online Resource 1 - Regularity conditions (PDF 43KB).

Online Resource 2 - Technical Proof (PDF 88KB).

Online Resource 3 - Figures S1, S2, and details of simulation studies (PDF 146KB).

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation