Skip to main content
Log in

New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test

  • Original Paper
  • Published:
TEST Aims and scope Submit manuscript

Abstract

Various tests are available to compare the means of two populations. Tests for skewed data, however, are not well studied even though they are often needed in pharmaceutical study and agricultural economics. In particular, there is no available result to give power and sample size calculation for a two-sample Bootstrap-t test in skewed populations. In this paper, we propose easy-to-compute new tests and study their theoretical properties. The proposed work starts with derivation of a second-order Edgeworth expansion for the pooled two-sample t-statistic. Then new test rejection regions are formed based on Cornish–Fisher expansion of quantiles. The new tests account for first-order and second-order population skewnesses that were ignored in two-sample t test. We report the theoretical type I error accuracy and power of the newly proposed tests and the large sample t test. We also provide the detailed conditions under which the proposed tests give better power than the two-sample large sample test. Our new tests, \(\hbox {TCF}_1\) and TCF, are asymptotically equivalent to Bootstrap-t test up to \(O(N^{-1})\) and \(O(N^{-3/2})\), respectively. Compared with commonly used two-sample parametric and nonparametric tests, the new tests are computationally efficient, give better power for skewed data with moderate sample size, and provide sample size calculation to achieve desired power at a given significance level. Empirical studies confirmed our theoretical results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • An L, Ahmed ES (2008) Improving the performance of kurtosis estimator. Comput Stat Data Anal 52:2669–2681

    Article  MathSciNet  MATH  Google Scholar 

  • Barndorff-Nielsen O, Hall P (1988) On the level-error after bartlett adjustment of the likelihood ratio statistic. Biometrika 75:374–378

    Article  MathSciNet  MATH  Google Scholar 

  • Beran R (1988) Prepivoting test statistics: A bootstrap view of asymptotic refinements. Journal of the American Statistical Association 83:682–697

    MathSciNet  MATH  Google Scholar 

  • Davison AC, Hinkley DV (1997) Bootstrap methods and their application. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  • Efron B (1979) Bootstrap methods: another look at the jackknife. Ann Stat 7:1–26

    Article  MathSciNet  MATH  Google Scholar 

  • Efron B, Tibshirani R (1993) An introduction to the bootstrap. Chapman and Hall, New York

    Book  MATH  Google Scholar 

  • Fisher NI, Hall P (1990) On bootstrap hypothesis testing. Aust J Stat 32:177–190

    Article  MathSciNet  Google Scholar 

  • Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439):531–537

    Article  Google Scholar 

  • Hall P (1992) The bootstrap and Edgeworth expansion. Springer, New York

    Book  MATH  Google Scholar 

  • Hinkley D (1988) Bootstrap methods. J R Stat Soc B 50:321–337

    MathSciNet  MATH  Google Scholar 

  • Ott RL, Longnecker MT (2008) An introduction to statistical methods and data analysis. Duxbury Press, Michigan

    Google Scholar 

  • Phillip D, Zhou XH (2006) Nonparametric statistical methods for cost-effectiveness analyses. Biometrics 62:576–588

    Article  MathSciNet  MATH  Google Scholar 

  • Shao J, Tu D (1995) The jackknife and bootstrap. Springer, New York, NY

    Book  MATH  Google Scholar 

  • Tibshirani R, Hastie T, Narasimhan B, Chu G (2002) Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci USA 99:6567–6572

    Article  Google Scholar 

  • Xu J (2010) Asymptotic expansion of the non null distribution of the two-sample t-statistic under non normality with application in power comparison. Commun Statist Theory Methods 39:1915–1921

    Article  MathSciNet  MATH  Google Scholar 

  • Xu J, Cui X, Gupta AK (2009) Improved statistics for contrasting means of two samples under non-normality. Br J Math Stat Psychol 62:21–40

    Article  MathSciNet  Google Scholar 

  • Zhou XH, Philip D (2005) Nonparametric confidence intervals for the one- and two-sample problems. Biostatistics 6:187–200

    Article  MATH  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the two anonymous referees whose comments have led to significant improvement of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haiyan Wang.

Additional information

Bo Tong and Huaiyu Zhang have contributed equally to this paper.

This work was partially supported by a grant by Simons foundation (#246077) to Haiyan Wang.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Tong, B., Zhang, H. et al. New two-sample tests for skewed populations and their connection to theoretical power of Bootstrap-t test. TEST 26, 661–683 (2017). https://doi.org/10.1007/s11749-017-0530-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11749-017-0530-x

Keywords

Mathematics Subject Classification

Navigation