Classical Monofactorial (Parametric and Non-parametric) Tests

Brezina, Vaclav

doi:10.1007/978-3-030-46216-1_20

Vaclav Brezina³

1922 Accesses
2 Citations

Abstract

This chapter focuses on the use of monofactorial statistical tests to compare two or more groups of speakers or two or more corpora. It starts with a discussion of the Null-Hypothesis Significance Testing Procedure (NHSTP) and its applications to corpus data. The chapter then offers seven different statistical procedures showing their principles, equations, and underlying assumptions. These procedures include the chi-squared test, the t-test, Mann-Whitney U test, ANOVA, Kruskal-Wallis test, Pearson’s correlation and non-parametric correlations. Effect sizes and confidence intervals are also discussed. A particular attention is paid to the distinction between the parametric and non-parametric tests and their respective assumptions. The ‘Practical Guide with R’ section offers the readers a step-by-step guide on how to run the tests discussed in the chapter in the statistical package R.

The writing of this chapter has been supported by UK Economic and Social Research Council (grants ES/R008906/1 and EP/P001559/1).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Relative frequencies are used because speaker samples are of unequal sizes (number of tokens). These are calculated as absolute frequency/number of tokens in the sample x basis for normalisation (e.g. 1000).
2.
An independent variable (also known as an explanatory variable or a predictor variable) is a variable that is used to explain linguistic patterns measured as dependent variables. In corpus research, independent variables are typically related to the context (like the genre in this example) or speaker characteristics (gender, age etc.).
3.
If we have two or more samples from each speaker and are interested in the difference in their language between sample 1 and sample 2 etc. (e.g. linguistic change/development), we are dealing with a so-called repeated measures design, which requires a different version of the statistic (Crowder 1990).
4.
Generally, the cut-off point depends on how much chance we are comfortable with in our discipline. Without testing the whole population (which is usually impracticable), we will always operate in the realm of probability (p-values) and will never have a 100% certainty. Imagine that you have a jar full of sweets of different flavours, some of which you like and some dislike. Would you be willing to pick one at random? Would your answer to this question change if you knew that one of these sweets is poisoned?

References

Balakrishnan, N., Voinov, V., & Nikulin, M. S. (2013). Chi-squared goodness of fit tests with applications. Oxford: Academic.
Google Scholar
Boneau, C. A. (1960). The effects of violations of assumptions underlying the t test. Psychological Bulletin, 57(1), 49–64. https://doi.org/10.1037/h0041412.
Article Google Scholar
Bonett, D. G., & Wright, T. A. (2000). Sample size requirements for estimating Pearson, Kendall and Spearman correlations. Psychometrika, 65(1), 23–28. https://doi.org/10.1007/BF02294183.
Article Google Scholar
Brezina, V. (2018). Statistics in corpus linguistics: A practical guide. Cambridge: Cambridge University Press.
Book Google Scholar
Brezina, V., & Meyerhoff, M. (2014). Significant or random? A critical review of sociolinguistic generalisations based on large corpora. International Journal of Corpus Linguistics, 19(1), 1–28. https://doi.org/10.1075/ijcl.19.1.01bre.
Article Google Scholar
Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations in context: A new perspective on collocation networks. International Journal of Corpus Linguistics, 20(2), 139–173. https://doi.org/10.1075/ijcl.20.2.01bre.
Article Google Scholar
Cabin, R. J., & Mitchell, R. J. (2000). To Bonferroni or not to Bonferroni: When and how are the questions. Bulletin of the Ecological Society of America, 81(3), 246–248.
Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioural sciences (2nd ed.). Hillsdale: Erlbaum.
Google Scholar
Cohen, J. (1995). The earth is round (p < 0.05): Rejoinder. American Psychologist, 50(12), 1103. https://doi.org/10.1037/0003-066X.50.12.1103.
Article Google Scholar
Cooper, H. M., Hedges, L. V., & Valentine, J. C. (Eds.). (2009). The handbook of research synthesis and meta-analysis (pp. 103–126). New York: Russell Sage Foundation.
Google Scholar
Corder, G. W., & Foreman, D. I. (2009). Nonparametric statistics for non-statisticians: A step-by-step approach. New York: Wiley.
Book Google Scholar
Cox, D. R., & Donnelly, C. A. (2011). Principles of applied statistics. Cambridge: Cambridge University Press.
Book Google Scholar
Crowder, M. J. (1990). Analysis of repeated measures. New York: Chapman and Hall.
Google Scholar
de Winter, J. C. F., Gosling, S. D., & Potter, J. (2016). Comparing the Pearson and Spearman correlation coefficients across distributions and sample sizes: A tutorial using simulations and empirical data. Psychological Methods, 21(3), 273–290. https://doi.org/10.1037/met0000079.
Article Google Scholar
Duncan, G. T., & Layard, M. W. J. (1973). A Monte-Carlo study of asymptotically robust tests for correlation coefficients. Biometrika, 60(3), 551–558. https://doi.org/10.2307/2335004.
Article Google Scholar
Everitt, B. (2006). The Cambridge dictionary of statistics (3rd ed.). Cambridge: Cambridge University Press.
Google Scholar
Fisher, R. A. (1935). The design of experiments. London: Oliver and Boyd.
Google Scholar
Gayen, A. K. (1951). The frequency distribution of the product moment correlation in random samples of any size drawn from non-normal universes. Biometrika, 38, 219–247. https://doi.org/10.2307/2332329.
Article Google Scholar
Greenwood, P. E., & Nikulin, M. S. (1996). A guide to chi-squared testing (Vol. 280). New York: Wiley.
Google Scholar
Gries, S. T. (2013). Statistics for linguistics with R: A practical introduction (2nd ed.). Berlin: Mouton de Gruyter.
Book Google Scholar
Kapadia, A. S., Chan, W., & Moyé, L. (2005). Mathematical statistics with applications. Boca Raton: CRC Press.
Google Scholar
Kepes, S., Banks, G. C., & Oh, I. S. (2014). Avoiding bias in publication bias research: The value of “null” findings. Journal of Business and Psychology, 29(2), 183–203. https://doi.org/10.1007/s10869-012-9279-0.
Article Google Scholar
Kerby, D. S. (2014). The simple difference formula: An approach to teaching nonparametric correlation. Innovative Teaching, 3, 1–9. https://doi.org/10.2466/11.IT.3.1.
Article Google Scholar
Kilgarriff, A. (2005). Language is never, ever, ever, random. Corpus Linguistics and Linguistic Theory, 1(2), 263–275. https://doi.org/10.1515/cllt.2005.1.2.263.
Article Google Scholar
Kowalski, C. J. (1972). On the effects of non-normality on the distribution of the sample product moment correlation coefficient. Applied Statistics, 21(1), 1–12. https://doi.org/10.2307/2346598.
Article Google Scholar
Kruskal, W. H., & Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 47(260), 583–621. https://doi.org/10.2307/2280779.
Article Google Scholar
Lumley, T., Diehr, P., Emerson, S., & Chen, L. (2002). The importance of the normality assumption in large public health data sets. Annual Review of Public Health, 23(1), 151–169. https://doi.org/10.1146/annurev.publhealth.23.100901.140546.
Article Google Scholar
Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18(1), 50–60. https://doi.org/10.1214/aoms/1177730491.
Article Google Scholar
Miller, R. G. (1997). Beyond ANOVA: Basics of applied statistics. New York: Chapman and Hall/CRC.
Book Google Scholar
Plonsky, L. (2015). Statistical power, p values, descriptive statistics, and effect sizes: A “back-to-basics” approach to advancing quantitative methods in L2 research. In Advancing quantitative methods in second language research (pp. 43–65). London: Routledge.
Chapter Google Scholar
Rasch, D., Kubinger, K. D., & Moder, K. (2011). The two-sample t test: Pre-testing its assumptions does not pay off. Statistical Papers, 52(1), 219–231. https://doi.org/10.1007/s00362-009-0224-x.
Article Google Scholar
Rayson, P., Berridge, D., & Francis, B. (2004). Extending the Cochran rule for the comparison of word frequencies between corpora. In: Proceedings from 7th international conference on statistical analysis of textual data, JADT 2004, pp. 926–936.
Google Scholar
Ruxton, G. D., & Neuhäuser, M. (2010). When should we use one-tailed hypothesis testing? Methods in Ecology and Evolution, 1(2), 114–117. https://doi.org/10.1111/j.2041-210X.2010.00014.x.
Article Google Scholar
Schmider, E., Ziegler, M., Danay, E., Beyer, L., & Bühner, M. (2010). Is it really robust? Reinvestigating the robustness of ANOVA against violations of the normal distribution assumption. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 6(4), 147–151. https://doi.org/10.1027/1614-2241/a000016.
Article Google Scholar
Schönbrodt, F. D., & Perugini, M. (2013). At what sample size do correlations stabilize? Journal of Research in Personality, 47(5), 609–612. https://doi.org/10.1016/j.jrp.2013.05.009.
Article Google Scholar
Schucany, W. R., & Tony Ng, H. K. (2006). Preliminary goodness-of-fit tests for normality do not validate the one-sample Student t. Communications in Statistics-Theory and Methods, 35(12), 2275–2286. https://doi.org/10.1080/03610920600853308.
Article Google Scholar
Shaffer, J. P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46(1), 561–584.
Article Google Scholar
Sherman, G. R. (1954). The “Student” T test when applied to non-normal populations. Department of Statistics, Stanford University.
Google Scholar
Sheskin, D. (2004). Handbook of parametric and nonparametric statistical procedures (3rd ed.). London: Chapman & Hall/CRC.
Google Scholar
Shingala, M. C., & Rajyaguru, A. (2015). Comparison of post hoc tests for unequal variance. International Journal of New Technologies in Science and Engineering, 2(5), 22–33.
Google Scholar
Sprent, P. (2011). Fisher exact test. In M. Lovric (Ed.), International encyclopedia of statistical science (pp. 524–525). Berlin/Heidelberg: Springer.
Chapter Google Scholar
Trafimow, D., & Marks, M. (2015). Editorial. Basic and Applied Social Psychology, 37, 1–2.
Article Google Scholar
Upton, G. J. G. (1992). Fisher’s exact test. Journal of the Royal Statistical Society. Series A (Statistics in Society), 155(3), 395–402. https://doi.org/10.2307/2982890.
Article Google Scholar
Yates, F. (1934). Contingency table involving small numbers and the χ² test. Supplement to the Journal of the Royal Statistical Society, 1(2), 217–235.
Article Google Scholar
Zimmerman, D. W. (1998). Invalidation of parametric and nonparametric statistical tests by concurrent violation of two assumptions. The Journal of Experimental Education, 67(1), 55–68. https://doi.org/10.1080/00220979809598344.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Lancaster University, Lancaster, UK
Vaclav Brezina

Authors

Vaclav Brezina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vaclav Brezina .

Editor information

Editors and Affiliations

FNRS Centre for English Corpus Linguistics, Language and Communication Institute, UCLouvain, Louvain-la-Neuve, Belgium
Magali Paquot
Department of Linguistics, University of California, Santa Barbara, CA, USA
Stefan Th. Gries

1 Electronic Supplementary Materials

20_correlation (ZIP 8 kb)

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Brezina, V. (2020). Classical Monofactorial (Parametric and Non-parametric) Tests. In: Paquot, M., Gries, S.T. (eds) A Practical Handbook of Corpus Linguistics. Springer, Cham. https://doi.org/10.1007/978-3-030-46216-1_20

Download citation

DOI: https://doi.org/10.1007/978-3-030-46216-1_20
Published: 05 May 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46215-4
Online ISBN: 978-3-030-46216-1
eBook Packages: Religion and PhilosophyPhilosophy and Religion (R0)

Publish with us

Policies and ethics

Classical Monofactorial (Parametric and Non-parametric) Tests

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic Supplementary Materials

20_correlation (ZIP 8 kb)

Further Reading

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Classical Monofactorial (Parametric and Non-parametric) Tests

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic Supplementary Materials

20_correlation (ZIP 8 kb)

Further Reading

Further Reading

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation