Abstract
Methods for the Behavioral, Educational, and Social Sciences (MBESS; Kelley, 2007b) is an open source package for R (R Development Core Team, 2007b), an open source statistical programming language and environment. MBESS implements methods that are not widely available elsewhere, yet are especially helpful for the idiosyncratic techniques used within the behavioral, educational, and social sciences. The major categories of functions are those that relate to confidence interval formation for noncentralt, F, and Χ2 parameters, confidence intervals for standardized effect sizes (which require noncentral distributions), and sample size planning issues from the power analytic and accuracy in parameter estimation perspectives. In addition, MBESS contains collections of other functions that should be helpful to substantive researchers and methodologists. MBESS is a long-term project that will continue to be updated and expanded so that important methods can continue to be made available to researchers in the behavioral, educational, and social sciences.
Article PDF
Similar content being viewed by others
References
Algina, J., &Olejnik, S. (2000). Determining sample size for accurate estimation of the squared multiple correlation coefficient.Multivariate Behavioral Research,35, 119–136.
Bonett, D. G., &Wright, T. A. (2000). Sample size requirements for estimating Pearson, Kendall and Spearman correlations.Psychometrika,65, 23–28.
Brown, B. W., Lovato, J., Russel, K., & Halvorsen, K. (2006). Asymptotic power package (Version 1.2) [Computer software and manual]. Retrievable from www.cran.r-project.org/.
Browne, M. W., &Cudeck, R. (1993). Alternative ways of assessing model fit. In K. A. Bollen & J. S. Long (Eds.),Testing structural equation models (pp. 136–162). Newbury Park, CA: Sage.
Champely, S. (2006). Power package (Version 1.0) [Computer software and manual]. Retrievable from www.cran.r-project.org/.
Cohen, J. (1988).Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
Cohen, J. (1994). The earth is round (p<.05).American Psychologist,49, 997–1003.
Cumming, G., &Finch, S. (2001). A primer on the understanding, use, and calculation of confidence intervals that are based on central and noncentral distributions.Educational & Psychological Measurement,61, 532–574.
de Leeuw, J. (2006).R in psychometrics and psychometrics in R. Paper presented at the UseR! conference, Vienna.
Ding, C. G. (1996). On the computation of the distribution of the square of the sample multiple correlation coefficient.Computational Statistics & Data Analysis,22, 345–350.
Doran, H. C., &Lockwood, J. R. (2006). Fitting value-added models in R.Journal of Educational & Behavioral Statistics,31, 205–230.
Everitt, B. S. (2005).An R and S-Plus companion to multivariate analysis. New York: Springer.
Fox, J. (2002).An R and S-Plus companion to applied regression. Thousand Oaks, CA: Sage.
Gatsonis, C., &Sampson, A. R. (1989). Multiple correlation: Exact power and sample size calculations.Psychological Bulletin,106, 516–524.
Grissom, R. J., &Kim, J. J. (2005).Effect sizes for research: A broad practical approach. Mahwah, NJ: Erlbaum.
Hahn, G., &Meeker, W. (1991).Statistical intervals: A guide for practitioners. New York: Wiley.
Kelley, K. (2005). The effects of nonnormal distributions on confidence intervals around the standardized mean difference: Bootstrapping as an alternative to parametric confidence intervals.Educational & Psychological Measurement,65, 51–69.
Kelley, K. (2007a). Confidence intervals for standardized effect sizes: Theory, application, and implementation.Journal of Statistical Software,20, 1–24.
Kelley, K. (2007b). Methods for the Behavioral, Educational, and Social Sciences (MBESS) [Computer software and manual]. Retrievable from www.cran.r-project.org/.
Kelley, K. (2007c). Sample size planning for the coefficient of variation from the accuracy in parameter estimation approach.Behavior Research Methods,39, 755–766.
Kelley, K. (2007d). Sample size planning for the squared multiple correlation coefficient: Accuracy in parameter estimation via narrow confidence intervals. Manuscript submitted for publication.
Kelley, K., Lai, K., &Wu, P.-J. (2008). Using R for data analysis: A best practice for research. In J. Osbourne (Ed.),Best practices in quantitative methods (pp. 535–572). Newbury Park, CA: Sage.
Kelley, K., &Maxwell, S. E. (2003). Sample size for multiple regression: Obtaining regression coefficients that are accurate, not simply significant.Psychological Methods,8, 305–321.
Kelley, K., &Maxwell, S. E. (2008). Sample size planning for multiple regression: Power and accuracy for omnibus and targeted effects. In J. Brannon, P. Alasuutari, & L. Bickman (Eds.),Sage handbook of social research methods. Thousand Oaks, CA: Sage.
Kelley, K., Maxwell, S. E., &Rausch, J. R. (2003). Obtaining power or obtaining precision: Delineating methods of sample size planning.Evaluation & the Health Professions,26, 258–287.
Kelley, K., &Rausch, J. R. (2006). Sample size planning for the standardized mean difference: Accuracy in parameter estimation via narrow confidence intervals.Psychological Methods,11, 363–385.
Kraemer, H. C., &Thiemann, S. (1987).How many subjects? Beverly Hills, CA: Sage.
Kupper, L. L., &Hafner, K. B. (1989). How appropriate are popular sample size formulas.American Statistician,43, 101–105.
Lai, K., & Kelley, K. (2007).Sample size planning for standardized ANCOVA and ANOVA contrasts: Obtaining narrow confidence intervals. Manuscript submitted for publication.
Lee, Y. S. (1971). Tables of the upper percentage points of the multiple correlation.Biometrika,59, 175–189.
Lipsey, M. W. (1990).Design sensitivity: Statistical power for experimental research. Newbury Park, CA: Sage.
Meehl, P. E. (1997). The problem is epistemology, not statistics: Replace significance tests by confidence intervals and quantify accuracy of risky numerical predictions. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.),What if there were no significance tests? (pp. 393–426). Mahwah, NJ: Erlbaum.
Murphy, K. R., &Myors, B. (2004).Statistical power analysis: A simple and general model for traditional and modern hypothesis tests (2nd ed.). Mahwah, NJ: Erlbaum.
Pinheiro, J., Bates, D., DebRoy, S., & Sarkar, D. (2006). NLME: Linear and nonlinear mixed effects models [Computer software and manual]. Retrieved from www.cran.r-project.org/.
R Development Core Team (2007a). The comprehensive R archive network. Vienna: R Foundation for Statistical Computing. Available at www.cran.r-project.org.
R Development Core Team (2007b). R: A language and environment for statistical computing [Computer software and manual]. R Foundation for Statistical Computing. Retrieved from www.r-project.org.
R Development Core Team (2007c). R Project Internet home page: www.r-project.org/. R Foundation for Statistical Computing.
Rencher, A. C. (2000).Linear models in statistics. New York: Wiley.
Rosenthal, R., Rosnow, R. L., &Rubin, D. B. (2000).Contrasts and effect sizes in behavioral research: A correlational approach. New York: Cambridge University Press.
Sampson, A. R. (1974). A tale of two regressions. Journal of the American Statistical Association,69, 682–689.
Schmidt, F. L. (1996). Statistical significance testing and cumulative knowledge in psychology: Implications for training of researchers.Psychological Methods,1, 115–129.
Smithson, M. (2001). Correct confidence intervals for various regression effect sizes and parameters: The importance of noncentral distributions in computing intervals.Educational & Psychological Measurement,61, 605–632.
Smithson, M. (2003).Confidence intervals. Thousand Oaks, CA: Sage.
Steiger, J. H. (2004). Beyond the F test: Effect size confidence intervals and tests of close fit in the analysis of variance and contrast analysis.Psychological Methods,9, 164–182.
Steiger, J. H., &Fouladi, R. T. (1997). Noncentrality interval estimation and the evaluation of statistical methods. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.),What if there were no significance tests? (pp. 221–257). Mahwah, NJ: Erlbaum.
Thompson, B. (2002). What future quantitative social science research could look like: Confidence intervals for effect sizes.Educational Researcher,31, 25–32.
Venables, W. N., Smith, D. M., & the R Development Core Team. (2007).An introduction to R. Vienna: R Development Core Team.
Wilkinson, L., &The American Psychological Association Task Force on Statistical Inference (1999). Statistical methods in psychology: Guidelines and explanations.American Psychologist,54, 594–604.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was sponsored in part by a Proffitt Fellowship for Educational Research.
Rights and permissions
About this article
Cite this article
Kelley, K. Methods for the Behavioral, Educational, and Social Sciences: An R package. Behavior Research Methods 39, 979–984 (2007). https://doi.org/10.3758/BF03192993
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03192993