Combining reaction-time distributions to conserve shape

Sternberg, Saul

doi:10.3758/s13428-023-02084-7

Combining reaction-time distributions to conserve shape

Published: 30 May 2023

Volume 56, pages 1164–1191, (2024)
Cite this article

Behavior Research Methods Aims and scope Submit manuscript

Saul Sternberg¹

276 Accesses
1 Citation
3 Altmetric
Explore all metrics

Abstract

To improve the estimate of the shape of a reaction-time distribution, it is sometimes desirable to combine several samples, drawn from different sessions or different subjects. How should these samples be combined? This paper provides an evaluation of four combination methods, two that are currently in use (the bin-means histogram, often called "Vincentizing", and quantile averaging) and two that are new (linear-transform pooling and shape averaging). The evaluation makes use of a modern method for describing the shape of a distribution, based on L-moments, rather than the traditional method, based on central moments. Also provided is an introduction to shape descriptors based on L-moments, whose advantages over central moments—less biased and less sensitive to outliers—are demonstrated. Whether traditional or modern shape descriptions are employed, the combination methods currently in use, especially bin-means histograms, based on averaged bin means, prove to be substantially inferior to the new methods. Averaged bin-means themselves are less deficient when estimating differences between distribution shapes, as in delta plots, but are nonetheless inferior to linear-transform pooling.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Percentile rank pooling: A simple nonparametric method for comparing group reaction time distributions with few trials

Article 31 August 2020

Forget binning and get SMART: Getting more out of the time-course of response data

Article Open access 18 June 2019

PsiMLE: A maximum-likelihood estimation approach to estimating psychophysical scaling and variability more reliably, efficiently, and flexibly

Article 19 May 2015

Code availability

R code for the simulations will be made available by the author to anyone who plans to pursue these investigations further.

Notes

The meaning of "kurtosis", when it is defined, traditionally, as the standardized fourth central moment, is unclear (Westfall, 2014). The interpretation is clearer when it is defined in terms of L-moments, discussed below and in Appendix A. The corresponding density functions in Panels A and B of Fig. D1 have the same values of L-skewness, but differ in L-kurtosis, with values of L-kurtosis smaller in Panel B. The light curve in Panel A has about the same L-kurtosis as the broken curve in Panel C (0.175 vs. 0.172), but differs in L-skewness (.100 vs .172).
Such speech detection variations can be as great as 100 ms, larger than the word- frequency effect often of primary interest (See Table 2 in Andrews & Heathcote, 2001, and Table 4 in Spieler & Balota, 1997). Detection delays can be reduced by using a speech detector that applies appropriately different but low thresholds to high and low audio frequencies, along with filtering that rejects the brief non-speech lip noises that exceed the low thresholds. Alternatively, estimates of such effects can be used to correct the RTs before combining data from different words. Without either suitable instrumentation or such correction, the shapes of distributions of combined RTs may be misleading.
Thomas and Ross (1980) and Jiang et al. (2004) showed that quantile averaging has desirable properties when the parent distributions of the components belong to the same LSF, and Thomas and Ross (1980) recommended the use of Q-Q plots to test this. However, I have seen no evidence in the psychological literature of this test having been applied as a prelude to quantile averaging. Furthermore, the evaluations described below show that with realistic sample sizes, quantile averaging is inadequate even when the samples are from members of the same LSF.
For an example of applying L-moments in the analysis of the shapes of RT distributions, see Sternberg & Backus, 2015.
Because of the standardization by m₂, β₁ and β₂ are dimensionless quantities: The same (β₁, β₂) point represents distributions that can differ in location and/or scale.
Traditionally, β₁ is defined as ${\mu }_{3}^{2}/{\mu }_{2}^{3}$, and the skewness measure as $\sqrt{{\beta }_{1}}$ (Kendall & Stuart, 1969, Chapter 3). For the purpose of this paper, β₁ will be defined as ${\mu }_{3}/{\mu }_{2}^{3/2}$.
I use the term "family of distributions", or just "family" to indicate a theoretical distribution with the values of its parameters unspecified, and the term "distribution" to denote a particular member of that family, with specified parameter values.
The variability of these measures, emphasized for C-moments of RT data by Ratcliff (1979) and Heathcote et al. (1991), increases with the variance of the underlying distribution. Thus, when their use is anticipated, experimental methods are called for that reduce this variance, such as extensive practice, rest periods, adequate warning signals, and performance incentives and feedback.
L-skewness and L-kurtosis have often been represented by τ₁ and τ₂, respectively, by analogy to β₁ and β₂. To avoid their being confused with τ, often used for the exponential parameter of the ex-Gaussian distribution, popular in psychology, this paper will use λ_s3 and λ_s4, etc., instead, where the "s" indicates the standardization.
When considering the sums of stochastically independent random variables, it should be noted that (functions of) L-moments do not share the very useful additivity property of the cumulants of a distribution (κ_r), which are functions of the C-moments. For example, when x and y are stochastically independent, then, for their sum, x + y, the cumulants are additive: κ_r (x + y) = κ_r (x) + κ_r (y), (r ≥ 2). It follows, for example, that the variance, third moment, and higher cumulants of the duration of a sequence of stochastically independent iterated processes all increase linearly with the number of iterations.
These two distributions are among those that have been applied to reaction-time data; see, e.g., Luce (1986), Anders et al. (2016).
These levels of bias for L-skewness and L-kurtosis can be further reduced by adopting the estimates developed by Withers & Nadarajah (2011).
Examples of distributions that are represented by regions, rather than points or curves, are the four-parameter Kappa distribution (Hosking & Wallis, 1997; Kjeldsen et al., 2017) and the generalized lambda distribution (Asquith, 2007).
But see, e.g., Asquith, 2014; Hosking, 2007; Karvanen & Nuutinen, 2008.
To derive the density function of the distribution of combined samples when using the shape averaging combination method, or to generate random samples from the combined distribution, Headrick’s procedure (or an equivalent) would be necessary.
I have not included the method devised by Cousineau et al. (2016), because it requires estimating the minimum RT, which depends on assuming a feature of the shape of the distribution.
Referring to these two methods, Jiang et al. (2004, p. 187), who distinguish them, claimed that "In practice, the two variants yield highly similar results." In what follows we shall see that one can be substantially inferior to the other, especially if the number of bins in bin-means histograms is as small as has been recommended. See footnote 21.
The R function I used to obtain bin means is provided in Appendix G.
The midpoint of the j^th bin is ( p_j + p_j+1)/2. It has sometimes been claimed that the j^th bin mean is the quantile associated with the j^th bin midpoint. This is (approximately) true if and only if the distribution of observations within the j^th bin is (approximately) symmetric. Thus, bin means for different samples are quantiles whose associated probabilities are likely to differ.
The bin means have sometimes been called "Vincentiles" (Van Zandt, 2000). They are also quantiles, of course, as are any values within the bounds of the sample, but these quantiles are not defined in terms of the proportions to which they correspond.
According to Ratcliff (1979, p. 459), whose examples included nine bins (Fig. 2) and 24 bins (Fig. 4), the procedure can be used with "as few as ten observations per subject". According to Heathcote (1996), who recommends between 4 and 19 bins, and whose example (Fig. 7) has 19 bins (p. 435), "For good ex-Gaussian fits, Vincentizing requires as few as 20 observations per condition". He also suggests (p. 436) that "Vincent averaging combines data across subjects or conditions without distorting the shape of the underlying distribution." According to Andrews & Heathcote (2001, p. 517), "Average vincentiles approximately preserve the shape of underlying distributions for each participant, as long as the individual participants’ distributions are unimodal and smooth." In what follows, the validity of these claims is tested.
I used Hyndman & Fan’s (1996) type-8 quantile estimator. This method might be improved by using a better quantile estimator, such as the one proposed by Navruz & Ozdemir (2020).
This method (which they call "Vincentizing") has been thoroughly discussed by Jiang et al. (2004), Rouder & Speckman (2004), and by Thomas & Ross (1980), who also discuss variants that make use of transformed observations. The differences between quantile histograms and bin-means histograms (sometimes called "Vincent histograms") are discussed by Van Zandt (2000). See also footnote 2 in Andrews & Heathcote (2001).
This last measure was used by Wolfe et al. (2010), who called it the "x- score transform".
Qn is given by the .25 quantile of the distances {|x_i − x_j|; i < j}
The linear-transform method was used by Roberts and Sternberg in their "summation test" (1993; Sternberg & Backus, 2015, Appendix), but using the median and the interquartile range as measures of location and scale, respectively, which are not the choices that minimize the mean estimation error in my simulations (Appendix E).
A contrasting view is expressed by Ratcliff and Smith (2010, p. 90), according to whom "Invariance of distribution shape is one of the most powerful constraints on models of RT distributions.That the diffusion model predicts this invariance is a strong argument in support of its use in performing process decomposition of RT data." See Sternberg & Backus (2015) for a discussion of arguments for (and evidence against) shape invariance.
See Footnote 10, and, for a recent validation of their use for inferring distributional shape, Section 6.3 in Sternberg (2016).
The order statistics of a sample are the sample values in ascending order. One approach to computing the n^th L-moment, λ_n, of a sample would be to determine the mean of each of the m^th order statistics (m = 1, 2,.., n) of all possible combinations of n values of the sample, and then to combine these means as in the equation for λ_n below. See Wang (1996). For example, suppose a sample of size n = 5 with values 1, 2, 3, 4, and 5, and we want to calculate λ₃ for that sample. The set of all possible combinations of size 3 is: [ (1,2,3), (1,2,4), (1,2,5), (1,3,4), (1,3,5), (1,4,5), (2,3,4), (2,3,5), (2,4,5), and (3,4,5)]. (These would appear with equal probability in a random sample.) The second order-statistics of this set are 2, 2, 2, 3, 3, 4, 3, 3, 4, and 4. Their mean, E(X_2:3) = 3. 0. Similarly, E(X_1:3) = 1. 5, and E(X_3:3) = 4. 5. Thus, λ₃ = [1. 5 −(2 × 3. 0) + 4. 5] = 0, as expected, given that the sample, thought of as a distribution, is symmetric.
Thus, unlike the measure of kurtosis, β₂, based on the standardized fourth central moment, the meaning of L-kurtosis is clear.

References

An, H., Zhang, K., Oja, H., & Marron, J. (2023). Variable screening based on Gaussian Centered L-moments. Computational Statistics and Data Analysis, 179, 107632.
Google Scholar
Anders, R., Alario, F.-X., & Van Maanen, L. (2016). The shifted Wald distribution for response time data analysis. Psychological Methods, 21, 309–327.
PubMed Google Scholar
Andrews, S., & Heathcote, A. (2001). Distinguishing common and task-specific processes in word identification: A matter of some moment? Journal of Experimental Psychology: Learning, Memory, and Cognition, 27, 514–544.
PubMed Google Scholar
Asquith, W. H. (2007). L-moments and TL-moments of the generalized lambda distribution. Computational Statistics & Data Analysis, 51, 4484–4496.
Google Scholar
Asquith, W. H. (2011). Distributional analysis with L-moment statistics using the R environment for statistical computing. Create Space Independent Publishing Platform.
Google Scholar
Asquith, W. H. (2014). Parameter estimation for the 4-parameter asymmetric exponential power distribution by the method of L-moments using R. Computational Statistics & Data Analysis, 71, 955–970.
Google Scholar
Asquith, W. H. (2021). lmomco---L-moments, censored L-moments, trimmed L-moments, L-comoments, and many distributions. R package version 2.3.7, Texas Tech University
Balota, D. A., & Yap, M. J. (2011). Moving beyond the mean in studies of mental chronometry: The power of response time distributional analyses. Current Directions in Psychological Science, 20, 160–166.
Google Scholar
Balota, D. A., Yap, M. J., Cortese, M. J., & Watson, J. M. (2008). Beyond mean response latency: Response time distributional analysis of semantic priming. Journal of Memory and Language, 59, 495–523.
Google Scholar
Christie, L. S., & Luce, R. D. (1956). Decision structure and time relations in simple choice behavior. Bulletin of Mathematical Biophysics, 18, 89–112.
Google Scholar
Colonius, H., & Vorberg, D. (1994). Distribution inequalities for parallel models with unlimited capacity. Journal of Mathematical Psychology, 38, 35–58.
Google Scholar
Cousineau, D., Thivierge, J.-P., Harding, B., & Lacouture, Y. (2016). Constructing a group distribution from individual distributions. Canadian Journal of Experimental Psychology, 70, 253–277.
PubMed Google Scholar
De Jong, R., Liang, C.-C., & Lauber, E. (1994). Conditional and unconditional automaticity: A dual-process model of effects of spatial stimulus-response correspondence. Journal of Experimental Psychology: Human Perception and Performance, 20, 731–750.
PubMed Google Scholar
Dawson, M. R. W. (1988). Fitting the ex-Gaussian equation to reaction time distributions. Behavior Research Methods, Instruments, & Computers, 20, 54–57.
Google Scholar
Ellinghaus, R., & Miller, J. (2018). Delta plots with negative-going slopes as a potential marker of decreasing response activation in masked semantic priming. Psychological Research Psychologische Forschung, 82, 590–599.
PubMed Google Scholar
Gondan, M., & Vorberg, D. (2021). Testing trisensory interactions. Journal of Mathematical Psychology, 101, 102513. https://doi.org/10.1016/j.jmp.2021.102513
Article Google Scholar
Headrick, T. C. (2011). A characterization of power method transformations through L-Moments. Journal of Probability and Statistics, 2011, Article ID 497463.
Google Scholar
Heathcote, A. (1996). RTSYS: A DOS application for the analysis of reaction time data. Behavior Research Methods, Instruments, & Computers, 28, 427–445.
Google Scholar
Heathcote, A., Popiel, S. J., & Mewhort, D. J. (1991). Analysis of response time distributions: An example using the Stroop task. Psychological Bulletin, 109, 340–347.
Google Scholar
Hohle, R. H. (1965). Inferred components of reaction times as functions of foreperiod duration. Journal of Experimental Psychology, 69, 382–386.
PubMed Google Scholar
Hosking, J. R. M. (1990). L-moments: Analysis and estimation of distributions using linear combinations of order statistics. Journal of the Royal Statistical Society B, 52, 105–124.
Google Scholar
Hosking, J. R. M. (1996). Some theoretical results concerning L-moments. Research Report RC 14492 (revised version) IBM Research Division, Yorktown Heights, NY.
Hosking, J. R. M. (1992). Moments or L-moments? An example comparing two measures of distributional shape. The American Statistician, 16, 186–189.
Google Scholar
Hosking, J. R. M. (2006). On the characterization of distributions by their L-moments. Journal of Statistical Planning and Inference, 136, 193–198.
Google Scholar
Hosking, J. R. M. (2007). Some theory and practical uses of trimmed L-moments. Journal of Statistical Planning and Inference, 43, 299–314.
Google Scholar
Hosking, J. R. M. (2019). L-Moments. R package, version 2.8. https://CRAN.R-project.org/package=lmom. Accessed December 2022.
Hosking, J. R. M., & Wallis, J. R. (1997). Regional frequency analysis: An approach based on L-moments. Cambridge University Press.
Google Scholar
Hyndman, R. J., & Fan, Y. (1996). Sample quantiles in statistical packages. American Statistician, 50, 361–365. https://doi.org/10.2307/2684934
Article Google Scholar
Jiang, Y., Rouder, J. N., & Speckman, P. L. (2004). A note on the sampling properties of the Vincentizing (quantile averaging) procedure. Journal of Mathematical Psychology, 48, 186–195.
Google Scholar
Johnson, N. L., Kotz, S., & Balakrishnan, N. (1994). Continuous univariate distributions (Vol. 1). Wiley.
Google Scholar
Johnson, R. L., Staub, A., & Fleri, A. M. (2012). Distributional analysis of the transposed-letter neighborhood effect on naming latency. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38, 1773–1779.
PubMed Google Scholar
Jones, M. C. (2004). On some expressions for variance, covariance, skewness and L-moments. Journal of Statistical Planning and Inference, 126, 97–106.
Google Scholar
Karvanen, J., & Nuutinen, A. (2008). Characterizing the generalized lambda distribution by L-moments. Computational Statistics & Data Analysis, 52, 1971–1983.
Google Scholar
Kendall, M. G., & Stuart, A. (1969). The advanced theory of statistics (Vol. 1, 3rd ed.). New York: Hafner.
Kjeldsen, T. R., Ahn, H., & Prosdocimi, I. (2017). On the use of a four-parameter kappa distribution in regional frequency analysis. Hydrological Sciences Journal, 62, 1354–1363.
Google Scholar
Little, D. R., Altieri, N., Fific, M., & Yang, C.-T. (Eds.) (2017). Systems factorial technology. Academic.
Luce, R. D. (1986). Response times. Oxford University Press.
Google Scholar
Lombardi, L., D’Alessandro, M., & Colonius, H. (2019). A new nonparametric test for the race model inequality. Behavior Research Methods, 51, 2290–2301.
PubMed Google Scholar
Mackenzie, I. G., Mittelstadt, V., Ulrich, R., & Leuthold, H. (2022). The role of temporal order of relevant and irrelevant dimensions within conflict tasks. Journal of Experimental Psychology: Human Perception and Performance. Advance online publication. https://doi.org/10.1037/xhp0001032
Maechler, M., Rousseeuw, P., Croux, C., Todorov, V., Ruckstuhl, A., Salibian-Barrera, M., Verbeke, T., Koller, M., Conceicao, E. L., & di Palma, M. A. (2022). robustbase: Basic Robust Statistics. R package version 0.95-0. http://CRAN.R-project.org/package=robustbase. Accessed December 2022.
Marzuki, M., Randeu, W. L., Kozu, T., Shimomai, T., Schonhuber, M., & Hashiguchi, H. (2012). Estimation of raindrop size distribution parameters by maximum likelihood and L-moment methods: Effect of discretization. Atmospheric Research, 112, 1–11.
Google Scholar
McGill, W. J. (1963). Stochastic latency mechanisms. In R. D. Luce, R. R. Bush, & E. Galanter (Eds.), Handbook of mathematical psychology (pp. 309–360). Wiley.
Google Scholar
Miller, J. O. (1982). Divided attention: Evidence for coactivation with redundant signals. Cognitive Psychology, 14, 247–279.
PubMed Google Scholar
Miller, J., & Schwarz, W. (2021). Delta plots for conflict tasks: An activation-suppression race model. Psychonomic Bulletin and Review. https://doi.org/10.3758/s13423-021-01900-5
Article PubMed Google Scholar
Miller, R., Scherbaum, S., Heck, D. W., Goschke, T., & Enge, S. (2017). On the relation between the (censored) shifted Wald and the wiener distribution as measurement models for choice response times. Applied Psychological Measurement, 42, 116–135.
PubMed PubMed Central Google Scholar
Mittelstädt, V., Miller, J., Leuthold, H., Mackenzie, I. G., & Ulrich, R. (2022). The time-course of distractor-based activation modulates effects of speed–accuracy tradeoffs in conflict tasks. Psychonomic Bulletin & Review, 29, 837–854.
Google Scholar
Navruz, G., & Ozdemir, F. (2020). A new quantile estimator with weights based on a subsampling approach. British Journal of Mathematical and Statistical Psychology. https://doi.org/10.1111/bmsp.12198
Article PubMed Google Scholar
Pearson, E. S. (1963). Some problems arising in approximating to probability distributions, using moments. Biometrika, 50, 95–112. Possibly
R Core Team. (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/. Accessed December 2022.
Ratcliff, R. (1979). Group reaction time distributions and an analysis of distribution statistics. Psychological Bulletin, 86, 446–461.
PubMed Google Scholar
Ratcliff, R., & Smith, P. L. (2010). Perceptual discrimination in static and dynamic noise: The temporal relation between perceptual encoding and decision making. Journal of Experimental Psychology. General, 139, 70–94.
PubMed PubMed Central Google Scholar
Reynolds, A., & Miller, J. (2009). Display size effects in visual search: Analyses of reaction time distributions as mixtures. The Quarterly Journal of Experimental Psychology, 62, 988–1009.
PubMed Google Scholar
Roberts, S., & Sternberg, S. (1993). The meaning of additive reaction-time effects: Tests of three alternatives. In D. E. Meyer & S. Kornblum (Eds.), Attention and performance XIV: Synergies in experimental psychology, artificial intelligence, and cognitive neuroscience: A silver jubilee (pp. 611–653). M.I.T. Press.
Google Scholar
Rouder, J. N., & Speckman, P. L. (2004). An evaluation of the Vincentizing method of forming group-level response time distributions. Psychonomic Bulletin & Review, 11, 419–427.
Google Scholar
Rousseeuw, P. J., & Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88, 1273–1283.
Google Scholar
Royston, P. (1992). Which measures of skewness and kurtosis are best? Statistics in Medicine, 11, 333–343.
PubMed Google Scholar
Schweickert, R., & Giorgini, M. (1999). Response time distributions: Some simple effects of factors selectively influencing mental processes. Psychonomic Bulletin & Review, 6, 269–288.
Google Scholar
Schweickert, R., Fisher, D. L., & Sung, K. (2012). Discovering cognitive architecture by selectively influencing mental processes. World Scientific.
Google Scholar
Schwarz, W., & Miller, J. (2012). Response time models of delta plots with negative-going slopes. Psychonomic Bulletin & Review, 19, 555–574.
Google Scholar
Selaman, O. S., Said, S., & Putuhena, F. J. (2007). Flood frequency analysis for Sarawak using Weibull, Gringorten and L-Moments Formula. Journal of The Institution of Engineers, Malaysia, 68, 43–52.
Google Scholar
Snodgrass, J. G., Luce, R. D., & Galanter, E. (1967). Some experiments on simple and choice reaction time. Journal of Experimental Psychology, 75, 1–17.
PubMed Google Scholar
Spieler, D. H., & Balota, D. A. (1997). Bringing computational models of word naming down to the item level. Psychological Science, 8, 411–416.
Google Scholar
Stasinopoulos, M. & Rigby, R. (2022). gamlss.dist: Distributions for Generalized Additive Models for Location Scale and Shape. R package version 6.0-3. https://CRAN.R-project.org/package=gamlss.dist. Accessed December 2022.
Sternberg, S. (1964). Estimating the distribution of additive reaction-time components. Paper presented at the meeting of the Psychometric Society, Niagara Falls, Canada, October.
Sternberg, S. (1973). Evidence against self-terminating memory search from properties of RT distributions. Paper presented at a meeting of the Psychonomic Society, St. Louis, November.
Sternberg, S. (2016). In defence of high-speed memory scanning. The Quarterly Journal of Experimental Psychology, 69, 2020–2075.
PubMed Google Scholar
Sternberg, S., & Backus, B. T. (2015). Sequential processes and the shapes of reaction time distributions. Psychological Review, 122, 830–837.
PubMed Google Scholar
Stone, M. (1960). Models for choice-reaction time. Psychometrika, 25, 251–260.
Google Scholar
Thomas, E. A. C., & Ross, B. H. (1980). On appropriate procedures for combining probability distributions within the same family. Journal of Mathematical Psychology, 21, 136–152.
Google Scholar
Townsend, J. T., & Ashby, F. G. (1983). Stochastic modeling of elementary psychological processes. Cambridge University Press.
Google Scholar
Townsend, J. T., & Nozawa, G. (1995). Spatio-temporal properties of elementary perception: An investigation of parallel, serial, and coactive theories. Journal of Mathematical Psychology, 39, 321–359.
Google Scholar
Ulrych, T. J., Velis, D. R., Woodbury, A. D., & Sacchi, M. D. (2000). L-moments and C-moments. Stochastic Environmental Research and Risk Assessment, 14, 50–68.
Google Scholar
Van Zandt, T. (2000). How to fit a response time distribution. Psychonomic Bulletin & Review, 7, 424–465.
Google Scholar
Vickers, D. (1979). Decision processes in visual perception. Academic.
Google Scholar
Vogel, R. M., & Fennessey, N. M. (1993). L moment diagrams should replace product moment diagrams. Water Resources Research, 29, 1745–1752.
Google Scholar
Vorberg, D. (1981). Reaction time distributions predicted by serial, self-terminating models of memory search. In S. Grossberg (Ed.), Mathematical psychology and psychophysiology. Proceedings of the symposium in applied mathematics of the american mathematical society and the society for industrial and applied mathematics (pp. 301–318). American Mathematical Society.
Google Scholar
Wang, Q. J. (1996). Direct sample estimators of L moments. Water Resources Research, 32, 3617–3619.
Google Scholar
Westfall, P. H. (2014). Kurtosis as Peakedness, 1905–2014. R.I.P. The American Statistician, 68, 191–195.
PubMed PubMed Central Google Scholar
Wheeler, B. (2022). SuppDists: Supplementary Distributions. R package version 1.1-9.7. https://CRAN.R-project.org/package=SuppDists. Accessed December 2022.
Wilcox, R. R. (2017) Introduction to robust estimation and hypothesis testing (4th ed.). New York: Academic Press.
Withers, C. S., & Nadarajah, S. (2011). Bias-reduced estimates for skewness, kurtosis, L-skewness and L-kurtosis. Journal of Statistical Planning and Inference, 141, 3839–3861.
Google Scholar
Wolfe, J. M., Palmer, E. M., & Horowitz, T. S. (2010). Reaction time distributions constrain models of visual search. Vision Research, 50, 1304–1311.
PubMed Google Scholar
Yantis, S., Meyer, D. E., & Smith, J. E. K. (1991). Analyses of multinomial mixture distributions: New tests for stochastic models of cognition and action. Psychological Bulletin, 110, 350–374.
PubMed Google Scholar

Download references

Acknowledgements

I thank Vincent Hurtubise and Mark Davidson for computer support.

Author information

Authors and Affiliations

University of Pennsylvania, Philadelphia, PA, USA
Saul Sternberg

Authors

Saul Sternberg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saul Sternberg.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The computations on which this report is based were conducted in R (R Core Team, 2022). Packages used include lmom (Hosking, 2019), lmomco (Asquith, 2021), robustbase (Maechler et al., 2022), SuppDists (Wheeler, 2022), and gamlss.dist (Stasinopoulos & Rigby, 2022).

Appendices

Appendix A: definitions and properties of L-moments

The first six L-moments are defined as follows: Let X_m:n be the m^th order-statistic in a sample of size n, and E denote expectation.^{Footnote 29}

$$\begin{array}{l}\lambda_1=E\left(X\right)\\\lambda_2 = E(X_{2:2}-X_{1:2})/2\\\lambda_3 = E(X_{3:3}-{{2X}_{2:3}+X}_{1:3})/3\\\lambda_4 = E(X_{4:4}-{{3X}_{3:4}+{3X}_{2:4}-X}_{1:4})/4\\\lambda_5 = E(X_{5:5}-{{4X}_{4:5}+{6X}_{3:5}-4_{2:5}+X}_{1:5})/5\\\lambda_6 = E(X_{6:6}-{{5X}_{5:6}+{10X}_{4:6}-10X_{3:6}+5X}_{2:6}-X_{1:6})/6\end{array}$$

To understand these L-moments, it helps to rewrite the expressions inside the parentheses as follows:

For λ₃

$$\left(X_{3:3}-X_{2:3}\right)-(X_{2:3}-X_{1:3}).$$

Thus, λ₃ will be zero if the distribution is symmetric, such that the mean separation between the middle and third order-statistic is equal to the that between the middle and first order-statistic; if it is larger (smaller), it will be positive (negative).

For λ₄

$$\left(X_{4:4}-X_{1:4}\right)-3(X_{3:4}-X_{2:4}).$$

If the means of the four order-statistics are equally spaced, the separation between the means of the two middle ones will be one third of the that between the first and fourth, and we will have λ₄ = 0. If the separation is less (more) than one third, e.g., if either or both tails are heavy (light) relative to the middle of the distribution, λ₄ will be positive (negative).

For λ₅

^{Footnote 30}

$$\left[\left(X_{5:5}-X_{4:5}\right)-(X_{2:5}-X_{1:5})\right]-3\left[\left(X_{4:5}-X_{3:5}\right)-(X_{3:5}-X_{2:5})\right].$$

The value of λ₅ indicates how much of the asymmetry measured by λ₃ is due to the tails (large first term) rather than the middle region of the distribution (large second term).

For λ₆

$$\left[\left(X_{6:6}-X_{1:6}\right)-5(X_{4:6}-X_{3:6})\right]-5\left[\left(X_{5:6}-X_{4:6}\right)+5(X_{3:6}-X_{2:6})\right].$$

This will be large if either tail and/or the peak (first term) is large relative to the "shoulders" (second term) of the distribution. Thus, unlike λ₄, and supplementing it, λ₆ distinguishes the shoulders from other parts of the distribution.

For k ≥ 3, the L-moments are usually standardized: λ_sk = λ_k / λ₂, which defines L-skewness (λ_s3) and L-kurtosis (λ_s4).

The standardized L-moments are constrained by several inequalities, including |λ_sk| < 1, (k ≥ 3); λ_s4 ≥ -1/4; λ_s6 ≥ – 1/6; and 4λ_s4 ≥ 5(λ_s3)² – 1 (Hosking, 1996; Jones, 2004). Given the limited ranges of λ_sk when k ≥ 3, small differences in their values matter greatly.

Appendix B: bias of shape measures based on L-moments and C-moments

Among the advantages of measures of distribution shape based on L-moments over those based on C-moments is that the former are less biased, especially for small samples. This difference is demonstrated by the findings pictured below (Fig. B1 and B2).

Appendix C: outlier effects on shape measures based on L-moments and C-moments

Among the advantages of measures of distribution shape based on L-moments over those based on C-moments is that the former are less influenced by outliers. This difference is demonstrated by the findings pictured below, which show the effects on mean shape measures of introducing one high outlier among 200 RTs, for seven ex-Gaussian distributions (Fig. C1) and seven inverse-Gaussian (Wald) distributions (Fig. C2).

Appendix D: density functions of nine basic distributions

Appendix E: choice among variants of linear-transform pooling

Tables E1, E2, E3, E4, E5 and E6 show the mean estimation error (Tables E1A and E2A, etc.) and the mean bias (Tables E1B and E2B, etc.) for each of the thirty variants of the linear-transform pooling method, defined by combining five measures of location: the mean, the 20% winsorized mean (winmn), the 20% trimmed mean (trimmn), the mean of the 0.25 and 0.75 quantiles (quartmn), and the median, with six measures of scale: Qn (Rousseeuw & Croux, 1993), the second L-moment (lm2), the square root of the 20% winsorized variance (swv, Wilcox, 2017), the median absolute deviation (mad), the interquartile range (iqr), and the standard deviation (sdev). The values in each table are the means from two independent simulations, each involving 500 replications, and averaged over the nine basic distributions. Sample sizes of 60 and 100 were chosen because they seemed to approximate the sizes most likely to be used.

Table E1 Ten samples, same-shape parents, N = 60

Full size table

Table E2 Ten samples, same-shape parents, N = 100

Full size table

Table E3 Four samples, same-shape parents, N = 60

Full size table

Table E4 Four samples, same-shape parents, N = 100

Full size table

Table E5 Four samples, different-shape parents, N = 60

Full size table

Table E6 Four samples, different-shape parents, N = 100

Full size table

When ten samples from the same parent distributions were combined (Tables E1 and E2) the mean estimation error was smallest for the mean + Qn variant. When four samples from the same parent distributions were combined (Tables E3 and E4) the mean estimation error was smallest for the mean + lm2 variant. However, all six tables show that the mean bias associated with the mean + lm2 variant is considerably greater than with the mean + Qn variant, while their mean estimation errors do not differ greatly. I therefore selected the mean+Qn variant of the linear-transform method for comparison with the other methods.

It is worth noting, however, that relative to the differences in mean estimation error produced by the various methods (Fig. 2), the range of values of the mean estimation error produced by the thirty variants of the linear-transform pooling method is small. For example, the range when four samples are combined and N = 100 (Table E4) is (4.35, 4.55), a difference of 0.20, while the range of means across all methods for this case (Fig. 2B) is (4.45, 8.95), a difference of 4.50.

Appendix F: why are bin-means histograms so bad?

A hint as to the reasons for the large bias of the results of bin means averaging is provided by noting how the bin-means histogram represents the tails of a distribution. Recall that to represent a distribution, a set of m bins between m +1 equally spaced proportions is defined, and the means {X_k_•}, (k = 1, 2, . . . , m), of the values in each bin determined. The bin-means histogram contains m equal-area rectangles bounded by the successive pairs of these means. As noted by van Zandt (2000, p. 430), such histograms exclude the data below the smallest and above the greatest of the m means. To demonstrate this, I created a distribution by combining a uniformly distributed center with low and high tails. A standard histogram of this distribution together with its density function are shown in Panel A of Fig. F1. The same data are represented as bin-means histograms with 6 and 15 bins, respectively, in Panels B and C of Fig. F1. Note how much area under the density function is outside the range of the bin-means histograms.

The L-moments associated with the sample values and with the two bin-means histograms are shown in Table F1. Consistent with the bias shown in Figs. 4, 5, and 6, we see that the values associated with the bin-means histograms are underestimates.

Table F1 L-moments of sample and bin-means histogram representations

Full size table

Appendix G: R code for selected functions

The following function starts with a sample, and generates bin means ("Vincentiles") with nbin bins.

quantiles were generated using the type-8 quantile function:

Given averaged quantiles, a sample was generated using the following function (a deterministic version of inverse transform sampling), which distributes nobs observations uniformly in each interval, thus creating total obsns = nobs*(length(quantiles)-1).

The same function was used to generate a sample from bin-means averaging, replacing quantiles by bin means.

Appendix H: R code to implement linear-transform pooling and shape averaging

Each sample of reaction-time data is a vector. The input to the program, called "sample.list" is a list of these vectors. The program outputs are "pool" and "meanshape".

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Sternberg, S. Combining reaction-time distributions to conserve shape. Behav Res 56, 1164–1191 (2024). https://doi.org/10.3758/s13428-023-02084-7

Download citation

Accepted: 03 February 2023
Published: 30 May 2023
Issue Date: March 2024
DOI: https://doi.org/10.3758/s13428-023-02084-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Combining reaction-time distributions to conserve shape

Abstract

Access this article

Similar content being viewed by others

Percentile rank pooling: A simple nonparametric method for comparing group reaction time distributions with few trials

Forget binning and get SMART: Getting more out of the time-course of response data

PsiMLE: A maximum-likelihood estimation approach to estimating psychophysical scaling and variability more reliably, efficiently, and flexibly

Code availability

Notes

References

Acknowledgements