Smoothed Bootstrap Methods for Hypothesis Testing

Al Luhayb, Asamh S. M.; Coolen-Maturi, Tahani; Coolen, Frank P. A.

doi:10.1007/s42519-024-00370-x

Smoothed Bootstrap Methods for Hypothesis Testing

Original Article
Open access
Published: 04 March 2024

Volume 18, article number 16, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Statistical Theory and Practice Aims and scope Submit manuscript

Smoothed Bootstrap Methods for Hypothesis Testing

Download PDF

Asamh S. M. Al Luhayb¹,
Tahani Coolen-Maturi ORCID: orcid.org/0000-0002-0229-2671² &
Frank P. A. Coolen²

607 Accesses
Explore all metrics

Abstract

This paper demonstrates the application of smoothed bootstrap methods and Efron’s methods for hypothesis testing on real-valued data, right-censored data and bivariate data. The tests include quartile hypothesis tests, two sample medians and Pearson and Kendall correlation tests. Simulation studies indicate that the smoothed bootstrap methods outperform Efron’s methods in most scenarios, particularly for small datasets. The smoothed bootstrap methods provide smaller discrepancies between the actual and nominal error rates, which makes them more reliable for testing hypotheses.

Resampling-Free Bootstrap Inference for Quantiles

Bootstrap for inference after model selection and model averaging for likelihood models

Article 05 March 2024

Multiplier bootstrap methods for conditional distributions

Article 04 May 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The bootstrap method, as introduced by Efron [13], is a nonparametric statistical method proposed to specify the variability of sample estimates. The method has been widely used in the literature for a variety of statistical problems [17] as it is easy to apply and overall provides good results. When the distribution is unknown, the bootstrap method could be of great practical use [10].

For univariate real-valued data, Efron [13] introduced the bootstrap method, which is used in many real-world applications; see Efron and Tibshirani [17], Davison and Hinkley [10] and Berrar [5] for more details. For an original data set of size n, bootstrap samples of size n are created by random sampling with replacement and then computing the function of interest based on each bootstrap sample. The empirical distribution of the results can be used as a proxy for the distribution of the function of interest. In the case of finite support, Banks [4] presented a smoothed bootstrap method by linear interpolation between consecutive observations. Banks’ bootstrap method starts with ordering the n observations of the original sample, where it is assumed that there are no ties, and taking the $n+1$ intervals of the partition of the support created by the n ordered observations. Each interval is assigned probability $\frac{1}{n+1}$. To generate one Banks’ bootstrap sample, n intervals are resampled, and then one observation is drawn uniformly from each chosen interval. With Banks’ bootstrap method, it is allowed to sample from the whole support, and ties occur with probability 0 in the bootstrap samples. This is contrary to Efron’s method, where the process is restricted to resampling from the original data set [13]. In the case of underlying distributions with infinite support, Coolen and BinHimd [8] generalised Banks’ bootstrap method by assuming distribution tail(s) for the first and last interval.

Efron [14] presented the bootstrap method for right-censored data, which is widely used in survival analysis; see Efron and Tibshirani [4, 16]. This bootstrap version is very similar to the method presented for univariate real-valued data, where multiple bootstrap samples of size n are created by resampling from the original sample, and the function of interest is computed based on each bootstrap sample. The empirical distribution of those resulting values can be used as a good proxy for the distribution of the function of interest. Al Luhayb et al. [2] generalized Banks’ bootstrap method based on the right-censoring $A_{(n)}$ assumption [9]. The generalised bootstrap method produced better results; see Al Luhayb [1] and Al Luhayb et al. [2] for more details.

Efron and Tibshirani [16] introduced the bootstrap method for bivariate data, where again, multiple bootstrap samples are generated by resampling from the original data set, and the function of interest is computed based on each bootstrap sample. The empirical distribution of the resulting values can be a good proxy for the distribution of the function of interest. However, Efron’s bootstrap method often produces poor results when working with small data sets. To address this issue, Al Luhayb et al. [3] proposed three new smoothed bootstrap methods. These methods rely on applying Nonparametric Predictive Inference on the marginals and modelling the dependence using parametric and nonparametric copulas. The new bootstrap methods have been shown to produce more accurate results. For further details, we refer the reader to Al Luhayb [1] and Al Luhayb et al. [3].

Classical statistical methods are widely used for testing statistical hypotheses, although their underlying assumptions are not always met, especially with complex data sets. To avoid these issues, Efron’s bootstrap method has been used to test statistical hypotheses [16, 23, 24], which is easy to implement, and it provides good approximation results. However, it may not be suitable for small data sets and may include ties in the bootstrap samples. To overcome these limitations, various smoothed bootstrap methods have been proposed by Banks [4], Al Luhayb et al. [2] and Al Luhayb et al. [3] for real-valued data, right-censored data, and bivariate data, respectively. This paper investigates the use of these bootstrap methods for hypothesis testing and compares their results with those of Efron’s methods.

This paper is organised as follows: Sect. 2 provides an overview of several bootstrap methods for real-valued univariate data, right-censored univariate data, and real-valued bivariate data. To illustrate their application, an example with data from the literature is presented in Sect. 3 using Efron’s and Banks’ bootstrap methods for hypothesis testing. Section 4 compares the smoothed bootstrap methods and Efron’s bootstrap methods through simulations in various hypothesis tests, such as quartile hypothesis tests, two-sample medians, Pearson and Kendall correlation tests. Firstly, the smoothed bootstrap methods and Efron’s bootstrap methods for real-valued univariate data and right-censored univariate data are used to compute the Type I error rates for quartile tests. Secondly, the achieved significance level is used to compute the Type I error rate for two-sample median tests. Lastly, for real-valued bivariate data, the smoothed bootstrap methods and Efron’s bootstrap method are compared in computing the Type I error rates for Pearson and Kendall correlation tests. The final section provides some concluding remarks.

2 Bootstrap Methods for Different Data Types

When it comes to real-world applications, using traditional statistical methods can be challenging due to the mathematical assumptions involved. However, the use of bootstrap methods can provide a computer-based way of conducting statistical inference that doesn’t require complex formulas. This paper demonstrates the use of different bootstrap methods for hypothesis testing. This section will provide an overview of multiple bootstrap methods that can be applied to real-valued data, right-censored data, and bivariate data.

2.1 Bootstrap Methods for Real-Valued Univariate Data

In this section, we will discuss two bootstrap methods for data that include only real-valued observations, namely Efron’s bootstrap method and Banks’ bootstrap method [4, 13]. These methods are used to measure the variability of sample estimates for a given function of interest $\theta (F)$, where F is a continuous distribution defined on the interval [a, b]. Suppose we have n independent and identically distributed random quantities $X_{1}, X_{2}, \ldots , X_{n}$ from the distribution F and the corresponding observations are $x_{1}, x_{2}, \ldots , x_{n}$.

Efron’s bootstrap method [13] is a nonparametric method proposed to measure the variability of sample estimates. It uses the empirical distribution function of the original sample, where each observation has the same probability of being selected. To create B resamples of size n, we randomly select observations with replacement from the original sample. We then calculate the function of interest $\hat{\theta }$ for each bootstrap sample to obtain $\hat{\theta }_{1}, \hat{\theta }_{2}, \ldots , \hat{\theta }_{B}$. The empirical distribution of these results approximates the sampling distribution of $\theta (F)$. Efron’s bootstrap method is commonly used for hypothesis testing and has been shown to provide reliable results [17].

Banks’ bootstrap method [4] is a smoothed bootstrap method for real-valued univariate data. The original data points are ordered as $x_{(1)}, x_{(2)}, \ldots , x_{(n)}$, and the sample space [a, b] is divided into $n+1$ intervals by the observations, where the end points $x_{(0)}$ and $x_{(n+1)}$ are equal to a and b, respectively. Each interval $(x_{(i)}, x_{(i+1)})$ for $i= 0, 1, 2, \ldots , n$ is assigned a probability of $\frac{1}{n+1}$. To create a bootstrap sample, we randomly select n intervals with replacement, and then sample one observation uniformly from each selected interval. Based on the bootstrap sample, we calculate the function of interest and repeat this process B times to obtain $\hat{\theta }_{1}, \hat{\theta }_{2}, \ldots , \hat{\theta }_{B}$. The empirical distribution of these values approximates the sampling distribution of $\theta (F)$. Banks’ bootstrap method is used for hypothesis testing in this paper and will be compared to Efron’s bootstrap method in Sect. 4.

2.2 Bootstrap Methods for Right-Censored Univariate Data

This section presents Efron’s bootstrap method [14] and the smoothed bootstrap method for right-censored data [1, 2]. Let $T_{1},T_{2},\ldots ,T_{n}$ be independent and identically distributed event random variables from a distribution F supported on $\mathbb {R}^{+}$ and let $C_{1},C_{2},\ldots ,C_{n}$ be independent and identically distributed right-censored random variables from a distribution G supported on $\mathbb {R}^{+}$. Furthermore, let $(X_{1}, D_{1}), (X_{2}, D_{2}), \ldots ,$ $ (X_{n}, D_{n})$ be the right-censored random variables, where each pair can be derived by

$$\begin{aligned} X_{i}= & {} \left\{ \begin{array}{ll} T_{i} &{} \quad \text {if} \ \ T_{i}\le C_{i} \ \ \text {(uncensored)} \\ C_{i} &{} \quad \text {if}\ \ T_{i}>C_{i} \ \ \text {(censored)} \end{array} \right. \end{aligned}$$

(1)

$$\begin{aligned} D_{i}= & {} \left\{ \begin{array}{ll} 1 &{} \quad \text {if} \ \ X_{i}=T_{i} \ \ \text {(uncensored)} \\ 0 &{} \quad \text {if}\ \ X_{i}=C_{i} \ \ \text {(censored)} \end{array} \right. \end{aligned}$$

(2)

where $i= 1, 2, \ldots , n$. Let $(x_{1},d_{1}),(x_{2},d_{2}),\ldots ,(x_{n},d_{n})$ be the observations of the corresponding random quantities $(X_{1},D_{1}),(X_{2},D_{2}),\ldots ,(X_{n},D_{n})$ and $\theta (F)$ is the function of interest, where this function can be estimated by $\theta (\hat{F})$.

Efron [14] proposed a nonparametric bootstrap method for data with right-censored observations. This method is similar to the one he proposed for real-valued data. In this method, the empirical distribution function of the original sample is used, so that each observation has an equal probability of $\frac{1}{n}$, regardless of whether it is an event or a censored observation. To apply this method, B bootstrap samples of size n are generated by randomly selecting observations from the original dataset with replacement. The function of interest is then calculated based on each bootstrap sample. This process results in values $\hat{\theta }_{1}, \hat{\theta }_{2}, \ldots , \hat{\theta }_{B}$, where the empirical distribution of these values can be a good estimate for the sampling distribution of $\theta (F)$. This bootstrap method is useful for testing the equality of average lifetimes over two populations [25], and it has been shown to provide good results in multiple statistical inferences, see Efron [15], Efron and Tibshirani [16, 17] for more details.

Another method for right-censored data is the smoothed bootstrap method, introduced by Al Luhayb [1] and Al Luhayb et al. [2]. This method generalises Banks’ bootstrap method for right-censored data, and is based on the generalisation of the A$_{(n)}$ assumption for data that contains right-censored observations, proposed by Coolen and Yan [9]. To implement this method, the data support is divided into $n+1$ intervals by the original data, and the right-censored A$_{(n)}$ assumption is used to assign specific probabilities to these intervals. For each bootstrap sample, n intervals are resampled with the assignment probabilities, and one observation is sampled from each interval. Performing these steps B times creates B bootstrap samples. Then, the function of interest is computed for each bootstrap sample, resulting in the values $\hat{\theta }_{1}, \hat{\theta }_{2}, \ldots , \hat{\theta }_{B}$. The empirical distribution of these values is used to estimate the sampling distribution of $\theta (F)$. In this paper, we use the smoothed bootstrap method for hypothesis testing and compare its performance to Efron’s bootstrap method, with the comparison results presented in Sect. 4.

2.3 Bootstrap Methods for Bivariate Data

In this section, we will discuss Efron’s bootstrap method [16] and three smoothed bootstrap methods for bivariate data [1, 3]. Let $(X_i, Y_i) \in \mathbb {R}^{2}$, for $i= 1, 2, \ldots , n$ denote independent and identically distributed random variables with a distribution of H. The observations corresponding to $(X_i, Y_i)$ are $(x_i, y_i)$. We are interested in $\theta {(H)}$, which is estimated by $\theta {(\hat{H})}$. To implement the bootstrap, Efron and Tibshirani [16] used the empirical distribution. The bootstrap method involves creating multiple bootstrap samples, say B, of size n by resampling with equal probability from the observed data. Based on each bootstrap sample, the function of interest is calculated, resulting in B values. The empirical distribution of these B values is used as a proxy for the distribution of the function of interest. This is the same approach as for univariate data. Several references use this bootstrap method for hypothesis testing. For further details, see e.g. Dolker et al. [11], MacKinnon [19] and Hesterberg [18].

In their recent work, Al Luhayb [1] and Al Luhayb et al. [3] proposed three different smoothed bootstrap methods for estimating the distribution of a function of interest. The first smoothed bootstrap method, referred to by SBSP, is based on the semi-parametric predictive method, which is proposed by Muhammad [20]. The second smoothed bootstrap method, referred to by SBNP, is based on the nonparametric predictive method introduced by Muhammad et al. [21]. These two methods divide the sample space into $(n+1)^2$ squares (or blocks hereafter), each assigned with a certain probability. The third method, referred to by SEB, is based on uniform kernels, where each data point is surrounded by a block of size $b_X \times b_Y$, and the observation is located at the centre of its corresponding block, with $b_X$ and $b_Y$ being the chosen bandwidths for the kernel. To create a bootstrap sample, n blocks are resampled with the assignment probabilities, and one observation is sampled from each chosen block. This process is repeated multiple times, typically $B=1000$ times, and based on each bootstrap sample, the function of interest is calculated. This results in B values, and the empirical distribution of these values is used to estimate the distribution of the function of interest.

3 Example

In this section, we will explore an example using data from the literature on the maximum flow rates over a 100 year period at gauging stations on rivers in North Carolina [6]. The data is presented in Table 1, and it shows the maximum flow rates in gallons per second. Our goal is to investigate whether the median of the data is equal to 5400 gallons per second using a 90% confidence interval, using Efron’s bootstrap method and Banks’ bootstrap method.

Table 1 Yearly maximum flow rates (gallons per second) at a gauging station in North Carolina

Smoothed Bootstrap Methods for Hypothesis Testing

Abstract

Similar content being viewed by others

Resampling-Free Bootstrap Inference for Quantiles

Bootstrap for inference after model selection and model averaging for likelihood models

Multiplier bootstrap methods for conditional distributions

1 Introduction

2 Bootstrap Methods for Different Data Types

2.1 Bootstrap Methods for Real-Valued Univariate Data

2.2 Bootstrap Methods for Right-Censored Univariate Data

2.3 Bootstrap Methods for Bivariate Data

3 Example

4 Comparison of the Bootstrap Methods

4.1 Hypothesis Tests for Quartiles

4.2 The Two-Sample Problem

4.3 Pearson Correlation Test

4.4 Kendall Correlation Test

5 Concluding Remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation