1 Introduction

This paper examines the Bakshi et al. (2003) (BKM) risk-neutral estimators with explicit skewness and kurtosis as both the input and benchmark and finds the condition of strike prices required to bound the errors of skewness. The BKM estimators have been widely used in the literature and are used by both academics and practitioners; for example, the BKM estimator for skewness is used to calculate the Chicago Board Options Exchange (CBOE) skewness (SKEW) index,Footnote 1 the forward-looking tail-risk or crash risk indicator of the S&P 500 index. Although the errors of the BKM estimators have been studied by Aschakulporn and Zhang (2022) (AZ), the relationship between the model’s parameters and skewness is complicated. This paper analyzes the BKM risk-neutral estimators using simulated option prices generated by a closed-form option-pricing formula with skewness and kurtosis (derived using the Gram–Charlier density)—in doing so, the estimated values can be compared directly with their true values.

A literature review regarding the use and importance of the BKM estimators is presented in AZ. Four papers have examined BKM errors—AZ, Ammann and Feser (2019), Liu and van der Heijden (2016), and Lee and Yang (2015)—with AZ being the key piece of the literature. Prior to AZ, the most advanced model used was the Bates (1996) model—a combination of Heston’s (1993) stochastic volatility and Merton’s (1976) jump. AZ use the Duffie et al. (2000) double-jump affine jump-diffusion model and derive the skewness analytically to get the closed-form solution; this removes the approximation aspect of previous literature. However, due to the complexity of the affine jump-diffusion model, the entire range of their parameters cannot be examined exhaustively, and the relationship between skewness and the model parameters is also lengthy.

Following AZ, the errors of BKM risk-neutral moment estimators caused by truncation and discretization are tested using simulated option prices. The BKM estimators are derived in a setting that has options with a continuum of strikes over an infinite domain; this, however, does not match reality, as options data have discrete strikes over a finite domain. These discrete strikes cause discretization errors, and a finite domain causes truncation errors. Having complete control over options allows the errors of the BKM estimators to be analyzed without external influences in a controlled manner. Using simulated option prices, the BKM estimators can be applied and the estimates can be compared with the true moments calculated from the model originally used to create the options. Similar to AZ, the true moments need not be found from simulations or approximated. The true moments are instead found analytically. For AZ, their formula is quite lengthy; whereas for the Gram–Charlier density, the true moments are specified, not obtained, as the input parameters are the moments themselves. Following Zhang and Xiang (2008), option prices are generated using an option pricing formula with skewness and excess kurtosis (hereinafter, kurtosis). The Gram–Charlier density is used as it provides a way to introduce higher-order moments, namely skewness and kurtosis, to the standard normal distribution. This method has been used by many, including Jarrow and Rudd (1982), Corrado and Su (1996), Longstaff (1995), Backus et al. (1997), Kochard (1999), and Corrado (2007). Using AZ’s framework, the effects of constant and linear extrapolation, and linear, cubic spline, and kernel interpolation on the BKM estimators are similarly tested. In addition, this paper extends the analysis of AZ to examine the volatility, cubic, and quartic contracts used to calculate the BKM estimators in depth. The error of the approximate convexity term used to derive the BKM estimators is also examined.

The remainder of this paper is organized as follows. Section 2 presents an option pricing model with skewness and kurtosis. Section 3 presents the method used to create options prices and the method used to quantify and analyze the errors and convergence of the BKM method. Section 4 describes the data. Section 5 provides the numerical results, and Sect. 6 concludes. The appendix gives the details of key derivations, including part A which briefly presents the BKM risk-neutral moment estimators.

2 Option pricing formula with skewness and kurtosis

The return of the Black-Scholes model is normally distributed. The return has no skewness or excess kurtosis. The Gram–Charlier series is a statistical method that is commonly used to introduce skewness and kurtosis (and higher-order moments) to the normal distribution.Footnote 2 Combining the two, a parsimonious option-pricing formula with skewness and kurtosis can be derived. The two additional parameters, the skewness and kurtosis, are the parameters of interest for this paper. As there are only two additional parameters, examining their effects on errors comprehensively is possible.

2.1 Option pricing using the Gram–Charlier density

This paper follows Zhang and Xiang (2008) to derive the option-pricing formula. They first specify the underlying stock price at maturity, \(S_T\), to be

$$\begin{aligned} S_{T} = F_{t}^{T} e^{\left( -\frac{1}{2}\sigma ^2 + \mu _{c} \right) \tau + \sigma \sqrt{\tau } y}, \end{aligned}$$
(1)

where \(F_{t}^{T}\), \(\sigma \), \(\tau \), \(\mu _{c}\), and \(y\) is the forward price, standard deviation, time to maturity (\(T - t\)), convexity adjustment term, and a random variable, respectively. Using the Gram–Charlier density, if y has probability density

$$\begin{aligned} f\left( y \right) = n\left( y \right) - \frac{\lambda _1}{3!} \frac{d^3 n\left( y \right) }{d y^3} + \frac{\lambda _2}{4!} \frac{d^4 n\left( y \right) }{d y^4}, \end{aligned}$$
(2)

where \(n\left( y \right) = \frac{1}{\sqrt{2\pi }} e^{-\frac{y^2}{2}}\), then \(y\) will have a mean of zero, variance of one, skewness equal to \(\lambda _1\) and an excess kurtosis of \(\lambda _2\). The forward price, \(F_{t}^{T}\), is related to the current stock price, \(S_{t}\), by \(F_{t}^{T} = S_{t} e^{\left( r - q \right) \tau }\), where \(r\) is the risk-free rate and \(q\) is the continuous dividend rate. The convexity adjustment term is required to keep this model arbitrage-free by ensuring that the stock price satisfies the martingale condition (\(F_{t}^{T} = E_{t}^{\mathcal{Q}}\left[ S_{T} \right] \)) in the risk-neutral world and was found to be

$$\begin{aligned} \mu _c = -\frac{1}{\tau } \ln \left[ 1 + \frac{\lambda _1}{3!} \left( \sigma \sqrt{\tau } \right) ^3 + \frac{\lambda _2}{4!} \left( \sigma \sqrt{\tau } \right) ^4 \right] . \end{aligned}$$
(3)

Using the risk-neutral valuation formula of Harrison and Kreps (1979) and Harrison and Pliska (1981), Proposition 1, the European call option pricing formula with skewness and kurtosis can be derived.

Proposition 1

The price of a European call option with skewness and kurtosis is

$$\begin{aligned} \begin{aligned} c_{t}&= F_{t}^{T} e^{-r \tau } N(d_1) - K e^{-r \tau } N(d_2) + K e^{-r \tau } \left( \frac{\lambda _1}{3!} A + \frac{\lambda _2}{4!} B \right) \sigma \sqrt{\tau } \end{aligned} \end{aligned}$$
(4)

where

$$\begin{aligned} \begin{aligned} A&= -\left( d_2 - \sigma \sqrt{\tau } \right) n(d_2), \qquad B = -\left( 1 - d_2^2 + \sigma \sqrt{\tau } d_2 - \sigma ^2 \tau \right) n(d_2), \\ d_2&= \frac{\ln (F_{t}^{T}/K) + \left( -\frac{1}{2}\sigma ^2 + \mu _{c} \right) \tau }{\sigma \sqrt{\tau }}, \qquad d_1 = d_2 + \sigma \sqrt{\tau }. \end{aligned} \end{aligned}$$

The European put option price can be found using the put-call parity:

$$\begin{aligned} c_{t} - p_{t} = F_{t}^{T} e^{-r \tau } - K e^{-r \tau }. \end{aligned}$$
(5)

Remark 1

If the skewness, \(\lambda _1\), and excess kurtosis, \(\lambda _2\), are both zero, then the convexity adjustment term \(\mu _{c}\) will also be zero and Eq. (4) will be equivalent to Black and Scholes (1973).

Remark 2

There is a minor error in Zhang and Xiang (2008), which, once fixed, allows the final result to be expressed more concisely. In Appendix B of Zhang and Xiang (2008), the proof of proposition 2, the call price is given as:

$$\begin{aligned} c_0^{*}&= \left[ F_0 e^{-r \tau } N\left( d_1 \right) - K e^{-r \tau } N\left( d_2 \right) \right] \left[ 1 + \frac{\lambda _1}{3!} \left( \sigma \sqrt{\tau } \right) ^3 + \frac{\lambda _1}{4!} \left( \sigma \sqrt{\tau } \right) ^4 \right] \\&\quad + K e^{-r \tau } \left( \frac{\lambda _1}{3!} A + \frac{\lambda _1}{4!} B \right) \sigma \sqrt{\tau }, \end{aligned}$$

where

$$\begin{aligned} A&= -\left( d_2 - \sigma \sqrt{\tau } \right) n(d_2) + \sigma ^2 \tau N\left( d_2 \right) , \\ B&= -\left( 1 - d_2^2 + \sigma \sqrt{\tau } d_2 - \sigma ^2 \tau \right) n(d_2) + \left( \sigma \sqrt{\tau } \right) ^3 N\left( d_2 \right) \end{aligned}$$

and \(d_1\) and \(d_2\) are the same as Eq. (4). The first term \(F_0 e^{-r \tau }\) should be \(F_0 e^{\left( \mu _{c} - r \right) \tau }\). With the additional \(\mu _{c}\) term, the formula can be simplified to Eq. (4).

Remark 3

The results of Kochard (1999) and Schlögl (2013) are equivalent to Eq. (4), however, due to their choice of arrangement, their final expression was not simplified further. Kochard (1999) incorrectly defines \(d_2\) as \(\frac{\mu _{\tau } - \ln \left( \frac{X}{S_0} \right) }{\sigma _{\tau }}\), missing the dividend term, in Appendix 2.1. Correcting this gives \(d_2 = \frac{\mu _{\tau } - \delta \tau - \ln \left( \frac{X}{S_0} \right) }{\sigma _{\tau }}\). Ki et al. (2005) also derive an equivalent expression but did not simplify the expression.

Remark 4

Backus et al. (1997) derive a similar formula; however, their martingale restriction is an approximation of the convexity adjustment term (Eq. (3)). In the proof of proposition 1 in the appendix, they incorrectly define their \(I_3\) with the third derivative instead of the fourth. Correcting this gives, \(I_3 = \int _{w^{*}}^{\infty }{\left( S_t e^{\mu _{n} + \sigma _{n} w} - K \right) \varphi ''''\left( w \right) dw}\).

Remark 5

Corrado (2007) makes the risk-neutral stock price model a martingale in the risk-neutral measure by adding

$$\begin{aligned} -\left( \frac{\lambda _1}{3!} \left( \sigma \sqrt{\tau } \right) ^2 + \frac{\lambda _2}{4!} \left( \sigma \sqrt{\tau } \right) ^3 \right) y \end{aligned}$$

to Eq. (2). Equation (4) can be transformed to Corrado (2007) by setting \(\mu _c = 0\) in \(d_2\).

Remark 6

The option pricing formula (Eq. (4)) can be transformed into the Brown and Robinson (2002) correction of Corrado and Su (1996) by replacing \(F_t^{T}\) with \(S_t e^{\left( r - q - \mu _{c} \right) \tau }\). Corrado and Su (1996) does not have a martingale restriction to ensure that the model satisfies the no-arbitrage principle.

Remark 7

Heston and Rossi (2017) correctly apply the Gram–Charlier density in a general form which encompasses both Corrado and Su (1996) and Eq. (4), by setting their \(\mu \) to \(\ln S_0 + r - q - \frac{1}{2}\sigma ^2\) and \(\ln S_0 + r - q + \mu _{c} - \frac{1}{2}\sigma ^2\), respectively—using their notation. They also extend from using Hermite polynomials to logistic polynomials (another orthogonal polynomial) and show that orthogonal polynomials can be used to span option payoffs. However, Hermite polynomials tend to diverge unlike logistic polynomials.

Remark 8

An advantage of using the Gram–Charlier density is that it can be easily extended to have an arbitrary number of additional moments/cumulants while satisfying the martingale condition.

For the purposes of this paper, the Gram–Charlier density (Eq. (2)), including only the skewness and kurtosis term is used. Figure 1 shows the region of skewness and kurtosis, which ensures a valid density function. The derivation of this region and an alternate formulation of the density is shown in the appendix, part D.

Fig. 1
figure 1

Gram–Charlier valid region. This figure shows the region (shaded area) in which pairs of skewness and kurtosis values will yield a valid probability density function using the Gram–Charlier density. The curve represents the skewness and kurtosis required to make the density function vanish. Point A corresponds to the origin and points B to O correspond to various values of \(x\) between \(-\infty\) and \(\infty \)

This formulation (Eq. 4) is ideal for testing the BKM estimators. The stock price model satisfies the no-arbitrage principle, unlike Corrado and Su (1996). The martingale restriction is not approximated like Backus et al. (1997). The expression is clearer and more elegant compared to Kochard (1999), Schlögl (2013), and Ki et al. (2005). The relationship between the model’s skewness and excess kurtosis and inputs are clear as they are the same, unlike Corrado (2007) and Chateau and Dufresne (2017).

3 Methodology

To analyze the errors of BKM risk-neutral moment estimators, a controlled environment with clear benchmarks and parameters would be ideal. Part A of the appendix presents a brief derivation of the BKM estimator. The BKM estimator requires the use of option prices which can be provided by simulated option prices generated based on the Gram–Charlier density. The Gram–Charlier density is ideal, as it provides a malleable framework with skewness and kurtosis that are explicitly specified as parameters. These parameters are used as direct benchmarks against the moments calculated by the BKM estimator.

The procedure to test the BKM estimator is to (1) generate simulated option prices (Sect. 3.1), (2) apply the BKM estimator (Part A of the appendix), and (3) compare the estimates with their true values (Sect. 3.2). In addition to this, various interpolation and extrapolation techniques could be tested by introducing these techniques before applying the BKM estimator. This procedure follows the error analysis of AZ with the addition of a more comprehensive and in-depth analysis. Instead of testing two data points in two different market periods, the entire range of parameters (2515 points) is tested using the Gram–Charlier approach. Each contract used to calculate the BKM estimators are examined individually and with a smaller step size in Sect. 5.4. The rare disaster concern index (RIX) from Gao et al. (2018) and simple variance swaps (SVIX) from Martin (2016) are also tested.

3.1 Simulated option prices using the Gram–Charlier density

The options prices generated by Eq. (4) will be for an underlying asset with a specified return distribution. These values can be generated with specific sets of strike prices to test various errors. As the skewness and kurtosis is bounded by the valid region of the Gram–Charlier density (shown in Fig. 1), 2515 different pairs of skewness and kurtosis values within the valid region were chosen for analysis. These points are shown in Fig. 2.

Using the Gram–Charlier approach, the BKM estimator errors can be found directly. Similarly, other estimators can be tested with this approach. The CBOE VIX, rare disaster concern index, RIX, from Gao et al. (2018) and simple variance swaps, SVIX, from Martin (2016) derived under the Gram–Charlier model is shown in the appendix, part E.

3.2 Errors from truncation and discretization

Following AZ, the truncation and discretization errors are the two sources of errors that were tested. The same definitions are used to allow for consistency. These are presented in part B of the appendix for convenience. For calculations, the maximum and minimum strike prices are rounded to the nearest dollar and the boundary controlling factor, a, is varied between 0.05 and 0.95 in 0.01 intervals.Footnote 3 The step size, \(\Delta K\), has been chosen to vary from 1 to 50, in increments of 1.Footnote 4

For each combination of a and \(\Delta K\), of which there are 4550 (91 × 50), the mesh of points (pairs of skewness and kurtosis) within the valid Gram–Charlier region have been used to generate simulated option prices (as described in Sect. 3.1). From this, the discretized BKM method is applied to each set of options to find the risk-neutral moments. The estimation error is defined as

$$\begin{aligned} \text{ Estimation}\, \text{ Error }:= \text{Estimated}\, \text{Moment } - \text{True }\, \text{Moment}, \end{aligned}$$
(6)

following AZ.

The error for each point in the valid Gram–Charlier region for a specific combination of a and \(\Delta K\) is averaged to find the average error for the whole valid region. The maximum of the absolute value of the error over the valid region was also recorded.

The specification of each parameter is shown in Table 1, and the skewness-kurtosis points used in the valid Gram–Charlier region are shown in Fig. 2.

Table 1 Specification of parameters
Fig. 2
figure 2

Points in the valid Gram–Charlier region used for calculations. Each point used for calculations has been marked with a black dot. The number of points is 2515. Skewness ranges between \(-1.05\) and 1.05 and kurtosis between 0 and 4

An example of the error calculation for skewness and kurtosis done for the points from Fig. 2 are shown in Fig. 3. From this, both the average of the errors and the maximum absolute error for each region are plotted at their respective boundary controlling factor and step size.

Fig. 3
figure 3

Example of the errors for skewness and kurtosis. These figures show the errors between the estimated skewness (a) and kurtosis (b) (using the BKM method) and their true values. Based on the valid Gram–Charlier region, of which 2515 pairs of true skewness and kurtosis values were used to calculate the error surface. For this set of figures, \(K \in [500, 8000]\) (\(a = 0.25\)) and \(\Delta K = 1\)

3.3 Errors from the approximated convexity adjustment term

The approximation made by BKM (Eq. A2) can be analyzed analytically by entering the Black and Scholes (1973) stock price model into the approximation,Footnote 5

$$\begin{aligned} \begin{aligned}&E_{t}^{\mathcal{Q}} \left[ R\left( t, \tau \right) \right] \approx \left( r - \frac{1}{2} \sigma ^2 \right) \tau \\&\quad + \frac{\left( 32 \left( r \tau \right) ^3 - 24 \left( r \tau \right) ^2 \sigma ^2 \tau + 8 r \tau \left( \sigma ^2 \tau \right) ^2 + 48 r \tau \sigma ^2 \tau - \left( \sigma ^2 \tau \right) ^3 - 16 \left( \sigma ^2 \tau \right) ^2 \right) \sigma ^2 \tau }{384} \end{aligned} \end{aligned}$$
(7)

and comparing the BKM return to the Black-Scholes expected log return \(\left( r - \frac{1}{2} \sigma ^2 \right) \tau \). Clearly, the returns are not the same. This additional term corresponds to the convexity adjustment term \(\mu _c\) as calculated by the BKM method. This is found by comparing the expected log returns of the BKM method and Gram–Charlier density return:

$$\begin{aligned} E_{t}^{\mathcal{Q}} \left[ R\left( t, \tau \right) \right]&= E_{t}^{\mathcal{Q}} \left[ \ln \frac{S_T}{S_t} \right] = \left( r - \frac{1}{2} \sigma ^2 + \mu _{c} \right) \tau \end{aligned}$$
(8)
$$\begin{aligned} \implies \mu _{c, BKM}&= \frac{1}{\tau } E_{t}^{\mathcal{Q}} \left[ R\left( t, \tau \right) \right] - \left( r - \frac{1}{2} \sigma ^2 \right) \end{aligned}$$
(9)

Using Eq. (8) in place of the BKM \(\mu \), the errors caused by BKM’s approximation can be calculated.

3.4 Error reduction by using interpolation and extrapolation

Following AZ, various interpolation and extrapolation techniques are applied to test whether the errors of the BKM estimator can be reduced. Constant and linear extrapolation as well as linear, cubic spline, and Gaussian kernel interpolation were applied to the implied volatility curve. What differs from AZ is that interpolation is done so that the largest step size between each strike is no larger than 0.1% of the forward price (with a minimum of $1), and the extrapolation is done so that the domain contains strike prices between 3/4 and 4/3 of the forward price. In AZ, interpolation was done to 0.05% of the forward price and the boundary controlling factor was 1/4—resulting in smaller step sizes over a larger domain.

4 Data

As this paper uses simulated option prices to test for estimation errors, the data required are minimal and can be set arbitrarily. However, to direct the results towards US markets, specifically the S&P 500, the risk-free rate, volatility, and time to maturity have been set to 2.4%,Footnote 6 0.20, and one month, respectively. These values partially reflect the current market conditions in April 2019. The current market volatility based on the CBOE VIXFootnote 7 is oddly low, so the standard deviation has been chosen to be 0.20. A one month time to maturity has been chosen as these tend to be the most liquid (shortest term contracts).

5 Numerical results

5.1 BKM estimation

The mean errors of the BKM standard deviation, skewness, and kurtosis estimators are shown in Table 2. The mean error is the mean of the estimation error (Eq. (6)) calculated over each valid Gram–Charlier region. These errors show that, as expected, the smaller the boundary controlling factor, a, the smaller the approximation error. Similarly, a smaller step size, \(\Delta K\), corresponds to a smaller discretization error. As these two approach zero, the discretization and truncation of the BKM method tend towards an integral over an infinite range. For the average of skewness errors to be less than \(10^{-3}\), the boundary controlling factor and step size, as shown in Table 2, must be less than 0.75 and $20, respectively. A more robust way to limit errors to be below \(10^{-3}\) is done by using the maximum value of the absolute error rather than the mean. This is shown in Table 3. The constraint is stronger and is reflected in the allowable boundary controlling factor and step size. The boundary controlling factor and step size to ensure that the errors are bounded by \(10^{-3}\) must also be bounded by 0.75 and \(\$2\), respectively. The step size is dependent on the forward price; to generalize this result, the step size must be less than 0.1% of the forward price. Comparing the errors of the second to fourth moments shows that the BKM estimator, for these values, is more accurate for lower-order moment estimators. These results are consistent with AZ for a non-volatile market period. During volatile market periods, AZ show that the errors of the BKM skewness estimator are accurate when the boundary controlling factor is 0.25 and the step size is 0.05% of the forward price ($1).

Table 2 Lookup table for mean errors
Table 3 Lookup table for maximum absolute errors

A visual form of Table 2 is presented in Fig. 4. This figure shows that the errors caused by truncation of the integral causes less predictable behaviour, whereas changes in the step size do not seem to affect the error as unpredictably. Although the truncation error is less predictable, when the boundary controlling factor is below 0.75, the behaviour is stable.

Fig. 4
figure 4

Sensitivity of the Average Error for Skewness. Using the Gram–Charlier density, simulated option prices with known standard deviation \(\sigma \), skewness \(\lambda _1\), and kurtosis \(\lambda _2\) were created for a forward price \(F_{t}^{T}\), time to maturity \(\tau \), risk-free rate r, and standard deviation \(\sigma \), have been arbitrarily set to $2000, one month, 2.4%, and 0.20, respectively. In total, 2515 different pairs of skewness and kurtosis (within the Gram–Charlier region) were used. The errors are shown for a varying degree of fineness of strike discretization (\(\Delta K \in [1, 50]\)). The range of strikes used is also varied symmetrically by adjusting \(a \in [0.05, 0.95]\) for \([K_{\min }, K_{\max }] = [F_{t}^{T} \times a, F_{t}^{T}/a]\) (rounded to the nearest dollar). These errors are the averaged errors of the entire valid Gram–Charlier region

Figure 5 shows the projection of Fig. 4 to show just the errors with respect to the step size. Due to the large errors caused by truncation, Fig. 5 shows errors with boundary controlling factor values of 0.05, 0.40, and 0.75. The error boundary of \(10^{-3}\) is shown in red. The same has been done for standard deviation and is shown in Fig. 7. Figures 6 and 8 show the same errors but with respect to the boundary controlling factors for step sizes of 1, 10, and 25.

Fig. 5
figure 5

Errors w.r.t. \({\Delta K}\). These figures have been created for boundary controlling factor values of 0.05, 0.40, and 0.75 and for step sizes of \(\Delta K\) from 1 to 50, to 25, and to 10. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

Fig. 6
figure 6

Errors w.r.t. \({a}\). These figures have been created for step sizes of 1, 10, and 25 and for boundary controlling factors of a from 0.05 to 0.95, to 0.75, and to 0.50. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

Fig. 7
figure 7

Standard deviation errors w.r.t. \({\Delta K}\). These figures have been created for boundary controlling factor values of 0.05, 0.40, and 0.75 and for step sizes of \(\Delta K\) from 1 to 50, to 25, and to 10. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

Fig. 8
figure 8

Standard deviation errors w.r.t. \({a}\). These figures have been created for step sizes of 1, 10, and 25 and for boundary controlling factors of a from 0.05 to 0.95, to 0.75, and to 0.50. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

The skewness errors can easily be restricted below \(10^{-3}\) by fixing \(\Delta K = \$2\) and \(a \le 0.75\). Kurtosis, however, has much larger errors.

The standard deviation, skewness, and kurtosis using the BKM method have been calculated for a skewness and kurtosis of \(-1\) and 2.5, respectively. This has been done for a boundary controlling factor value of 0.05, 0.40, and 0.75 and step size of 1 to 50. This is shown in Fig. 9. Figure 10 shows the same errors with respect to the boundary controlling factors.

Fig. 9
figure 9

Values w.r.t. \({\Delta K}\). These figures have been created for boundary controlling factor values of 0.05, 0.40, and 0.75 and for step sizes of \(\Delta K\) from 1 to 50, to 25, and to 10. The true value is shown in red. These figures have been plotted for a standard deviation (\(\sigma \)), skewness (\(\lambda _1\)), and kurtosis (\(\lambda _2\)) of 0.20, \(-1\), and 2.5, respectively (Color figure online)

Fig. 10
figure 10

Values w.r.t. \({a}\). These figures have been created for step sizes of 1, 10, and 25 and for boundary controlling factors of a from 0.05 to 0.95, to 0.75, and to 0.50. The true value is shown in red. These figures have been plotted for a standard deviation (\(\sigma \)), skewness (\(\lambda _1\)), and kurtosis (\(\lambda _2\)) of 0.20, \(-1\), and 2.5, respectively (Color figure online)

To examine the independence of the skewness estimator from kurtosis, the skewness is estimated when the skewness, the boundary controlling factor, and step size are set to \(-1\), 0.25, and 1, respectively, whilst adjusting the kurtosis within the valid Gram–Charlier region. The same has been done for kurtosis when it is set to 2.5. As neither the true skewness nor the true kurtosis has been chosen to be zero, the relative error for each estimator can, therefore, be calculated. The (absolute) relative error is defined as \(\text{ Relative Error } = \left| \frac{x - x_{true}}{x_{true}} \right|\), where x is the estimated value, and \(x_{true}\) is the true value. The maximum(minimum) relative error for skewness and kurtosis were found to be \(4.476 \times 10^{-04}(4.476 \times 10^{-04})\) and \(1.272 \times 10^{-03}(1.245 \times 10^{-03})\), respectively. These relative error curves are monotonically increasing and decreasing, as shown in Fig. 11. These values cannot be used to compare the sensitivity of skewness and kurtosis directly. The difference of the maximum and minimum values for skewness and kurtosis are \(5.866 \times 10^{-08}\) and \(2.663 \times 10^{-05}\), respectively. From this, the effects of kurtosis on the estimation of skewness and vice versa are shown to be insignificant. Therefore, for this case, the estimation of skewness is independent of kurtosis and vice versa.

Fig. 11
figure 11

Independence of Skewness and Kurtosis. These figures show the values and relative errors of skewness when kurtosis is changed (a), and kurtosis when skewness is changed (b). The boundary controlling factor and step size are set to 0.25 and 1, respectively. When testing skewness, the true skewness is set to \(-1\). For kurtosis, the true value is set to 2.5 (Color figure online)

5.2 Convexity adjustment term errors

The errors of using the Black-Scholes return in place of BKM’s approximation have been tabulated in Tables 4 and 5 and presented in Figs. 12 and 13.

Table 4 Errors for alternative convexity adjustment term
Table 5 Maximum value of the absolute error for alternative convexity adjustment term
Fig. 12
figure 12

Errors w.r.t. \({\Delta K}\) with Alternative Convexity Adjustment Term. These figures have been created for boundary controlling factor values of 0.05, 0.40, and 0.75 and for step sizes of \(\Delta K\) from 1 to 50, to 25, and to 10. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

Fig. 13
figure 13

Errors w.r.t. \({a}\) with Alternative Convexity Adjustment Term. These figures have been created for step sizes of 1, 10, and 25 and for boundary controlling factors of a from 0.05 to 0.95, to 0.75, and to 0.50. The error boundary of \(10^{-3}\) is shown in red (Color figure online)

Tables 4 and 5 have been combined with Tables 2 and 3 in Tables 6 and 7, respectively, to allow for ease of comparison.

Table 6 Comparison of errors for the original and alternative convexity adjustment term
Table 7 Comparison of maximum value of the absolute error for the original and alternative convexity adjustment term

In general, there is a slight decrease in both mean and absolute maximum errors for skewness when the boundary controlling factor is less than 0.75. For larger boundary controlling factors, the alternate convexity adjustment term increases the errors.

5.3 Interpolation and extrapolation

The results of interpolation and extrapolation are shown in Tables 8 and 9, respectively.

Table 8 Errors for interpolation
Table 9 Errors for extrapolation

Table 8 shows that all interpolation techniques reduce the magnitude of skewness error for almost all points. There are a few interpolated points which have errors with greater magnitudes than the non-interpolated case; however, this is due to the oscillating nature of the error. All three interpolation techniques yield much more stable results. Cubic spline performs the best, followed by linear interpolation, then Gaussian kernel regression. The linear interpolation option shows how smoothing the implied volatility curve is more beneficial than interpolating with the option prices directly. As the trapezium rule is used for integration, this effectively is equivalent to linear interpolation of the option prices, which corresponds to the no interpolation or “None” case in Table 8. A reason for this is the shape of the implied volatility curve compared with the option price curve, with respect to the strike price or moneyness. Under the Black-Scholes model, the implied volatility curve is simply a constant line, whereas the price curve is much more complicated. Simply put, linear interpolation of the implied volatility curve is effectively interpolating the price curve non-linearly. Table 8 also shows that if cubic spline interpolation is used, the condition of having a minimum step size no greater than 0.1% of the forward price can be relaxed to 2.5% of the forward price. Table 9 shows that both constant and linear extrapolation, in general, decreases the errors. Constant extrapolation tends to yield more consistent results than linear extrapolation; however, linear extrapolation yields more accurate results. Linear extrapolation, although occasionally more accurate, can lead to negative implied volatilities, depending on the end points of the existing implied volatility curve. Constant extrapolation essentially assumes that the tails of the return are normally distributed, in the same way as Black-Scholes’ model has constant volatility. If the BKM skewness estimator is used directly with no interpolation or extrapolation techniques on S&P 500 index options, based on Eq. (4), the errors are expected to exceed \(10^{-3}\). However, this can be easily remediated by using cubic spline interpolation with linear interpolation. For more stable and reliable results, cubic spline interpolation paired with constant extrapolation may be more appropriate, depending on the breadth of strikes available. These results are also consistent with AZ—although, the conditions are more strict as the affine jump-diffusion model is able to model more volatile markets.

5.4 In-depth error analysis

A closer inspection of errors shows that the current level of granularity of the step size \(\Delta K = 1\) is not sufficient to show the overall form of the relationship between the errors and step size. By reducing the step size to \(\Delta K = 0.001\), this reveals a form which is closer to the true form. This is shown in Fig. 14.

Fig. 14
figure 14

Granularity of step size \({\Delta K}\). The errors for skewness for a boundary controlling factor of \(a = 0.05\) with varying step sizes between 1 and 50 shows that, in general, as the step size gets larger, so do the errors. When the fineness of the granularity is increased from \(\Delta K = 1\) (black line) to \(\Delta K = 0.001\) (red line), the shape of the relationship between the error and the step size is revealed. The forward price \(F_{t}^{T}\), time to maturity \(\tau \), risk-free rate r, standard deviation \(\sigma \), skewness \(\lambda _1\), and kurtosis \(\lambda _2\) are equal to $2000, one month, 2.4%, 0.20, \(-1\), and 2.5, respectively. The error model and envelope (\(E^{\pm }\)) is given by \(\displaystyle \text{ Error } \,\text{ Model } = X \beta ^{s} \sin \left( \frac{2\pi }{X\beta ^{\omega }} \Delta K \right) + X \beta ^{t}\) and \(E^{\pm } = X\left( \beta ^{t} \pm \beta ^{s} \right) \) where \(\beta ^{s} = \begin{bmatrix} 2.311 \times 10^{-04}&1.433 \times 10^{-05} \end{bmatrix}^{T}\), \(\beta ^{t} = \begin{bmatrix} 6.726 \times 10^{-05}&5.366 \times 10^{-06} \end{bmatrix}^{T}\), \(\beta ^{\omega } = \begin{bmatrix} 6.980 \times 10^{-04}&4.967 \times 10^{-04} \end{bmatrix}^{T}\), and \(X = \begin{bmatrix} \Delta K&\left( \Delta K \right) ^2 \end{bmatrix}\)

The error does not decrease exponentially with respect to the step size, but more so quadratically. The error envelope, positive \((+)\) and negative \((-)\), was found using a least-squares approximation of a relatively simple parsimonious quadratic curve:

$$\begin{aligned} \beta _1 \Delta K + \beta _2 (\Delta K)^2. \end{aligned}$$
(10)

The coefficients \(\beta _1^{+}\), \(\beta _2^{+}\), \(\beta _1^{-}\), and \(\beta _2^{-}\) were found to be \(2.984 \times 10^{-04}\), \(1.969 \times 10^{-05}\), \(-1.639 \times 10^{-04}\), and \(-8.959 \times 10^{-06}\), respectively. Using these values, the asymmetric envelope can be decomposed into two components, a symmetric envelope and a trend term. The symmetric envelope has parameters \(\beta ^{s}_1\) and \(\beta ^{s}_2\) equal to \(2.311 \times 10^{-04}\) and \(1.433 \times 10^{-05}\), respectively. The trend component has parameters \(\beta ^{t}_1\) and \(\beta ^{t}_2\) equal to \(6.726 \times 10^{-05}\) and \(5.366 \times 10^{-06}\), respectively. The error is oscillating with an increasing period. To capture this behaviour, Eq. (10) seems to be suitable. The increasing period increases approximately quadratically with \(\beta ^{\omega }_1\) and \(\beta ^{\omega }_2\) equal to \(6.980 \times 10^{-04}\) and \(4.967 \times 10^{-04}\), respectively. This model is also presented in Fig. 14. The methodology used to obtain these parameters is presented in the appendix, part G.

As the risk-neutral moments are composed of multiple contracts, which in itself produces errors, the errors of each contract with respect to the step size are shown in Fig. 15.Footnote 8

Fig. 15
figure 15

Contract Errors w.r.t. \({\Delta K}\). The relative errors for the volatility, cubic, and quartic contracts for a boundary controlling factor of \(a = 0.05\) with varying step sizes between 1 and 50 show that, in general, as the step size gets larger, so do the errors. The forward price \(F_{t}^{T}\), time to maturity \(\tau \), risk-free rate r, standard deviation \(\sigma \), skewness \(\lambda _1\), and kurtosis \(\lambda _2\) are equal to $2000, one month, 2.4%, 0.20, \(-1\), and 2.5, respectively. The value of the volatility, cubic, and quartic contracts are \(3.327 \times 10^{-03}\), \(-1.884 \times 10^{-04}\), and \(6.071 \times 10^{-05}\), respectively. The relative error is defined as \(\text{ Relative }\, \text{ Error } := \frac{\text{ Estimated } - \text{ True }}{\text{ True }}\)

The errors of the volatility contract can be modelled the same way as the skewness estimator. The same error model is not appropriate for the other contracts. The coefficients for the volatility contract (relative) errors are \(\beta ^{s} = \begin{bmatrix} 1.487 \times 10^{-04}&8.976 \times 10^{-06} \end{bmatrix}^{T}\), \(\beta ^{t} = \begin{bmatrix} 2.539 \times 10^{-05}&4.631 \times 10^{-06} \end{bmatrix}^{T}\), and \(\beta ^{\omega } = \begin{bmatrix} 2.773 \times 10^{-05}&5.113 \times 10^{-04} \end{bmatrix}^{T}\). The relative error is defined as

$$\begin{aligned} \text{ Relative Error } := \frac{\text{ Estimated } - \text{ True }}{\text{ True }}. \end{aligned}$$
(11)

6 Conclusion

The importance of the BKM method is clear. It is used by many practitioners and academics as an indicator of tail risk and also to calculate the CBOE SKEW. This paper demonstrates a method to create simulated option prices using the Gram–Charlier density. These simulated prices are created by specifying the mean, variance, skewness, and kurtosis. By doing so, the risk-neutral moment estimators can be compared with the true values. The results show that in order to estimate skewness within \(10^{-3}\) of the true value, the range of strikes (\(K_{\min }, K_{\max }\)) must contain at least 3/4 to 4/3 of the forward price and have a step size (\(\Delta K\)) of no more than 0.1% of the forward price. Rather than using the absolute error as the measure, if the average error is used, then the step size restriction can be relaxed to 1% of the forward price. Under the same boundary controlling factor and step size specification, absolute errors of the kurtosis and standard deviation are bounded by \(5 \times 10^{-3}\) and \(10^{-4}\), respectively.

In general, markets are not able to satisfy the conditions required to calculate skewness with errors less than \(10^{-3}\), hence, interpolation and extrapolation techniques are required. Cubic spline interpolation performs better than linear interpolation and Gaussian kernel regression. For extrapolation, there is a trade-off between stability and accuracy, with constant extrapolation being more stable than the more accurate linear extrapolation. Using both cubic spline and either constant or linear regressions will help reduce discretization and truncation errors of skewness.

AZ provide the prequel to this paper by examining the BKM risk-neutral skewness estimator under the affine jump-diffusion model. The results of this paper are consistent with AZ for market periods that are not too volatile. When volatile periods are included, AZ recommend that linear interpolation to 0.05% of the forward price and constant extrapolation to include strikes at least a quarter to quadruple the forward price (\(a = 1/4\)) should be performed on the implied volatility curve in order to bound the errors of the BKM skewness estimator to be less than \(10^{-3}\). This condition is stricter than the \(a = 3/4\) and \(\Delta K = 0.1\% F_{t}^{T}\) found using the Gram–Charlier approach—this is expected. The Gram–Charlier density introduces skewness and excess kurtosis, whereas the Duffie et al. (2000) model has parameters, such as the instantaneous variance, \(v_{t}\), mean-reverting speed, \(\kappa \), long-term mean, \(\theta \), diffusive volatility, \(\sigma \), correlation between the stock and variance, \(\rho \), jump intensity, \(\lambda \), mean jump size in variance, \(\mu _{y}\), mean jump size in price, \(\mu _{x} + \rho _{J} y\), variance of jump size in price, \(\sigma _{x}^2\), and correlation between jumps, \(\rho _{J}\), which can be combined to get the skewness. With the additional freedom given by the number of additional parameters and the model’s specification, the affine jump-diffusion model can better model the market. However, due to the complexity of the Duffie et al. (2000) model, the errors of the BKM estimators were not analyzed as comprehensively or as in-depth by AZ.

Beyond AZ, this paper finds that the errors of skewness oscillate with respect to the step size. Increasing the granularity of the step size decreases the error approximately quadratically, not exponentially. However, this oscillation can be significantly reduced using interpolation techniques. Also, it was found that errors caused by the approximation (Eq. (A2)) used to derive the BKM estimator is negligible. In addition to the risk-neutral skewness, the risk-neutral standard deviation, risk-neutral kurtosis, VIX, RIX, and SVIX are also examined.