Martingale estimation functions for Bessel processes

Hufnagel, Nicole; Woerner, Jeannette H. C.

doi:10.1007/s11203-021-09250-8

Martingale estimation functions for Bessel processes

Open access
Published: 04 August 2021

Volume 25, pages 337–353, (2022)
Cite this article

Download PDF

You have full access to this open access article

Statistical Inference for Stochastic Processes Aims and scope Submit manuscript

Martingale estimation functions for Bessel processes

Download PDF

Nicole Hufnagel¹ &
Jeannette H. C. Woerner¹

1797 Accesses
Explore all metrics

Abstract

In this paper we derive martingale estimating functions for the dimensionality parameter of a Bessel process based on the eigenfunctions of the diffusion operator. Since a Bessel process is non-ergodic and the theory of martingale estimating functions is developed for ergodic diffusions, we use the space-time transformation of the Bessel process and formulate our results for a modified Bessel process. We deduce consistency, asymptotic normality and discuss optimality. It turns out that the martingale estimating function based of the first eigenfunction of the modified Bessel process coincides with the linear martingale estimating function for the Cox Ingersoll Ross process. Furthermore, our results may also be applied to estimating the multiplicity parameter of a one-dimensional Dunkl process and some related polynomial processes.

Some Martingales Associated With Multivariate Bessel Processes

Article 28 November 2020

Convergence Types and Rates in Generic Karhunen-Loève Expansions with Applications to Sample Path Properties

Article 06 July 2018

Regularity of Gaussian Processes on Dirichlet Spaces

Article 01 February 2018

1 Introduction

Martingale estimating functions introduced in Bibby and Sørensen (1995) provide a well-established method for inference in discretely observed diffusion processes, when the likelihood function is unknown or too complicated. The idea behind martingale estimating functions is to provide a simple approximation of the true likelihood, which forms a martingale and hence leads under suitable regularity assumptions to consistent and asymptotically normal estimators. One way of approximating the likelihood function is by Taylor expansion leading to linear and quadratic martingale estimating functions, cf. Bibby and Sørensen (1995). Another possibility is to use the eigenfunctions of the associated diffusion operator, cf. Kessler and Sørensen (1999). In this context a suitable optimality concept was introduced by Godambe and Heyde (1987) and Heyde (1988). For a general theory of asymptotic statistics for diffusion processes we refer e.g. to Höpfner (2014).

Our aim in this paper is to estimate the dimensionality or index parameter $\vartheta \in \Theta \subset (-\frac{1}{2},\infty )$ of a classical one-dimensional Bessel process given by the stochastic differential equation

$$\begin{aligned} \left\{ \begin{array}{ll} \,{\mathrm {d}}Y_t &{}=\,{\mathrm {d}}B_t+\left( \vartheta +\frac{1}{2}\right) \frac{1}{Y_t} \,{\mathrm {d}}t,\\ Y_0&{}=y_0>0, \end{array} \right. \end{aligned}$$

(1.1)

where B denotes a standard Brownian motion. Since a Bessel process is non-ergodic, we transform it into a stationary and ergodic process by adding a mean reverting term with speed of mean reversion $\alpha >0$ in the drift, which we call modified Bessel process in the following. The two processes are then related by the well-known space-time transformation of a Bessel process. Since the eigenfunctions of the associated diffusion operator of the modified Bessel process are known, we base our martingale estimation function on these eigenfunctions and follow the lines of Kessler and Sørensen (1999).

For the estimating function based on the first eigenfunction we obtain an explicit formula for the estimator, which only depends quadratically on the observations. We see that the estimator coincides with the one of a linear martingale estimation function for the Cox Ingersoll Ross process, which is the square of the modified Bessel process. We discuss optimality in the sense of Godambe and Heyde. Note that in Overbeck and Ryden (1997) also local asymptotic normality for estimators in the Cox Ingersoll Ross model was established.

Furthermore, we consider martingale estimating functions based on the first two eigenfunctions and discuss the improvement of the asymptotic variance. In this case we do not get an explicit result for the estimator anymore.

Note that our results for the Bessel process may also be used to estimate the multiplicity parameter k of a one-dimensional Dunkl process, a special jump diffusion given by the generator

$$\begin{aligned} L_k u(x)=u^{\prime \prime }(x)+k\left( \frac{2}{x}u^{\prime }(x)+\frac{u(-x)-u(x)}{x^2}\right) ,\;\; k\ge 0. \end{aligned}$$

By the last term in the generator we see that the associated process possesses jumps due to a reflection, which lead to a sign change. Hence, the modulus of this Dunkl process is a Bessel process with dimensionality parameter $k-1/2$, cf. Chybiryakov et al. (2008). For the Dunkl process the multiplicity parameter is of special interest, since it determines the jump activity, namely for $k\ge \frac{1}{2}$ a Dunkl process has a finite jump activity, whereas for $k<1/2$ we have infinite jump activity.

Furthermore, the technique transforming a non-ergodic process to an ergodic one via a space-time transformation may also be used for larger classes of polynomial diffusion processes given by a generalization of the stochastic differential equation of a Bessel process. We introduce these processes and provide results for the martingale estimating function based on the first eigenfunction.

The paper is organised as follows: in Sect. 2 we collect the basic facts on the processes, Sect. 3 is devoted to martingale estimation functions based on the first eigenfunction for Bessel processes, while in Sect. 4 we provide an extension to a larger class of polynomial diffusions. Section 5 considers estimators based on two eigenfunctions for Bessel processes.

2 Basic results on Bessel processes and a stationary modification

In this section we introduce the basic results on the underlying diffusions, which we will need in the following for the theory of martingale estimation functions. Our aim is to estimate the parameter $\vartheta \in \Theta \subset (-\frac{1}{2},\infty )$ of a classical one-dimensional Bessel process. Since a Bessel process is non-ergodic and most results on parameter estimation for diffusions are developed for ergodic diffusions, we start by introducing a modification of a Bessel process which is ergodic.

We consider the stochastic differential equation

$$\begin{aligned} \left\{ \begin{array}{ll} \,{\mathrm {d}}X_t &{}=\,{\mathrm {d}}B_t+\left[ \left( \vartheta +\frac{1}{2}\right) \frac{1}{X_t}-\alpha X_t\right] \,{\mathrm {d}}t,\\ X_0&{}=x_0>0 \end{array} \right. \end{aligned}$$

(2.1)

for a Brownian motion B, some fixed $\alpha >0$ and the parameter of interest $\vartheta \in \Theta \subset (-\frac{1}{2},\infty )$. The equation (2.1) is similar to the equation defining a Bessel process except for the drift term $-\alpha X_t \,{\mathrm {d}}t$, which we add to ensure ergodicity and stationarity. We can also state the generator

$$\begin{aligned} L_\vartheta f(x) =\frac{1}{2}f''(x)+\left[ \left( \vartheta +\frac{1}{2}\right) \frac{1}{x}-\alpha x\right] f'(x). \end{aligned}$$

In order to determine the density of $(X_t)_{t\ge 0}$, we consider the space time transformation

$$\begin{aligned} X_t=\exp (-\alpha t)Y_{\frac{\exp (2\alpha t)-1}{2\alpha }} \end{aligned}$$

(2.2)

for a Bessel process $(Y_t)_{t\ge 0}$ with index $\vartheta $, which immediately follows by Itô’s formula. For simplicity, we use the notation $f(t):=\exp (-\alpha t)$ and $g(t):=\frac{\exp (2\alpha t)-1}{2\alpha }$

$$\begin{aligned} \,{\mathrm {d}}X_t&{\mathop {=}\limits ^{(2.2)}}\,{\mathrm {d}}(f(t)Y_{g(t)})=f(t)\,{\mathrm {d}}Y_{g(t)}+Y_{g(t)}\,{\mathrm {d}}f(t)\\&\ = f(t) \left[ \,{\mathrm {d}}B_{g(t)} +\left( \vartheta +\frac{1}{2} \right) \frac{1}{Y_{g(t)}} \,{\mathrm {d}}g(t) \right] +Y_{g(t)}f'(t)\,{\mathrm {d}}t\\&\ = f(t)\sqrt{g'(t)}\,{\mathrm {d}}W_t+\left( \vartheta +\frac{1}{2}\right) \frac{f(t)g'(t)}{Y_{g(t)}}\,{\mathrm {d}}t -\alpha f(t)Y_{g(t)} \,{\mathrm {d}}t\\&{\mathop {=}\limits ^{(2.2)}}f(t)\sqrt{g'(t)}\,{\mathrm {d}}W_t+\left( \vartheta +\frac{1}{2}\right) \frac{f(t)^2g'(t)}{X_t}\,{\mathrm {d}}t -\alpha X_t\,{\mathrm {d}}t\\&\ =\,{\mathrm {d}}W_t+\left( \vartheta +\frac{1}{2}\right) \frac{1}{X_t}\,{\mathrm {d}}t -\alpha X_t\,{\mathrm {d}}t \end{aligned}$$

for some Brownian motion W as $f(t)^2g'(t)=1$ and $f'(t)=-\alpha f(t)$. Therefore, we derive the distribution of $(X_t)_{t\ge 0}$ by using the well-known distribution of the Bessel process $(Y_t)_{t\ge 0}$, namely

$$\begin{aligned} P(Y_t\le z|Y_0=x)=\frac{2}{(2t)^\vartheta \Gamma (\vartheta +1)}\int _0^z j_\vartheta \left( \frac{ixy}{t}\right) e^{-\frac{x^2+y^2}{2t}}y^{2\vartheta +1}\,{\mathrm {d}}y; \quad x, z >0, \end{aligned}$$

where

$$\begin{aligned}&j_\vartheta (z):=\frac{\Gamma (\vartheta +1)}{\Gamma (\vartheta +\frac{1}{2})\Gamma (\frac{1}{2})}\int _{-1}^{1}e^{isz}(1-s^2)^{\vartheta -\frac{1}{2}}\,{\mathrm {d}}s \end{aligned}$$

is the Bessel function with index $\vartheta $ [see for instance Itô and McKean (1974)]. Hence, we obtain

$$\begin{aligned} P(X_t\le z|X_0=x)&{\mathop {=}\limits ^{(2.2)}}P(Y_{\frac{\exp (2\alpha t)-1}{2\alpha }}\le \exp (\alpha t)z|Y_0=x)\\&= C_{\vartheta ,\alpha ,t}\int _0^{z} j_\vartheta \left( ixy\frac{2\alpha \exp (\alpha t)}{\exp (2\alpha t)-1}\right) \exp \left( -\alpha \frac{x^2+y^2\exp (2\alpha t)}{\exp (2\alpha t)-1}\right) y^{2\vartheta +1} \,{\mathrm {d}}y \end{aligned}$$

with

$$\begin{aligned} C_{\vartheta ,\alpha ,t}:=\frac{2\alpha ^\vartheta (\exp (2\alpha t))^{\vartheta +1}}{\Gamma (\vartheta +1)(\exp (2\alpha t)-1)^\vartheta }. \end{aligned}$$

We denote the density of $X_{\Delta }$ with starting point x by $p_\vartheta (x,\cdot ,\Delta )$ and the distribution of $X_\Delta $ by $P_{\vartheta }$. In the following, we check that $(X_t)_{t\ge 0}$ is indeed stationary and ergodic and determine the invariant measure. The density of the scale measure for a fixed $\xi \in (0,\infty )$ is defined as

$$\begin{aligned} s(x)&:=\exp \left( -2\int _{\xi }^x\left( \vartheta +\frac{1}{2}\right) \frac{1}{y}-\alpha y\,{\mathrm {d}}y \right) \\&=\left( \frac{x}{\xi }\right) ^{-(2\vartheta +1)} e^{\alpha (x^2-\xi ^2)}. \end{aligned}$$

Note that, due to the singularity in the drift, we initially have to consider some positive interior point $\xi $.

By Sørensen (2012, p. 9) and Skorokhod (1989) we may deduce that $(X_t)_{t\ge 0}$ is ergodic as we see that the conditions

$$\begin{aligned}&\int _0^\xi s(x) \,{\mathrm {d}}x=\infty ,\quad \int _\xi ^\infty s(x)\,{\mathrm {d}}x=\infty \quad \text {and} \quad \int _0^\infty \frac{1}{s(x)}\,{\mathrm {d}}x<\infty \end{aligned}$$

are satisfied.

As the invariant measure is defined via the scale measure $m(\,{\mathrm {d}}x):=\frac{1}{s(x)}\,{\mathrm {d}}x$, we obtain by a straightforward calculation that the density of the invariant probability measure is given by

$$\begin{aligned} \mu _\vartheta (x)=\frac{2\alpha ^{\vartheta +1}}{\Gamma (\vartheta +1)}x^{2\vartheta +1}e^{-\alpha x^2} \end{aligned}$$

on $(0,\infty )$ with respect to the Lebesgue measure (Sørensen 2012, Eq. (1.15)).

For the calculation of the asymptotic variance we will need the symmetric distribution $Q_\Delta ^\vartheta $ of two consecutive observations $X_{(i-1)\Delta }$ and $X_{i\Delta }$ on $(0,\infty )^2$. It is given by

$$\begin{aligned} Q_\Delta ^\vartheta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)&=\mu _\vartheta (x) p_\vartheta (x,y,\Delta )\,{\mathrm {d}}x\,{\mathrm {d}}y\\&=C_\vartheta j_\vartheta \left( ixy\frac{2\alpha \exp (\alpha \Delta )}{\exp (2\alpha \Delta )-1}\right) \exp \left( -\frac{\alpha \exp (2\alpha \Delta )}{\exp (2\alpha \Delta )-1}(x^2+y^2)\right) (xy)^{2\vartheta +1} \,{\mathrm {d}}y \,{\mathrm {d}}x \end{aligned}$$

with

$$\begin{aligned} C_\vartheta :=\frac{4\alpha ^{2\vartheta } (\exp (2\alpha \Delta ))^{\vartheta +1}}{\Gamma (\vartheta +1)^2(\exp (2\alpha \Delta )-1)^\vartheta }. \end{aligned}$$

3 Martingale estimating functions based on eigenfunctions

In this section we proceed similarly to Bibby and Sørensen (1995) and Kessler and Sørensen (1999) to construct martingale estimation functions for our parameter of interest $\vartheta $. The concepts in these papers are based on ergodic diffusions. As Bessel processes are non-ergodic we constructed the ergodic and stationary version in (2.1). Let $X_{\Delta },\dots ,X_{n\Delta }$ be discrete observations of the process. We consider the eigenfunctions of the generator

$$\begin{aligned} L_\vartheta f(x) =\frac{1}{2}f''(x)+\left[ \left( \vartheta +\frac{1}{2}\right) \frac{1}{x}-\alpha x\right] f'(x), \end{aligned}$$

which are the solutions of $L_\vartheta \phi _\eta =-\lambda _\eta \phi _\eta $ given by

$$\begin{aligned} \lambda _\eta =2\alpha \eta , \quad \phi _\eta (x,\vartheta )= \sum _{k=0}^{\eta } \frac{(-\eta )_k}{(\vartheta + 1 )_k k!} (\alpha x^2)^k,\quad \eta \in {\mathbb {N}} \end{aligned}$$

with the Pochhammer symbols $(x)_0:=1$ and $(x)_k:=\frac{\Gamma (x+k)}{\Gamma (x)}=x (x+1) \dots (x+k-1)$ for $k\in {\mathbb {N}}$ cf. (Rösler and Voit 2008, 2.58 Corollary (i)). According to (Kessler and Sørensen 1999, 5. Eigenfunctions and Martingals), the property

$$\begin{aligned} \int _0^\infty (\phi _\eta '(x,\vartheta ))^2\mu _\vartheta (\,{\mathrm {d}}x)= \frac{2\alpha ^{\vartheta +1}}{\Gamma (\vartheta +1)}\int _0^\infty (\phi _\eta '(x,\vartheta ))^2x^{2\vartheta +1}e^{-\alpha x^2} \,{\mathrm {d}}x<\infty \end{aligned}$$

for the polynomials $\phi _\eta $ is sufficient to deduce

$$\begin{aligned} \mathrm {E}_\vartheta (\phi _\eta (X_{i\Delta },\vartheta )|X_{(i-1)\Delta })=e^{-\lambda _\eta \Delta }\phi _\eta (X_{(i-1)\Delta },\vartheta ) \end{aligned}$$

by Itô’s formula. Consequently, we may use the general theory on estimators based on eigenfunctions given in Kessler and Sørensen (1999). However, in our case we may calculate the involved quantities and obtain explicit results. For the first eigenfunction $\phi _1(x,\vartheta )=1-\frac{\alpha x^2}{\vartheta +1}$ we consider the estimator based on the martingale estimating function

$$\begin{aligned} G_n(\vartheta )&=\sum _{i=1}^{n}(\phi _1(X_{i\Delta },\vartheta )-e^{-\lambda _1\Delta }\phi _1(X_{(i-1)\Delta },\vartheta ) )\\&= n(1-e^{-2\alpha \Delta })+\sum _{i=1}^n\left( e^{-2\alpha \Delta }\frac{\alpha X_{(i-1)\Delta }^2}{\vartheta +1}-\frac{\alpha X_{i\Delta }^2}{\vartheta +1}\right) . \end{aligned}$$

The unique solution of $G_n({\widehat{\vartheta }}_n)=0$ is

$$\begin{aligned} {\widehat{\vartheta }}_n=\frac{\alpha \sum _{i=1}^{n} (X_{i\Delta }^{2}-X_{(i-1)\Delta }^{2} e^{-2\alpha \Delta })}{n(1-e^{-2\alpha \Delta })}-1. \end{aligned}$$

Now, we may deduce consistency and asymptotic normality along the same lines as for general martingale estimating functions.

Theorem 3.1

For every true value $\vartheta _0 \in \Theta \subset (-\frac{1}{2},\infty )$, we have

(i)
${\widehat{\vartheta }}_n\rightarrow \vartheta _0$ in probability and
(ii)
$\sqrt{n}({\widehat{\vartheta }}_n-\vartheta _0)\rightarrow N(0,\sigma ^2(\vartheta _0))$ in distribution

under $P_{\vartheta _0}$ with $\sigma ^2(\vartheta _0):=(\vartheta _0+1)\frac{1+e^{-2\alpha \Delta }}{1-e^{-2\alpha \Delta }}.$

Proof

We define

$$\begin{aligned} g(x,y,\vartheta ):= 1-\frac{\alpha y^2}{\vartheta +1}-e^{-2\alpha \Delta }\left( 1-\frac{\alpha x^2}{\vartheta +1}\right) \end{aligned}$$

a continuously differentiable function with respect to $\vartheta $. The absolute value of the derivative

$$\begin{aligned} \frac{\partial }{\partial \vartheta } g(x,y,\vartheta )= \frac{\alpha }{(\vartheta +1)^2}(y^2-e^{-2\alpha \Delta }x^2) \end{aligned}$$

is dominated by $4\alpha (y^2+e^{-2\alpha \Delta }x^2)$, which is independent of $\vartheta $ and square integrable with respect to $Q_\Delta ^{\vartheta _0}$. Moreover, the symmetry in x and y of the density of $Q_\Delta ^{\vartheta _0}$ implies

$$\begin{aligned} f(\vartheta _0)&:=\int _0^\infty \int _0^\infty \frac{\partial }{\partial \vartheta } g(x,y,\vartheta _0) Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&=\underbrace{\frac{\alpha }{(\vartheta _0+1)^2}(1-e^{-2\alpha \Delta })}_{>0} \underbrace{\int _0^\infty \int _0^\infty x^2 Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)}_{>0}\not =0, \end{aligned}$$

which completes the proof of (i) and (ii) according to (Kessler and Sørensen 1999, Theorem 4.3).

Due to (Kessler and Sørensen 1999, Theorem 4.3), the asymptotic variance is given by $\sigma ^2(\vartheta _0)=\frac{v(\vartheta _0)}{f^2(\vartheta _0)}$ with

$$\begin{aligned} v(\vartheta _0)&:=\int _0^\infty \int _0^\infty g^2(x,y,\vartheta _0) Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y){\mathop {=}\limits ^{!}}\frac{1-e^{-4\alpha \Delta }}{\vartheta _0+1}. \end{aligned}$$

Because of the symmetry of $Q_\Delta ^{\vartheta _0}$ and

$$\begin{aligned} g^2(x,y,\vartheta )&= (1-e^{-2\alpha \Delta })^2+\frac{\alpha ^2}{(\vartheta +1)^2}y^4+\frac{\alpha ^2 e^{-4\alpha \Delta }}{(\vartheta +1)^2}x^4-(1-e^{-2\alpha \Delta })\frac{2\alpha }{\vartheta +1}y^2\\&\quad +(1-e^{-2\alpha \Delta }) \frac{2\alpha e^{-2\alpha \Delta }}{\vartheta +1}x^2-\frac{2\alpha ^2 e^{-2\alpha \Delta }}{(\vartheta +1)^2}x^2y^2, \end{aligned}$$

we get

$$\begin{aligned} v(\vartheta _0)= & {} (1-e^{-2\alpha \Delta })^2\left( 1-\frac{2\alpha }{\vartheta _0+1}\int _0^\infty \int _0^\infty x^2 Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)\right) \\&\quad +\, \frac{\alpha ^2(1+e^{-4\alpha \Delta })}{(\vartheta _0+1)^2}\int _0^\infty \int _0^\infty x^4 Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad -\, \frac{2\alpha ^2 e^{-2\alpha \Delta }}{(\vartheta _0+1)^2}\int _0^\infty \int _0^\infty x^2y^2 Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y). \end{aligned}$$

Furthermore, we can calculate

$$\begin{aligned} \int _0^\infty \int _0^\infty x^{2n}Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)&= \int _0^\infty \int _0^\infty x^{2n}\mu _{\vartheta _0} (x) p(x,y,\Delta )\,{\mathrm {d}}x\,{\mathrm {d}}y \\&=\int _0^\infty x^{2n}\mu _{\vartheta _0} (x)\,{\mathrm {d}}x=\frac{\Gamma (n+{\vartheta _0}+1)}{\alpha ^n\Gamma ({\vartheta _0}+1)}. \end{aligned}$$

By calculating $\,{\mathrm {E}}\,(X^2_{i\Delta } \, \vert \, X_{(i-1)\Delta }=x)$ explicitly, we conclude

$$\begin{aligned} \int _{0}^\infty \int _{0}^\infty x^2y^2 Q_\Delta ^{\vartheta _0}(\,{\mathrm {d}}x,\,{\mathrm {d}}y)&= \int _{0}^\infty \int _{0}^\infty x^2y^2 \mu _{\vartheta _0}(x) p(x,y,\Delta )\,{\mathrm {d}}y \,{\mathrm {d}}x \\&= \int _{0}^\infty x^2 \,{\mathrm {E}}\,(X^2_{i\Delta } \, \vert \, X_{(i-1)\Delta }=x) \mu _{\vartheta _0}(x) \,{\mathrm {d}}x\\&= \int _{0}^{\infty } x^2\left( x^2e^{-2\alpha \Delta }-\frac{\vartheta _0+1}{\alpha }(e^{-2\alpha \Delta }-1)\right) \mu _{\vartheta _0}(x) \,{\mathrm {d}}x\\&= \frac{(\vartheta _0+1)^2}{\alpha ^2}+e^{-2\alpha \Delta }\frac{\vartheta _0+1}{\alpha ^2}. \end{aligned}$$

Applying these formulas we establish

$$\begin{aligned} \sigma ^2(\vartheta _0)&=\frac{v(\vartheta _0)}{f^2(\vartheta _0)}=(\vartheta _0+1)\frac{1+e^{-2\alpha \Delta }}{1-e^{-2\alpha \Delta }}. \end{aligned}$$

$\square $

Let us discuss the results. Looking at the asymptotic variance we see that it decreases when $\alpha \Delta $ is increasing. This seems surprisingly at the first glance, since it implies that the asymptotic variance decreases when the distance between observations increases, as we keep the mean reverting parameter $\alpha $ fixed. Note that we have the observation scheme $X_\Delta , \cdots , X_{n\Delta }$, hence $n\rightarrow \infty $ and $\Delta \rightarrow 0$ such that $n\Delta \rightarrow \infty $ would correspond to continuous observations. However, keeping in mind that equidistant observations for the stationary version of the Bessel process means that the distance between two observations of the underlying Bessel process is exponentially growing, this leads to a fast growing observation interval. This might capture the non-stationary behaviour of the original Bessel process. Furthermore, we see that the asymptotic variance tends to infinity as the mean-reverting parameter tends to zero.

Having a closer look at the estimator, we see that it only depends on the square of the observations, hence we could reformulate our problem and consider the squared process $Y_t:=X_t^2$. Itô’s formula yields

$$\begin{aligned} \,{\mathrm {d}}Y_t= 2\sqrt{Y_t}\,{\mathrm {d}}B_t+(2\vartheta +2-2\alpha Y_t)\,{\mathrm {d}}t, \end{aligned}$$

an equation describing a Cox Ingersoll Ross process. We consider now the canonical linear martingale estimating function

$$\begin{aligned} {\widetilde{G}}_n(\vartheta )&:=\sum _{i=1}^n (Y_{i\Delta }- \mathrm {E}(Y_{i\Delta }|Y_{(i-1)\Delta }))\\&=\sum _{i=1}^n (Y_{i\Delta }- Y_{(i-1)\Delta }e^{-2\alpha \Delta }+\frac{\vartheta +1}{\alpha }(e^{-2\alpha \Delta }-1))\\&=-\frac{\vartheta +1}{\alpha }G_n(\vartheta ). \end{aligned}$$

For $\vartheta >-\frac{1}{2}$ the unique solution of ${\widetilde{G}}_n({\widehat{\vartheta }}_n)=0$ is again

$$\begin{aligned} {\widehat{\vartheta }}_n=\frac{\alpha \sum _{i=1}^{n} (X_{i\Delta }^{2}-X_{(i-1)\Delta }^{2} e^{-2\alpha \Delta })}{n(1-e^{-2\alpha \Delta })}-1. \end{aligned}$$

Hence, we see that the two estimators coincide. In 3.1 we have already established the consistence and asymptotic normality of ${\widehat{\vartheta }}_n$.

The next step is to increase the flexibility of ${\widetilde{G}}_n$ by adding the weight $g_{i-1}$ depending on the parameter of interest and the previous observation

$$\begin{aligned} \sum _{i=1}^{n} g_{i-1}(\vartheta ,X_{(i-1)\Delta }) \left( X_{i\Delta }^2-X_{(i-1)\Delta }^2e^{-2\alpha \Delta }+\frac{\vartheta +1}{\alpha }(e^{-2\alpha \Delta }-1)\right) , \end{aligned}$$

where $g_{i-1}$ is $\sigma (X_\Delta ,\dots , X_{(i-1)\Delta })$ measurable and continuously differentiable to keep the martingale property. Using the same technique we search for the optimal estimator with the smallest asymptotic variance. Considering this second approach via linear martingale estimating functions for the squared process, allows us easily to determine this optimal estimator, cf. Heyde (1988) and Godambe and Heyde (1987). By Bibby and Sørensen (1995, Eq. (2.10)) the optimal estimator is given by

$$\begin{aligned} g_{i-1}(\vartheta ,X_{(i-1)\Delta }):= \frac{\frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\vartheta } \,{\mathrm {E}}\,(X_{i\Delta }^2 \, \vert \, X_{(i-1)\Delta })}{\varphi (X_{(i-1)\Delta },\vartheta )}=\frac{1}{\frac{\vartheta +1}{\alpha }(1-e^{-2\alpha \Delta })+2X_{(i-1)\Delta }^2e^{-2\alpha \Delta }}, \end{aligned}$$

where $\varphi (X_{(i-1)\Delta },\vartheta )$ is the conditional variance of $X_{i\Delta }$ given $X_{(i-1)\Delta }$. Unfortunately, the equation defining the optimal estimator

$$\begin{aligned}&\sum _{i=1}^{n} \frac{1}{\frac{\vartheta +1}{\alpha }(1-e^{-2\alpha \Delta })+2X_{(i-1)\Delta }^2e^{-2\alpha \Delta }} \\&\quad \times \left( X_{i\Delta }^2-X_{(i-1)\Delta }^2e^{-2\alpha \Delta }+\frac{\vartheta +1}{\alpha }(e^{-2\alpha \Delta }-1)\right) =0 \end{aligned}$$

is not explicitly solvable with respect to $\vartheta $. However, we can nevertheless determine the improvement in the asymptotic variance. Following again the same lines as (Bibby and Sørensen 1995, Theorem 3.2), we have to establish the finiteness of

$$\begin{aligned} \,{\mathrm {E}}\,_{\mu _{\vartheta _0}}\left( g_{i-1}(\vartheta _0,X_{(i-1)\Delta })\frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\vartheta } \,{\mathrm {E}}\,(X_{i\Delta }^2 \, \vert \, X_{(i-1)\Delta })\right)&=\,{\mathrm {E}}\,_{\mu _{\vartheta _0}} \left( \frac{1}{\vartheta _0+1+\frac{2\alpha e^{-2\alpha \Delta }}{1-e^{-2\alpha \Delta }} X_{(i-1)\Delta }^2}\right) \\&< \frac{1}{\vartheta _0+1}, \end{aligned}$$

the reciprocal of the asymptotic variance, the asymptotic information. Consequently, we can deduce that a lower bound of the optimal variance is given by $\vartheta _0 +1$.

Figure 1 shows the asymptotic information of the 10.000 simulated optimal estimator (triangles) and ${\widehat{\vartheta }}_n$ (dots) for $n=1.000$. The solid line corresponds to the calculated asymptotic information of ${\widehat{\vartheta }}_n$ in Theorem 3.1. The dotted line represents our computed bound above. As the lines nearly touch around $\Delta =3$, the improvement of the optimal estimator quickly tends to zero. Starting from the value $\Delta =1$ the simulated asymptotic information is almost the same for both estimators. Beforehand, the improvement is clearly visible but we do not want to maintain such a high variance as we can choose the value of $\alpha \Delta $ such that the asymptotic variance is close to the lower bound.

We take a closer look at the asymptotic variance of ${\widehat{\vartheta }}_n$, which decreases monotonously in $\alpha \Delta $:

$$\begin{aligned} \lim \limits _{\alpha \Delta \rightarrow \infty } (\vartheta _0+1)\frac{1+e^{-2\alpha \Delta }}{1-e^{-2\alpha \Delta }}=\vartheta _0+1. \end{aligned}$$

Due to the fast convergence to the lower bound $\vartheta _0+1$, we can for practical purposes restrict ourselves to the estimator ${\widehat{\vartheta }}_n$ and hence have an explicit estimator.

4 An extension to some polynomial diffusion processes

Next, we aim to extend the previously developed technique to some larger class of processes. We consider some non-ergodic polynomial processes solving the stochastic differential equation

$$\begin{aligned} \left\{ \begin{array}{ll} \,{\mathrm {d}}Y_{t,p} &{}= Y_{t,p}^{\frac{p+1}{2}}\,{\mathrm {d}}B_t+\left( \vartheta +\frac{1}{2}\right) Y_{t,p}^p \,{\mathrm {d}}t,\\ Y_{0,p}&{}=x_0>0 \end{array} \right. \end{aligned}$$

(4.1)

for a Brownian motion B, the parameter of interest $\vartheta \in \Theta \subset (-\frac{1}{2},\infty )$ and the additional parameter $p<1$. Note that for $p=-1$, we get the Bessel process back. We briefly analyze a martingale estimator based on the first eigenfunction with the same technique as before. Using the space-time transformation

$$\begin{aligned} X_{t,p}:= e^{-\alpha t}Y_{\frac{e^{(1-p)\alpha t}-1}{(1-p)\alpha },p} \end{aligned}$$

for some $\alpha >0$, we receive by Itô’s formula an ergodic and stationary version

$$\begin{aligned} \left\{ \begin{array}{ll} \,{\mathrm {d}}X_{t,p} &{}= X_{t,p}^{\frac{p+1}{2}}\,{\mathrm {d}}B_t+\left[ \left( \vartheta +\frac{1}{2}\right) X_{t,p}^p-\alpha X_{t,p}\right] \,{\mathrm {d}}t,\\ X_{0,p}&{}=x_0>0. \end{array} \right. \end{aligned}$$

(4.2)

The corresponding generator can be stated as

$$\begin{aligned} L_{\vartheta ,p} f(x) =\frac{1}{2}x^{p+1}f''(x)+\left[ \left( \vartheta +\frac{1}{2}\right) x^{p}-\alpha x\right] f'(x). \end{aligned}$$

With a similar calculation as for $\mu _\vartheta $, we obtain the invariant measure

$$\begin{aligned} \mu _{\vartheta ,p}(x) =\frac{1-p}{\Gamma \left( \frac{2\vartheta +2}{1-p}\right) } \left( \frac{2\alpha }{1-p}\right) ^{\frac{2\vartheta +2}{1-p}}x^{2\vartheta +1} e^{-\frac{2\alpha }{1-p}x^{1-p}} \end{aligned}$$

on $(0,\infty )$ with respect to the Lebesgue measure. After a brief calculation we get

$$\begin{aligned} \phi _{1,p}(x)=x^{1-p}-\frac{2\vartheta +1-p}{2\alpha } \end{aligned}$$

as the first eigenfunction of the generator $L_{\vartheta ,p}$ with eigenvalue $\lambda _{1,p}=(1-p)\alpha $. Let $X_{\Delta ,p},\dots , X_{n\Delta ,p}$ be discrete observations of (4.2). We consider the estimator based on the martingale estimating function

$$\begin{aligned} G_{n,p}(\vartheta )&=\sum _{i=1}^{n}(\phi _{1,p}(X_{i\Delta ,p},\vartheta )-e^{-\lambda _{1,p}\Delta }\phi _{1,p}(X_{(i-1)\Delta ,p},\vartheta ) )\\&= \sum _{i=1}^n\left( X_{i\Delta }^{1-p}-e^{-(1-p)\alpha } X_{(i-1)\Delta }^{1-p}\right) -\frac{2\vartheta +1-p}{2\alpha } n(1-e^{-(1-p)\alpha \Delta }). \end{aligned}$$

The unique solution of $G_{n,p}({\widehat{\vartheta }}_{n,p})=0$ is

$$\begin{aligned} {\widehat{\vartheta }}_{n,p}=\frac{\alpha \sum _{i=1}^{n} (X_{i\Delta ,p }^{1-p}-X_{(i-1)\Delta ,p}^{1-p} e^{-(1-p)\alpha \Delta })}{n(1-e^{-(1-p)\alpha \Delta })}-\frac{1-p}{2}. \end{aligned}$$

(4.3)

Next, we review how this process is related to a linear martingale estimating function. Application of Itô’s formula yields

$$\begin{aligned} \,{\mathrm {d}}X_{t,p}^{1-p}&=\left[ (1-p)\left( \vartheta +\frac{1}{2}\right) +\frac{(1-p)(-p)}{2} -\alpha (1-p)X_{t,p}^{1-p}\right] \,{\mathrm {d}}t +(1-p)X_{t,p}^{\frac{1-p}{2}}\,{\mathrm {d}}B_t, \end{aligned}$$

hence we can determine the conditional mean $f(t):=\,{\mathrm {E}}\,(X_{t,p}^{1-p}|X_{t_0,p})$ by solving the differential equation

$$\begin{aligned} \left\{ \begin{array}{ll} f'(t)&{}=(1-p)\left( \vartheta +\frac{1}{2}\right) +\frac{(1-p)(-p)}{2}-\alpha (1-p) f(t),\\ f(t_0)&{}=X_{t_0,p}^{1-p}. \end{array} \right. \end{aligned}$$

Thus, we receive the linear martingale estimating function

$$\begin{aligned} {\widetilde{G}}_{n,p}(\vartheta )&:=\sum _{i=1}^n (X_{i\Delta ,p}^{1-p}- \mathrm {E}(X_{i\Delta ,p}^{1-p}|X_{(i-1)\Delta ,p}))\\&=\sum _{i=1}^n \left( X_{i\Delta ,p}^{1-p}- X_{(i-1)\Delta ,p}^{1-p}e^{-(1-p)\alpha \Delta }+\frac{2\vartheta +1-p}{2\alpha }(e^{-(1-p)\alpha \Delta }-1)\right) \end{aligned}$$

and see that the unique solution of ${\widetilde{G}}_{n,p}(\vartheta _{n,p})=0$ is again (4.3).

Theorem 4.1

For every true value $\vartheta _0 \in \Theta \subset (-\frac{1}{2},\infty )$, we have

(i)
${\widehat{\vartheta }}_{n,p}\rightarrow \vartheta _0$ in probability and
(ii)
$\sqrt{n}({\widehat{\vartheta }}_{n,p}-\vartheta _0)\rightarrow N(0,\sigma ^2(\vartheta _0))$ in distribution

under $P_{\vartheta _0}$ with $\sigma ^2(\vartheta _0):=\frac{(1-p)( \vartheta _0+1)e^{-(1-p)\alpha \Delta }}{1-e^{-(1-p)\alpha \Delta }}+\frac{(2\vartheta _0+1-p)(1-p)}{4}.$

Proof

Obviously, $\sigma ^2(\vartheta _0)\in (0,\infty )$ applies. According to (Bibby and Sørensen 1995, Theorem 3.2), the convergences (i) and (ii) are given if the equation

$$\begin{aligned} \sigma ^2(\vartheta _0)=\frac{v(\vartheta _0)}{f(\vartheta _0)^2} \end{aligned}$$

holds, where

$$\begin{aligned} f(\vartheta )&:= -\,{\mathrm {E}}\,_{\mu _{\vartheta ,p}}\left( \frac{\partial }{\partial \vartheta } \,{\mathrm {E}}\,(X_{\Delta ,p}^{1-p}\, \vert \, X_{0,p})\right) \\&= -\,{\mathrm {E}}\,_{\mu _{\vartheta ,p}}\left( -\frac{1}{\alpha }\left( e^{-(1-p)\alpha \Delta }-1\right) \right) \\&= \frac{e^{-(1-p)\alpha \Delta }-1}{\alpha },\\ v(\vartheta )&:= \,{\mathrm {E}}\,_{\mu _{\vartheta ,p}}(\varphi (X_{0,p},\vartheta )), \end{aligned}$$

and $\varphi $ is the conditional variance of $X_{\Delta ,p}^{1-p}$ given $X_{0,p}$ determined by

$$\begin{aligned} \varphi (X_{0,p},\vartheta )&=\frac{(1-p)X_{0,p}^{1-p}}{\alpha }(e^{-(1-p)\alpha \Delta }- e^{-2(1-p)\alpha \Delta })\\&\quad +\, \frac{(2\vartheta +1-p)(1-p)}{4\alpha ^2} (1-e^{-(1-p)\alpha \Delta })^2. \end{aligned}$$

Note that this formula can also be derived via the solution of a differential equation. By establishing

$$\begin{aligned} \,{\mathrm {E}}\,_{\mu _{\vartheta }}(X_{0,p}^{1-p})&=\frac{1-p}{\Gamma \left( \frac{2\vartheta +2}{1-p}\right) } \left( \frac{2\alpha }{1-p}\right) ^{\frac{2\vartheta +2}{1-p}}\int _0^\infty x^{2\vartheta +2-p} e^{-\frac{2\alpha }{1-p}x^{1-p}} \,{\mathrm {d}}x\\&=\frac{1-p}{\Gamma \left( \frac{2\vartheta +2}{1-p}\right) } \left( \frac{2\alpha }{1-p}\right) ^{\frac{2\vartheta +2}{1-p}}\int _0^\infty \left( \frac{1-p}{2\alpha }y\right) ^{\frac{2\vartheta +2}{1-p}} e^{-y} \frac{\,{\mathrm {d}}y}{2\alpha }\\&=\frac{1-p}{2\alpha \Gamma \left( \frac{2\vartheta +2}{1-p}\right) } \Gamma \left( \frac{2\vartheta +2}{1-p}+1\right) \\&=\frac{1-p}{2\alpha }\frac{2\vartheta }{1-p}=\frac{\vartheta +1}{\alpha }, \end{aligned}$$

we conclude

$$\begin{aligned} v(\vartheta )&=\frac{(1-p)(\vartheta +1)}{\alpha ^2}e^{-(1-p)\alpha \Delta }(1-e^{-(1-p)\alpha \Delta })\\&\quad + \, \frac{(2\vartheta +1-p)(1-p)}{4\alpha ^2}(1-e^{-(1-p)\alpha \Delta })^2 \end{aligned}$$

and hence the equation $\sigma ^2(\vartheta _0)=\frac{v(\vartheta _0)}{f(\vartheta _0)^2}$ is valid. $\square $

We want to increase the flexibility of ${\widetilde{G}}_{n,p}$ using the same scheme as for ${\widetilde{G}}_n={\widetilde{G}}_{n,-1}$. According to Heyde (1988); Godambe and Heyde (1987), we once more obtain the optimal weight

$$\begin{aligned} g_{i-1,p}(\vartheta ,X_{(i-1)\Delta })&:= \frac{\frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\vartheta } \,{\mathrm {E}}\,(X_{i\Delta ,p}^{1-p} \, \vert \, X_{(i-1)\Delta ,p})}{\varphi (X_{(i-1)\Delta ,p},\vartheta )}\\&=\frac{1}{\frac{(2\vartheta +1-p)(1-p)}{4\alpha }(1-e^{-(1-p)\alpha \Delta })+(1-p)X_{(i-1)\Delta ,p}^{1-p}e^{-(1-p)\alpha \Delta }} \end{aligned}$$

for the estimating function

$$\begin{aligned} \sum _{i=1}^{n} g_{i-1,p}(\vartheta ,X_{(i-1)\Delta ,p}) \left( X_{i\Delta ,p}^{1-p}- X_{(i-1)\Delta ,p}^{1-p}e^{-(1-p)\alpha \Delta }+\frac{2\vartheta +1-p}{2\alpha }(e^{-(1-p)\alpha \Delta }-1)\right) , \end{aligned}$$

cf. Bibby and Sørensen (1995, Eq. (2.10)). As before, we cannot explicitly derive the estimator as a solution of

$$\begin{aligned}&\sum _{i=1}^{n}\frac{1}{\frac{(2\vartheta +1-p)(1-p)}{4\alpha }(1-e^{-(1-p)\alpha \Delta })+(1-p)X_{(i-1)\Delta ,p}^{1-p}e^{-(1-p)\alpha \Delta }} \\&\quad \times \left( X_{i\Delta ,p}^{1-p}- X_{(i-1)\Delta ,p}^{1-p}e^{-(1-p)\alpha \Delta }+\frac{2\vartheta +1-p}{2\alpha }(e^{-(1-p)\alpha \Delta }-1)\right) =0, \end{aligned}$$

but we can analyze the improvement with respect to the estimator ${\widehat{\vartheta }}_{n,p}$. Following the same lines as (Bibby and Sørensen 1995, Theorem 3.2), we have to establish the finiteness of

$$\begin{aligned}&\,{\mathrm {E}}\,_{\mu _{\vartheta _0}}\left( g_{i-1,p}(\vartheta _0,X_{(i-1)\Delta ,p})\frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\vartheta } \,{\mathrm {E}}\,(X_{i\Delta ,p}^{1-p} \, \vert \, X_{(i-1)\Delta ,p})\right) \\&\quad =\,{\mathrm {E}}\,_{\mu _{\vartheta _0}} \left( \frac{1}{\frac{(2\vartheta _0+1-p)(1-p)}{4}+\frac{(1-p)e^{-(1-p)\alpha \Delta }}{\alpha (1-e^{-(1-p)\alpha \Delta })}X_{(i-1)\Delta ,p}^{1-p}}\right) \\&\quad < \frac{4}{(2\vartheta _0+1-p)(1-p)}, \end{aligned}$$

the reciprocal of the asymptotic variance, to achieve consistency and asymptotic normality. Comparing this result to the limit

$$\begin{aligned} \lim \limits _{\alpha \Delta \rightarrow \infty }\sigma ^2(\vartheta _0)&=\lim \limits _{\alpha \Delta \rightarrow \infty }\frac{(1-p)( \vartheta _0+1)e^{-(1-p)\alpha \Delta }}{1-e^{-(1-p)\alpha \Delta }}+\frac{(2\vartheta _0+1-p)(1-p)}{4}\\&= \frac{(2\vartheta _0+1-p)(1-p)}{4}, \end{aligned}$$

we recognize a fast convergence to the asymptotic variance’s lower bound of the optimal estimator. This result resembling the case of the Bessel process justifies the restriction to the explicit estimator ${\widehat{\vartheta }}_{n,p}$ from a practical point of view.

5 Estimator based on two eigenfunctions

Now, we turn back to the Bessel process and try to improve the asymptotic variance further by considering martingale estimating functions based on two eigenfunctions. Yet, this approach suffers from the drawback that we do not get explicit results for the estimators anymore, but for the asymptotic variance at least for weights depending only on the unknown parameter.

As in the previous sections we start with a class of martingale estimating functions with weight depending on the unknown parameter only. We consider

$$\begin{aligned} H_n(\vartheta ):=\sum _{i=1}^{n} \sum _{j=1}^{2} \beta _j(\vartheta )\left( \phi _j(X_{i\Delta ,}\vartheta )-e^{-\lambda _j(\vartheta )\Delta } \phi _j(X_{(i-1)\Delta },\vartheta )\right) , \end{aligned}$$

where $\beta _1$ and $\beta _2$ are continuously differentiable functions only depending on $\vartheta $. Under suitable conditions on the interplay between the weights $\beta _i$ and the eigenfunctions, we can easily achieve a consistent and asymptotic normal estimator.

Theorem 5.1

If for every $\vartheta \in \Theta $

$$\begin{aligned} f(\beta _1,\beta _2,\vartheta ):=\beta _1(\vartheta )\frac{1-e^{-2\alpha \Delta }}{\vartheta +1}+\beta _2(\vartheta )\frac{1-e^{-4\alpha \Delta }}{(\vartheta +1)(\vartheta +2)}\not =0 \end{aligned}$$

is satisfied, then there exists a solution of $H_n({\widehat{\vartheta }}_{n,2})=0$ with a probability tending to one as $n\rightarrow \infty $ under $P_{\vartheta _0}$. Furthermore, for every true value $\vartheta _0 \in \Theta \subset (-\frac{1}{2},\infty )$ we have

(i)
${\widehat{\vartheta }}_{n,2}\rightarrow \vartheta _0$ in probability and
(ii)
$\sqrt{n}({\widehat{\vartheta }}_{n,2}-\vartheta _0)\rightarrow N\left( 0,\frac{v(\beta _1,\beta _2,\vartheta _0)}{f^2(\beta _1,\beta _2,\vartheta _0)}\right) $ in distribution

under $P_{\vartheta _0}$ with

$$\begin{aligned} v(\beta _1,\beta _2,\vartheta _0):=\beta _1^2(\vartheta _0)\frac{1-e^{-4\alpha \Delta }}{\vartheta _0+1}+\beta _2^2(\vartheta _0)\frac{2-2e^{-8\alpha \Delta }}{(\vartheta _0+1)(\vartheta _0+2)}. \end{aligned}$$

Proof

As by the assumption $f(\cdot , \cdot , \vartheta )\not = 0$ for every $\vartheta \in \Theta $, we conclude $\beta _1(\vartheta )\not =0$ or $\beta _2(\vartheta )\not = 0$ and consequently $v(\cdot , \cdot ,\vartheta )\not = 0 $ for every $\vartheta \in \Theta $. Using again (Kessler and Sørensen 1999, Theorem 4.3) we only have to establish the formulas of f and v. In our calculations below we need the following straightforward properties

(a)
$Q_\Delta $ symmetric,
(b)
$\int _0^\infty \phi _1(x,\vartheta )\phi _2(x,\vartheta )\mu _\vartheta (x)\,{\mathrm {d}}x=0$,
(c)
$\int _0^\infty \phi _j(x,\vartheta )\mu _\vartheta (x)\,{\mathrm {d}}x=0$,
(d)
$\int _0^\infty x^{2\eta } \mu _{\vartheta }(x)\,{\mathrm {d}}x=\frac{\Gamma (\eta +\vartheta +1)}{\alpha ^\eta \Gamma (\vartheta +1)}$ for $\eta \in {\mathbb {N}}$.

Step 1 Like in (Kessler and Sørensen 1999, Condition 4.2 (a)) we define f by

$$\begin{aligned} f(\beta _1,\beta _2,\vartheta ):=\sum _{i=1}^2 \int _0^\infty \int _0^\infty \frac{\partial }{\partial \vartheta } \beta _i(\vartheta )\left( \phi _i(x,\vartheta )-e^{-2\alpha \Delta } \phi _i(y,\vartheta )\right) Q_\Delta ^\vartheta (\,{\mathrm {d}}x,\,{\mathrm {d}}y). \end{aligned}$$

The first step is to obtain the explicit expression given in Theorem 5.1. We can easily calculate the two summands

$$\begin{aligned}&\int _0^\infty \int _0^\infty \frac{\partial }{\partial \vartheta } \beta _1(\vartheta )\left( \phi _1(x,\vartheta )-e^{-2\alpha \Delta } \phi _1(y,\vartheta )\right) Q_\Delta ^\vartheta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad {\mathop {=}\limits ^{\text {(a)}}}(1-e^{-2\alpha \Delta })\int _{0}^\infty \int _0^\infty \frac{\partial }{\partial \vartheta } \beta _1(\vartheta )\phi _1(x,\vartheta ) Q_\Delta ^\vartheta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad {\mathop {=}\limits ^{\text {(c)}}}(1-e^{-2\alpha \Delta }) \beta _1(\vartheta ) \int _{0}^\infty \frac{\partial }{\partial \vartheta } \phi _1(x,\vartheta ) \mu _\vartheta (x)\,{\mathrm {d}}x\\&\quad =(1-e^{-2\alpha \Delta }) \beta _1(\vartheta ) \int _{0}^\infty \frac{\alpha x^2}{(\vartheta +1)^2} \mu _\vartheta (x)\,{\mathrm {d}}x\\&\quad {\mathop {=}\limits ^{\text {(d)}}}\beta _1(\vartheta )\frac{ 1-e^{-2\alpha \Delta }}{\vartheta +1} \end{aligned}$$

and similarly

$$\begin{aligned}&\int _0^\infty \int _0^\infty \frac{\partial }{\partial \vartheta } \beta _2(\vartheta )\left( \phi _2(x,\vartheta )-e^{-4\alpha \Delta } \phi _2(y,\vartheta )\right) Q_\Delta ^\vartheta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad =\beta _2(\vartheta )\frac{1-e^{-4\alpha \Delta }}{(\vartheta +1)(\vartheta +2)}. \end{aligned}$$

Step 2 According to (Kessler and Sørensen 1999, Theorem 4.3), we receive

$$\begin{aligned} v(\vartheta )=\sum _{i,j=1}^{2} \beta _i(\vartheta )\beta _j(\vartheta )\alpha _{ij}(\vartheta ) \end{aligned}$$

with

$$\begin{aligned} \alpha _{ij}:=\int _0^\infty \int _0^\infty \left( \phi _i(y,\vartheta )-e^{-\lambda _i\Delta }\phi _i(x,\vartheta )\right) \,\,\cdot \,\,\left( \phi _j(y, \vartheta )-e^{-\lambda _j\Delta }\phi _j(x,\vartheta )\right) Q_\Delta (\,{\mathrm {d}}x,\,{\mathrm {d}}y). \end{aligned}$$

In the following we explicitly compute these integrals, starting with $\alpha _{11}$. If we take a look at the proof of Theorem 3.1, we recognize the already calculated value

$$\begin{aligned} \alpha _{11}=\int _0^\infty \int _0^\infty \left( 1-\frac{\alpha y^2}{\vartheta +1}-e^{-2\alpha \Delta }\left( 1-\frac{\alpha x^2}{\vartheta +1}\right) \right) ^2 Q_\Delta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)=\frac{1-e^{-4\alpha \Delta }}{\vartheta +1}. \end{aligned}$$

For the next term $\alpha _{12}$, it holds

$$\begin{aligned}&\int _0^\infty \int _0^\infty \left( \phi _1(y,\vartheta )-e^{-2\alpha \Delta }\phi _1(x,\vartheta )\right) \cdot \left( \phi _2(y,\vartheta )-e^{-4\alpha \Delta }\phi _2(x,\vartheta )\right) Q_\Delta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad {\mathop {=}\limits ^{\text {(a)},\text {(b)}}}-(e^{-2\alpha \Delta }+e^{-4\alpha \Delta })\int _0^\infty \int _0^\infty \phi _1(y,\vartheta )\phi _2(x,\vartheta ) Q_\Delta (\,{\mathrm {d}}x, \,{\mathrm {d}}y)\\&\quad = -(e^{-2\alpha \Delta }+e^{-4\alpha \Delta })\int _0^\infty \int _0^\infty \left( 1- \frac{\alpha y^2}{\vartheta +1}\right) p(x,y,\Delta )\,{\mathrm {d}}y \phi _2(x,\vartheta ) \mu _\vartheta (x) \,{\mathrm {d}}x\\&\quad {\mathop {=}\limits ^{\text {(c)}}}(e^{-2\alpha \Delta }+e^{-4\alpha \Delta })\int _0^\infty \frac{\alpha }{\vartheta +1}\,{\mathrm {E}}\,_{\mu _{\vartheta }}(X_\Delta ^2 \, \vert \, X_0=x) \phi _2(x,\vartheta ) \mu _\vartheta (x) \,{\mathrm {d}}x\\&\quad =(e^{-2\alpha \Delta }+e^{-4\alpha \Delta })\int _0^\infty \left( \frac{\alpha }{\vartheta +1}x^2e^{-2\alpha \Delta }+1-e^{-2\alpha \Delta } \right) \phi _2(x,\vartheta ) \mu _\vartheta (x) \,{\mathrm {d}}x\\&\quad {\mathop {=}\limits ^{\text {(c)}}}\frac{\alpha (e^{-4\alpha \Delta }+e^{-6\alpha \Delta })}{\vartheta +1}\int _0^\infty x^2 \phi _2(x,\vartheta ) \mu _\vartheta (x) \,{\mathrm {d}}x\\&\quad =\frac{\alpha (e^{-4\alpha \Delta }+e^{-6\alpha \Delta })}{\vartheta +1}\int _0^\infty \left( x^2-\frac{2\alpha x^4}{\vartheta +1} +\frac{\alpha ^2x^6}{(\vartheta +1)(\vartheta +2)} \right) \mu _\vartheta (x) \,{\mathrm {d}}x\\&\quad {\mathop {=}\limits ^{\text {(d)}}}\frac{(e^{-4\alpha \Delta }+e^{-6\alpha \Delta })}{\vartheta +1}\left( \vartheta +1-2(\vartheta +2)+\vartheta +3\right) \\&\quad =0 \end{aligned}$$

and similarly we obtain for $\alpha _{22}$

$$\begin{aligned}&\int _0^\infty \int _0^\infty \left( \phi _2(y,\vartheta )-e^{-4\alpha \Delta }\phi _2(x,\vartheta )\right) ^2Q_\Delta (\,{\mathrm {d}}x,\,{\mathrm {d}}y)\\&\quad {\mathop {=}\limits ^{\text {(a)}}}(1+e^{-8\alpha \Delta })\int _0^\infty \phi _2^2(x,\vartheta )\mu _{\vartheta }(x)\,{\mathrm {d}}x \\&\qquad -\, 2e^{-4\alpha \Delta } \int _0^\infty \int _0^\infty \phi _2(x,\vartheta )\phi _2(y,\vartheta ) Q_\Delta (\,{\mathrm {d}}x, \,{\mathrm {d}}y)\\&\quad =\frac{2-2e^{-8\alpha \Delta }}{(\vartheta +1)(\vartheta +2)}. \end{aligned}$$

$\square $

Our aim is now to find $\beta _i$s, which lead to the smallest asymptotic variance as $\alpha \Delta \rightarrow \infty $. Therefore, we define for fixed $\vartheta \in \Theta $ the approximated functions

$$\begin{aligned}&{\widetilde{v}}(\beta _1,\beta _2):=\frac{\beta _1^2(\vartheta )}{\vartheta +1}+\frac{2\beta _2^2(\vartheta )}{(\vartheta +1)(\vartheta +2)},\\&{\widetilde{f}}(\beta _1,\beta _2):=\frac{\beta _1(\vartheta )}{\vartheta +1}+\frac{\beta _2(\vartheta )}{(\vartheta +1)(\vartheta +2)}, \end{aligned}$$

for which

$$\begin{aligned} \lim \limits _{\alpha \Delta \rightarrow \infty }\left| \frac{v(\vartheta )}{f^2(\vartheta )} - \frac{{\widetilde{v}}(\vartheta )}{{\widetilde{f}}^2(\vartheta )} \right| =0 \end{aligned}$$

holds. This property justifies the search for the global minimum of

$$\begin{aligned} (\beta _1,\beta _2)\mapsto \frac{{\widetilde{v}}(\beta _1,\beta _2)}{{\widetilde{f}}^2(\beta _1,\beta _2)}. \end{aligned}$$

To establish the minimum we first simplify the function

$$\begin{aligned} \frac{{\widetilde{v}}(\beta _1,\beta _2)}{{\widetilde{f}}^2(\beta _1,\beta _2)}&= \frac{\frac{\beta _1^2}{\vartheta +1}+\frac{2\beta _2^2}{(\vartheta +1)(\vartheta +2)} }{\left( \frac{\beta _1}{\vartheta +1}+\frac{\beta _2}{(\vartheta +1)(\vartheta +2)}\right) ^2}\\&=(\vartheta +1)(\vartheta +2)\frac{(\vartheta +2)\beta _1^2+2\beta _2^2 }{\left( (\vartheta +2)\beta _1+\beta _2\right) ^2} \end{aligned}$$

and determine the first derivatives

$$\begin{aligned} \frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\beta _1}\frac{{\widetilde{v}}(\beta _1,\beta _2)}{{\widetilde{f}}^2(\beta _1,\beta _2)}&= 2(\vartheta +1)(\vartheta +2)^2\frac{\beta _1\beta _2-2\beta _2^2}{ \left( (\vartheta +2)\beta _1+\beta _2\right) ^3}, \\ \frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\beta _2}\frac{{\widetilde{v}}(\beta _1,\beta _2)}{{\widetilde{f}}^2(\beta _1,\beta _2)}&= 2(\vartheta +1)(\vartheta +2)^2\frac{2\beta _1\beta _2-\beta _1^2 }{\left( (\vartheta +2)\beta _1+\beta _2\right) ^3}. \end{aligned}$$

Taking into account the properties of the $\beta _i$s in Theorem 5.1, we get as possible minima $\beta _1=2\beta _2\not =0$ with value

$$\begin{aligned} \frac{{\widetilde{v}}(2\beta _2,\beta _2)}{{\widetilde{f}}^2(2\beta _2,\beta _2)}&=\frac{2(\vartheta +1)(\vartheta +2) }{2\vartheta +5}. \end{aligned}$$

In order to check, if we indeed have minima, we consider $\beta _1\not = 2\beta _2$ and see

$$\begin{aligned}&\frac{{\widetilde{v}}(\beta _1,\beta _2)}{{\widetilde{f}}^2(\beta _1,\beta _2)}-\frac{2(\vartheta +1)(\vartheta +2) }{2\vartheta +5}=(\vartheta +1)(\vartheta +2)^2\frac{(\beta _1-2\beta _2)^2}{(2\vartheta +5)\left( (\vartheta +2)\beta _1+\beta _2\right) ^2}>0. \end{aligned}$$

Hence, these critical points are global minima. Finally, we may specify the improvement of the asymptotic variance

$$\begin{aligned}&\vartheta +1-\frac{2(\vartheta +1)(\vartheta +2) }{2\vartheta +5} =(\vartheta +1)\frac{2\vartheta +5-2(\vartheta +2)}{2\vartheta +5} =\frac{\vartheta +1}{2\vartheta +5}>0 \end{aligned}$$

if we consider the asymptotic behaviour $\alpha \Delta \rightarrow \infty $. Hence, we see that relative improvement compared to $\vartheta +1$, the bound of the asymptotic variance in the case of only one eigenfunction, is $\frac{1}{2\vartheta +5}$ and decreases as $\vartheta $ increases. However, for the boundary case $\vartheta =-1/2$ we get an improvement of $25\%$. For the case $\vartheta =0$, which for a Dunkl process separates between finite and infinite jump activity, we still get an improvement of $20\%$.

As a second step we may consider weights which also depend on the observations. Note that though we may determine the optimal weights as solutions to a system of linear equations with coefficients depending on higher order conditional moments, which is theoretically feasible, we cannot provide an explicit result for the optimal asymptotic variance. Hence, we are not able to quantify the improvement compared to the simpler weights before.

If we take into account weights $a_j^\star $, that additionally depend on the trajectories, i.e. if we consider estimating functions

$$\begin{aligned} \sum _{i=1}^{n}\sum _{j=1}^{2} a^\star _j(X_{(i-1)\Delta },\vartheta )(\phi _j(X_{i\Delta },\vartheta )-e^{-\lambda _j\Delta }\phi _j(x,\vartheta )) \end{aligned}$$

the optimal weights in the sense of Godambe Heyde Godambe and Heyde (1987) are given in (Kessler and Sørensen 1999, p. 305). The weights $\alpha _j^\star $ are specified by the equation

$$\begin{aligned} \left( \begin{array}{ll} a_{11} &{} a_{12}\\ a_{12} &{} a_{22} \end{array}\right) \left( \begin{array}{l} a_1^\star \\ a_2^\star \end{array}\right) = \left( \begin{array}{l} b_1\\ b_2 \end{array}\right) \end{aligned}$$

with

$$\begin{aligned} a_{ij}(x,\vartheta )&:=\int _{0}^{\infty }(\phi _i(y,\vartheta )-e^{-\lambda _i\Delta }\phi _i(x,\vartheta ))(\phi _j(y,\vartheta )-e^{-\lambda _j\Delta }\phi _j(x,\vartheta ))p_\vartheta (x,y,\Delta )\,{\mathrm {d}}y \end{aligned}$$

for $1\le i < j \le 2$ and

$$\begin{aligned} b_j(x,\vartheta ):= - \int _0^\infty \frac{\,{\mathrm {d}}}{\,{\mathrm {d}}\vartheta } (\phi _j(y,\vartheta )-e^{-\lambda _j\Delta }\phi _j(x,\vartheta ))p_\vartheta (x,y,\Delta )\,{\mathrm {d}}y \end{aligned}$$

for $j=1,2$. Hence,

$$\begin{aligned} a_{11}(x,\vartheta )&= \frac{(1-e^{-2\alpha \Delta })^2}{\vartheta +1}+\frac{2\alpha x^2}{(\vartheta +1)^2}( e^{-2\alpha \Delta }-e^{-4\alpha \Delta }),\\ a_{12}(x,\vartheta )&=\frac{2\alpha ^2}{(\vartheta +1)^2}\varphi (x,\vartheta )- \frac{\alpha ^3}{(\vartheta +1)^2(\vartheta +2)}\\&\quad \times \, \left( \,{\mathrm {E}}\,(X_{i\Delta }^6\, \vert \, X_{(i-1)\Delta =x})- \,{\mathrm {E}}\,(X_{i\Delta }^2\, \vert \, X_{(i-1)\Delta }=x)\,{\mathrm {E}}\,(X_{i\Delta }^4\, \vert \, X_{(i-1)\Delta }=x)\right) ,\\ a_{22}(x,\vartheta )&=\frac{\alpha ^4}{(\vartheta +1)^2(\vartheta +2)^2} \,{\mathrm {E}}\,((X_{i\Delta }^4- \,{\mathrm {E}}\,(X_{i\Delta }^4\, \vert \, X_{(i-1)\Delta }=x)^2\, \vert \, X_{(i-1)\Delta }=x)\\&\quad +\, \frac{4\alpha ^2}{(\vartheta +1)^2}\varphi (x,\vartheta )-\frac{4\alpha ^3}{(\vartheta +1)^2(\vartheta +2)}\\&\quad \times \, \left( \,{\mathrm {E}}\,(X_{i\Delta }^6\, \vert \, X_{(i-1)\Delta }=x)- \,{\mathrm {E}}\,(X_{i\Delta }^2\, \vert \, X_{(i-1)\Delta }=x)\,{\mathrm {E}}\,(X_{i\Delta }^4\, \vert \, X_{(i-1)\Delta }=x)\right) \end{aligned}$$

and

$$\begin{aligned} b_1(x,\vartheta )&=- \frac{1-e^{-2\alpha \Delta }}{\vartheta +1},\\ b_2(x,\vartheta )&=\frac{2\alpha x^2(e^{-2\alpha \Delta }-e^{-4\alpha \Delta })}{(\vartheta +1)(\vartheta +2)}+\frac{2\vartheta +3}{(\vartheta +1)(\vartheta +2)}(1-e^{-2\alpha \Delta })^2-\frac{2}{\vartheta +1}(1-e^{-2\alpha \Delta }). \end{aligned}$$

References

Bibby BM, Sørensen M (1995) Martingale estimation functions for discretely observed diffusion processes. Bernoulli 1:17–39
Article MathSciNet Google Scholar
Chybiryakov O, Gallardo L, Yor M (2008) Dunkl processes and their radial parts relative to a root system. In: Graczyk P et al (eds) Harmonic and stochastic analysis of Dunkl processes. Hermann, Paris
Google Scholar
Godambe VP, Heyde CC (1987) Quasi-likelihood and optimal estimation. Int Stat Rev 55:231–244
Article MathSciNet Google Scholar
Heyde CC (1988) Fixed sample and asymptotic optimality for classes of estimating functions. Contemp Math 80:241–247
Article MathSciNet Google Scholar
Höpfner R (2014) Asymptotic statistics: with a view to stochastic processes. De Gruyter
Itô K, McKean HP (1974) Diffusion processes and their sample paths. Springer
Kessler M, Sørensen M (1999) Estimating equations based on eigenfunctions for a discretely observed diffusion process. Bernoulli 5:299–314
Article MathSciNet Google Scholar
Overbeck L, Ryden T (1997) Estimation in the Cox Ingersoll Ross model. Econ Theory 13:430–461
Article MathSciNet Google Scholar
Rösler M, Voit M (2008) Dunkl theory, convolution algebras, and related Markov processes. In: Graczyk P et al (eds) Harmonic and stochastic analysis of Dunkl processes. Hermann, Paris
MATH Google Scholar
Skorokhod AV (1989) Asymptotic methods in the theory of stochastic differential equations. American Society, Providence
MATH Google Scholar
Sørensen M (2012) Estimating functions for diffusion-type processes. Kessler M, Lindner A, Sorensen M (eds) Statistical methods for stochastic differential equations, vol 124. CRC Press

Download references

Acknowledgements

The financial support of the DFG-GRK 2131 is gratefully acknowledged. We would like to thank the reviewers for their helpful comments and suggestions.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Fakultät Mathematik, Technische Universität Dortmund, Vogelpothsweg 87, 44221, Dortmund, Germany
Nicole Hufnagel & Jeannette H. C. Woerner

Authors

Nicole Hufnagel
View author publications
You can also search for this author in PubMed Google Scholar
Jeannette H. C. Woerner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jeannette H. C. Woerner.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hufnagel, N., Woerner, J.H.C. Martingale estimation functions for Bessel processes. Stat Inference Stoch Process 25, 337–353 (2022). https://doi.org/10.1007/s11203-021-09250-8

Download citation

Received: 25 July 2020
Accepted: 15 July 2021
Published: 04 August 2021
Issue Date: July 2022
DOI: https://doi.org/10.1007/s11203-021-09250-8

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Martingale estimation functions for Bessel processes

Abstract

Similar content being viewed by others

Some Martingales Associated With Multivariate Bessel Processes

Convergence Types and Rates in Generic Karhunen-Loève Expansions with Applications to Sample Path Properties

Regularity of Gaussian Processes on Dirichlet Spaces

1 Introduction

2 Basic results on Bessel processes and a stationary modification

3 Martingale estimating functions based on eigenfunctions

Theorem 3.1

Proof

4 An extension to some polynomial diffusion processes

Theorem 4.1

Proof

5 Estimator based on two eigenfunctions

Theorem 5.1

Proof

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Martingale estimation functions for Bessel processes

Abstract

Similar content being viewed by others

Some Martingales Associated With Multivariate Bessel Processes

Convergence Types and Rates in Generic Karhunen-Loève Expansions with Applications to Sample Path Properties

Regularity of Gaussian Processes on Dirichlet Spaces

1 Introduction

2 Basic results on Bessel processes and a stationary modification

3 Martingale estimating functions based on eigenfunctions

Theorem 3.1

Proof

4 An extension to some polynomial diffusion processes

Theorem 4.1

Proof

5 Estimator based on two eigenfunctions

Theorem 5.1

Proof

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation