Abstract
Uncertain hypothesis test is a statistical tool that uses uncertainty theory to determine whether some hypotheses are correct or not based on observed data. As an application of uncertain hypothesis test, this paper proposes a method to test whether an uncertain differential equation fits the observed data or not. In order to demonstrate the test method, some numerical examples are provided. Finally, both uncertain currency model and stochastic currency model are used to model US Dollar to Chinese Yuan (USD–CNY) exchange rates. As a result, it is shown that the uncertain currency model fits the exchange rates well, but the stochastic currency model does not.
Similar content being viewed by others
1 Introduction
Uncertainty theory, founded by Liu (2007) and perfected by Liu (2009), has been successfully applied to many fields, such as science, engineering, finance, environment, etc. Among the different applications, uncertain statistics was first introduced by Liu (2010) as a set of mathematical methods to collect, analyze and interpret data based on uncertainty theory. As a part of uncertain statistics, uncertain hypothesis test is a statistical tool based on uncertainty theory to determine whether some hypotheses are correct or not according to observed data. This work was initialized by Ye and Liu (2022). Following that, uncertain hypothesis test has been applied to other research areas of uncertain statistics, including uncertain regression analysis and uncertain time series analysis.
Uncertain regression analysis is used to explore the relationship between explanatory variables and response variables with uncertainty theory. Yao and Liu (2018) first proposed that the disturbance term of a regression model is not a random variable but an uncertain variable, which marks the beginning of the uncertain regression analysis. On this basis, Lio and Liu (2018) explored to estimate the uncertain disturbance term and obtained an interval estimation for predicting the response variables. In order to test whether an uncertain regression model is a good fit to observed data, Ye and Liu (2022) proposed a test method based on uncertain hypothesis test. As applications of uncertain regression analysis and uncertain hypothesis test, Liu (2021b) used the uncertain growth model to study the number of COVID-19 infections in China, and Ye (2022a) used the linear uncertain regression model to explore the relationship between labour income share and four influence factors, including trade openness, financial development, government intervention and industrial structure.
Uncertain time series analysis is used to predict future values via the previously observed values based on uncertainty theory. Yang and Liu (2019) first presented that the disturbance term of a time series model is an uncertain variable instead of a random variable, which marks the beginning of the uncertain time series analysis. In order to test whether an uncertain time series model is a good fit to observed data, Ye (2022b) proposed a test method based on uncertain hypothesis test. As applications of uncertain time series analysis and uncertain hypothesis test, Ye and Yang (2021) studied the number of COVID-19 infections in China, Ye (2022c) discussed the birth rates in China, and Ye (2022d) investigated the grain yield in China.
As another part of uncertain statistics, uncertain differential equation (Liu 2008) is used to model the time evolution of a dynamic system. When using uncertain differential equation in practice, we first need to estimate unknown parameters in an uncertain differential equation to fit observed data as much as possible based on uncertainty theory. For that purpose, Yao and Liu (2020) first proposed the moment estimation based on the difference scheme of uncertain differential equation. Following that, Yang et al. (2020) presented the minimum cover estimation, Sheng et al. (2020) investigated the least squares estimation, and Liu and Liu (2022) introduced the maximum likelihood estimation. However, the above parameter estimation methods of uncertain differential equation based on difference scheme are not suitable for the case where the interval times between observations are not short enough. In order to deal with this problem, Liu and Liu (2022) presented the concept of residual to establish a connection between uncertain differential equation and observed data, and proposed a new method of parameter estimation in uncertain differential equation based on residuals. Up to now, uncertain differential equation has been widely applied in many fields such as finance (Liu 2013), chemical reaction (Tang and Yang 2021), electric circuit (Liu 2021a), pharmacokinetics (Liu and Yang 2021), software reliability (Liu and Kang 2022), COVID-19 (Lio and Liu 2021), Alibaba stock (Liu and Liu 2022) and so on.
With the help of residuals, this paper employs uncertain hypothesis test to determine whether an uncertain differential equation is a good fit to the observed data. The rest of the paper is organized as follows. Section 2 introduces some basic knowledge about uncertain hypothesis test. After that, uncertain hypothesis test is used to determine whether a set of observed data follow a given linear uncertainty distribution in Sect. 3. On this basis, Sect. 4 provides a method to test whether an uncertain differential equation fits the observed data, and gives two numerical examples to illustrate the test method. In Sect. 5, uncertain currency model is applied to USD–CNY exchange rates. Then, some conclusions are made in Sect. 6. Finally, as a comparison with uncertain currency model in Sect. 5, stochastic currency model is also applied to USD–CNY exchange rates in the appendix, and the results show that the uncertain currency model fits the exchange rates well, but the stochastic currency model does not.
2 Preliminaries
This section will introduce some basic knowledge of uncertain hypothesis test. Let \(\xi \) be an uncertain variable with uncertainty distribution \(\Phi _{\theta }\) where \(\theta \) is an unknown parameter. Consider the following hypotheses:
where \(H_0\) is called a null hypothesis, and \(H_1\) is called an alternative hypothesis. Assume
are a set of observed data of the uncertain variable \(\xi \). A rejection region for the null hypothesis \(H_0\) is a set \(W\subset \Re ^n\). If the vector of observed data belongs to the rejection region W, i.e.,
then we reject \(H_0\). Otherwise, we accept \(H_0\). A core problem is how to choose a suitable rejection region W for the given hypothesis \(H_0\).
Definition 1
(Ye and Liu 2022) Let \(\xi \) be a population with uncertainty distribution \(\Phi _{\theta }\) where \(\theta \) is an unknown parameter. A rejection region \(W\subset \Re ^n\) is said to be a test for the two-sided hypotheses \(H_0: \theta =\theta _0\) versus \(H_1: \theta \ne \theta _0\) at significance level \(\alpha \) if
-
(i)
for any \((z_1,z_2,\ldots ,z_n)\in W\), there are at least \(\alpha \) of indexes i’s with \(1\le i\le n\) such that
-
(ii)
for some \(\theta \ne \theta _0\) and some \((z_1,z_2,\ldots ,z_n)\in W\), there are more than \(1-\alpha \) of indexes i’s with \(1\le i\le n\) and at least \(\alpha \) of indexes j’s with \(1\le j\le n\) such that
In order to test whether a normal uncertainty distribution \(\mathcal {N}(e_0,\sigma _0)\) is a good fit to a set of observed data \(z_1,z_2,\ldots ,z_n\), Ye and Liu (2022) proved by Definition 1 that a test at significance level \(\alpha \) is
where \(\Phi _0^{-1}\) is the inverse uncertainty distribution of \(\mathcal {N}(e_0,\sigma _0)\), i.e.,
If the vector of observed data belongs to the test W, i.e.,
then the normal uncertainty distribution \(\mathcal {N}(e_0,\sigma _0)\) is not a good fit to the observed data. Otherwise, the normal uncertainty distribution \(\mathcal {N}(e_0,\sigma _0)\) is a good fit to the observed data.
3 Uncertain hypothesis test for linear uncertainty distribution
In this section, we would like to use the uncertain hypothesis test to determine whether a set of observed data follow a given linear uncertainty distribution.
Theorem 1
Let \(\xi \) be an uncertain variable that follows a linear uncertainty distribution \(\mathcal {L}(a,b)\) with unknown parameters a and b. Then the test for the hypotheses
at significance level \(\alpha \) is
where \(\Phi _0^{-1}\) is the inverse uncertainty distribution of \(\mathcal {L}(a_0,b_0)\), i.e.,
Proof
In order to prove that W is a test, we need to verify that W simultaneously meets Conditions (i) and (ii) in Definition 1. For any \((z_1,z_2,\ldots ,z_n)\in W\), it follows from the definition of W that there are at least \(\alpha \) of indexes i’s with \(1\le i\le n\) such that
i.e.,
Thus W meets Condition (i) in Definition 1.
In order to prove Condition (ii), we take
and
It is clear that \((z_1,z_2,\ldots ,z_n)\in W\). Let \(\Phi _1\) denote the uncertainty distribution of \(\mathcal {L}(a_1,b_1)\). On the one hand, we have
\(i=1,2,\ldots ,n\). Thus
On the other hand, we have
Thus
\(i,j=1,2,\ldots ,n\). Therefore W meets Condition (ii) in Definition 1. \(\square \)
Example 1
Let us use uncertain hypothesis test to determine whether the linear uncertainty distribution \(\mathcal {L}(0,1)\) fits the 30 observed data
plotted in Fig. 1.
Suppose \(\Phi ^{-1}\) is the inverse uncertainty distribution of \(\mathcal {L}(0,1)\), i.e.,
Given a significance level \(\alpha =0.05\), we obtain
It follows from \(\alpha \times 30=1.5\) and Theorem 1 that the test is
Since only the 4th observed datum \(0.989\notin [0.025,0.975]\), the vector of observed data does not belong to W. Thus the linear uncertainty distribution \(\mathcal {L}(0,1)\) is a good fit to the observed data.
Example 2
Let us use uncertain hypothesis test to determine whether the linear uncertainty distribution \(\mathcal {L}(5,6)\) fits the 30 observed data
plotted in Fig. 2.
Suppose \(\Phi ^{-1}\) is the inverse uncertainty distribution of \(\mathcal {L}(5,6)\), i.e.,
Given a significance level \(\alpha =0.05\), we obtain
It follows from \(\alpha \times 30=1.5\) and Theorem 1 that the test is
Since the 6th and 26th observed data \(5.014,5.985\notin [5.025,5.975]\), the vector of observed data belongs to W. Thus the linear uncertainty distribution \(\mathcal {L}(5,6)\) is not a good fit to the observed data.
4 Uncertain differential equation
In this section, we would like to employ the uncertain hypothesis test to determine whether an uncertain differential equation fits the observed data well. Consider an uncertain differential equation
where f and g are known continuous functions, and \(C_t\) is a Liu process. Assume
are observed values of some uncertain process \(X_t\) at times \(t_1,t_2,\ldots ,t_n\) with \(t_1<t_2<\cdots <t_n\), respectively. For each index i (\(2\le i\le n\)), let \(X_{t_i}\) be the solution of the updated uncertain differential equation
at time \(t_i\). Denote the uncertainty distribution of \(X_{t_i}\) by \(\Phi _{t_i}\). It is clear that
is a linear uncertain variable \(\mathcal {L}(0,1)\) whose uncertainty distribution is
and inverse uncertainty distribution is
Then, Liu and Liu (2022) called
the ith residual of the uncertain differential equation (1) corresponding to the observed data (2) by substituting \(X_{t_i}\) with the observed value \(x_{t_i}\) in (4). Thus the residual \(\varepsilon _i\) may be regraded as a sample of the linear uncertainty distribution \(\mathcal {L}(0,1)\).
If the uncertain differential equation (1) does fit the observed data (2) well, then the \(n-1\) residuals \(\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _n\) should follow the linear uncertainty distribution \(\mathcal {L}(0,1)\), i.e.,
Thus, to test whether the uncertain differential equation (1) fits the observed data (2) well, we should test whether the linear uncertainty distribution \(\mathcal {L}(0,1)\) fits the residuals \(\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _n\) defined in (5), i.e.,
To do so, it follows from Theorem 1 that the test at a given significance level \(\alpha \) (e.g. 0.05) is
If the vector of the \(n-1\) residuals \(\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _n\) belongs to the test W, i.e.,
then the uncertain differential equation (1) is not a good fit to the observed data (2). If
then the uncertain differential equation (1) is a good fit to the observed data (2).
Example 3
Let us employ the uncertain hypothesis test to determine whether the uncertain differential equation
fits the 30 observed data in Table 1 on the time horizon from 0 to 2.19.
In Table 1, denote the observed values of \(X_t\) at times \(t_1,t_2,\ldots ,t_{30}\) by
respectively. For each index i (\(2\le i\le 30\)), we solve the updated uncertain differential equation
and obtain the uncertainty distribution of \(X_{t_i}\) as follows,
It follows from (5) that the ith residual is
See Fig. 3. In order to test whether the uncertain differential equation (6) fits the observed data well, we should test whether the linear uncertainty distribution \(\mathcal {L}(0,1)\) fits the 29 residuals \(\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _{30}\). Given a significance level \(\alpha =0.05\), it follows from \(\alpha \times 29=1.45\) and Theorem 1 that the test is
Since only \(\varepsilon _{21}=0.983\notin [0.025,0.975]\), we have \((\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _{30})\notin W\). Thus the uncertain differential equation (6) is a good fit to the observed data.
Example 4
Let us employ the uncertain hypothesis test to determine whether the uncertain differential equation
fits the 30 observed data in Table 2 on the time horizon from 0 to 9.5.
In Table 2, denote the observed values of \(X_t\) at times \(t_1,t_2,\ldots ,t_{30}\) by
respectively. For each i (\(2\le i\le 30\)), we can calculate the residual \(\varepsilon _i\) of the updated uncertain differential equation
with the help of an algorithm proposed by Liu and Liu (2022). See Fig. 4. In order to test whether the uncertain differential equation (7) fits the observed data well, we should test whether the linear uncertainty distribution \(\mathcal {L}(0,1)\) fits the 29 residuals
Given a significance level \(\alpha =0.05\), it follows from \(\alpha \times 29=1.45\) and Theorem 1 that the test is
Since only \(\varepsilon _{10}=0.987\notin [0.025,0.975]\), we have \((\varepsilon _2,\varepsilon _3,\ldots ,\varepsilon _{30})\notin W\). Thus the uncertain differential equation (7) is a good fit to the observed data.
5 Uncertain currency model
Example 5
Table 3 shows US Dollar to Chinese Yuan (USD–CNY) exchange rates (weekly average) in Forex Capital Markets (FXCM) from October 2019 to June 2021, which are plotted in Fig. 5.
Let \(i=1,2,\ldots ,91\) represent the weeks from October 1, 2019 to June 30, 2021, and denote the exchange rates by
Assume the exchange rate \(X_t\) follows the uncertain differential equation
where m, a and \(\sigma \) are unknown parameters to be estimated, and \(C_t\) is a Liu process. Then, Liu and Liu (2022) suggested that the moment estimate \((m, a,\sigma )\) is the solution of the system of equations,
where
\(i=2,3,\ldots ,91\) are residuals of the uncertain differential equation (8). Solving the above system of equations, we get
Thus we obtain an uncertain currency model
where \(X_t\) represents the exchange rate.
Finally, let us test whether the uncertain currency model (9) fits USD–CNY exchange rates. That is, we should test whether the linear uncertainty distribution \(\mathcal {L}(0,1)\) fits the 90 residuals
See Fig. 6.
Given a significance level \(\alpha =0.05\), it follows from \(\alpha \times 90=4.5\) and Theorem 1 that the test is
Since all residuals \(\varepsilon _i\), \(i=2,3,\ldots ,91\) are between 0.025 and 0.975, we have
Thus the uncertain currency model (9) is a good fit to USD–CNY exchange rates.
6 Conclusion
In order to test whether an uncertain differential equation fits the observed data well, this paper presented the uncertain hypothesis test to determine whether the uncertain differential equation is a good fit to the observed data by testing whether the residuals of the uncertain differential equation follow the linear uncertainty distribution \(\mathcal {L}(0,1)\). Then, some numerical examples were given to demonstrate the test method. Finally, both uncertain currency model and stochastic currency model were applied to USD–CNY exchange rates. The results showed that the uncertain currency model fits the exchange rates well, but the stochastic currency model does not.
References
Lio, W., & Liu, B. (2018). Residual and confidence interval for uncertain regression model with imprecise observations. Journal of Intelligent & Fuzzy Systems, 35(2), 2573–2583.
Lio, W., & Liu, B. (2021). Initial value estimation of uncertain differential equations and zero-day of COVID-19 spread in China. Fuzzy Optimization and Decision Making, 20(2), 177–188.
Liu, B. (2007). Uncertainty Theory (2nd ed.). Springer.
Liu, B. (2008). Fuzzy process, hybrid process and uncertain process. Journal of Uncertain Systems, 2(1), 3–16.
Liu, B. (2009). Some research problems in uncertainty theory. Journal of Uncertain Systems, 3(1), 3–10.
Liu, B. (2010). Uncertainty theory: A branch of mathematics for modeling human uncertainty. Springer.
Liu, B. (2013). Toward uncertain finance theory. Journal of Uncertainty Analysis and Applications, 1, 1.
Liu, Y. (2021a). Uncertain circuit equation. Journal of Uncertain Systems, 14(3), 2150018.
Liu, Y., & Liu, B. (2022). Estimating unknown parameters in uncertain differential equation by maximum likelihood estimation. Soft Computing, 26(6), 2773–2780.
Liu, Y., & Liu, B. (2022). Residual analysis and parameter estimation of uncertain differential equation. Fuzzy Optimization and Decision Making. https://doi.org/10.1007/s10700-021-09379-4.
Liu, Z. (2021b). Uncertain growth model for the cumulative number of COVID-19 infections in China. Fuzzy Optimization and Decision Making, 20(2), 229–242.
Liu, Z., & Yang, Y. (2021). Uncertain pharmacokinetic model based on uncertain differential equation. Applied Mathematics and Computation, 404, 126118.
Liu, Z., & Kang, R. (2022). Software belief reliability growth model based on uncertain differential equation. IEEE Transactions on Reliability. https://doi.org/10.1109/TR.2022.3154770.
Sheng, Y., Yao, K., & Chen, X. (2020). Least squares estimation in uncertain differential equations. IEEE Transactions on Fuzzy Systems, 28(10), 2651–2655.
Tang, H., & Yang, X. (2021). Uncertain chemical reaction equation. Applied Mathematics and Computation, 411, 126479.
Yang, X., & Liu, B. (2019). Uncertain time series analysis with imprecise observations. Fuzzy Optimization and Decision Making, 18(3), 263–278.
Yang, X., Liu, Y., & Park, G. (2020). Parameter estimation of uncertain differential equation with application to financial market. Chaos, Solitons and Fractals, 139, 110026.
Yao, K., & Liu, B. (2018). Uncertain regression analysis: An approach for imprecise observations. Soft Computing, 22(17), 5579–5582.
Yao, K., & Liu, B. (2020). Parameter estimation in uncertain differential equations. Fuzzy Optimization and Decision Making, 19(1), 1–12.
Ye, T., & Liu, B. (2022). Uncertain hypothesis test with application to uncertain regression analysis. Fuzzy Optimization and Decision Making, 21, 157–174.
Ye, T., & Yang, X. (2021). Analysis and prediction of confirmed COVID-19 cases in China with uncertain time series. Fuzzy Optimization and Decision Making, 20(2), 209–228.
Ye, T., & Zheng, H. (2022). Analysis of birth rates in China with uncertain time series model. Technical report.
Ye, T. (2022). Analysis of labour income share in China with uncertain regression model. Technical report.
Ye, T. (2022). Difference order selection in uncertain time series analysis. Technical Report.
Ye, T. (2022). Modeling grain yield in China with uncertain time series analysis. Technical report.
Acknowledgements
This work was supported by the National Natural Science Foundation of China Grant Nos.61873329 and 12026225.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Stochastic currency model
Appendix: Stochastic currency model
Let us reconsider USD–CNY exchange rates (weekly average) from October 1, 2019 to June 30, 2021 in Example 5. See Table 3. Let \(i=1,2,\ldots ,91\) represent the weeks from October 1, 2019 to June 30, 2021, and denote the exchange rates by
Assume the exchange rate \(X_t\) follows the stochastic differential equation
where m, a and \(\sigma \) are unknown parameters to be estimated, and \(W_t\) is a Wiener process. For any fixed parameters \(m, a, \sigma \) and index i \((2\le i\le 91)\), we solve the updated stochastic differential equation
and find that \(X_i\) is a normal random variable with expected value
and variance
Thus the probability distribution of the normal random variable \(X_i\) is
and \(\Phi _i(X_i)\) is always a uniform random variable \(\mathcal {U}(0,1)\) since
for any x with \(0<x<1\). Substitute \(X_i\) with the corresponding observed value \(x_i\), and write
Then \(\varepsilon _i(m,a,\sigma )\) is always a sample of uniform probability distribution \(\mathcal {U}(0,1)\) and called the ith residual of the stochastic differential equation (10). For each positive integer k, the kth sample moment of the 90 residuals \(\varepsilon _2(m,a,\sigma ),\varepsilon _3(m,a,\sigma ),\ldots ,\) \(\varepsilon _{91}(m,a,\sigma )\) is
and the k-th population moment of the uniform probability distribution \(\mathcal {U}(0,1)\) is
Since the number of unknown parameters in the stochastic differential equation (10) is 3, the moment estimate \((m,a,\sigma )\) is obtained by equating the first 3 sample moments to the corresponding first 3 population moments. In other words, the moment estimate \((m,a,\sigma )\) should solve the system of equations,
whose root is
Thus we obtain a stochastic currency model
where \(X_t\) represents exchange rate.
Does the stochastic currency model (11) fit USD–CNY exchange rates? In order to answer this question, let us consider the 90 residuals
See Fig. 7. If the stochastic currency model (11) fits USD–CNY exchange rates well, then the residuals in Fig. 7 should be from the same population in the sense of probability theory. However, when we try to divide those residuals into two parts, i.e., the first half and the last half, and use two-sample Kolmogorov-Smirnov test to determine whether the residuals of these two parts are from the same population, we get the p-value of \(5\times 10^{-16}\) via the function “kstest2” in Matlab. This means the residuals neither come from the same population nor are white noise in the sense of probability theory, let alone follow the uniform probability distribution \(\mathcal {U}(0,1)\). Thus the stochastic currency model (11) does not fit USD–CNY exchange rates. Furthermore, we cannot find a better stochastic differential equation to fit the USD–CNY exchange rates. This is the reason why we use uncertain differential equation to model currency exchange rate rather than stochastic differential equation.
Rights and permissions
About this article
Cite this article
Ye, T., Liu, B. Uncertain hypothesis test for uncertain differential equations. Fuzzy Optim Decis Making 22, 195–211 (2023). https://doi.org/10.1007/s10700-022-09389-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10700-022-09389-w