Portfolio Selection Based on EMD Denoising with Correlation Coefficient Test Criterion

Su, Kuangxi; Yao, Yinhong; Zheng, Chengli; Xie, Wenzhao

doi:10.1007/s10614-022-10345-4

Portfolio Selection Based on EMD Denoising with Correlation Coefficient Test Criterion

Published: 27 November 2022

Volume 63, pages 391–421, (2024)
Cite this article

Download PDF

Computational Economics Aims and scope Submit manuscript

Portfolio Selection Based on EMD Denoising with Correlation Coefficient Test Criterion

Download PDF

Kuangxi Su¹,
Yinhong Yao²,
Chengli Zheng ORCID: orcid.org/0000-0001-9719-6262³ &
…
Wenzhao Xie⁴

1 Citation
Explore all metrics

Abstract

Noise is an important factor affecting portfolio performance, how to construct an effective denoising strategy is becoming increasingly important for investors. In this study, we theoretically explain the impact of noise on portfolio and argue the necessity of denoising. Next, the empirical mode decomposition (EMD) denoising strategy based on the correlation coefficient test criterion is proposed to improve portfolio performance. In detail, EMD is used to decompose the noisy price, then, a series of correlation coefficient tests are performed to determine which intrinsic mode functions (IMFs) are noise. In the empirical analysis, we apply the proposed method to denoise the SSE 50 index’s constituents, and further test the out-of-sample performance under the mean–variance framework. The empirical results show that the proposed denoising method outperforms four common EMD, Ensemble EMD (EEMD) and wavelet denoising methods in return-risk ratio. The proposed method is the optimal denoising strategy, which can help investors improve portfolio performance to the greatest extent.

Portfolio allocation with CEEMDAN denoising algorithm

Article 14 July 2023

Research on regularized mean–variance portfolio selection strategy with modified Roy safety-first principle

Article Open access 29 June 2016

Portfolio selection: shrinking the time-varying inverse conditional covariance matrix

Article 16 November 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Portfolio selection problem has been one of the core issues of the modern investment theory (Ao et al., 2019). How to construct an effective portfolio to improve the out-of-sample performance is the focus in academia and industry (Ma et al., 2019). In practice, an often ignored fact is that noise is an important factor affecting portfolio performance (Kondor et al., 2007; Dessaint et al., 2019; Peress and Schmidt, 2020). Some studies indicate that denoising can significantly improve investors’ returns (Aloui and Jammazi, 2015; Zhu et al., 2019, 2021). However, the previous common denoising methods, especially empirical mode decomposition (EMD) denoising, have some weaknesses in portfolio management, such as inadequate or excessive denoising (He et al., 2017; Helong et al., 2019). To address these weaknesses, an EMD denoising strategy based on the correlation coefficient test criterion is proposed to improve portfolio performance.

The existence of noise originates from that individual investors have no access to inside information, they do not follow buy and hold strategies, and tend to select stocks with strong past returns (Black, 1986; Odean, 1999). A result from this concentrated trading is that prices tend to deviate from their fundamental values (Odean, 1999). Black (1986) labels these deviations as "noise". One often ignored fact is that the time series in financial market are easily interfered by noise, which may mislead the model fitting (Kondor et al., 2007). As results, the portfolio models may provide inaccurate results, investors who make decisions based on biased results will inevitably suffer losses. To eliminate noise interference, some researchers try to introduce data decomposition methods, such as the popular wavelet decomposition, into portfolio management. For example, Aloui & Jammazi (2015), Zhu et al. (2019, 2021) propose different denoising methods to construct portfolio models based on the wavelet decomposition technique, their empirical results indicate that the profitability, Sharpe ratio, and model accuracy have been improved after filtering the noise from original data. Overall, there are limited theoretical and empirical studies to investigate portfolio performance from a denoising perspective.

Except for the wavelet decomposition, EMD also receives extensive attention (Huang et al., 1998). Compare to wavelet decomposition, EMD does not require any prior assumptions about signal modes or system orders, and can directly decompose original data into finite intrinsic mode functions (IMFs) and a trend item. To date, it has shown outstanding advantages in decomposing financial data (Zhu et al., 2017; Yang et al., 2019). In this study, we use EMD instead of wavelet decomposition to construct different denoising strategies.

The key to EMD denoising is how to select the decomposed IMFs. It is generally accepted that different IMFs represent different fluctuation levels (Huang et al., 1998), the high-frequency IMFs are disordered and display minimal regularity, which are mainly caused by a series of factors that have short-term effects, such as bad weather and strikes, etc. Flandrin et al. (2004) consider these high-frequency components as noise and argue that the main information is concentrated in the low-frequency IMFs. Thus, there must be a key index, the IMFs after IMF$_{index}$ are regarded as the dominant modes, and the formers are considered as noise. Numerous studies follow this framework to denoise different types of data in engineering and medical fields, etc (Boudraa and Cexus, 2007; Nguyen and Kim, 2016). However, these denoising methods may not be suitable for finance data since the optimal denoising strategy highly depends on the data characteristic, i.e., different types of data have different optimal denoising strategies (Li et al., 2016; Nguyen and Kim, 2016; Zhu et al., 2019, 2021). In practice, the approach might face many weaknesses, such as inadequate or excessive denoising.

Therefore, a new EMD denoising strategy based on the correlation coefficient test criterion is proposed to improve portfolio performance. In detail, we first theoretically prove that noise can cause the optimal portfolio weights and effective frontier to deviate from their true positions. Thus, it is necessary to eliminate noise. Next, we apply EMD to decompose original noisy price and perform a series of correlation coefficient tests to identify which IMFs are noise. If the tests accept the null hypothesis, the IMFs are considered as noise. Conversely, they are considered as non-noisy components. Finally, we sum the non-noisy components and residual to construct the denoised price.

In the empirical analysis, the daily closing prices of 3180 trading days ranging from October 8, 2007 to October 30, 2020 are collected to test portfolio performance. Four quantitative indicators including Sharpe ratio, Sortino ratio, upside potential ratio and tracking error ratio, are used to deeply summarize out-of-sample performance. The empirical results show that the proposed denoising method outperforms common EMD, Ensemble EMD (EEMD) and wavelet denoising methods under the mean–variance framework. Besides, the portfolio performance is examined in four different subsamples, including bull, bear markets and two special periods, i.e., the 2007–2008 financial crisis and coronavirus disease 2019 (COVID-19) pandemic in 2020. The results reconfirm the superiority of the proposed denoising method. The simulation study by setting different parameters validates the above conclusions. Overall, the proposed denoising method can minimize noise interference, and help investors improve portfolio performance to the greatest extent.

This paper contributes to portfolio management in the following two dimensions. First, we theoretically analyze the impact of noise on the portfolio, and prove that noise causes the optimal portfolio and effective frontier to deviate from their true positions. In this way, the theoretical basis of denoising is argued. Second, we point out the weaknesses of common denoising methods applied to portfolio management and construct an EMD denoising strategy based on the correlation coefficient test criterion, whose portfolio performance significantly outperforms other common denoising methods.

Figure 1 plots the framework of this paper. Section 2 theoretically analyzes the motivation of denoising. Section 3 introduces the proposed EMD denoising method based on the correlation coefficient test criterion. As a comparison, four common EMD denoising methods are also described. Section 4 compares the portfolio performance of different denoising methods under the mean–variance framework with different sample periods. Section 5 further evaluates the robustness of the proposed denoising method through simulated data. The last section concludes the paper.

2 Portfolio Theory Under Noisy Environment

In this section, we decompose the noisy price into non-noisy component and noise, and further construct the mean–variance model under the noisy environment. By comparing the portfolio under non-noisy environment, we explain the impact of noise on portfolio and argue the necessity of denoising.

2.1 The Noisy Portfolio Returns

Due to the asymmetry and incompleteness of information, the stock prices are generally noisy (Black, 1986; Odean 1999). Considering the price $ x_i(t)$ of stock $i\,\,(i=1,\ldots ,k)$ at time $t \,\,(t=1,\ldots ,T)$ is composed of non-noisy component $s_i(t)$ and noise $n_i(t)$.

$$\begin{aligned} x_i(t)= s_i(t) + n_i(t),\,\, t=1,\ldots ,T \end{aligned}$$

(1)

where the noise $n_i(t)$ and non-noisy component $s_i(t)$ are uncorrelated, i.e., $\text {cov}(s_i(t),n_i(t)) = 0$. Then, the return $r_i(t)$ for stock i can be calculated as

$$\begin{aligned} \begin{array}{ll} r_i(t)&{}=\displaystyle \frac{x_i(t)-x_i(t-1)}{x_i(t-1)} =\displaystyle \frac{s_i(t)-s_i(t-1)+n_i(t)-n_i(t-1)}{x_i(t-1)} \\ &{}=\displaystyle \frac{s_i(t)-s_i(t-1)}{s_i(t-1)}\frac{s_i(t-1)}{x_i(t-1)}+ \frac{n_i(t)-n_i(t-1)}{n_i(t-1)}\frac{n_i(t-1)}{x_i(t-1)} \\ &{}=r_{i,s}(t)\displaystyle \frac{s_{i}(t-1)}{x_i(t-1)}+r_{i,n}(t)\frac{x_i(t-1)-s_i(t-1)}{x_i(t-1)} \\ &{}=\alpha _i(t-1) r_{i,s}(t)+(1-\alpha _i(t-1)) r_{i,n}(t) \end{array} \end{aligned}$$

(2)

where $r_{i,s}(t)=(s_i(t)-s_i(t-1))/{s_i(t-1)}$ is the return of non-noisy component. Similarly, $r_{i,n}(t)=(n_i(t)-n_i(t-1))/{n_i(t-1)}$ is the return for noise. $\alpha _{i}(t-1)=s_{i}(t-1)/{x_{i}(t-1)}$ denotes the share of non-noisy component in x(t). For reading convenience, the variables $r_i(t), r_{i,s}(t), r_{i,n}(t), x_i(t),s _i(t), n_i(t)$ and $\alpha _i(t-1)$ are denoted by $r_{i},r_{i, s},r_{i, n},x_{i},s_{i},n_{i}$ and $\alpha _{i}$, respectively. Furthermore, the noisy returns ${\varvec{r}}=(r_1,\ldots ,r_k)^{\tau }$ can be expressed as

$$\begin{aligned} \begin{array}{ll} {\varvec{r}} &{}={\varvec{\alpha \odot r_{s}+(1-\alpha ) \odot r_{n} }}\\ &{}={\varvec{R_{s}+R_{n}}} \end{array}\end{aligned}$$

(3)

where $(r_1,\ldots , r_k)^{\tau }$ and ${\varvec{\odot }}$ denote the transposition of ${\varvec{(}}r_1,\ldots ,r_k)$ and Hadamard product (Johnson, 1990). ${\varvec{r_s}}=(r_{1,s}, \ldots , r_{k,s})^{\tau }$ and ${\varvec{r_n}}=(r_{1,n},\ldots ,r_{k,n})^{\tau }$ present the noisy and non-noisy returns, their shares in the noisy returns are ${\varvec{\alpha }}=(\alpha _1, \ldots , \alpha _k)^{\tau }$ and ${\mathbf {1}}-\varvec{\alpha }=(1-\alpha _1, \ldots , 1-\alpha _k)^{\tau }$, respectively. Besides, we let ${\varvec{R_{s}}}=(R_{1,s}, \ldots , R_{k,s})^{\tau }$ and ${\varvec{R_{n}}}=(R_{1,n}, \ldots , R_{k,n})^{\tau }$ denote ${\varvec{\alpha \odot r_s}}$ and ${\varvec{(1-\alpha )\odot r_n}}$, where $ R_{i, s}=\alpha _{i} \odot r_{i, s}=r_{i, s}s_{i}/{x_{i}} $ and $R_{i,n}=(1-\alpha _i) \odot r_{i,n}=r_{i,n}n_i/x_i$.

Since the price $x_{i}$ is generally bounded, i.e., $M_1\le x_{i}\le M_2$, where $M_1$ and $M_2$ are constants. Besides, it is deduced that $\text {cov}(r_{i,s},r_{i,n})= \text {cov}(s_ir_{i,s}, n_ir_{i,n})=0$ based on $\text {cov}(s_i,n_i)=0$. Finally, the covariance $\text {cov}(R_{i,s}, R_{i,n})$ follows the inequality if considering $1/x_{i}$ as a coefficient term.

$$\begin{aligned} 0= \frac{1}{M_2^2}\text {cov}(s_{i} r_{i, s}, n_{i} r_{i, n}) \le \text {cov}(R_{i, s}, R_{i, n})=\text {cov}\left( \frac{s_{i}}{x_{i}} r_{i, s}, \frac{n_{i}}{x_{i}} r_{i, n}\right) \! \le \!\frac{1}{M_1^2}\text {cov}(s_{i} r_{i, s}, n_{i} r_{i, n})=0 \end{aligned}$$

(4)

Equation 4 shows that $cov(R_{i,s},R_{i,n})=0$, which means that the return $r_i$ are mainly composed of non-noisy component $R_{i,s}$ and noise $R_{i,n}$. Besides, we can deduce that $\text {cov}(R_{i,s},R_{j,n})=0,\ i\ne j$. In this way, the portfolio return $r_{p}$ is

$$\begin{aligned} r_{p}={\varvec{w^{\tau }r}}={\varvec{w^{\tau }(R_s+R_n)}} \end{aligned}$$

(5)

where ${{\varvec{w}}}=(w_i,\ldots ,w_k)^{\tau }$ are the portfolio weights, and $\text {cov}{\varvec{(R_s,R_n)=0}}$. Furthermore, we can obtain that the expectation and variance of the portfolio return $r_p$ are

$$\begin{aligned} \begin{array}{rl} {\mathbb {E}}(r_{p}) &{}={\varvec{w^{\tau }(\mu _{s}+\mu _{n})}} \\ var(r_{p}) &{}={\varvec{w^{\tau }\Sigma _{s}w}}+{\varvec{w^{\tau }\Sigma _nw}} \end{array}\end{aligned}$$

(6)

where ${\varvec{\mu _s}}$ and ${\varvec{\mu _n}}$ denote the expectations of non-noisy component ${\varvec{R}}_{s}$ and noise ${\varvec{R}}_{n}$. Similarly, ${\varvec{\Sigma _s}}$ and ${\varvec{\Sigma _n}}$ denote the covariance matrices of ${\varvec{R_s}}$ and ${\varvec{R_n}}$, respectively.

2.2 Mean–Variance Model Under Noisy Environment

Following Markowitz’s portfolio optimization framework (Markowitz 1952). The classical mean–variance portfolio model, which aims at minimizing portfolio variance under the given expected return ${\mathbb {E}} (r_p) = \mu _0$, can be expressed as

$$\begin{aligned} \begin{array}{ll} {\varvec{w}}(\mu _0)=\text {argmin} &{}{\varvec{w^{\tau } { \Sigma _s} w+w^{\tau } {\Sigma _n} w}}\\ \qquad \quad \quad \text {s.t.}&{} {\varvec{w^{\tau }(\mu _s+\mu _n)}} = \mu _0 \end{array} \end{aligned}$$

(7)

For calculation convenience, we consider an investor’s wealth might be partially allocated to the risk-free security and short sales are allowed, the restriction ${\varvec{w^{\tau }1}}=1$ is not included in Eq. (7). By using the Lagrange multiplier algorithm, the optimal solution can be obtained by solving $\mathop {min}\limits _{({{{\varvec{w}}},\lambda })} L({{\varvec{w}}},\lambda )$,

$$\begin{aligned} L({{\varvec{w}}},\lambda )={\varvec{ w^{\tau } {\Sigma _s} w+w^{\tau } {\Sigma _n} w}}-\lambda \left[ {\varvec{ w^{\tau }(\mu _s+\mu _n)}}- \mu _0\right] \end{aligned}$$

(8)

where ${\varvec{w}}$ is the optimal solution of Eq. (7) when the Lagrange function $L({{\varvec{w}}},\lambda )$ satisfies

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \frac{{\partial L}}{{\partial {{\varvec{w}}}}} =2{\varvec{(\Sigma _s+\Sigma _n)w}}-\lambda {\varvec{(\mu _s+\mu _n)=0}}\\[3mm] \displaystyle \frac{{\partial L}}{{\partial \lambda }} ={\varvec{ w^{\tau }{\varvec{(\mu _s+\mu _n)}}}}- \mu _0 = 0 \end{array} \right. \end{aligned}$$

(9)

Then under the noisy environment, the optimal mean–variance portfolio weight vector ${\varvec{ w_{noise}^*}}$ is computed as

$$\begin{aligned} {\varvec{w_\mathrm{{noise}}^{*}}}=\mu _0{\varvec{\frac{(\Sigma _s+\Sigma _n)^{-1} (\mu _s+\mu _n)}{(\mu _s+\mu _n)^{\tau } (\Sigma _s+\Sigma _n)^{-1} (\mu _s+\mu _n)}}} \end{aligned}$$

(10)

Similarly, the optimal portfolio weight vector ${\varvec{ w_\mathrm{{nonnoise}}^*}}$ under the noise-free environment is calculated as follows:

$$\begin{aligned} {\varvec{ w_\mathrm{{nonnoise}}^{*}}}=\mu _0{\varvec{\frac{(\Sigma _s)^{-1} \mu _s}{\mu _s^{\tau } (\Sigma _s)^{-1} \mu _s}}} \end{aligned}$$

(11)

Equations (10), (11) show that noise affects portfolio weight not only through the covariance matrix but also through the expected return, which confirms the fact that noise is an important factor affecting portfolio performance. In practice, what investors need is the portfolio weight ${\varvec{ w_\mathrm{{nonnoise}}^*}}$ under non-noisy environment, however, due to the existence of noise, the actual portfolio weight they obtain is ${\varvec{ w_\mathrm{{noise}}^*}}$. As a result, it is difficult for investors to construct an effective diversification, therefore, it is necessary to use some appropriate denoising strategies to suppress the noise interference.

When focusing on noise, a common assumption in practice is that the mean of noise is 0, i.e., ${\varvec{\mu _n=0}}$ (Donoho and Johnstone, 1994). In this case, the optimal portfolio weight ${\varvec{w_{noise}^{\dag }}}$ under noisy environment is

$$\begin{aligned} {\varvec{w_{\mathrm {noise}}^{\dag }}}=\mu _0{\varvec{\frac{(\Sigma _s+\Sigma _n)^{-1} \mu _s}{\mu _s^{\tau } (\Sigma _s+\Sigma _n)^{-1} \mu _s}}} \end{aligned}$$

(12)

It is clear that noise affects portfolio performance only through the covariance matrix, which confirms the validity of previous studies to filter the covariance matrix (Daly et al., 2008; Tian and Zhao, 2020). However, when the assumption ${\varvec{\mu _n=0}}$ is not satisfied, only filtering the covariance matrix is not sufficient.

2.3 Mean–Variance Effective Frontier

When analyzing the interference of noise on portfolio variance, since the mean of returns is close to 0 in practice, we can consider a simple scenario, i.e., the assumption ${\varvec{\mu _n=0}}$ is satisfied. In this way, we bring Eq. (12) into Eq. (6), then, the portfolio variance under noisy environment is calculated as

$$\begin{aligned} \begin{array}{ll} \sigma ^2_{\mathrm {noise}}={\varvec{(w_{\mathrm {noise}}^{\dag })^{\tau }(\Sigma _s+ \Sigma _n) w_{\mathrm {noise}}^{\dag }}} =\displaystyle \frac{\mu _0^2}{{\varvec{\mu _s^{\tau } (\Sigma _s+\Sigma _n)^{-1} \mu _s}}} \end{array}\end{aligned}$$

(13)

If taking the portfolio variance $\sigma ^2_{\mathrm {noise}}$ and expected return $\mu _0$ as the axis, the shape of mean–variance effective frontier is a parabola that opens to the right and passes through the origin point. The reason for this result is that we impose certain constraints on the mean–variance model, such as ${\varvec{\mu _n=0}}$, etc. Similarly, the portfolio variance under the non-noisy environment is computed as

$$\begin{aligned} \begin{array}{ll} \sigma ^2_\mathrm{{nonnoise}}={\varvec{(w_\mathrm{{nonnoise}}^{*})^{\tau }\Sigma _s wv^{*}}}= \displaystyle \frac{\mu _0^2}{{\varvec{\mu _s^{\tau } \Sigma _s^{-1} \mu _s}}} \end{array}\end{aligned}$$

(14)

Equation (14) shows that noise causes the portfolio variance to deviate from the true position, which is consistent with the results of optimal portfolio weights. Besides, when comparing the portfolio variance under noisy and non-noisy environments, the magnitude between them can be obtained from the following equation.

$$\begin{aligned} \begin{array}{ll} \displaystyle \frac{\mu _0^2}{\sigma ^2_{\mathrm {noise}}}-\frac{\mu _0^2}{\sigma ^2_\mathrm{{nonnoise}}}&{}={\varvec{\mu _s^{\tau } (\Sigma _s+\Sigma _n)^{-1} \mu _s}}-{\varvec{\mu _s^{\tau } \Sigma _s^{-1} \mu _s}}\\ [-1mm] &{}=|{\varvec{\mu _s^{\tau } (\Sigma _s+\Sigma _n)^{-1} \mu _s}}|-{\varvec{|\mu _s^{\tau } \Sigma _s^{-1} \mu _s|}}\\ &{}={\varvec{| (\Sigma _s+\Sigma _n)^{-1}|\cdot | \mu _s^{\tau }\mu _s|}}-{\varvec{|\Sigma _s^{-1}|\cdot |\mu _s^{\tau } \mu _s|}}\\ &{}={\varvec{[\,| (\Sigma _s+\Sigma _n)^{-1}|- |\Sigma _s^{-1}|\,]\cdot |\mu _s^{\tau } \mu _s|}} \end{array}\end{aligned}$$

(15)

where ${\varvec{|\mu _s^{\tau } \mu _s|}}\ge 0$, the matrices ${\varvec{\Sigma _s}}$, ${\varvec{\Sigma _n}}$ and ${\varvec{\Sigma _s+\Sigma _n}}$ are positive definite. Based on the knowledge of higher algebra, the inverse matrices ${\varvec{\Sigma _s^{-1}}}$, ${\varvec{\Sigma _n^{-1}}}$ and ${\varvec{(\Sigma _s+\Sigma _n)^{-1}}}$ are also positive definite. Besides, it can be deduced that ${\varvec{|\Sigma _s+\Sigma _n|\ge |\Sigma _s|}}$,^{Footnote 1} and ${\varvec{|(\Sigma _s+\Sigma _n)^{-1}|\le |\Sigma _s^{-1}|}}$,^{Footnote 2} In this way, we can obtain the following inequality.

$$\begin{aligned} \begin{array}{ll} \displaystyle \frac{\mu _0^2}{\sigma ^2_{\mathrm {noise}}}\le \frac{\mu _0^2}{\sigma ^2_\mathrm{{nonnoise}}}\Longleftrightarrow \sigma ^2_{\mathrm {noise}}\ge \sigma ^2_\mathrm{{nonnoise}} \end{array}\end{aligned}$$

(16)

Equation (16) implies that noise increases the portfolio variance and shifts the mean–variance effective frontier to the right. Therefore, denoising is equivalent to changing from a noisy environment to a non-noisy environment. As consequence, the effective frontier will shift to the left compared to that of using original price, and the higher the denoising degree is, the farther the shift to the left will be. Figure 2 summarizes the mean–variance effective frontier for different scenarios.

2.4 Measures of Portfolio Performance

In practice, investors are more concerned about the return they can achieve under a certain level of risk tolerance (Moura et al., 2020). Thus, four common quantitative indicators are considered to evaluate portfolio performance, which include the Sharpe ratio, Sortino ratio, upside potential ratio, and tracking error ratio. The higher these indicators are, the better the effect of portfolio will be.

As we know, the Sharpe ratio, abbreviated SR, is the most common indicator adopted by investors to measure portfolio return.

$$\begin{aligned} SR= \frac{{\mathbb {E}}(r_p)}{\sqrt{{var}(r_p)} } \end{aligned}$$

(17)

Due to potential drawbacks of Sharpe ratio in evaluating portfolio performance, we apply the Sortino ratio, abbreviated SoR, to take account of the asymmetric pattern of financial volatility which cannot be captured via Sharpe ratio (Sortino and Van Der Meer, 1991).

$$\begin{aligned} SoR=\frac{{\mathbb {E}}(r_p)}{\sqrt{{\mathbb {E}}(min (r_p, 0))^{2}}} \end{aligned}$$

(18)

Additionally, as described by Sortino et al. (1999), we take into account the upside potential return, and use the upside potential ratio, abbreviated UPR, to study the information in the higher moment.

$$\begin{aligned} UPR=\frac{ {\mathbb {E}}(max(r_p, 0))}{\sqrt{{\mathbb {E}}(min (r_p, 0))^2}} \end{aligned}$$

(19)

Also, in order to quantify the differences between competing portfolio strategies, the tracking error ratio, abbreviated TR, is used to evaluate the error-tracking ability (Berger and Czudaj, 2020).

$$\begin{aligned} TR = \frac{{\mathbb {E}}(r_p - r_b)}{\sqrt{var(r_p - r_b)}} \end{aligned}$$

(20)

where $r_b$ denotes the portfolio based on original unfiltered return, which is defined as the benchmark. TR gives the tracking error, i.e. the difference between the evaluated portfolio return and the benchmark. Thus, a higher TR denotes that the portfolio performance on error-tracking is better.

3 EMD Denoising Methodology

Section 2 points out that noise is an important factor affecting portfolio performance, take a step forward, a new EMD denoising method is constructed to improve portfolio performance. The reason for preferring EMD to construct the denoising method is that compared to traditional denoising methods such as wavelet denoising, etc, it is adaptive and does not require any prior assumptions about signal pattern or system order, such as basis function, decomposition level, etc, which are important factors affecting the denoising results. For investors, how to choose the right parameters is a difficult task. Besides, EMD shows better properties in dealing with nonlinear and non-stationary data (Huang et al., 1998), and has been widely applied to decompose financial data (Zhu et al., 2017; Yang et al., 2019). To illustrate the superiority of the proposed denoising method, we thoroughly compare several common denoising methods and test the portfolio performance under the mean–variance framework.

3.1 Empirical Mode Decomposition

The EMD proposed by Johnson et al. (1998) decomposes original noisy price x(t) into a series of IMFs, which need to satisfy the following two conditions: (1) The extremum numbers and zero-crossing points must be equal or differ at most by one in the whole time series. (2) The mean value of the envelope defined by the local maxima and minima is zero at any point. With this definition, the noisy price x(t) can be decomposed according to Table 1:

Table 1 EMD algorithm

Portfolio Selection Based on EMD Denoising with Correlation Coefficient Test Criterion

Abstract

Similar content being viewed by others

Portfolio allocation with CEEMDAN denoising algorithm

Research on regularized mean–variance portfolio selection strategy with modified Roy safety-first principle

Portfolio selection: shrinking the time-varying inverse conditional covariance matrix

1 Introduction

2 Portfolio Theory Under Noisy Environment

2.1 The Noisy Portfolio Returns

2.2 Mean–Variance Model Under Noisy Environment

2.3 Mean–Variance Effective Frontier

2.4 Measures of Portfolio Performance

3 EMD Denoising Methodology

3.1 Empirical Mode Decomposition

3.2 Common EMD Denoising Methods

3.3 The Proposed Denoising Method

4 Empirical Analysis

4.1 Data Resource

4.2 Denoising Analysis

4.3 Optimal Portfolio Construction

4.4 Portfolio Performance Evaluation

4.4.1 Full Sample Analysis

4.4.2 Subsamples Analysis

5 Simulation Study

6 Conclusions

Availability of Data and Materials

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Ethical Approval

Consent to Participate

Consent for Publication

Additional information

Publisher's Note

Appendices

Appendix 1

Appendix 2: Denoising Analysis

Appendix 3: Portfolio performance based on different wavelet soft threshold denoising methods

Appendix 4: Simulation study based on different sample lengths

Appendix 5: Robustness Test

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation