Abstract
We investigate the predictability of leading equity indices of 23 developed and 18 emerging markets with a set of 6406 technical trading rules over up to 66 years. Using a state-of-the-art test for superior predictive ability to control for data snooping bias, we find in-sample evidence for technical heuristics with significant outperformance over a simple buy-and-hold strategy in the majority of markets. The proportion of heuristics with superior performance is much higher among emerging market indices, and the predictability diminishes drastically over time in all markets. In particular, markets turn unpredictable in the last years of our sample. Moreover, the results are very sensitive to the introduction of moderate transaction costs. An out-of-sample analysis shows that the performance of technical rules is not persistent in the sense that recently best-performing rules perform significantly worse than simple buy-and-hold strategies in the future. Overall, our results cast serious doubt on whether investors could have earned any excess profits using the broad range of considered technical trading rules.
Similar content being viewed by others
1 Introduction
Whether technical analysis is capable of generating consistent profits is a matter of intense debate, both in research and in practice. According to several academic studies, technical trading rules,Footnote 1 which generate trading signals solely based on past price and volume information, are popular among professional and retail investors.Footnote 2 However, proponents of the efficient market hypothesis (Fama 1965, 1970) argue that technical analysis is meaningless, such as Malkiel (1999) who states that “technical analysts build their strategies on dreams of castles in the air” (p. 138) and that these investors follow an “abnormally dedicated cult” (p. 119).
The empirical literature mainly disagrees on whether technical analysis adds value for investors.Footnote 3 Most of these studies, however, focus exclusively on the leading US stock market indices and investigate narrow sets of technical trading heuristics. In addition to that, the research on technical trading rules generally faces the challenge of an accurate statistical analysis, which is necessary to produce reliable results. Data snooping bias is a major concern when assessing multiple models at the same time, which is often the case in the research of technical trading rules.Footnote 4 Since the risk of data snooping bias is immense when analyzing large sets of trading rules simultaneously, a statistical procedure to account for data snooping is inevitable. However, adequately dealing with data snooping is technically challenging, such that statistical tests that address this problem were not developed until after White (2000) introduced a “Reality Check” for data snooping. Irwin and Park (2007) argue that the technical rigor of previous studies is generally low due to outdated statistical procedures and that many of these positive contributions to the literature may be significantly flawed due to problems with data snooping.
This paper revisits the mixed results found by the previous literature, whose contributions are mainly limited in at least one of three dimensions: (i) the scope of investigated market data, (ii) the range of considered technical trading rules, and (iii) the adequacy of statistical tests to address concerns of data snooping bias. Our goal is to provide a comprehensive study that does not fall short on any of these points. We investigate the predictability of 23 developed and 18 emerging market indices with a set of 6406 common technical trading rules.Footnote 5 The sample includes daily close prices spanning up to 66 years.Footnote 6 We employ the “Stepwise Superior Predictive Ability Test” proposed by Hsu et al. (2010), which is a state-of-the-art multiple hypothesis test designed to assess the statistical significance of the performance of large sets of trading rules while controlling for data snooping bias. The test combines features of related multiple hypothesis tests of Hansen (2005) and Romano and Wolf (2005), and is able to identify as many predictive technical trading rules as possible in a stepwise procedure.Footnote 7
We apply the Stepwise Superior Predictive Ability Test to all stock indices in our sample and find statistically significant outperformance of technical trading rules over simple buy-and-hold strategies in 13 of the 23 developed countries and in 14 of the 18 emerging markets based on a Sharpe ratio (Sharpe 1966, 1994) performance measure.Footnote 8 Particularly for developed market indices, the number of rules that outperform the respective index is generally low. We then divide the sample periods into subsamples of 7 years in length to examine the evolution of predictive ability over time. We find a significant time trend in the predictive power of technical rules, which is highest during the first subperiods and declines sharply thereafter. Almost no developed market is predictable with the considered technical trading rules during the last two subperiods starting in 2002. In contrast, we still find that half of the emerging markets are predictable, with at least one rule during the period between 2002 and 2008. However, in the last subperiod between 2009 and 2016, almost all emerging market indices are unpredictable. These in-sample results are consistent with the predictions of the adaptive market hypothesis of Lo (2004). In contrast to the efficient market hypothesis, the adaptive market hypothesis assumes that markets evolve over time and that efficiency gradually increases as market participants are subject to a steady learning process. The degree of efficiency and the speed of adjustment depends on the competition among traders and their ability to learn. Consequently, initially superior (proprietary) trading strategies may eventually turn unprofitable due to either changing market environments or excessive competition as more participants adopt these strategies. In line with that, for all markets in our sample, the predictability through technical trading rules vanishes over time, and markets that are assumed to be less competitive generally exhibit higher predictability through technical trading rules.
Next, we test for the sensitivity of technical trading to transaction costs. In contrast to the previous literature, we refrain from calculating simple break-even transaction costs for technical rules. This approach treats transaction costs as exogenous and, thereby, neglects the impact of these costs on the performance of trading rules relative to each other.Footnote 9 We circumvent this issue by running the Stepwise Superior Predictive Ability Test multiple times with steadily increasing single-trip transaction costs. The analysis reveals that the predictive power of many rules is quickly offset by the introduction of low transaction costs. Only 5 of the 23 developed markets and 4 of the 18 emerging markets exhibit significantly outperforming trading rules for single-trip transaction costs of at least 20 basis points. The high sensitivity of technical trading performance to transaction costs is in line with prior findings in the literature (e.g., Allen and Karjalainen 1999; Bajgrowicz and Scaillet 2012; Ready 2002).
A significant part of this paper is devoted to an out-of-sample analysis on the applicability of technical heuristics in a way that aims to mimics the trading behavior of a real trader. Most of the previous literature on technical analysis examines only the ex-post performance of technical heuristics and neglects whether these heuristics actually exhibit out-of-sample persistent performance in the future. We identify the best trading rules in 3 year subperiods that are eligible for trading in each subsequent 3 year period. While the in-sample performance in terms of Sharpe ratios of the best trading rules is highly economically and statistically significant, the corresponding out-of-sample performance relative to buying and holding the respective index is either insignificant or negative and significant.Footnote 10 We also construct three equally weighted portfolios based on the out-of-sample technical trading algorithm applied to all 41 markets, as well as all developed markets and all emerging markets, respectively. Assuming transaction costs of zero, we find that none of the three portfolios generates a Sharpe ratio that is significantly different from the ones of the equally weighted buy-and-hold portfolios. For positive transaction costs, the technical trading portfolios significantly underperform the buy-and-hold portfolios most of the time. These findings suggest that the performance of best rules is not persistent and technical trading signals can be considered as noise.
We contribute to the empirical literature with a clear argument against the usefulness of technical trading rules in today’s markets. We consider this contribution significant for several reasons. To the best of our knowledge, this paper is the first broad-based analysis using the powerful and rigorous Stepwise Superior Predictive Ability Test for an analysis of technical trading rules in a comparative analysis of equity markets.Footnote 11 In addition, we leverage the test procedure to determine the distribution of rules with superior performance at different transaction cost levels. Unlike existing transaction cost analyses in the literature, we do not perform a pure break-even analysis, which may be biased since transaction costs are treated as exogenous. Another contribution is the out-of-sample test for technical trading performance. A similar persistence analysis has only been conducted by Bajgrowicz and Scaillet (2012), who apply it to a single time series of daily index data. To the best of our knowledge, our study provides the most comprehensive analysis of a large set of technical trading rules in terms of the number of investigated stock markets, the power of the applied testing procedure, and the rigor of the transaction cost and out-of-sample analysis. Given the general disagreement in the academic literature on the predictive ability of technical trading rules, our work aims to shed light on this ongoing debate.
The remainder of this paper is organized as follows. Section 2 discusses data snooping bias and multiple hypothesis tests in research on technical trading rules. Section 3 introduces the technical trading rules used in the empirical analysis of this paper. Section 4 deals with the performance measurement. Section 5 describes the data. Sections 6 and 7 present the in-sample and out-of-sample results, respectively. Section 8 concludes.
2 Data snooping bias and research on technical analysis
In the following, we provide a discussion of data snooping and appropriate testing procedures to cope with it in the context of technical analysis. While this section is mainly nontechnical, Appendix 1 introduces the basic steps required to implement the Stepwise Superior Predictive Ability Test by Hsu et al. (2010) used in the empirical part of this paper.
Data snooping becomes a problem if data are used more than once for statistical inference or to assess the applicability of multiple models (White 2000). This occurs frequently in empirical research as prior empirical findings likely motivate researchers to reexamine already investigated data (Lo and MacKinlay 1990b). Thus, datasets that happen to yield “desired” results are likely studied more intensively, while other datasets may be studied less frequently and the corresponding results are rarely published. With respect to that, Merton (1987) stresses the potential for biased results due to an overuse of certain data. He argues that standard statistical tests are inappropriate for these applications as they do not account for potential biases caused by data snooping. In addition, data snooping bias is a concern when multiple hypotheses are evaluated based on one dataset. One may (unintentionally) over adjust and fine tune the parametrization of technical heuristics based on past information, and succumb to the impression of superior performance. Thus, technical analysis research is highly susceptible to data snooping bias due to the large number of potentially testable trading rules. Bajgrowicz and Scaillet (2012) exemplify the danger of data snooping with the following exaggerated example: “[I]magine you put enough monkeys on type writers and that one of the monkeys writes ‘The Iliad’ in ancient Greek. Because of the sheer size of the sample, you are likely to find a lucky monkey once in a while. Would you bet any money that he is going to write ‘The Odyssey’ next?” (p. 474). The essence of data snooping bias is, therefore, that ample search for successful trading rules (or, more generally, for some models) in past data may likely yield superior but spurious results.
Classical hypothesis testing procedures disregard the impact of search on statistical significance in problems involving a high number of models to be evaluated. Following the argument of Hsu et al. (2016), if one aims to test for statistical significance of the maximum element from the vector \(\theta =\left( \theta _1,\theta _2,\dots ,\theta _m\right)\) (e.g., the highest performance measure from a set of m trading rules), a traditional test setup may simply formulate a null hypothesis based on the maximum element of \(\theta\), \(\theta ^{\max }= \max \left\{ \theta _1,\theta _2,\dots ,\theta _m\right\}\). However, according to White (2000), if an extensive specification search is required to identify the best-performing forecasting model, there is a high risk that the performance of this model is spurious. Even if one is only interested in the significance of the best model, the search process involved in identifying such a model implicitly turns the setup into a multiple hypothesis problem. That is because weaker models are also evaluated along the search process. As a result, neglecting the specification search and, thus, disregarding worse models in the test procedure of the best model increases the probability of committing a type I error of incorrectly rejecting the null hypothesis. Hence, for large m, one likely overstates the statistical significance of the highest performance in \(\theta\) and understates the actual probability of a type I error if the test procedure does not account for data snooping. This concern is particularly pronounced when testing technical trading rules, as the free choice of parameter values typically leads to many different (albeit correlated) models.
An early and simple approach to account for multiple hypotheses in statistical tests is provided by the popular Bonferroni correction. If one targets an overall significance level (often referred to as familywise error rate) of \(\alpha\) while testing m models simultaneously, the Bonferroni correction requires that each model is tested with a significance level of \(\alpha /m\). This approach might be suitable for applications where m is small. However, the test becomes very conservative for large m.Footnote 12 Also, the Bonferroni correction disregards a potentially complex correlation structure of trading signals of “related” trading rules.Footnote 13 The higher the correlation, the more conservative a Bonferroni test is. In the extreme case of a perfect correlation between all considered trading rules, a multiple hypothesis test is equivalent to a single hypothesis test. An appropriate test for evaluating the performance of technical trading rules should take this correlation into account.
White (2000) is the first to propose a joint testing method that considers the dependence structure of individual time series. His Reality Check tests whether the best trading rule from a potentially large set of trading rules shows significant outperformance of a benchmark such as a buy-and-hold strategy. White (2000) encounters the problem that the distribution of \(\theta ^{\max }\) from the vector \(\theta\) with correlated elements is highly complex or unknown (e.g., if the elements of \(\theta\) are correlated normal random variables, the distribution of \(\theta ^{\max }\) is unknown). White (2000) shows that a null hypothesis of the form \({\text{H}}_{0} :\, \max _{{k = 1, \ldots ,m}} \theta _{k} \le 0\) can be tested using the stationary bootstrap of Politis and Romano (1994). According to that, the original data are resampled in blocks of stochastic length to obtain pseudo time series that are used to compute an empirical bootstrap distribution for \(\theta ^{\max }\). The p-value for the best model is then calculated as the proportion of bootstrapped maximum performance measures that are larger than the actual sample maximum.
The Reality Check has two limitations from a statistical and a practical perspective. First, as Hansen (2005) notes, the Reality Check has low statistical power if many poorly performing trading rules are tested. This is because an increasing number of poor models disguises the good ones, resulting in lower test power.Footnote 14 Second, the Reality Check only tests for the significance of \(\theta ^{\max }\), such that there is no statistical evaluation of all strategies that are inferior to the best strategy. In practice, however, it may not be sufficient to just assess the statistical significance for the best trading rule, since one may be interested in all rules that exhibit real outperformance.
Both drawbacks of White’s (2000) Reality Check are addressed in advanced versions of the test. Hansen (2005) proposes the “Superior Predictive Ability Test,” an improved version of the Reality Check in which bootstrap samples are recentered when models perform particularly poorly, assigning lower weights to those models. Still, the Superior Predictive Ability Test also just tests the single best model, regardless of the total number of considered models. Both Romano and Wolf (2005) and Hsu et al. (2010) introduce stepwise versions of the Reality Check and the Superior Predictive Ability Test, respectively. These tests extend the existing counterpart by an algorithm that allows to identify all models with statistically significant outperformance in a stepwise procedure that asymptotically controls for the probability of committing a type I error. If the full set of models contains models with superior performance, these are removed from the full set and the procedure is reestimated over the remaining subset. This stepwise procedure terminates as soon as the test no longer identifies a model that exhibits superior performance at the prespecified significance level. For our empirical study, we employ the Stepwise Superior Predictive Ability Test of Hsu et al. (2010), as it is the most progressive test procedure by combining the Superior Predictive Ability Test of Hansen (2005) with a stepwise algorithm.Footnote 15
3 Technical trading rules
We use a sample of 6406 technical trading rules that are categorized in five major strategy classes. For some rules, we additionally use up to three trading filters. While we briefly introduce these five strategies and three trading filters here, Appendix 2 provides the technical definitions required to implement them for computer-based automated trading.
The relative strength index is a so-called oscillator, which was first introduced and studied by Levy (1967a, 1967b) and Wilder (1978). It defines a ratio of the sum of positive to the sum of all absolute price changes over a prespecified time period. The frequency and the magnitude of positive price changes raises the relative strength index. Analogously, negative price changes decrease the relative strength index. By definition, the index ranges from 0 to 100. Technical analysts interpret a high relative strength index (usually values of 70 and above are considered high) as an indication of a recently overbought stock that is expected to undergo a downward correction in the near future. A value of 30 or below is usually interpreted in the other direction.
Filter rules were studied by Alexander (1961, 1964) as well as Blume and Fama (1966). These rules assume that recent trends in stock prices are likely to continue in the future. Trading signals are generated once a stock shows a trend (i.e., positive or negative cumulative price changes of a certain magnitude) during a specific period of time. Adverse price movements indicate the reversal of trends.
Moving average rules attempt to identify price trends by smoothing the time series of past prices. Trading signals are triggered when two moving averages cross which is perceived as a reversal of a trend. Specifically, once the “slow” moving average (i.e., the smoother moving average due to an inclusion of more historical data) penetrates the “fast” moving average from above, the crossover event is interpreted as a bullish signal. Likewise, a penetration from below is interpreted as a bearish signal.
Support and resistance rules rely on the assumption that the security price has not moved below or above a certain price level several times in the recent past (i.e., the price has formed a “support level” from above or a “resistance level” from below). Once a price breaks through the support level or falls below the resistance level, it is assumed to continue moving in the respective direction.
Channel breakout rules are based on the idea that a security price moves within the boundaries of a certain price range. More specifically, a channel is defined by two parallel lines between which the price moves back and forth over a specific period of time. A break through the upper (lower) boundary of the channel is interpreted as the beginning of a positive (negative) trend.
We supplement the trading rules with up to three common trading filters.Footnote 16 These filters are designed to improve trading results by filtering out “unfavorable” trading signals ex ante.Footnote 17 First, a band filter sets an additional return by which the price of an asset must move in the desired direction to open a trading position after a trading signal is generated. It is, therefore, designed to prevent the entry into a position if the price movement is not strong enough. Second, a time-delay filter simply delays a trade by a specified time interval after a signal is generated. Third, a fixed-length filter defines a fixed holding period after which a position is automatically liquidated. Any intermediate trading signals are ignored. This filter aims to close a position before trends flatten out and the security potentially generates negative returns.
The filter, moving average, support and resistance, and channel breakout rules as well as their parameterizations are adopted from the seminal paper of Sullivan et al. (1999). In contrast to their study, this paper does not use on-balance volume rules (that require daily volume data) because volume data are not available for several indices in our sample. We, furthermore, add the relative strength index rules used by Hsu et al. (2016) to our sample of rules. The parameters of the trading rules are presented in Table 1 and Table 2 lists the considered parameter combinations. Overall, we study a total of 6406 trading rules, including 600 relative strength index rules, 497 filter rules, 2049 moving average rules, 1220 support and resistance rules, and 2040 channel breakout rules.
4 Performance measurement
Similar to the prior literature (e.g., Bajgrowicz and Scaillet 2012; Hsu et al. 2010, 2016; Sullivan et al. 1999), we use a Sharpe ratio criterion in all applications of the Stepwise Superior Predictive Ability Test to evaluate the performance of technical trading rules (see Appendix 1 for the implementation of the test procedure based on the Sharpe ratio criterion, which is introduced in this section). To formalize the underlying performance measurement, let there be a total of m trading heuristics to be tested and let \(r_{t}\) denote the one-period return of a certain risky asset and \(r_t^f\) denote the risk-free rate in period t, \(t = 1,\dots ,N\). Furthermore, \(\delta _{t-1,k}\) is a signal indicator for heuristic k, \(k = 1,\dots ,m\), which indicates the exposure to the security in period t (i.e., \(\delta _{t-1,k}= 1\) for a long, \(\delta _{t-1,k}= 0\) for a neutral, and \(\delta _{t-1,k}=-1\) for a short position).
Trading according to technical sell signals requires closing long positions and/or entering short positions, where the latter may be associated with several constraints such as lending fees, lacking assets to sell short, or governmental or institutional restrictions (see, e.g., Almazan et al. 2004; D’Avolio 2002; Nagel 2005). Short sale constraints are likely severe for certain markets and time periods in our sample. On the one hand, less-developed equity markets may not be endowed with the institutional framework for short selling (at reasonable costs). On the other hand, during earlier time periods, short selling was much more expensive than it became more recently (D’Avolio 2002; Duffie et al. 2002; Jones and Lamont 2002). Moreover, short selling costs are time varying and tend to increase substantially during periods with a high willingness of investors to short a stock.
We circumvent the problems associated with true short positions by using the “double-or-out” strategy proposed by Bessembinder and Chan (1995). According to that, whenever \(\delta _{t-1,k}= 0\), an investor holds one unit of the asset and earns its return \(r_t\). If \(\delta _{t-1,k}= 1\), an investor is required to double his or her exposure by borrowing money at the risk-free rate to finance the additional unit of the asset that results in a payoff of \(2r_t-r_{t}^f.\) Finally, whenever \(\delta _{t-1,k}= -1\), the asset is sold and the proceeds are invested in the risk-free security such that the payout is equal to \(r_{t}^f\). In summary, the return of trading rule k in period t under a double-or-out strategy reads as
and the associated Sharpe ratio is given by
where \({\bar{\pi }}_k-{\bar{r}}^f\) is the sample average of excess returns and \({\hat{\sigma }}_k\) is the corresponding standard deviation.
We evaluate the performance of technical trading rules against a benchmark. A natural choice for a benchmark model is the null model of staying out of the market (e.g., Brock et al. 1992). However, since active traders usually aim to evaluate their trading performance against a benchmark that is related to the risk–return profile of the traded asset, we chose a simple buy-and-hold strategy as benchmark model. Therefore, we set \(\pi _{t,0} = r_t \delta _{t,0}\), where \(\delta _{t,0}=1\) for all t such that \(\pi _{t,0} =r_t\).
Following Sullivan et al. (1999), our final performance measure is the difference between the Sharpe ratios generated by trading rule k and the benchmark,
where \({\hat{\sigma }}_0\) is the sample standard deviation of the return series of the benchmark.
Since we do not have interest rate data for most of the investigated markets, we set \(r_{t}^f = 0\) for all t. Even though this simplification may bias our results, we assume that the impact on the overall outcomes is negligible. First, if a trading rule exhibits more days with long than short exposure, the trading returns may be biased upwards as we disregard longer periods when the investor has to pay \(r_{t}^f\) for borrowing funds to invest in a second unit of the asset. In turn, the performance could be biased downwards in the opposite case. Second, if costs of borrowing vary systematically between periods with long and short exposure, the performance measure may be biased as well. Third, expression (3) shows that the impact of \({\bar{r}}^f\) is approximately offset if standard deviations are similar. However, the performance of trading rule k tends to be overestimated if \({\hat{\sigma }}_0 > {\hat{\sigma }}_k\) when neglecting \({\bar{r}}^f\). For those reasons, we examine whether the simplification could have a significant impact on the performance measure. We find that the vast majority of investigated trading rules are relatively balanced in terms of the time invested in short and long positions. Moreover, the returns from technical trading tend to be more volatile than those of the respective benchmarks (which is mainly due to the characteristics of the double-or-out strategy), suggesting that setting \(r_t^f=0\) may keep our results rather on the conservative side. Based on these findings, we expect that the likelihood of a significant upward bias should be low.
5 Data
The empirical study is based on the daily close prices of the leading stock market indices of 23 developed countries and 18 emerging markets. An overview of these markets and respective sample periods is provided in Table 3. The longest available time series is for the US S&P 500, which covers the period between January 1950 and May 2016, while the shortest is for the Colombian IGBC, which covers the period between July 2001 and May 2016. The index data are retrieved from Datastream and cover the maximum available sample periods ending in May 2016. Markets for which less than 14 years of data are available are not included to ensure appropriately long time series and to have at least two subsamples of 7 years each in the subperiod analysis. The classification of countries as either “developed” or “emerging” is based on the categorization of the International Monetary Fund (IMF 2016), which applied at the end of the sample period. The 23 developed countries are considered as “advanced economies” by the IMF. Likewise, the 18 emerging markets are categorized as either emerging or developing markets.Footnote 18 Obviously, some of the markets undergo significant economic development during the sample period in the sense that some countries were not considered developed at the beginning of the sample period, but are considered so at the end.Footnote 19 Clearly, a static classification of markets for longer time periods is not perfectly possible.
Given that many emerging economies in the sample experienced high levels of inflation in the past, we adjust stock market returns for inflation rates.Footnote 20 The annual inflation rates are obtained from World Bank Open Data. We compute inflation-adjusted daily returns as \(r_t^{\textit{adj}} = r_t^{\textit{unadj}} - \log (1+i_y)/252\), where \(r_t^{\textit{unadj}}\) is the unadjusted daily index return on day t, and \(i_y\) is the corresponding discrete inflation rate in year y.
Table 4 presents basic descriptive statistics for the 41 stock market indices as well as annual inflation rates of the corresponding countries. Not surprisingly, emerging market indices in panel B tend to have higher average annualized returns and are more volatile than most developed market indices in panel A. Similarly, with a few exceptions such as Croatia, these countries have much higher inflation rates on average.
The use of index data is widespread in studies of technical trading rules, which is mainly due to the availability of long time series. A major drawback of such analyses is the limitation that stock indices cannot be traded directly. Investments that are closely related to the risk and return profile of stock indices are usually only possible through certain derivative instruments or exchange-traded funds. An obvious alternative to conducting our analysis would be to use actual market data for the constituent stocks of the considered indices, which is, however, not available for many markets in our sample. In the absence of reasonable alternatives, we therefore also rely on index data, but note that a broad analysis of actual stock data could enrich the technical analysis literature.
The analysis of index data raises additional issues that may lead to an upward bias of results. As shown by Scholes and Williams (1977), nonsynchronous trading of index constituents can lead to measurement error due to spurious serial dependence of index returns. As Ready (2002) notes, especially on days with technical trading signals, there may be systematic differences between the close price and the opening price of a stock index on the next trading day. First, imbalances in buy/sell orders caused by excess buy/sell interest at the close could reappear at the opening of the following trading day and lead to a systematic price movement. Typically, technical signals tend to exploit positive serial dependence since buy (sell) signals are often observed on days with large positive (negative) price movements (Bessembinder and Chan 1995). Moreover, if at least some close prices of the stocks included in the index are stale, partial adjustments on subsequent trading days may result in price changes biased in the same direction as those on previous days. The concern of stale prices is relatively low for very liquid stocks, but it tends to be higher if stocks trade at low volume (Campbell et al. 1993). Hence, biases induced by nonsynchronous trading could be problematic for earlier time periods when trading volume used to be lower (e.g., Lo and Wang 2000) or for less liquid markets in the sample.Footnote 21
To mitigate concerns of overestimated trading performance, we use the common approach of delaying the actual entry into a position by one day after a signal is triggered (see, e.g., Bessembinder and Chan 1995; Ready 2002; Sullivan et al. 1999). More specifically, if a technical signal is generated on day t, it is utilized at the close price of day \(t+1\). Since order imbalances and partial adjustments of stale prices tend to normalize after one trading day, it is assumed that the close of the following day is the first trading opportunity for investors. Also, assuming that stock prices are equally likely to trade at the bid or the ask price one day after a trade signal occurs, delayed trading corrects for spread-induced trading costs.Footnote 22
6 In-sample results
6.1 Performance of technical trading rules
In this section, we present the results of the application of the Stepwise Superior Predictive Ability Test. While we examine subperiods and the impact of transaction costs in later sections, the following analysis addresses full sample periods of all 41 market indices under the assumption that trading is free of any costs.
The results are reported in Table 5, where panel A shows the results for developed countries and panel B shows the results for emerging markets, respectively. The column “Full sam.” lists the number of technical trading rules with superior performance against buying and holding the index (i.e., these rules exhibit a p-value of less than 0.1 according to the Stepwise Superior Predictive Ability Test based on the Sharpe ratio criterion). We find that 13 of the 23 developed market indices and 14 of the 18 emerging market indices are predictable with at least one technical trading rule. The highest degree of predictability among developed country indices in terms of the total number of rules with superior performance is observed for Hong Kong (290 or about 4.5% of all considered heuristics), Portugal (153 or about 2.4%), and Great Britain (101 or about 1.6%). Among the emerging market indices, predictability is highest for Peru (1700 or about 26.5%), Pakistan (805 or about 12.6%), and Bulgaria (556 or about 8.7%). Thus, while the share of predictable markets is not only higher among the sample of emerging market indices, these markets also exhibit a higher degree of predictability in terms of the proportion of rules with superior performance. For instance, 8 of the 13 predictable developed market indices have at most 14 outperforming rules, which corresponds to only about 0.2% of the full universe of considered technical trading rules.
This first analysis suggests that the majority of markets are predictable with technical rules, and, in particular, several emerging markets have a significant proportion of rules with superior performance. On the one hand, the high number of predictive rules in emerging markets may be due to lower market efficiency, as the stock markets of less developed countries tend to incorporate new information into prices more slowly and lack institutional framework to enhance efficiency (see, e.g., Araújo Lima and Tabak 2004; Bhuyan et al. 2008; Poshakwale 2002; Sharma and Thaker 2015). On the other hand, trading in these markets tends to incur comparatively high transaction costs (Lesmond 2005), which may then be picked up as profits through active trading strategies under a zero-trading cost scheme.Footnote 23 In fact, most technical trading rules generate trading signals very frequently, such that these rules would incur substantial transaction costs if applied to real markets. Later robustness checks will assess the sensitivity of performance to transaction costs.
Since investors usually strive for maximum profits, the best-performing technical trading rules play an important role among the whole set of examined rules. Therefore, Table 5 also reports basic characteristics for these specific rules for all markets.Footnote 24 For 23 (15) markets, support and resistance rules (moving average rules) have the best performance in terms of the Sharpe ratio criterion.Footnote 25 Moreover, the best-performing rules have significant excess Sharpe ratios of up to 1.29 (Portugal) for developed markets and 1.12 (Saudi Arabia) for emerging markets. These rules, furthermore, produce highly economically significant excess returns, which mainly range from more than 10% to around 40% per year.
Table 5 also shows that best-performing rules predominately exhibit long or short exposure to the market [i.e., 28 (39) of the 41 best rules are either long or short at least 80% (50%) of the time]. In several cases, the best rules are never market-neutral once an initial trading signal is triggered, as a signal converts a long position to a short position and vice versa (e.g., this is the case for all moving average rules without trading filters). For both developed and emerging market indices, best-performing trading rules tend to generate higher average returns per trade than short positions, and long signals are more likely to generate winning trades. This is in line with previous research by, for instance, Brock et al. (1992) who report that gains of long–short trading strategies are mainly driven by long positions, which is consistent with generally upward-trending equity markets in the long run.
The average number of transactions per year varies significantly among best rules. Many of them, however, generate trading signals very frequently. For example, the best rule for the Colombian market is a 5 day moving average that emits more than 70 trades per year on average with a mean holding period of just 4.3 days on the long side and 2.7 days on the short side. Accordingly, trading such rules may accumulate high transaction costs.
In the next two sections, we check for the robustness of our in-sample results by examining how the predictive ability of technical trading rules evolves over time, using subperiods of the full sample periods. Thereafter, we explore the impact of transaction costs.
6.2 Subperiod analysis
In a further step, we test whether the results for the full sample periods also hold during subperiods of 7 years.Footnote 26 Table 6 presents the results for developed country indices (panel A) and emerging market indices (panel B). Again, outperforming rules are defined as those rules that have a p-value of less than 0.1 according to the Stepwise Superior Predictive Ability Test using the Sharpe ratio criterion. For comparison purposes, we report the number of rules with superior performance over the entire sample periods, as already presented in Table 5, in the last column of the table.
For 20 of the 23 indices for which we find outperforming rules in at least one subperiod, the highest number of outperforming rules is observed in the first subperiod. In these cases, the proportion of predictive rules tends to decrease sharply after the first subperiods. Most notably, there are 350 outperforming rules for the Finnish stock market index between 1988 and 1994, 305 outperforming rules for the Bulgarian stock market index between 2002 and 2008, and 103 outperforming rules for the Brazilian stock market index between 1995 and 2001. While these are the results for the first subperiods, for all heuristics in the sample, the three indices become unpredictable during the subsequent subperiods. A decline in technical trading performance over time is also found by Kish and Kwon (2002) and LeBaron (2000) for major US stock indices and by Hsu et al. (2016), Neely et al. (2009), and Olsen (2004) for several foreign exchange pairs. Our results, furthermore, show that predictive power drops much earlier in developed markets than in emerging markets. In developed markets, predictive ability of technical trading rules disappears almost completely by the early 2000s. For emerging markets, the sharpest decline in predictability is observed between the penultimate (2002–2008) and the most recent (2009–2016) subperiods in terms of the share of predictable markets (see also the last row of the panel). In the last subperiod, no developed market index is predictable anymore and only two emerging market indices are still predictable with only one rule each.
There are several potential reasons for this negative trend in predictability. As discussed in the previous section, trade-intensive rules may perform particularly well before transaction costs. As trading costs have steadily declined in recent decades (e.g., Jones 2002), rules that earn high profits by exploiting a zero-cost system should, all else equal, be more successful during the early subperiods. Another explanation for our results is different levels of market efficiency over time, which is in line with the adaptive market hypothesis of Lo (2004). According to that, the first subperiods can be interpreted as “early evolutionary stages,” which are subject to greater market efficiency in the future, for instance, through gradual learning by market participants. Thus, technical trading strategies can generate superior performance over certain periods, and the more proprietary these strategies are, the longer they may yield attractive results. However, even very sophisticated trading strategies may eventually “die out” if market conditions change significantly, such as more intense competition among traders for strategy returns (Timmermann 2008; Timmermann and Granger 2004).Footnote 27 Closely related to the notion of growing market efficiency is the rise of institutional investors and professional algorithmic traders.Footnote 28 While institutional investors are not necessarily the better investors, greater competition among these investors can lead to an increase in know-how about trading techniques and drive the speed of processing market data.
In summary, the declining performance of technical trading rules across markets may not be surprising, as the rules studied in this paper are considered quite simple and were not ahead of their time at the beginning of the sample periods. The ease with which these naive technical trading rules can be copied suggests that initially predictive rules may become unfavorable relatively quickly. While at least some of the trading rules have performed significantly better than buy-and-hold strategies under a zero-transaction cost scheme in early subperiods, they all appear to be useless in current markets.
6.3 Transaction cost analysis
In the previous analyses, transaction costs were not taken into account. The superior performance found for several technical trading heuristics does not necessarily imply superior returns after transaction costs, since a zero-cost scheme is prone to overestimate the performance of highly trade-intensive heuristics. Transaction costs may also vary substantially across the examined markets (Lesmond 2005). In the following, we relax the strong simplification of trading without costs.
A proper transaction cost analysis is not trivial since reliable cost estimates for certain markets and time periods are usually not available. Therefore, a common approach in the literature is to calculate break-even transaction costs of the rules with superior performance at zero transaction costs (see, e.g., Bessembinder and Chan 1995, 1998; Hsu et al. 2016). For example, Hsu et al. (2016) simply calculate the break-even costs for the best trading rules they identified in a previous test for superior predictive ability. The drawback of this approach is that it does not reveal the maximum transaction cost per trade up to which a trading rule still exhibits statistically significant outperformance. In addition, technical trading rules that emit frequent signals are likely to be more sensitive to transaction costs compared with other rules. Thus, the distribution of rules with significant performance is most likely affected by the introduction or increase of transaction costs (see also Bajgrowicz and Scaillet 2012). To address these issues, we consider transaction costs before testing for statistical significance.
Our approach is simple, but to the best of our knowledge has not been used in the literature before. We increase the transaction costs in steps of five basis points and use the Stepwise Superior Predictive Ability Test at each transaction cost level to test whether technical trading rules have superior performance. As a result, we obtain the number of outperforming rules for each transaction cost level. Once the transaction costs have reached a level where no significant performance can be detected, the algorithm stops.
Table 7 reports the results of the transaction cost study (the results for zero transaction costs from Table 6 are also reported for ease of comparison). Regarding panel A for developed countries, for 10 of the 23 market indices (i.e., Canada, Finland, Great Britain, Greece, Hong Kong, Israel, Japan, Norway, Portugal, and the USA) at least one trading rule with significant outperformance at positive transaction costs exists. However, for these markets, the number of outperforming rules and the maximum level of transaction costs are mostly small. Trading rules with superior performance at transaction costs of more than ten basis points are only found in five markets. The highest single-trip transaction costs of 200 basis points are estimated for one rule in the Japanese stock market. This is followed by the stock indices from Hong Kong and Portugal, both of which are predictable with one rule for costs of up to 40 basis points per transaction. Of the 18 emerging markets, 11 markets (i.e., Bulgaria, Chile, China, Colombia, Croatia, Malaysia, Pakistan, Peru, the Philippines, Russia, and Saudi Arabia) are predictable for positive one-way transaction costs of at least five basis points (see panel B). The Bulgarian, Malaysian, Pakistani, and Peruvian stock market indices have the highest predictability for costs of at least 50 basis points per trade. With 112 outperforming rules at the 500 basis point level and four outperforming rules at the 1000 basis point level, the Bulgarian SOFIX exhibits the highest predictability. With a few exceptions, our results are consistent with previous findings in the literature that technical trading performance is quickly offset by moderate transaction costs (e.g., Allen and Karjalainen 1999; Bajgrowicz and Scaillet 2012; Ready 2002).
The impact of transaction costs for the full sample of technical trading rules is also shown in Fig. 1 for the US S&P 500 (panel A) and the Peruvian S&P/PVL Peru General (panel B) as examples for a developed and an emerging market, respectively.Footnote 29 Both panels plot the Sharpe ratio performance measure for all 6406 technical heuristics at transaction costs of 0 and 25 basis points per transaction. Generally, the impact of transaction costs is quite significant for rules that favor very frequent trading (these are often rules with lower parameter values which are mainly shown further to the left for each of the five classes of heuristics). For example, while the performance measures of all moving average rules before transaction costs range within the interval of \([-0.5,0.3]\) in the US market, performance suffers significantly when trading costs are added. With single-trip costs of 25 basis points, excess Sharpe ratios of moving average rules are as low as \(-\)3.5. Despite the meaningful impact of trading costs on performance, the example of the Peruvian market suggests that a large fraction of rules generate excess Sharpe ratios well above zero, which is not the case in the US market. Again, the higher degree of predictability of the Peruvian market could be due to relatively lower market efficiency and generally higher transaction costs required to trade in this market in reality.
In most markets, the estimated maximum transaction costs for a single transaction are small compared with what traders would likely have had to pay to trade these markets during the sample periods. For example, Chan and Lakonishok (1993) estimate single-trip trading costs of 14 basis points incurred by institutional traders for large US stocks between 1986 and 1988. According to Stoll and Whaley (1983), average trading costs were much higher in the preceding decades. Moreover, the emerging markets in our sample are likely to have much higher transaction costs. For the period from 1991 to 2000, Lesmond (2005) estimates variable trade costs of 0.5–1.0% for Malaysia, 0.6–0.9% for Poland (both depending on order volume), 0.8–1.05% for China, 1.5% for the Philippines (both for foreign investors), and 1.0% for Indonesia. Given these figures, our analysis casts doubt on whether markets could have been traded profitably with any of the investigated trading rules.
Next, we investigate the evolution of the average number of trades and average holding periods of outperforming rules at different transaction cost levels. Table 8 presents the relative change in the average number of trades for positive transaction costs.Footnote 30 For each market and transaction cost level, we average the number of trades for all rules with superior performance. To ensure comparability across markets, we normalize the average values for each transaction cost level of a market with the corresponding value when transaction costs are zero. Thus, the table shows changes in the average number of trades relative to the zero-transaction cost scenario (e.g., a value of 0.9 implies a 10% decrease in the average number of trades among all outperforming rules at a given transaction cost level compared with the corresponding average number of trades of all outperforming rules at zero transaction costs). The results indicate that the average number of trades decreases in transaction costs for most developed and emerging markets. Thus, as conjectured above, the share of outperforming rules with frequent trading signals declines for higher transaction costs. Table 9 presents the results for the average holding period conditional on transaction costs. The reported figures are calculated analogously to those presented in Table 8. According to that, in almost all markets, the average holding periods increase monotonically with transaction costs compared to the case of zero transaction costs.
The results from Tables 8 and 9 suggest that the share of outperforming rules with infrequent trading signals and longer average holding periods tends to increase as transaction costs rise. Based on that, one may expect relatively more outperforming rules which mimic a simple buy-and-hold strategy when transaction costs are high. However, this is not the case. Despite the longer average holding periods, we observe a decrease in the average time invested in 17 of the 21 markets for increasing transaction costs. Thus, the outperforming rules tend to exploit very specific price patterns and have longer periods with no exposure to the market when transaction costs are high.
7 Out-of-sample results
In this section, we address two fundamental problems that arise in analyses of technical trading rules over long sample periods. First, using technical trading rules requires active trading, and technical analysis has been shown to be used by amateur and professional investors who deliberately engage in active portfolio management (e.g., Faugère et al. 2013, Lease et al. 1980). Thus, the common convention in the literature of testing trading rules over periods of several decades most often diverges from the short-term oriented trading behavior of individuals who may aim to apply heuristics which they find most valuable at a given point in time. Second, backtests (such as those conducted in this paper so far) only provide ex-post information on whether trading rules could have been traded profitably. However, in-sample outperformance does not imply out-of-sample outperformance, nor does it give any indication of how to select the best rules to trade in the future. So far, we cannot answer the question whether a trader could have exploited the predictability of some of the investigated markets (at least if trading were free or very cheap) by implementing a profitable trading system based on the studied technical rules. We approach this task by performing an out-of-sample persistence analysis of technical trading performance in the following.
To test out-of-sample performance, we mimic the trading activities of a technical trader, similar to Bajgrowicz and Scaillet (2012). For each market, we divide the sample periods into 36 month subperiods and select the trading rules with the best performance in terms of the highest excess Sharpe ratio (we refer to these 36 month intervals in which in-sample performance of technical trading rules is evaluated as “testing periods”).Footnote 31 The best-performing rule during a testing period is used for trading during the subsequent 36 month period (which we refer to as “trading period”). When a trading signal is generated by the selected rule during a trading period, the index is purchased. A potentially open position is automatically closed at the end of a trading period. We apply the double-or-out strategy as in the previous analyses to mitigate true short positions.
Table 10 presents the results based on single-trip transaction costs of 0, 10, 25, and 50 basis points.Footnote 32 The reported performance measure is the difference between annualized Sharpe ratios for technical trading returns and the corresponding buy-and-hold benchmark returns. In addition to the out-of-sample performance, the table also reports the corresponding in-sample performance for each transaction cost level (i.e., the in-sample results are based on the best rules identified during the testing periods, which are used to evaluate the out-of-sample performance during subsequent trading periods). The statistical significance of the performance is evaluated using a hypothesis test proposed by Bailey and López de Prado (2012), which assesses the null hypothesis that two Sharpe ratios (computed from potentially nonnormal return distributions) are equal.
The analysis shows that the in-sample performance is sizable (with excess Sharpe ratios mostly above 1) and highly statistically significant even at single-trip transaction costs of 50 basis points. This may not be surprising since in-sample performance is by construction the result of extensive data snooping. More importantly, the out-of-sample performance is mostly insignificant or negative and significant. For higher single-trip transaction costs of 25 and 50 basis points, we observe significant out-of-sample underperformance in 14 and 18 markets, respectively, at least at the 10% significance level. The findings imply that best-performing technical trading rules do not generate persistent performance over relatively short time horizons, and that such a trading system is detrimental compared with simply buying and holding the respective market index. This impression is confirmed in panel C of Table 10, which reports results for equally weighted portfolios of developed market indices, emerging market indices, and all market indices, respectively. All in-sample performance measures are positive and statistically significant at the 1% level for the four transaction cost scenarios. In contrast, the out-of-sample performance measures are insignificant for single-trip costs of 0 basis points and turn negative and significant at positive transaction costs in most of the considered cases [e.g., for single-trip costs of 50 basis points, the performance measure equals − 0.315 (significant at the 5% level) for the developed markets portfolio as well as − 0.507 and − 0.385 for the emerging markets and all-markets portfolio (both significant at the 1% level)]. We reestimate the selection algorithm using excess raw returns as an alternative performance measure. The results presented in Table 11 in the appendix confirm our baseline findings obtained with the Sharpe ratio criterion.
Panels A–C in Fig. 2 plot the performance of the three portfolios over the entire sample period for single-trip transaction costs of 0 and 25 basis points. Here, performance is measured as the in-sample and out-of-sample excess returns (i.e., the daily differences between the equally weighted returns from technical trading and the equally weighted returns from the buy-and-hold benchmark strategy). All three panels show that the in-sample excess returns are substantial and comparatively unaffected by transaction costs. In contrast, out-of-sample performance is regressive most of the time, and all portfolios generate negative returns over the full sample period, even in the absence of transaction costs.
In summary, the results presented in this section demonstrate that past superior performance of technical trading rules does not persist in the near future. This seriously calls into question whether the studied rules could have been traded at any profit. Based on our simple selection algorithm for trading rules, technical trading is harmful for investment performance across the broad range of considered markets. Our results corroborate those of Bajgrowicz and Scaillet (2012), who also measure very poor out-of-sample performance of simple technical trading rules applied to the DIJA.
8 Conclusions
This paper provides a comprehensive analysis of a broad universe of simple technical trading rules applied to a total of 23 developed market indices and 18 emerging market indices using the Stepwise Superior Predictive Ability Test. The novelty of this test is its ability to identify the whole subset of trading rules with superior performance relative to a buy-and-hold benchmark, while accounting for data snooping bias. To the best of our knowledge, we are the first to apply this powerful statistical test to a large set of trading rules in a comparative analysis of multiple stock markets.
Our in-sample results show that technical trading rules have predictive power in some markets during relatively early periods when transaction costs are ignored. However, in recent years, the investigated technical rules do not have predictive power anymore. To evaluate the impact of transaction costs, we run a stepwise algorithm to determine the number of outperforming trading rules for different transaction cost levels. This analysis reveals a high sensitivity of trading performance to moderate single-trip transaction costs. Moreover, an out-of-sample analysis suggests that the performance of the best technical heuristics is generally not persistent over shorter time periods in the future. These in-sample best trading rules tend to significantly underperform out-of-sample even when transaction costs are low.
The existing literature on the profitability of technical trading rules is relatively comprehensive, but it shows inconclusive results and relies mainly on limited data or outdated statistical tests. We provide extensive empirical evidence that simple technical rules do not achieve data snooping-free outperformance of various stock indices. This is true even for markets that are considered far less information efficient than the extensively studied US stock market. Overall, our results cast doubt on the economic value of technical trading rules that have been found to generate superior performance by several previous studies based on tests with less statistical rigor. The results strongly suggest that the investigated trading signals are noise and that a trading strategy which follows these signals ultimately underperforms the market due to an accumulation of transaction costs.
Our analysis is limited with respect to several dimensions, so the results of this study should not be generalized beyond the scope of the trading rules and data examined in this paper. Clearly, the examined rules represent only a subset of the potentially testable technical trading rules and are among the simplest used in practice. There are several, more subtle technical strategies that are not part of the universe tested in this paper. Moreover, the analysis is limited to daily close prices of national stock market indices and does not provide evidence for more detailed data such as intraday stock prices. The boom in high-frequency algorithmic trading in recent years, driven by increasing computational capabilities, may have created highly specialized investors who generate excess returns through technical trading. More recent empirical evidence related to this subject by Batten et al. (2015) and Gebka et al. (2014) point in this direction. However, the nature, complexity, and trading horizon of these technical approaches drastically differ from the naive heuristics typically employed by individual investors.
Despite the poor performance of simple technical rules, they are still widely used by market participants. Future research should investigate whether there may be other, nonmonetary factors that motivate these investors to apply technical analysis. Potential preference-based reasons for trading technical signals and the behavior of technical traders have hardly been addressed in the finance literature.
Notes
Throughout this paper, we use the terms “technical trading rules,” “technical trading heuristics,” or “technical trading strategies” interchangeably.
For example, the survey studies by Hoffmann and Shefrin (2014) and Lease et al. (1974) suggest that about one-fourth to one-third of retail investors regularly use technical analysis to come up with trading decisions. Menkhoff (2010) shows that fund managers heavily rely on technical analysis to predict prices with a forecast horizon of less than two months. Survey responses analyzed by Cheung and Chinn (2001), Gehrig and Menkhoff (2003, 2004), and Menkhoff (1997) highlight the importance of technical trading for foreign exchange traders.
Research results in favor of the use of simple technical trading heuristics in Western equity markets are provided by Alexander (1961, 1964), Benington and Jensen (1970), Bessembinder and Chan (1995), Brock et al. (1992), Hsu and Kuan (2005), Levy (1967a, 1967b, 1968), Neely et al. (2010), Neftci (1991), and Sullivan et al. (1999), among others. The empirical findings of Allen and Karjalainen (1999), Bajgrowicz and Scaillet (2012), Blume and Fama (1966), Day and Wang (2002), and Neely (2003) present empirical evidence against the profitability of technical trading rules. In addition, Hsu et al. (2016), Neely et al. (1997), Neely and Weller (2011), and Sweeney (1986) investigate applications of technical analysis in foreign exchange markets and find mixed results. Comprehensive reviews of the technical analysis literature are provided by Farias Nazário et al. (2017) and Irwin and Park (2007). Adekoya et al. (2019) review the literature on stock market predictability based on machine learning techniques, most often combined with technical analysis. A new strand of literature examines whether technical analysis is beneficial in cryptocurrency markets. Detzel et al. (2020) show theoretically that moving averages can add value for trading assets with hard-to-value fundamentals such as Bitcoin. Empirical support for technical trading in cryptocurrency markets is presented by Ahmed et al. (2019), Bazán-Palomino and Svogun (2022), Cohen (2021), and Hudson and Urquhart (2021). Hackethal et al. (2022) document that individual investors trading structured retail products for cryptocurrencies are also more likely to use moving average heuristics.
Terms such as “p-hacking” or “data dredging” may be more common and are often used as synonyms for “data snooping” (see, for example, Harvey (2017) for a recent discussion of the problem of p-hacking in scientific research). The latter term is commonly used in the literature on technical analysis as well as on multiple hypothesis tests suitable for processing large sets of technical trading rules (e.g., Hansen 2005; Romano and Wolf 2005; Sullivan et al. 1999; White 2000). Therefore, we stick to this terminology in this paper as well.
It is important to note that our study focuses exclusively on the application of time series strategies at the index level and does not consider cross-sectional technical trading approaches such as momentum (Jegadeesh and Titman 1993; Rouwenhorst 1998) or contrarian strategies (Lo and MacKinlay 1990a). However, we include a wide range of moving average trading rules that can be considered a time series equivalent of cross-sectional momentum trading (see Marshall et al. 2017; Moskowitz et al. 2012). The previous literature has extensively covered cross-sectional momentum, with comprehensive reviews on the topic provided by, for example, Subrahmanyam (2018) and Wiest (2023).
The covered sample periods depend on data availability and first stock exchange notation of national stock indices.
A detailed discussion of data snooping and the test procedure to evaluate the performance of technical trading rules is provided in the next section.
In this paper, we refer to technical trading rules whose performance is significantly better than that of a buy-and-hold benchmark strategy as “outperforming rules,” “significantly performing rules,” or “predictive rules.” Similarly, we consider stock indices where the benchmark strategy is significantly outperformed by at least one technical trading rule as “predictable.” Technical rules that have the highest performance in the entire sample of trading rules are referred to as “best-performing trading rules.” However, these rules do not necessarily outperform the benchmark strategy with statistical significance.
In particular, the performance of technical rules that generate trading signals less frequently tends to be less sensitive to transaction costs. Thus, if transaction costs are high, heuristics with few trading signals may (all else equal) perform better than others that generate more frequent trading signals.
In-sample performance is measured with the trading rule with the highest Sharpe ratio during a 3 year period. In contrast, out-of-sample performance is measured with the Sharpe ratio of the trading rule that had the highest Sharpe ratio during the previous 3 year period.
Jiang et al. (2019) use the Stepwise Superior Predictive Ability Test to study technical trading in the Chinese stock market. They report evidence for superior in-sample performance of some rules compared with a buy-and-hold strategy. Furthermore, Chen et al. (2015) analyze the profitability of candlestick trading strategies applied to stocks listed in the Dow Jones Industrial Average (DJIA) and report positive results. Prior studies, such as the seminal paper by Sullivan et al. (1999) examines a similar set of rules in the US stock market. However, the paper relies on the Reality Check of White (2000), which can only evaluate the statistical significance of the best trading rule from a potentially large set of rules.
For instance, testing the 6406 trading rules used in our empirical analysis at \(\alpha = 0.1\) yields an individual significance level of less than 0.000016 for each trading rule.
The large number of trading rules used in this studied mainly results from slight parameter variations, so that the generated trading signals typically cluster within short time periods for heuristics with similar parameterization (see Etheber 2014).
This effect can be exemplified by the Bonferroni correction as noted by Hsu et al. (2016). If one raises the number of poor models, the familywise error rate declines and the hurdle to reject the null hypothesis for the performance of the best model increases. This makes it less likely that significant outperformance will be detected.
A recent strand of the literature introduces the false discovery rate methodology to evaluate the performance of portfolios formed with technical trading rules (cf., Bajgrowicz and Scaillet 2012). Introduced by Benjamini and Hochberg (1995) and first employed for finance applications by Barras et al. (2010), the technique reduces the number of false positives in multiple hypothesis testing by allowing a small fraction of false discoveries. The false discovery rate approach is a suitable alternative to the data snooping methods presented in this section.
To avoid confusion, we note that filter rules are a standalone class of technical trading heuristics, while trading filters are used to refine any type of technical trading rule.
The IMF does not differentiate between emerging and developing markets in a stricter sense. However, the countries in our list are generally considered as emerging markets by most analyst firms. For instance, Financial Times Stock Exchange (FTSE) Group (2016) lists all countries in our sample except for Argentina (which they have on their watch list as a candidate) as either “advanced emerging,” “secondary emerging,” or “frontier” countries in their Annual Country Classification Review from September 2016. For simplicity, we refer to all these countries as emerging markets throughout this paper.
For example, Hong Kong and Singapore are classified as developing countries by the United Nations in its 1974 World Economic Survey (United Nations 1975).
Unless otherwise noted, the index returns in this study are always adjusted for inflation. For the sake of simplicity, we will not highlight this at every point.
For example, Gatev et al. (2006) postpone trades of a pairs trading strategy by one trading day to reduce the risk of upward biased returns caused by potentially high bid–ask spreads on signal days.
According to Grossman and Stiglitz (1980), capital markets cannot be perfectly information efficient because a compensation for the costs associated with trading and obtaining information is necessary to encourage participation. Otherwise, there would be no incentive to trade. These costs may vary strongly across the markets in our sample.
Note that the summary statistics for time invested, mean return per trade, fraction of winners, number of trades per year, and mean holding period in Table 5 are not computed based on the double-or-out strategy (which is, however, used to calculate the performance measure) to give an “undistorted” view on the performance of the rules. For example, under the double-or-out scheme, short signals require a market-neutral position and result in returns of zero (or the risk-free rate), leading to statistics that are meaningless for evaluating the actual trading characteristics of technical trading rules.
This finding is similar to other studies on technical analysis. Sullivan et al. (1999) use an identical set of rules except for relative strength index rules and find that a 5 day moving average exhibits the highest Sharpe ratio based on daily market data of the DJIA. Similarly, Bajgrowicz and Scaillet (2012) report that moving averages and support and resistance rules perform best for the DJIA in several subperiods during 1897 and 2011. Hsu et al. (2010) analyze the predictive ability of 30 currency pairs and find that moving average heuristics perform the best in most cases.
The length of the subperiods does not significantly affect the overall results. We choose intervals of 7 years to have at least two subperiods per country of approximately equal length and to obtain samples that contain enough observations to have meaningful test power.
Moreover, the adaptive market hypothesis allows for cases in which markets can go through cycles of varying efficiency as well as changes in the structure and intensity of competition among market participants (Lo 2004). This may be one reason for the rare cases in which the number of outperforming rules rises again after a previous decline (e.g., the number of outperforming rules for Japan’s Nikkei drops to zero between 1960 and 1966 and then rises again to 18 in the subsequent subperiod).
Friedman (1996) reports that the share of institutional investors in the USA increased from 10% in 1950 to about 50% in 1990. Gompers and Metrick (2001) show that the share of large institutional investors in the stock market almost doubled between 1980 and 1996. More recently, high-frequency trading has captured a significant share of trading volume. Kaya (2016) reports that the share of high-frequency trading in the European stock market surged from nearly 0% in 2005 to over 50% in 2010. During the same period, high-frequency trading accounted for about 20% and more than 50% of the trading volume in the US stock market.
We choose the US market due to the longest available sample period and as it is the most investigated market in the literature. The Peruvian index is an example of a market that is predictable by comparatively many rules for both zero and moderate transaction costs.
For clarity, the table only reports results for markets that have at least one rule with statistically significant performance at least at the 10% level for single-trip transaction costs of five basis points, as presented in Table 7.
To check for robustness, we also employ excess raw returns as a second performance measure. The corresponding results are discussed later.
These transaction cost estimates are similar to the values considered in previous studies. Allen and Karjalainen (1999), for example, also use 10, 25, and 50 basis points to evaluate the impact of transaction costs on the performance of technical trading rules applied to the S&P 500 between 1928 and 1995. Ready (2002) uses single-trip trading costs of 13 basis points for the DJIA between 1962 and 1999, and Bajgrowicz and Scaillet (2012) study the DJIA for single-trip trading costs of 12.5 basis points for the period from 1897 to 2011.
Politis and Romano (1994) compute the variance estimator, \({\hat{\sigma }}_k^2\), for model k as \({\hat{\sigma }}_{k}^2 = {\hat{\gamma }}_{0,k}+2\sum _{i=1}^{N-i} \omega _{n,i} {\hat{\gamma }}_{i,k}\) with kernel weights under the stationary bootstrap given by \(\omega _{n,i} = \frac{N-i}{N}\left( 1-q\right) ^i + \frac{i}{N}\left( 1-q\right) ^{N-i}\), where q is the parameter of the geometric distribution, and with the empirical covariance \({\hat{\gamma }}_{i,k} = N^{-1}\sum _{j=1}^{N-i} \left( \pi _{j,k}-{\bar{\pi }}_{k}\right) \left( \pi _{j+i,k}-{\bar{\pi }}_{k}\right)\).
Hansen (2005) shows that poor models are irrelevant for the asymptotic null distribution. However, they cannot be neglected in finite samples as the impact on the distribution of the test statistic of models being little worse than the benchmark may be huge. Therefore, Hansen (2005) does not exclude any models but reduces the influence of the poorest ones. To do so, \(-\sqrt{2\log \log N}\) in expression (8) serves as a threshold and is a result of an application of the law of the iterated logarithm.
References
Adekoya, A.F., Asubam, B.A., Nti, I.K.: A systematic review of fundamental and technical analysis of stock market predictions. Artif. Int. Rev. 53(4), 3007–3057 (2020)
Ahmed, S., Grobys, K., Sapkota, N.: Technical trading rules in the cryptocurrency market. Finan. Res. Lett. 32, 101396 (2020)
Alexander, S.: Price movements in speculative markets: trends or random walks. Ind. Manag. Rev. 2(2), 7–26 (1961)
Alexander, S.: Price movements in speculative markets: trends or random walks, number 2. Ind. Manag. Rev. 5(2), 25–46 (1964)
AlKhazali, O.: Does infrequent trading make a difference on stock market efficiency? Evidence from the gulf cooperation council (GCC) countries. Stud. Econ. Finance 28(2), 96–110 (2011)
Allen, F., Karjalainen, R.: Using genetic algorithms to find technical trading rules. J. Finan. Econ. 51(2), 245–271 (1999)
Almazan, A., Brown, K.C., Carlson, M., Chapman, D.A.: Why constrain your mutual fund manager? J. Finan. Econ. 73(2), 289–321 (2004)
Araújo Lima, E.J., Tabak, B.M.: Tests of the random walk hypothesis for equity markets: evidence from China, Hong Kong and Singapore. Appl. Econ. Lett. 11(4), 255–258 (2004)
Bailey, D.H., López de Prado, M.: The Sharpe ratio efficient frontier. J. Risk 15(2), 3–44 (2012)
Bajgrowicz, P., Scaillet, O.: Technical trading revisited: false discoveries, persistence tests, and transaction costs. J. Finan. Econ. 106(3), 473–491 (2012)
Barras, L., Scaillet, O., Wermers, R.: False discoveries in mutual fund performance: measuring luck in estimated alphas. J. Finance 65(1), 179–216 (2010)
Batten, J.A., Lucey, B.M., McGroarty, F., Peat, M., Urquhart, A.: Does technical analysis beat the market? – Evidence From high frequency trading in gold and silver, Working Paper (2015)
Bazán-Palomino, W., Svogun, D.: Technical analysis in cryptocurrency markets: do transaction costs and bubbles matter? J. Int. Finan. Mark Institut. Money 79, 101601 (2022)
Benington, G.A., Jensen, M.C.: Random walks and technical theories: some additional evidence. J. Finance 25(2), 469–482 (1970)
Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Royal Stat. Soci., Series B 57(1), 289–300 (1995)
Bessembinder, H., Chan, K.: The profitability of technical trading rules in the Asian stock markets. Pacif.-Basin Finance J. 3(2–3), 257–284 (1995)
Bessembinder, H., Chan, K.: Market efficiency and the returns to technical analysis. Finan. Manag. 27(2), 5–17 (1998)
Bhuyan, R., Mobarek, A., Mollah, S.: Market efficiency in emerging stock market: evidence from Bangladesh. J. Emerg. Market Finance 7(1), 17–41 (2008)
Blume, M.E., Fama, E.F.: Filter rules and stock-market trading. J. Bus. 39(1), 226–241 (1966)
Brock, W., Lakonishok, J., LeBaron, B.: Simple technical trading rules and the stochastic properties of stock returns. J. Finance 47(5), 1731–1764 (1992)
Camilleri, S.J., Green, C.J.: Stock market predictability: non-synchronous trading or inefficient markets? Evidence from the national stock exchange of India. Stud. Econ. Finance 31(4), 354–370 (2014)
Campbell, J.Y., Grossman, S.J., Wang, J.: Trading volume and serial correlation in stock returns. Quart. J. Econ. 108(4), 905–939 (1993)
Chan, L. K. C., & Lakonishok, J.: Institutional trades and intraday stock price behavior. J. Financ. Econ. 33(2), 173–199 (1993)
Chen, C.-W., Huang, C.-S., Lai, H.-W.: The impact of data snooping on the testing of technical analysis: an empirical study of Asian Stock markets. J. Asian Econ. 20(5), 580–591 (2009)
Chen, Y.-C., Hsu, Y.-C., Lu, T.-H.: Trend definition or holding strategy: what determines the profitability of candlestick charting? J. Bank. Finance 61(C), 172–183 (2015)
Cheung, Y.-W., Chinn, M.D.: Currency traders and exchange rate dynamics: a survey of the US market. J. Int. Money Finance 20(4), 439–471 (2001)
Cohen, G.: Optimizing candlesticks patterns for bitcoin’s trading systems. Rev. Quantit. Finance Account. 57(3), 1155–1167 (2021)
D’Avolio, G.: The market for borrowing stock. J. Finan. Econ. 66(2–3), 271–306 (2002)
Day, T.E., Wang, P.: Dividends, nonsynchronous prices, and the returns from trading the dow jones industrial average. J. Empir. Finance. 9(4), 431–454 (2002)
Detzel, A., Liu, H., Strauss, J., Zhou, G., Zhu, Y.: Learning and predictability via technical analysis: evidence from bitcoin and stocks with hard-to-value fundamentals. Finan. Manag. 50(1), 107–137 (2020)
Duffie, D., Gârleanu, N., Pedersen, L.H.: Securities lending, shorting, and pricing. J. Finance Econ. 66(2–3), 307–339 (2002)
Etheber, T.: Trading volume effects of moving average heuristics, Working Paper (2014)
Fama, E.F.: The behavior of stock market prices. J. Bus. 38(1), 34–105 (1965)
Fama, E.F.: Efficient capital markets, A review of theory and empirical work. J. Finance 25(2), 383–417 (1970)
Farias Nazário, R.T., Kimura, H., Silva, J.L., Sobreiro, V.A.: A literature review of technical analysis on stock markets. Quart. Rev. Econ. Finance 66(C), 115–126 (2017)
Faugère, C., Smith, D., Wang, Y.: Head and shoulders above the rest? The performance of institutional portfolio managers who use technical analysis, Working Paper (2013)
Friedman, B.: Economic implications of changing share ownership. J. Portf. Manag. 22(3), 59–70 (1996)
FTSE Group (2016) FTSE annual country classification review, retrieved from http://www.ftse.com/products/downloads/FTSE-Country-Classification-Update_latest.pdf on October 16, 2019
Gatev, E., Goetzmann, W.N., Rouwenhorst, K.G.: Pairs trading: performance of a relative-value arbitrage rule. Rev. Financial Stud. 19(3), 797–827 (2006)
Gebka, B., Hudson, R., Manahov, V.: Does high frequency trading affect technical analysis and market efficiency? And if so, how? J. Int. Financ. Mark., Institut. Money 28(1), 131–157 (2014)
Gehrig, T., Menkhoff, L.: Technical analysis in foreign exchange - the workhorse gains further ground, Working Paper (2003)
Gehrig, T., Menkhoff, L.: The use of flow analysis in foreign exchange: exploratory evidence. J. Int. Money Finance 23(4), 573–594 (2004)
Gompers, P., Metrick, A.: Institutional investors and equity prices. Quart. J. Econ. 116(1), 229–259 (2001)
Grossman, S.J., Stiglitz, J.E.: On the impossibility of informationally efficient markets. Am. Econ. Rev. 70(3), 393–408 (1980)
Hackethal, A., Hanspal, T., Lammer, D.M., Rink, K.: The characteristics and portfolio behavior of bitcoin investors - evidence from indirect cryptocurrency investments. Rev. Finance 26(4), 855–898 (2022)
Hansen, P.R.: A test for superior predictive ability. J. Bus. & Econ. Stat. 23(4), 365–380 (2005)
Harvey, C.R.: Presidential address: the scientific outlook in financial economics. J. Finance 72(4), 1399–1440 (2017)
Hoffmann, A.O.I., Shefrin, H.: Technical analysis and individual investors. J. Econ. Behav. Organ. 107(4), 487–511 (2014)
Hsu, P.-H., Hsu, Y.-C., Kuan, C.-M.: Testing the predictive ability of technical analysis using a new stepwise test without data snooping bias. J. Empir. Finance 17(3), 471–484 (2010)
Hsu, P.-H., Kuan, C.-M.: Reexamining the profitability of technical analysis with data snooping checks. J. Finan. Econ. 3(4), 606–628 (2005)
Hsu, P.-H., Taylor, M.P., Wang, Z.: Technical trading: is it still beating the foreign exchange market? J. Int. Econ. 102(C), 188–208 (2016)
Hudson, R., Urquhart, A.: Technical trading and cryptocurrencies. Ann. Operat. Res. 297(1), 191–220 (2021)
International monetary fund: world economic outlook april: too slow for too long, Washington (2016)
Irwin, S., Park, C.-H.: What do we know about the profitability of technical analysis? J. Econ. Surv. 21(4), 786–826 (2007)
Jegadeesh, N., Titman: Returns to buying winners and selling losers: implications for stock market efficiency. J. Finance 48(1), 65–91 (1993)
Jiang, F., Song, G., Tong, G.: Technical Analysis profitability without data snooping bias: evidence from Chinese stock market. Int. Rev. Finance 19(1), 191–206 (2019)
Jones, C.M.: A century of stock market liquidity and trading costs, Working Paper (2002)
Jones, C.M., Lamont, O.A.: Short-sale constraints and stock returns. J. Financial Econ. 66(2–3), 207–239 (2002)
Kaya, O.: High-frequency trading: reaching the limits (2016). retrieved from http://www.dbresearch.com/PROD/RPS_ENPROD/PROD0000000000454703/Research_Briefing%3A_High-frequency_trading.PDF on June 7, 2022
Kish, R.J., Kwon, K.-Y.: A comparative study of technical trading strategies and return predictability: an extension of Brock, Lakonishok, and LeBaron (1992) using NYSE and NASDAQ indices. Quart. Rev. Econ. Finance 42(3), 611–631 (2002)
Lease, R.C., Lewellen, W.G., Schlarbaum, G.G.: The individual investor: attributes and attitudes. J. Finance 29(2), 413–433 (1974)
Lease, R. C., Lewellen, W. G., & Schlarbaum, G. G.: Portfolio design and portfolio performance: the individual investor. J. Econ. Bus. 32(3), 185–197 (1980)
LeBaron, B.: The stability of moving average technical trading rules on the Dow Jones index. Derivatives Use Trad. Regul. 5(4), 324–338 (2000)
Lesmond, D.A.: Liquidity of emerging markets. J. Finan. Econ. 77(2), 411–452 (2005)
Levy, R.A.: Random walks: reality or myth. Finan. Anal. J. 23(6), 69–77 (1967)
Levy, R.A.: Relative strength as a criterion for investment selection. J. Finance 22(4), 595–610 (1967)
Levy, R.A.: Random walks: reality or myth - reply. Finan. Anal. J. 24(1), 129–132 (1968)
Lo, A.W.: The adaptive markets hypothesis. J. Portf. Manag. 30(5), 15–29 (2004)
Lo, A.W., MacKinlay, A.: When are contrarian profits due to stock market overreaction? Rev. Financial Stud. 3(2), 175–205 (1990)
Lo, A.W., MacKinlay, A.: Data snooping biases in tests of financial asset pricing models. Rev. Finan. Stud. 3(3), 431–467 (1990)
Lo, A.W., Wang, J.: Trading volume: definitions, data analysis, and implications of portfolio theory. Rev. Finan. Stud. 13(2), 257–300 (2000)
Malkiel, B.G.: A random walk down wall street, revised edition, United States, New York (1999)
Marshall, B.R., Nguyen, N., Visaltanachoti, N.: Time-series momentum versus technical analysis. Quantit. Finance 17(3), 405–421 (2017)
Menkhoff, L.: Examining the use of technical currency analysis. Int. J. Finance Eco. 2(4), 307–318 (1997)
Menkhoff, L.: The use of technical analysis by fund managers: international evidence. J. Bank. Finance 17(3), 547–564 (2010)
Merton, R.: On the current state of the stock market rationality hypothesis. In: Dornbusch, R., Fischer, S., Bossons, J. (eds.) Macroeconomics and finance: essays in honor of Franco Modigliani. United States, Cambridge (1987)
Moskowitz, T.J., Ooi, Y.H., Pedersen, L.H.: Time series momentum. J. Finan. Econ. 104(2), 228–250 (2012)
Nagel, S.: Short sales, institutional investors and the cross-section of stock returns. J. Finan. Econ. 78(2), 277–309 (2005)
Neely, C.J.: Risk-adjusted, ex ante, optimal technical trading rules in equity markets. Int. Rev. Econ. Finance 12(1), 69–87 (2003)
Neely, C.J., Dittmar, R., Weller, P.A.: Is technical analysis in the foreign exchange market profitable? A genetic programming approach. J. Finan. Quantit. Anal. 32(4), 405–426 (1997)
Neely, C. J., Rapach, D. E., Tu, J., Zhou, G.: Out-of-sample equity premium prediction: economic fundamentals vs. moving-average Rules, Federal Reserve Bank of St. Louis, Working Paper (2010)
Neely, C.J., Weller, P.A.: Technical analysis in the foreign exchange market, federal reserve bank of St. Louis, Working Paper (2011)
Neely, C.J., Weller, P.A., Ulrich, M.: The adaptive markets hypothesis: evidence from the foreign exchange market. J. Finan. Quantit. Anal. 44(2), 467–488 (2009)
Neftci, S.N.: Naive trading rules in financial markets and wiener-kolmogorov prediction theory: a study of “technical analysis’’. J. Bus. 64(4), 549–571 (1991)
Olsen, D.: Have trading rule profits in the currency markets declined over time? J. Bank. Finance 28(1), 85–105 (2004)
Politis, D.N., Romano, J.P.: The stationary bootstrap. J. Am. Stat. Associat. 89(428), 1303–1313 (1994)
Poshakwale, S.: The random walk hypothesis in the emerging Indian stock market. J. Bus. Finance Account. 29(9 & 10), 1275–1299 (2002)
Pring, M.J.: Technical analysis explained, 5th edn. United States, New York (2014)
Ready, M.J.: Profits from technical trading rules. Finan. Manag. 31(3), 43–61 (2002)
Romano, J.P., Wolf, M.: Stepwise multiple testing as formalized data snooping. Econometrica 73(4), 1237–1282 (2005)
Rouwenhorst, K.G.: International momentum strategies. J. Finance 53(1), 267–284 (1998)
Scholes, M., Williams, J.: Estimating betas from nonsynchronous data. J. Finan. Econ. 5(3), 309–327 (1977)
Sharma, A., Thaker, K.B.: Market efficiency in developed and emerging markets. Afro-Asian J. Finance Account. 5(4), 311–333 (2015)
Sharpe, W.F.: Mutual fund performance. J. Bus. 39(1), 119–138 (1966)
Sharpe, W.F.: The Sharpe ratio. J. Portf. Manag. 21(1), 49–58 (1994)
Stoll, H.R., Whaley, R.E.: Transaction costs and the small firm effect. J. Finan. Econ. 12(1), 57–79 (1983)
Subrahmanyam, A.: Equity market momentum: a synthesis of the literature and suggestions for future work. Pacific-Basin Finance J. 51, 291–296 (2018)
Sullivan, R., Timmermann, A., White, H.: Data-snooping, technical trading rule performance, and the bootstrap. J. Finance 54(5), 1647–1691 (1999)
Sweeney, R.J.: Beating the foreign exchange market. J. Finance 41(1), 163–182 (1986)
Timmermann, A.: Elusive return predictability. Int. J. Forecast. 24(1), 1–18 (2008)
Timmermann, A., Granger, C.W.J.: Efficient market theory and forecasting. Int. J. Forecast. 20(1), 15–27 (2004)
United Nations: world economic survey, 1974 - Part One. Mid-term review and appraisal of progress in the implementation of the international development strategy, New York, United States (1975)
White, H.: A reality check for data snooping. Econometrica 68(5), 1097–1126 (2000)
Wiest, T.: Momentum: what do we know 30 years after Jegadeesh and Titman’s seminal paper? Finan. Mark. Portf. Manag. 37(1), 95–114 (2023)
Wilder, J.W.: New concepts in technical trading systems. Greensboro, United States (1978)
Acknowledgements
The author thanks an anonymous referee as well as the editor, Markus Schmid, for their helpful suggestions.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: Implementation of the stepwise superior predictive ability test
In this paper, the objective is to test whether trading according to signals of m technical trading rules outperforms a buy-and-hold strategy. Performance is measured in terms of the Sharpe ratio, such that the performance measure for a trading rule k is defined as \(\theta _{k}= \left( {{\bar{\pi }}_{k}-{\bar{r}}^{f}}\right) /{{\hat{\sigma }}_k} - \left( {{\bar{r}}-{\bar{r}}^{f}}\right) /{{\hat{\sigma }}_0}\) (cf., equation (3)). We use the Stepwise Superior Predictive Ability Test introduced by Hsu et al. (2010) to obtain p-values and control for data snooping bias.
Hsu et al. (2010) employ the stationary bootstrap by Politis and Romano (1994) to resample original time series. We outline the general procedure of the bootstrap using the matrix \(\Pi =\left( \pi _{t,k}\right) _{\begin{array}{c} t=1,\dots ,N \\ k=1,\dots ,m \end{array}}\) of strategy returns. The indices of each vector \(\pi _k = (\pi _{1,k},\pi _{2,k},\dots ,\pi _{N,k})^\prime\) are resampled yielding a pseudo-time series \(\pi _{k}^b \equiv \pi _{k}(n_{t,b})\) for the b-th bootstrap sample, \(b = 1,\dots ,B\). The indices \(n_{t,b}\) result from resampling the original time indices \(1,\dots ,N\) in blocks of stochastic lengths, where the block lengths follows a geometric distribution with parameter \(q \in \left( 0,1\right]\). Due to the properties of the geometric distribution, blocks have an expected length of 1/q. The resampling of blocks is implemented with two \(N \times B\) random matrices with independent entries \(u_{t,b}, v_{t,b} \sim \text {unif} \left( 0,1\right]\). The indices are initialized by setting
where \(\lceil \cdot \rceil\) is the ceiling function that rounds its argument to the nearest integer towards infinity. The indices for \(t=2,\dots ,N\) are then determined by
where \(\mathbbm {1}\left\{ \cdot \right\}\) is the indicator function. Thus, resampled indices are determined recursively, where the realization of the random variable \(v_{t,b}\) relative to q determines whether a new block is started, and the start index of a block is determined by the random variable \(u_{t,b}\). Finally, we obtain B bootstrapped matrices, \(\Pi ^b=\left( \pi _1^b, \pi _2^b,\dots ,\pi _m^b\right)\), \(b = 1,\dots ,B\), by using the b-th resample of indices to reorder all columns of \(\Pi\).
The stationary bootstrap is jointly applied to the time series of trading returns, the risk-free rate, and the benchmark returns, so that we can compute bootstrapped versions of \(\theta _k\) of the form
where \({\hat{\sigma }}_{k}^b\) and \({\hat{\sigma }}_{0}^b\) are the standard deviations of resampled strategy returns and benchmark returns based on bootstrap sample b using the bootstrapped variance estimator of Politis and Romano (1994).Footnote 33 This finally yields B vectors of bootstrapped performance measures, \(\theta ^b =\) \(\left( \theta _1^b,\theta _2^b,\dots ,\theta _m^b\right)\).
To implement the stepwise procedure, the elements of the vector of original performance measures, \(\theta = (\theta _1,\theta _2,\dots ,\theta _m)\), are sorted in descending order. Let, moreover, j and \(m_j\) denote the step of the stepwise test and the number of rejections of the null hypothesis (i.e., the number of models with true outperformance) at step j. To initialize the first step, let \(j = 1\) and \(m_0 = 0\). We compute an empirical null distribution \(\{\phi _j^b\}_{b=1,\dots ,B}\), where
with
Expression (8) formulates a measure to recenter the bootstrapped distribution for models performing particularly bad.Footnote 34 We use \(\{\phi _j^b\}_{b=1,\dots ,B}\) and compute its \(\left( 1-\alpha \right)\) quantile, \(q\left( \alpha \right)\), with \(\alpha\) being the allowed error rate. The subset of models with performance measures satisfying \(\sqrt{N}\theta _k > q\left( \alpha \right)\), \(k = m_{j-1}+1,\dots ,m\), are recorded as significantly outperforming models. If the number of rejected models at this step, \(m_j\), is zero, the stepwise procedure terminates. Otherwise, if \(m_j>0\), we set \(j=j+1\) and proceed with the next step by recomputing the empirical null distribution based on the new subset of models (i.e., the set of models that does not contain all models that have been identified as outperforming in the previous step).
The number of bootstraps, B, and the parameter of the geometric distribution determining the mean block length, q, are the two exogenous parameters of the stationary bootstrap of Politis and Romano (1994). White (2000) argues that 500 is a sufficiently large number of iterations for the stationary bootstrap. Regarding the parameter of the geometric distribution, Sullivan et al. (1999) show in a sensitivity analysis that the p-values computed with the Reality Check almost do not change if q is set to either 0.01, 0.1, or 0.5, respectively. Similarly, Hsu et al. (2016) also report only negligible sensitivity of results to the choice of bootstrap parameters, and Chen et al. (2009) find that the outcome of the bootstrap procedure is relatively insensitive to the choice of the two parameters. We, therefore, feel safe to set \(B= 500\) for the number of bootstrap resamples and \(q= 0.1\) as parameter for the geometric distribution in the empirical analysis of this study.
Appendix 2: Definition of trading filters and technical trading rules
1.1 Trading filters
1.1.1 Band filters
Band filters specify a multiplicative value, \(f_b>0\), which the price of a security is supposed to additionally exceed after a trading signal has been triggered. Thus, if a technical trading rule generates an unconditional long (short) signal, a long (short) position is only entered when the stock price subsequently increases (decreases) by another \(100 f_b\)%.
1.1.2 Time-delay filters
Time-delay filters impose a fixed time period, \(f_d\in {\mathbb {N}}\), by which the entry into a position is delayed. Thus, if a trading signal is generated in period t, a position is actually initiated in period \(t+f_d\).
1.1.3 Fixed-length filters
Fixed-length filters define a fixed holding period, \(f_l\in {\mathbb {N}}\), after which a position is automatically liquidated. More specifically, a position entered in period t will be liquidated in period \(t+f_l\), regardless of intermediate price movements or possible earlier signals. If a trading rule is not supplemented by a fixed-length filter, and unless otherwise specified, a short signal requires both the liquidation of a long position and the entry into a short position, and vice versa.
1.2 Technical trading rules
Relative Strength Index The relative strength index, \(RSI_t(n)\), in period t is the sum of the upward movements of a security price, \(P_t\), relative to the sum of the upward and absolute downward movements calculated over the last n periods. More formally, we compute the sum of upward and the sum of absolute downward movements as
Based on these expressions, the relative strength index is introduced as
Obviously, the RSI ranges from 0 to 100. Securities with extreme high or low relative strength index are considered overbought or oversold, respectively. Following Hsu et al. (2016), we consider a security to be oversold if its \(RSI_t(n)\le 50-v\), where v is a positive integer, for at least d periods. Then, a long signal is triggered when \(RSI_t(n)\) rises above \(50-v\). Short positions are initiated analogously.
1.2.1 Filter rules
The filter rules used by Sullivan et al. (1999) generate an initial long (short) signal as soon as the price of a security rises (falls) by more than \(100 x \%\). Subsequently, a long position is closed and a short position is entered once the price of the asset falls by at least \(100 x \%\) from the maximum price that occurred while holding that position. Similarly, a short position is closed and a long position is initiated when the price increases by at least \(100x \%\) from the minimum price observed during the holding period.
Let \(\tau\) denote the period in which the previous trading signal was triggered. A subsequent trading signal is generated in period \(t>\tau\) according to the following rules.
We also use an alternative definition for maxima and minima as introduced by Sullivan et al. (1999). According to that, a maximum (minimum) is defined as the most recent price that is higher (lower) than the prices of the previous e periods. This results in the following filter rules.
Both definitions of extrema lead to a permanent exposure to the asset as soon as the first trading signal is generated (i.e., there is no possibility for a market-neutral position). A third specification of the filter rules also allows for intermediate neutral positions. This is achieved by different percentage thresholds required to open or close a long or short position. While opening a position still requires a \(100x \%\) change in the asset price, closing a long (short) position requires a \(100y \%\) move below (above) the previous high (low). Let \(\tau\) denote the period in which the current position was initiated. The liquidation of this position in period t comes about as follows.
Moving Averages Let \(MA_t^n=n^{-1}\sum _{k=t-n+1}^t P_k\) be the simple moving average at time t with a rolling window of length n. A long (short) moving average trading signal is generated when the slow moving average, \(MA_t^{n_s}\), crosses the fast moving average, \(MA_t^{n_f}\), with \(n_s > n_f \ge 1\), from above (below):
Since Sullivan et al. (1999) also consider moving average rules with \(n_f=1\), the fast moving average in this case becomes \(MA_t^1=P_t\). In this case, we are actually dealing with a single moving average rule, where trading signals are generated by crossovers of the moving average with the asset price.
Support and Resistance In support and resistance trading rules, the asset is bought (sold short) when the close price \(P_t\) in period t exceeds (falls below) the high (low) price of the previous n trading days.
We also employ an alternative specification for extrema. As with the alternative specification used for filter rules, a maximum (minimum) is defined as the most recent price that is higher (lower) than the prices of the previous e periods. Consequently, trading signals in period t are triggered according to the following rules.
1.2.2 Channel breakouts
Trading channels denote price patterns where the minimum and maximum price of the last n trading days differ by no more than a fixed multiplicative value, x. A long (short) channel breakout signal is triggered when the price breaks out of the channel with an upward (downward) movement. This results in the following channel breakout rules.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rink, K. The predictive ability of technical trading rules: an empirical analysis of developed and emerging equity markets. Financ Mark Portf Manag 37, 403–456 (2023). https://doi.org/10.1007/s11408-023-00433-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11408-023-00433-2
Keywords
- Technical analysis
- Predictive ability
- Multiple hypothesis testing
- Data snooping bias
- Market efficiency
- Transaction costs