INTRODUCTION

Pairs trading strategy is a market-neutral strategy that involves identification of pairs with statistical measures and execution of pairs trading when the predefined threshold is triggered. Theoretically, the idea of pairs trading is to take advantage of market inefficiencies. An equity analyst/trader identifies two stocks that move together and trade them every time the absolute distance between the price paths is above a particular threshold value. The price relationship between the two stocks tends to fluctuate around its average in the short term, while remaining stable in the long run. In order to make money, a trader sells the main asset with the highest price and buys its pair with the lowest price with the expectation of a price decrease and an increase for the assets, respectively. The cause relationship of pairs selection from ISE30 indices, as it will be detailed further, depends on daily selections of the pairs other than setting the pairs once in the training period and monitoring these pairs during the trading period.

There are several reasons (Bolgün et al., 2010) for the popularity of pairs trading. First, the procedure is simple to understand and execute. Second, valuation models, which are subjected to wide error margins, are not required as pairs trading is based on relative valuation and the position is often near market-neutral. Third, it is sufficiently flexible to accommodate various investment styles. Lastly, it normally does not evoke frequent intraday rebalancing, such that actual trading can be automated and is feasibly profitable (Chng and Xia, 2007).

The term statistical arbitrage encompasses a variety of strategies and investment programs. Their common features are: (i) trading signals are systematic, or rules-based, as opposed to driven by fundamentals; (ii) the trading book is market-neutral, in the sense that it has zero β with the market; and (iii) the mechanism for generating excess returns is statistical. The idea is to make many bets with positive expected returns, taking advantage of diversification across stocks, to produce a low-volatility investment strategy, which is uncorrelated with the market. Holding periods range from a few seconds to days, weeks or even longer.

This article is an upgraded version of our previous article dated 2010. We developed an alternative short strategy by using futures contracts instead of shorting stocks separately, which is very costly in Turkey. In order to create a market-neutral portfolio, we prefer to buy synthetic ETF composed of 30 stocks. The composition of this ETF is subjective to statistical measures, and in some cases we may use less than 30 stocks. We contributed an alternative investment strategy by employing Istanbul Stock Exchange (ISE) and Turkish Derivatives Exchange (Turkdex) instruments, and performance results showed us that this strategy is a powerful tool for asset managers.

The article proceeds as follows. In the next section, we provide a brief literature review and identify the main methods to implement pairs trading strategy. In the subsequent section, we describe the pairs trading data and methodology for the ISE30 stock & TURKDEX futures index pairs formation procedure and trading rules. In the penultimate section, we compare the performance results of pairs trading strategy. The final section concludes.

LITERATURE REVIEW

Pairs trading is elusive due to the lack of academic research. Although it is based on simple contrarian principles, pairs trading did not draw nearly as much academic attention as contrarian trading. Gatev et al (2006) implemented a strategy of selecting ‘pairs’ of stocks according to how much stock prices have moved together in the past, and then proved this simple trading rule based on daily data that generate profits that exceed transactions costs. Nath (2003) examines the implementation of a simple pairs trading strategy with automatic extreme risk control using the entire universe of securities in the highly liquid secondary market for U.S. government debt. It documents, from a practical viewpoint, the contrasts in the generic features of pairs trading with such securities compared with equities.

Price formation models, a cornerstone of the market microstructure literature, are the result of academic endeavors Glosten and Milgrom (1985); Easley and O’Hara (1987); Brown and Jennings (1989) to turn technical analysis from an art to a science.

The strategy is designed to profit from overreaction and subsequent mean reversion, that is, negative serial correlation in stock returns. Positive profits are reported in Lehmann (1990). However, Lo and MacKinlay (1990) show that contrarian profits could also be driven by delayed reaction or lead-lag effects between winner and loser stocks. In brief, if stock j reacts in the same direction as stock i but with a delay, then buying (selling) j subsequent to an increase (decrease) in i should generate profits, even if neither stocks overreact. Their results show that around 50 per cent of contrarian profits is generated by such lead-lag effects. The essence of Lo and MacKinlay (1990) is to highlight both negative serial covariance and positive cross-serial covariance ij in stock returns as two potential sources of contrarian profits.

The study by Jegadeesh and Titman (1995) extends that of Lo and MacKinlay (1990) by associating lead-lag effects with the dynamics of price reaction to common factors. Their analysis of contrarian profits include a more detailed set of stock price reaction scenarios covering under- and overreaction to common factors and idiosyncratic news. Unlikely Lo and MacKinlay (1990) study, Jegadeesh and Titman (1995) indicates that stock prices overreact to firm-specific information, but react with a delay to common factors and most of the contrarian profit is driven by overreaction to idiosyncratic news. This is consistent with the fact that overreaction to idiosyncratic news always generates contrarian profits, but overreaction to common factors may actually decrease contrarian profits. The essence of Jegadeesh and Titman (1995) is to show that common factor price reaction is a more appropriate measure of lead-lag effects than cross-serial covariance in total returns (Chng and Xia, 2007).

Hogan et al (2004) provide a framework for testing anomalies based on the principle of statistical arbitrage. Vidyamurthy (2004) details an implementation strategy based on a cointegration-based framework, without empirical results. Elliott et al (2005) apply a Kalman filter to estimate a parametric model of the spread. These methods can be shown to be applicable for special cases of the underlying equilibrium relationship between two stocks. A pairs trading strategy forcing an equilibrium relationship between the two stocks with little room for adaptation may lead to a conclusion of ‘non-tradeability’ at best and non-convergence at worst.Footnote 1 Do et al (2006); Engelberg et al (2008) showed that profits to this strategy are lower when the initial divergence is due to value-relevant news relating to one of the stocks.

Gerasimos (2011), investigated the performance and the trading characteristics of 62 German Exchange-Traded Funds during the period 11 April 2000–12 September 2006. German ETFs slightly underperform their benchmarks. By regression analysis, he revealed that the tracking error is positively affected by risk. Marshall et al (2011) proved the arbitrage opportunities in ETFs, which are known as less expected mispricing instruments. Their findings show that economically important ETF mispricing (S&P 500) is a reasonably frequent occurrence. They analyzed more exotic ETFs such as those that seek to provide two times, three times or the inverse of underlying index performance (short ETF). Recently, Schizas et al (2011) examined the performance of pairs trading strategy by using international ETFs from across the world. He proved the profitability of this strategy in the context of mostly traded 22 international ETFs including SPY and MSCI country index ETFs those are listed in AMEX.

DATA & METHODOLOGY

Data

The database for this research is based on the ISE-30 index shares of ISE and Turkdex Contracts between the periods of February 2005–July 2011. Although index composition is subject to change in each quarter by several criteria determined by the ISE Board of Directors, we studied the same stocks during our research. Selected stocks are presented in Appendix A with their sectoral information and market capitalization as of 19 December 2011.

All the stocks prices we used are dividend adjusted so that we did not have to make that adjustment again. For the index futures prices, we used the daily prices of the closest maturity contract. That is, when the nearest maturity contract expires at time t, the maturity of the new nearest maturity contract will be 2 months and the new price at t+1 will have more interest rate effect than the price at time t. This change that occurs once in every 2 months will have minor effects on our calculations, as our methodology will be based on the log-returns of the prices instead of the prices themselves.

Pairs formation with the constrained least squares method

In this research, we implemented a stock ETF pair by regression methodology in the ISE30 stocks’ daily returns and Turkdex Index, which is programmed by a pair trading model on Matlab.

Our trading rule requires taking long position on the ETF and at the same time taking short position on the stock index futures contract. As the ETF is highly correlated with the index, our portfolio will be market neutral.

In order to determine the weights in the ETF portfolio, we used the following constrained least squares regression.

such that

According to the equation above, stock index's daily returns are regressed on individual stocks’ daily returns for the last 125 days. The coefficients can directly be interpreted as the weights of the stocks in the ETF portfolio. In addition, applying the above constraints ensures that the sum of weights is 1 and individual stock weights are prevented to be less than 0 so that the ETF portfolio will not have any short positions.

Trading strategy and rule

After determining the weights for each stock in the ETF, we can figure out the price movements of the ETF and compare it with the index. As our trading strategy is going to be market neutral, we will buy the ETF and sell the index futures contract when the discrepancy (the futures price – ETF price difference, which from now on will be referred as spread) between the two increases. That is, we enter a position when the ETF is underpriced and index future is overpriced. Our main assumption is based on the fact that a relatively high spread is not sustainable. Therefore, when we observe such a case, we would expect the spread to return to its mean. That being said, the mean of the spread between the stock index and the ETF also changes because the weights in the ETF are recalculated for every trading day. Thus, we will adjust its mean by using a simple moving average for the last n days.

Specifically, we enter a position by buying the ETF and short selling the index future when the spread exceeds k standard deviation of its mean. And we close the position when the spread goes down to its mean.

At this point, one should also note that our parameters n and k (the number of days to calculate the moving average and standard deviation of it, respectively) can be optimized in order to achieve the best performing trading rule.

An example of pairs trading

The following figure illustrates the application of our trading rule.

  • We developed a synthetic ETF by using constrained least squares methods as defined in section ‘Pairs formation with the constrained least squares method’. This ETF composition is subject to change daily. The panel view of some coefficient calculation results is presented in Appendix B.

  • We are always long in ETF and short in Turkdex index futures contract, as our model requires short constraint in the spot equities market. Generally, we do not rebalance our position until exit time. This may affect our trading profit as well.

  • We enter the position when the blue line (spread) exceeds its k standard deviation and close our position when it goes below its mean.

  • Although the spread is relatively volatile, it tends to go back to its moving average quickly after exceeding the k standard deviation of its mean.

  • As shown in Figure 1, after the three trades, our capital increased by 27 per cent in 3 months.

    Figure 1
    figure 1

    Pairs trade example.

  • We included the daily interest income of the deposited collateral and overnight interest of our capital when we are out of trade position.

  • We excluded the commission and taxes in our calculations.

  • Statistical test results of implied strategy are presented in Appendix C.

PAIRS TRADING PERFORMANCE

The important decision in this trading strategy is determination of parameters. As discussed in the previous example, standard deviation of spread (k) is one of the key parameters in this alternative investment strategy. Another required parameter is the number of days (n) to implement standard deviation. The Sharpe and Sortino performance criteria of various combinations of parameters are also measured in order to determine the maximum trade profit. We optimized these measures by Matlab optimization tool. Optimization results are presented in Tables 1 and 2 for comparison purpose. As an example if we employed 1 standard deviation with 10 days lag, we could produce 0.7774 Sharpe against 1.0276 Sharpe performance measure with the 0.2 standard deviation in 10 days. It shows us that the used parameters have a big impact on performance results.

Table 1 Optimized sharpe ratios
Table 2 Optimized sortino ratios

After optimizing the parameters to maximize our performance ratios, we end up with the parameters n=19 and k=0.1. In Appendix D, for a different combination of parameters both performance ratios increase almost monotonically as the parameters approach to our optimized parameters.

As presented in Table 3, our strategy's Sharpe ratio is 1.68 and Sortino ratio is 5.52. During the same period, ISE-30 stock index's Sharpe and Sortino ratios were −0.09 and −0.13, respectively.

Table 3 Performance comparison of model portfolio and ISE 30 index

We started with 100 capital as of 25 August 2005, which is the starting period of our investment. Remember that the first 125 days were used as the formation period of our selected pairs. Pairs trading transaction details are shown partially in Table 4.

Table 4 Pairs trading log

A total of 144 trades were executed during the investment period; 128 of them were profitable trade, and the maximum drawdown was 15.68 per cent. As can be seen in Figure 2, most of the profit was generated after 2007.

Figure 2
figure 2

Pairs trading performance result.

Kelly ratio, the optimal size of each bet, is calculated as 87.5 per cent. This is quite high because the number of profitable trades are notably high relative to the number of unprofitable trades.

CONCLUSION

Pairs trading tries to exploit the co-movement of the prices of a pair of assets. It assumes that the relation that has been measured historically is stable. We developed an alternative investment strategy, which is always long in selected synthetic ETF and always short in the nearest Turkdex index futures contract. Synthetic ETF is composed of a maximum of 30 stocks but not required to buy and hold all of them passively as the current sector index ETFs. It may include less than 30 stocks, and in some cases more than 50 per cent weight could be given to only one selected stock in the ETF. The results indicate that pairs produced overall returns of 901 per cent (February 2005–July 2011), whereas ISE-30 index return for the same period was 111 per cent. Keeping in mind that our strategy is market neutral, this difference is notable.

Furthermore, an academic contribution of this research can be summarized as the dynamic synthetic ETF approach for the pairs formation and usage of futures contracts as an alternative short strategy. It is observed that parameters used in this research should be optimized. In addition, we should note that outperformance of this strategy may be the selection capability of our model. We used same stock list for 27 quarters, and performance attribution of those selected stocks may have outperformed the original ISE-30 stocks.