Detecting Leaders from Correlated Time Series

  • Di Wu
  • Yiping Ke
  • Jeffrey Xu Yu
  • Philip S. Yu
  • Lei Chen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5981)

Abstract

Analyzing the relationships of time series is an important problem for many applications, including climate monitoring, stock investment, traffic control, etc. Existing research mainly focuses on studying the relationship between a pair of time series. In this paper, we study the problem of discovering leaders among a set of time series by analyzing lead-lag relations. A time series is considered to be one of the leaders if its rise or fall impacts the behavior of many other time series. At each time point, we compute the lagged correlation between each pair of time series and model them in a graph. Then, the leadership rank is computed from the graph, which brings order to time series. Based on the leadership ranking, the leaders of time series are extracted. However, the problem poses great challenges as time goes by, since the dynamic nature of time series results in highly evolving relationships between time series. We propose an efficient algorithm which is able to track the lagged correlation and compute the leaders incrementally, while still achieving good accuracy. Our experiments on real climate science data and stock data show that our algorithm is able to compute time series leaders efficiently in a real-time manner and the detected leaders demonstrate high predictive power on the event of general time series entities, which can enlighten both climate monitoring and financial risk control.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bhuyan, R.: Information, alternative markets, and security price processes: A survey of literature. Finance 0211002, EconWPA (2002)Google Scholar
  2. 2.
    Box, G., Jenkins, G.M., Reinsel, G.: Time Series Analysis: Forecasting and Control. Prentice Hall, Englewood Cliffs (1994)MATHGoogle Scholar
  3. 3.
    Brent, R.P.: Algorithms for Minimization Without Derivatives. Dover Publications, New York (2002)MATHGoogle Scholar
  4. 4.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 30(1-7), 107–117 (1998)CrossRefGoogle Scholar
  5. 5.
    Chan, K.: A further analysis of the lead-lag relationship between the cash market and stock index futures market. Review of Financial Studies 5(1), 123–152 (1992)CrossRefGoogle Scholar
  6. 6.
    Dorr, D.H., Denton, A.M.: Establishing relationships among patterns in stock market data. In: Data & Knowledge Engineering (2008)Google Scholar
  7. 7.
    Granger, C.W.J.: Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37(3), 424–438 (1969)CrossRefMathSciNetGoogle Scholar
  8. 8.
    Idé, T., Kashima, H.: Eigenspace-based anomaly detection in computer systems. In: KDD, pp. 440–449 (2004)Google Scholar
  9. 9.
    Idé, T., Papadimitriou, S., Vlachos, M.: Computing correlation anomaly scores using stochastic nearest neighbors. In: ICDM, pp. 523–528 (2007)Google Scholar
  10. 10.
    Meijering, E.: Chronology of interpolation: From ancient astronomy to modern signal and image processing. In: Proc. of the IEEE, pp. 319–342 (2002)Google Scholar
  11. 11.
    Papadimitriou, S., Sun, J., Yu, P.S.: Local correlation tracking in time series. In: ICDM, pp. 456–465 (2006)Google Scholar
  12. 12.
    Säfvenblad, P.: Lead-lag effects when prices reveal cross-security information. Working Paper Series in Economics and Finance 189, Stockholm School of Economics (September 1997)Google Scholar
  13. 13.
    Sakurai, Y., Papadimitriou, S., Faloutsos, C.: Braid: Stream mining through group lag correlations. In: SIGMOD, pp. 599–610 (2005)Google Scholar
  14. 14.
    Steinbach, M., Tan, P.-N., Kumar, V., Klooster, S.A., Potter, C.: Discovery of climate indices using clustering. In: KDD, pp. 446–455 (2003)Google Scholar
  15. 15.
    Tan, P.-N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison-Wesley, Reading (2006)Google Scholar
  16. 16.
    von Storch, H., Zwiers, F.W.: Statistical Analysis in Climate Research. Cambridge University Press, Cambridge (2002)Google Scholar
  17. 17.
    Wichard, J.D., Merkwirth, C., Ogorzałlek, M.: Detecting correlation in stock market. Physica A: Statistical Mechanics and its Applications 344(1-2), 308–311 (2004)CrossRefMathSciNetGoogle Scholar
  18. 18.
    Zhu, Y., Shasha, D.: Statstream: Statistical monitoring of thousands of data streams in real time. In: VLDB, pp. 358–369 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Di Wu
    • 1
  • Yiping Ke
    • 1
  • Jeffrey Xu Yu
    • 1
  • Philip S. Yu
    • 2
  • Lei Chen
    • 3
  1. 1.The Chinese University of Hong Kong 
  2. 2.University of IllinoisChicago
  3. 3.The Hong Kong University of Science and Technology 

Personalised recommendations