N-gram Events for Analysis of Financial Time Series

  • Igor BorovikovEmail author
  • Michael Sadovsky
Conference paper
Part of the Springer Proceedings in Complexity book series (SPCOM)


Discretization of time series and encoding it as a string in a finite alphabet allows application of information theory methods developed for discrete signals. Computing information values of n-grams extracted from such string leads to introduction of events as occurrences of n-grams that possess specific properties, e.g. abnormally high (or low) information value. We define information value of an n-gram via maximum entropy lifts over frequency dictionaries. We also look for correlation between market events and n-gram events. The paper shows that the proposed method of time series analysis when applied to events study may provide new insightful perspective.


Input Text Market Event Information Capacity Maximum Entropy Principle Financial Time Series 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Tsay, R.S.: Analysis of Financial Time Series, p. 448. Inc, Financial Econometrics. Wiley & Sons (2002)Google Scholar
  2. 2.
    Bugaenko, N.N., Gorban, A.N., Sadovsky, M.G.: Towards the definition of information content of nucleotide sequences. Mol. Biol. Moscow 30(5), 529–541 (1996)Google Scholar
  3. 3.
    Bugaenko, N.N., Gorban, A.N., Sadovsky, M.G.: The information capacity of nucleotide sequences and their fragments. Biophysics 5, 1063–1069 (1997)Google Scholar
  4. 4.
    Bugaenko, N.N., Gorban, A.N., Sadovsky, M.G.: Maximum entropy method in analysis of genetic text and measurement of its information content. Open Syst. Inf. Dyn. 5(2), 265–278 (1998)Google Scholar
  5. 5.
    Borovikov, I., Sadovsky, M.: A relative information approach to financial time series analysis using binary N-grams dictionaries; arXiv:1308.2732 [q-fin.ST] (2013) 13 pp
  6. 6.
    Sadovsky, M.G., Borovikov, I.: Analysis of financial time series with binary \(n\)-grams frequency dictionaries. J. Siberian Fed. Univ., Math. Phys. 7(1), 112–123 (2014)Google Scholar
  7. 7.
    Borovikov, I., Sadovsky, M.: Sliding Window Analysis of Binary n-Grams Relative Information for Financial Time Series, LLNL CASIS proceedings (2014).
  8. 8.
    Bachelier, L., Théorie de la spéculation. Annales Scientifiques de l’École Normale Supérieure 3(17), 21–86Google Scholar
  9. 9.
    Hu, R., Bin, W.: Statistically significant strings are related to regulatory elements in the promoter regions of Saccharomyces cerevisiae. Physica A 290, 464–474 (2001)ADSCrossRefzbMATHGoogle Scholar
  10. 10.
    1998 Russian financial crisis, Wikipedia, the free online encyclopedia.
  11. 11.
    Early 2000s recession, Wikipedia, the free online encyclopedia.

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. Int. LabsFoster CityUSA
  2. 2.Institute of Computational Modelling SB RASKrasnoyarskRussia

Personalised recommendations