Skip to main content

On Impact of Data Models on Predictability Assessment of Time Series

  • Chapter
  • First Online:
Data Analysis and Optimization

Part of the book series: Springer Optimization and Its Applications ((SOIA,volume 202))

  • 256 Accesses

Abstract

The main methods for predicting traffic in telecommunications networks are the methods and techniques of Machine Learning (ML).

Within the framework of the ML approach, the network traffic predictor is considered as a tool that uses one way or another accumulated statistics over time to draw conclusions about the future behavior of network traffic.

However, as the analysis of the literature shows, many modern MO tools do not work efficiently enough due to the pronounced non-linearity of traffic changes and non-stationarity.

This paper is an attempt at some ordering and categorization in a huge stream of publications on modern methods, techniques and models for predicting data of various nature.

For this, the concept of a conceptual model (CM) of algorithms for predicting the state of systems in the subject area is introduced. This concept is used widely used in modern Big Date and Software Engineering, is adapted here to represent specific probabilistic models of random sequences and processes of various nature (symbolic, binary, real). Under such a conceptual model, the paper refers to a formalized description of the relationships between elements, objects (both real and model) and goals of mathematical models and prediction algorithms, as well as the relationships between them, expressed in probabilistic terms.

This construction allows representing classes of models according to the types of predicted data, used probability measures, loss functions when making decisions about the acceptability of the received forecast.

Among the tasks of forecasting, the task of predicting signs of increments (direction of change) of the time series process is singled out separately. The paper proposes to use some results of the theory of random processes for a quick assessment of the predictability of signs of increments with acceptable ac-curacy. The proposed procedure is a simple heuristic rule for predicting the increment of two neighboring values of a random sequence. The connection with this approach for time series with known approaches for predicting binary sequences is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    Article  MATH  Google Scholar 

  2. Predict Traffic of LTE Network, [Online] https://www.kaggle.com/naebolo/predict-traffic-of-lte-network. Accessed on 26 Oct 2020

  3. Chen, A., Law, J., Aibin, M.: A survey on traffic prediction techniques using artificial intelligence for communication networks. Telecom. 2(4), 518–535 (2021)

    Article  Google Scholar 

  4. Zhang, J., Tan, D., Han, Zhu, H.: From machine learning to deep learning: Progress in machine intelligence for rational drugv discovery. Drug Discov. Today. 22(11), 1680–1685 (2017)

    Article  Google Scholar 

  5. Rooba, R., Vallimayil, V.: Semantic aware future page prediction based on domain. Int J. Pure Appl. Math. 118(9), 911–919 (2018)

    Google Scholar 

  6. Ryabko, B.: Compression-based methods for nonparametric prediction and estimation of some characteristics of time series. IEEE Trans. on Inf. Theory. 55(9), 4309–4315 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  7. Brovelli, M., Sanso, F., Venuti, G.: A discussion on the Wiener–Kolmogorov prediction principle with easy-to compute and robust variants. J. Geod. 76, 673–683 (2003)

    Article  MATH  Google Scholar 

  8. Feder, M., Merhav, N.: Universal prediction. IEEE T. Inform. Theory. 44(6), 2124–2147 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  9. Sharma, S..: Activation functions in neural networks (2019). Retrieved from https://towardsdatascience.com/activation-functions-neural-networks-1cbd9f8d91d6

  10. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer Science + Business Media, New York (2009)

    MATH  Google Scholar 

  11. Fettke, P.: Conceptual Modelling and Artificial Intelligence, Joint Proceedings of Modellierung Short. Workshop and Tools & Demo Papers Workshop on Models in AI (2020)

    Google Scholar 

  12. Introduction to Unified Modeling Language (UML) 3rd INSPIRATION Training, GFA (December 4–5, 2012)

    Google Scholar 

  13. Kwiatkowska, M., Norman, G., Parker, D.: PRISM 4.0: verification of probabilistic real-time systems. In: In, Proc. 23rd International Conference on Computer Aided Verification (CAV’11), pp. 585–591. Vol. 6806 of LNCS, Springer (2011)

    Chapter  Google Scholar 

  14. Ryabko, B.: Prediction of random sequences and universal coding. Probl. Inf. Transm. 24(2), 3–14 (1988)

    MathSciNet  MATH  Google Scholar 

  15. Ikharo, A.B., Anyachebelu, K.T., Blamah, N.V., Abanihi, V.: Optimising self-similarity network traffic for better performance. Int J Sci Technol Res, Int J Sci Technol. Print ISSN: 2395-6011. https://doi.org/10.32628/IJSRST207413164

  16. Cesa-Bianchi, N., Lugosi, G.: On prediction of individual sequences. Ann. Stat. 27(6), 1865–1895 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  17. Yu, P., Kuo, K.-S., Rilee, M.L., Yu, H.: Assessing Deep Neural Networks as Probability Estimatorsar. Xiv:2111.08239v1 [cs.LG] (2021)

    Google Scholar 

  18. Ryabko, B., Monarev, V.: Using information theory approach to randomness testing. J. Stat. Plan. Inference. 133, 95–110 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  19. Miller, J.B., Sanjurjo, A.: Surprised by the Hot Hand Fallacy? A Truth in the Law of Small Numbers. arXiv:1902.01265v1 [econ.GN] (2019)

    MATH  Google Scholar 

  20. Nikravesh, A., Ajila, S.A., Lung, C.-H.: An autonomic prediction suite for cloud resource provisioning. J. Cloud Comput. Adv. Syst. Appl. 6, 3 (2017). https://doi.org/10.1186/s13677-017-0073-4

    Article  Google Scholar 

  21. Frenkel, S.: Theoretical aspects of a priori on-line assessment of data predictability in applied tasks 5th International Symposium on Cyber Security Cryptology and Machine Learning CSCML 2021. LNCS. 12716, 187–195 (2021)

    Google Scholar 

  22. Buket Coskun, B., Vardar-Acar, C., Demirtas, H.: A Generalized Correlated,: Random Walk, Converging to Fractional Brownian Motion. arXiv:1903.05424v3 (2019)

    Google Scholar 

  23. Ming, L., Jia-Yue, L.: On the Predictability of Long-Range Dependent Series. Mathematical Problems in Engineering Volume (2010). https://doi.org/10.1155/2010/397454

    Book  MATH  Google Scholar 

  24. Brignoli, D.: DDOS detection based on traffic self-similarity (n.d.). https://ir.canterbury.ac.nz/bitstream/handle/10092/2105/Thesis_fulltext.pdf;sequence=2

  25. Graf, S.: Statistically self-similar fractals. Prob. Th. Rel. Fields. 74, 357–392 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  26. Park, R., Hernández-Campos, F., Le, L., Marron, J., Park, J., Pipiras, V., Smith, F., Smith, L., Trovero, M., Zhu, Z.: Long-range dependence analysis of internet traffic. J. Appl. Stat. 38(7), 1407–1433 (2011)

    Article  MathSciNet  Google Scholar 

  27. Megues, P., Molnar, S.: Analysis of Elephant Users in Broadband Network Traffic. 19th EUNICE Workshop on Advances in Communication Networking (2013). https://doi.org/10.1007/978-3-642-40552-5_4

    Book  Google Scholar 

  28. Leland, W.E., et al.: On the Self-Similar Nature of Ethernet Traffic (Extended Version), pp. 1–15. IEEE Press, Piscataway (1994)

    Google Scholar 

  29. Becchi M., From Poisson Processes to Self-Similarity: a Survey of Network Traffic Models. 2008., https://www.cse.wustl.edu/~jain/cse567-06/ftp/traffic_models1/index.html

    Google Scholar 

  30. Bosq, D., Nguyen, H.: A Course in Stochastic Processes. Stochastic Models and Statistical Inference, Kluwer, Dordrecht (1996)

    Book  MATH  Google Scholar 

  31. Pan, C., Wang, Y., Shi, H., Shi, J., Cai, R.: Network traffic prediction incorporating prior knowledge for an intelligent network. Sensors. 22(7), 2674 (2022)

    Article  Google Scholar 

  32. Lavasani, A., A., Eghlidos, T.: Practical next bit test for evaluating pseudorandom sequences. Comput. Sci. Eng. Electric. Eng. 16(1), 19–33 (2009)

    MATH  Google Scholar 

  33. Park, C., Hernandez, F., Le, L., Marron, J.S., Park, J., Pipiras, V., Smith, F.D., Smith, R.L., Trovero, M., Zhu, Z.: Long range dependence analysis of Internet traffic. Journal of Applied Statistics. 38, 1407–1433 (2004)

    Article  MathSciNet  Google Scholar 

  34. He, H.: 1 Shitao Cheng, 1 and Xiaofu Zhang, signal nonstationary degree evaluation method based on moving statistics theory. Shock. Vib. 2021., Article ID 5562110, 18 (2021). https://doi.org/10.1155/2021/5562110

    Article  Google Scholar 

  35. Box, G.E.P., Jenkins, G.M., Reinsel, G.C.: Time Series Analysis: Forecasting and Control. Wiley, New York (2008)

    Book  MATH  Google Scholar 

  36. Lyman, J., Edmonson, W., McCullough, C., Rao, M.: The predictability of continuous-time band limited processes. IEEE Trans. Signal Process. 48(2), 311–316 (2000)

    Article  Google Scholar 

  37. Song, W., Duan, S., Chen, D., Zio, E., Yan, W., Cai, F.: Finite iterative forecasting model based on fractional generalized Pareto motion. Fractal Fract. 6, 471 (2022)

    Article  Google Scholar 

  38. Loiseau, P., Gonçalves, P., Dewaele, G., Borgnat, P., Abry, P., Primet, P.: Investigating self-similarity and heavy-tailed distributions on a large-scale experimental facility. IEEE/ACM Trans. netw. 18, 1261–1274 (2010)

    Article  Google Scholar 

  39. Chen, A., Law, J., Aibin, M.: A survey on traffic prediction techniques using artificial intelligence for communication networks. Telecom. 2, 517–536 (2021)

    Article  Google Scholar 

  40. Vinchoff, C., Chung, N., Gordon, T., Lyford, L., Aibin, M.: Traffic Prediction in optical networks using graph convolutional generative adversarial networks. In: In Proceedings of the International Conference on Transparent Optical Networks, pp. 3–6. Bari, Italy (2020)

    Google Scholar 

  41. Aibin, M.: Deep Learning for Cloud Resources Allocation: Long-Short Term Memory in EONs. In Proceedings of the International Conference on Transparent Optical Networks, Angers, France, 9–13 July 2019; pp. 8–11

    Google Scholar 

  42. Yin, F., Wang, J., Guo, C. (eds.): A Boosting-Based Framework for Self-Similar and Non-linear Internet Traffic Prediction ISNN 2004, pp. 931–936. LNCS 3174 (2004)

    Google Scholar 

  43. Shi, Y., Fernando, B., Hartley, R.: Action Anticipation with RBF Kernelized Feature Mapping RNN. arXiv:1911.07806v3 [cs.CV] 11 Jul (2021)

    Google Scholar 

  44. Frenkel, S.: Predicting the direction of changes in the values of time Series for relatively small training samples. In: 6th International Symposium on Cyber Security Cryptology and Machine Learning CSCML 2021CSCML, Beer-Sheva, Israel, pp. 118–134. Proceedings, Lecture Notes in Computer Science (13301) (2022)

    Chapter  Google Scholar 

  45. The Influence of Long-Range Dependence on Traffic Prediction Sven A. M. Östring, H. Sirisena Published 11 June 2001 Computer Science ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240)

    Google Scholar 

  46. Nikravesh, A.Y., Ajila, S.A., Lung, C.-H.: An autonomic prediction suite for cloud. J. Cloud Comput. Adv. Syst. Applic. 6, 3 (2017). https://doi.org/10.1186/s13677-017-0073-4

    Article  Google Scholar 

  47. Zhao, A., Liu, Y.: Application of Nonlinear Combination Prediction Model for Network Traffic. 2nd International Conference on Electronic & Mechanical Engineering and Information Technology (EMEIT-2012). Proceedings, 2337–2340 (2012)

    Book  Google Scholar 

  48. Christoffersen, P., Diebold, F.: Financial asset returns, direction-of-change forecasting, and volatility dynamics. Manag. Sci. 52(8), 1273–1287 (2006)

    Article  Google Scholar 

  49. Sornette, D., Andersen, J.: Increments of uncorrelated time series can be predicted with a universal 75% probability of success. Int. J. Mod. Phys. 11(4), 713–720 (2000)

    Article  Google Scholar 

  50. Cloud, B.L., Dalmazo, L., Vilela, M.: Performance analysis of network traffic predictors. J. Netw Syst Manage. 25, 290–320 (2017)

    Article  Google Scholar 

  51. Aryan, M.: Efficient Methods for Large-Scale Empirical Risk Minimization. A Doctoral Thesis, Philadelphia, PA (2017)

    Google Scholar 

  52. Kleeman, R.: Information theory and dynamical system predictability. Entropy. 13, 612–649 (2011)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sergey Frenkel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Frenkel, S. (2023). On Impact of Data Models on Predictability Assessment of Time Series. In: Goldengorin, B., Kuznetsov, S. (eds) Data Analysis and Optimization. Springer Optimization and Its Applications, vol 202. Springer, Cham. https://doi.org/10.1007/978-3-031-31654-8_7

Download citation

Publish with us

Policies and ethics