Skip to main content
Log in

Parameter estimation in abruptly changing dynamic environments using stochastic learning weak estimator

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Many real-life dynamical systems experience abrupt changes followed by almost stationary periods. In this paper, we consider streams of data exhibiting such abrupt behavior and investigate the problem of tracking their statistical properties in an online manner. Wedevise a tracking procedure where an estimator that is suitable for a stationary environment is combined together with an estimator suitable for a dynamic environment. The current estimate is based on the stationary estimator unless a statistically significant difference is observed between both estimators. The stationary estimate is deemed off track and a large update (jump) is given to get the stationary estimate back on track. We use the Stochastic Learning Weak Estimator (SLWE) as the dynamic estimator. The SLWE is known to be the state-of-the art solution to tracking the properties of non-stationary environments, due to its multiplicative update form. Therefore, the SLWE is a better choice to accompany a stationary estimator than the far more common sliding window based approach. A theoretically well founded statistical testing procedure is developed to detect a significant difference between the stationary and dynamical estimators. Although our procedure bears similarities to the event detection procedure suggested by Ross et al. (Pattern Recogn Lett, 33(2):191–198, 2012), it is rather well founded theoretically. First, Ross et al. ignore the uncertainty in the stationary estimator in the detection procedure. Second, the detection threshold is determined based on heuristics and therefore lacks a solid statistical foundation. Extensive simulation results, based on both synthetic and real-life data related to news topic classification, demonstrate that our estimation procedure is easy to tune and outperforms legacy works.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Agresti A, Kateri M (2011) Categorical data analysis. Springer

  2. Basseville M, Nikiforov IV (1993) Detection of abrupt changes: theory and application. Prentice-Hall, Inc.

  3. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc. Series B (Methodological), 289–300

  4. Bickel PJ, Doksum KA (2015) Mathematical statistics: basic ideas and selected topics. CRC Press

  5. Bifet A, Gavalda R (2007) Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM international conference on data mining. SIAM, pp 443–448

  6. Dasu T, Krishnan S, Venkatasubramanian S, Yi K (2006) An information-theoretic approach to detecting changes in multi-dimensional data streams. In: Proc. Symp. on the interface of statistics, computing science, and applications. Citeseer

  7. De Santo M, Percannella G, Sansone C, Vento M (2004) A multi-expert approach for shot classification in news videos. Image Anal Recogn, 564–571

  8. Dries A, Rückert U (2009) Adaptive concept drift detection. Anal Stat Data Min 2(5):311–327

    Article  MathSciNet  Google Scholar 

  9. Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Statist Softw 33(1):1

    Article  Google Scholar 

  10. Gama J, žliobaitė I, Bifet A, Pechenizkiy M, Bouchachia A (2014) A survey on concept drift adaptation. ACM Comput Surv 46(4):44,1–44,37

    Article  Google Scholar 

  11. Hawkins DM, Qiu P, Kang CW (2003) The changepoint model for statistical process control. J Qual Technol 35(4):355

    Article  Google Scholar 

  12. Hawkins DM, Zamba KD (2005) A change-point model for a shift in variance. J Qual Technol 37(1):21

    Article  Google Scholar 

  13. Hawkins DM, Zamba KD (2005) Statistical process control for shifts in mean or variance using a changepoint formulation. Technometrics 47(2):164–173

    Article  MathSciNet  Google Scholar 

  14. Ibrahim A, Martin MV (2009) Detecting and preventing the electronic transmission of illicit images and its network performance. In: International conference on digital forensics and cyber crime. Springer, pp 139–150

  15. Johnson RA, Wichern DW et al. (2014) Applied multivariate statistical analysis, vol 4. Prentice-Hall, New Jersey

    Google Scholar 

  16. Karr A. F. (2012) Probability. Springer, New York

    MATH  Google Scholar 

  17. Kifer D, Ben-David S, Gehrke J (2004) Detecting change in data streams. In: Proceedings of the Thirtieth international conference on very Large data bases, vol 30, pp 180–191. VLDB Endowment

    Chapter  Google Scholar 

  18. Klinkenberg R (2004) Learning drifting concepts: example selection vs. example weighting. Intell Data Anal 8(3):281–300

    Google Scholar 

  19. Koychev I (2000) Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI 2000 workshop current issues in spatio-temporal reasoning, pp 101–106

  20. Koychev I (2000) Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI 2000 workshop on current issues in spatio-temporal reasoning

  21. Koychev I, Lothian R (2006) Tracking drifting concepts by time window optimisation. In: Bramer M, Coenen F, Allen T (eds) Research and development in intelligent systems XXII. Springer, London, pp 46–59

    Chapter  Google Scholar 

  22. Koychev I, Schwab I (2000) Adaptation to drifting user’s interests. In: Proceedings of ECML2000 workshop: machine learning in new information age, pp 39–46

  23. Kulkarni P, Ade R (2014) Incremental learning from unbalanced data with concept class, concept drift and missing features: a review. Int J Data Min Knowl Manag Process (IJDKP) 4(6):15– 29

    Article  Google Scholar 

  24. Lowry CA, Woodall WH, Champ CW, Rigdon SE (1992) A multivariate exponentially weighted moving average control chart. Technometrics 34(1):46–53

    Article  Google Scholar 

  25. Lucas JM, Saccucci MS (1990) Exponentially weighted moving average control schemes: properties and enhancements. Technometrics 32(1):1–12

    Article  MathSciNet  Google Scholar 

  26. Manning CD, Schütze H (1999) Foundations of statistical natural language processing, vol 999. MIT Press

  27. Misra S, Kapri NR, Wolfinger BE (2015) Selfishness-aware target tracking in vehicular mobile wimax networks. Telecommun Syst 58(4):313–328

    Article  Google Scholar 

  28. Mohan R, Yazidi A, Feng B, Oommen BJ (2016) Dynamic ordering of firewall rules using a novel swapping window-based paradigm. In: Proceedings of the 6th International conference on communication and network security. ACM, pp 11–20

  29. Narendra KS, Thathachar MAL (2012) Learning automata: an introduction. Courier Corporation

  30. Oommen BJ, Misra S (2010) Fault-tolerant routing in adversarial mobile ad hoc networks: an efficient route estimation scheme for non-stationary environments. Telecommun Syst 44:159–169

    Article  Google Scholar 

  31. Oommen BJ, Yazidi A, Granmo O-C (2012) An adaptive approach to learning the preferences of users in a social network using weak estimators. J Inf Process Syst, 8(2)

    Article  Google Scholar 

  32. Oommen JB, Rueda L (2006) Stochastic learning-based weak estimation of multinomial random variables and its applications to pattern recognition in non-stationary environments. Pattern Recogn 39(3):328–341

    Article  Google Scholar 

  33. R Core Team (2017) R: a language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria

  34. Rikli N-E, Alnasser A (2016) Lightweight trust model for the detection of concealed malicious nodes in sparse wireless ad hoc networks. Int J Distrib Sensor Netw 12(7):1550147716657246

    Article  Google Scholar 

  35. Ross GJ, Adams NM (2012) Two nonparametric control charts for detecting arbitrary distribution changes. J Qual Technol 44(2): 102

    Article  Google Scholar 

  36. Ross GJ, Adams NM, Tasoulis DK, Hand DJ (2011) A nonparametric change point model for streaming data. Technometrics 53(4):379–389

    Article  MathSciNet  Google Scholar 

  37. Ross GJ, Adams NM, Tasoulis DK, Hand DJs (2012) Exponentially weighted moving average charts for detecting concept drift. Pattern Recogn Lett 33(2):191–198

    Article  Google Scholar 

  38. Ross GJ, Tasoulis DK, Adams NiM (2013) Sequential monitoring of a bernoulli sequence when the pre-change parameter is unknown. Comput Stat, 1–17

  39. Rueda L, Oommen BJ (2006) Stochastic automata-based estimators for adaptively compressing files with nonstationary distributions. IEEE Trans Syst Man Cybern Part B: Cybern 36(5):1196–1200

    Article  Google Scholar 

  40. Sebastião R, Gama J (2007) Change detection in learning histograms from data streams. In: Proceedings of the aritficial intelligence 13th portuguese conference on progress in artificial intelligence, EPIA’07. Springer-Verlag, Berlin, pp 112–123

  41. Shiryayev AN (1978) Optimal stopping rules. Springer

  42. Stensby A, Oommen BJ, Granmo O-C (2013) The use of weak estimators to achieve language detection and tracking in multilingual documents. Int J Pattern Recogn Artif Intell 27(04):1350011

    Article  MathSciNet  Google Scholar 

  43. Tartakovsky AG, Rozovskii BL, Blazek RB, Kim H (2006) A novel approach to detection of intrusions in computer networks via adaptive sequential and batch-sequential change-point detection methods. IEEE Trans Signal Process 54:3372–3382

    Article  Google Scholar 

  44. Tsymbal A, Pechenizkiy M, Cunningham P, Puuronen S (2008) Dynamic integration of classifiers for handling concept drift. Inf Fusion 9(1):56–68

    Article  Google Scholar 

  45. Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23(1):69–101

    Google Scholar 

  46. Yazidi A, Granmo O-C, Oommen BJ, Gerdes M, Reichert F (2011) A user-centric approach for personalized service provisioning in pervasive environments. Wirel Pers Commun 61(3):543–566

    Article  Google Scholar 

  47. Yazidi Anis, Oommen BJ (2016) Novel discretized weak estimators based on the principles of the stochastic search on the line problem. IEEE Trans Cybern 46(12):2732–2744

    Article  Google Scholar 

  48. Yazidi A, Oommen BJ, Horn G, Granmo O-C (2016) Stochastic discretized learning-based weak estimation: a novel estimation method for non-stationary environments. Pattern Recogn 60:430–443

    Article  Google Scholar 

  49. Zhan J, Oommen BJ, Crisostomo J (2011) Anomaly detection in dynamic systems using weak estimators. ACM Trans Internet Technol 11:3,1–3,16

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hugo Lewi Hammer.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hammer, H.L., Yazidi, A. Parameter estimation in abruptly changing dynamic environments using stochastic learning weak estimator. Appl Intell 48, 4096–4112 (2018). https://doi.org/10.1007/s10489-018-1205-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-018-1205-3

Keywords

Navigation