On the nature and impact of self-similarity in real-time systems

Hernández-Orallo, Enrique; Vila-Carbó, Joan

doi:10.1007/s11241-012-9146-0

On the nature and impact of self-similarity in real-time systems

Published: 20 January 2012

Volume 48, pages 294–319, (2012)
Cite this article

Real-Time Systems Aims and scope Submit manuscript

Enrique Hernández-Orallo¹ &
Joan Vila-Carbó¹

166 Accesses
Explore all metrics

Abstract

In real-time systems with highly variable task execution times simplistic task models are insufficient to accurately model and to analyze the system. Variability can be tackled using distributions rather than a single value, but the proper characterization depends on the degree of variability. Self-similarity is one of the deepest kinds of variability. It characterizes the fact that a workload is not only highly variable, but it is also bursty on many time-scales. This paper identifies in which situations this source of indeterminism can appear in a real-time system: the combination of variability in task inter-arrival times and execution times. Although self-similarity is not a claim for all systems with variable execution times, it is not unusual in some applications with real-time requirements, like video processing, networking and gaming.

The paper shows how to properly model and to analyze self-similar task sets and how improper modeling can mask deadline misses. The paper derives an analytical expression for the dependence of the deadline miss ratio on the degree of self-similarity and proofs its negative impact on real-time systems performance through system’s modeling and simulation. This study about the nature and impact of self-similarity on soft real-time systems can help to reduce its effects, to choose the proper scheduling policies, and to avoid its causes at system design time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Digital Twin: Mitigating Unpredictable, Undesirable Emergent Behavior in Complex Systems

Evolutionary algorithms and their applications to engineering problems

Article Open access 16 March 2020

A survey on the evolution of stream processing systems

Article Open access 22 November 2023

Notes

Note that the aggregate process of W _t can be obtained as $W_{t}^{(m)}=\int_{(t-1)Tm}^{tTm}A(u) du$.
Assuming that all deadlines are the same is a weak condition. For different values of D _i, we can obtain the same results if we select, for example, the minimum deadline D=min_i D ⁱ (a tight condition) or the maximum deadline D=max_i D ⁱ (a loose condition).

References

Abdelzaher TF, Sharma V, Lu C (2004) A utilization bound for aperiodic tasks and priority driven scheduling. IEEE Trans Comput 53(3):334–350
Article Google Scholar
Abeni L, Buttazzo G (1999) QoS guarantee using probabilistic deadlines. In: Proc of the Euromicro confererence on real-time systems
Google Scholar
Abeni L, Buttazzo G (2004) Resource reservation in dynamic real-time systems. Real-Time Syst 37(2):123–167
Article Google Scholar
Anantharam V (1999) Scheduling strategies and long-range dependence. Queueing Syst 33(1–3):73–89
Article MathSciNet MATH Google Scholar
Beran J (1994) Statistics for long-memory processes. Chapman and Hall, London
MATH Google Scholar
Beran J, Sherman R, Taqqu M, Willinger W (1995) Long-range dependence in variable-bit-rate video traffic. IEEE Trans Commun 43(2):1566–1579
Article Google Scholar
Boxma O, Zwart B (2007) Tails in scheduling. SIGMETRICS Perform Eval Rev 34(4):13–20
Article Google Scholar
Brichet F, Roberts J, Simonian A, Veitch D (1996) Heavy traffic analysis of a storage model with long range dependent on/off sources. Queueing Syst 23(1):197–215
Article MathSciNet MATH Google Scholar
Crovella M, Bestavros A (1997) Self-similarity in world wide web traffic: evidence and possible causes. IEEE/ACM Trans Netw 5(6):835–846
Article Google Scholar
Dìaz J, Garcìa D, Kim K, Lee C, Bello LL, López J, Min LS, Mirabella O (2002) Stochastic analysis of periodic real-time systems. In: Proc of the 23rd IEEE real-time systems symposium, pp 289–300
Google Scholar
Erramilli A, Narayan O, Willinger W (1996) Experimental queueing analysis with long-range dependent packet traffic. IEEE/ACM Trans Netw 4(2):209–223
Article Google Scholar
Erramilli A, Roughan M, Veitch D, Willinger W (2002) Self-similar traffic and network dynamics. Proc IEEE 90(5):800–819
Article Google Scholar
Gardner M (1999) Probabilistic analysis and scheduling of critical soft real-time systems. Phd thesis, University of Illinois, Urbana-Champaign
Garrett MW, Willinger W (1994) Analysis, modeling and generation of self-similar vbr video traffic. In: ACM SIGCOMM
Google Scholar
Harchol-Balter M (2002) Task assignment with unknown duration. J ACM 49(2):260–288
Article MathSciNet Google Scholar
Harchol-Balter M (2007) Foreword: Special issue on new perspective in scheduling. SIGMETRICS Perform Eval Rev 34(4):2–3
Article Google Scholar
Harchol-Balter M, Downey AB (1997) Exploiting process lifetime distributions for dynamic load balancing. ACM Trans Comput Syst 15(3):253–285
Article Google Scholar
Hernandez-Orallo E, Vila-Carbo J (2007) Network performance analysis based on histogram workload models. In: Proceedings of the 15th international symposium on modeling, analysis, and simulation of computer and telecommunication systems (MASCOTS), pp 331–336
Google Scholar
Hernandez-Orallo E, Vila-Carbo J (2010) Analysis of self-similar workload on real-time systems. In: IEEE real-time and embedded technology and applications symposium (RTAS). IEEE Computer Society, Washington, pp 343–352
Google Scholar
Hernández-Orallo E, Vila-Carbó J (2010) Network queue and loss analysis using histogram-based traffic models. Comput Commun 33(2):190–201
Article Google Scholar
Hughes CJ, Kaul P, Adve SV, Jain R, Park C, Srinivasan J (2001) Variability in the execution of multimedia applications and implications for architecture. SIGARCH Comput Archit News 29(2):254–265
Article Google Scholar
Leland W, Ott TJ (1986) Load-balancing heuristics and process behavior. SIGMETRICS Perform Eval Rev 14(1):54–69
Article Google Scholar
Leland WE, Taqqu MS, Willinger W, Wilson DV (1994) On the self-similar nature of ethernet traffic (extended version). IEEE/ACM Trans Netw 2(1):1–15
Article Google Scholar
Liu CL, Layland JW (1973) Scheduling algorithms for multiprogramming in a hard-real-time environment. J ACM 20(1):46–61
Article MathSciNet MATH Google Scholar
Mandelbrot B (1965) Self-similar error clusters in communication systems and the concept of conditional stationarity. IEEE Trans Commun 13(1):71–90
Article Google Scholar
Mandelbrot BB (1969) Long run linearity, locally Gaussian processes, h-spectra and infinite variances. Int Econ Rev 10:82–113
Article MATH Google Scholar
Norros I (1994) A storage model with self-similar input. Queueing Syst 16(3):387–396
Article MathSciNet MATH Google Scholar
Norros I (2000) Queueing behavior under fractional Brownian traffic. In: Park K, Willinger W (eds) Self-similar network traffic and performance evaluation. Willey, New York, Chap 4
Google Scholar
Park K, Willinger W (2000) Self-similar network traffic: An overview. In: Park K, Willinger W (eds) Self-similar network traffic and performance evaluation. Willey, New York, Chap 1
Chapter Google Scholar
Paxson V, Floyd S (1995) Wide area traffic: the failure of Poisson modeling. IEEE/ACM Trans Netw 3(3):226–244
Article Google Scholar
Rolls DA, Michailidis G, Hernández-Campos F (2005) Queueing analysis of network traffic: methodology and visualization tools. Comput Netw 48(3):447–473
Article Google Scholar
Rose O (1995) Statistical properties of mpeg video traffic and their impact on traffic modeling in atm systems. In: Conference on local computer networks
Google Scholar
Roy N, Hamm N, Madhukar M, Schmidt DC, Dowdy L (2009) The impact of variability on soft real-time system scheduling. In: RTCSA ’09: Proceedings of the 2009 15th IEEE international conference on embedded and real-time computing systems and applications. IEEE Computer Society, Washington, pp 527–532
Chapter Google Scholar
Sha L, Abdelzaher T, Årzén KE, Cervin A, Baker T, Burns A, Buttazzo G, Caccamo M, Lehoczky J, Mok AK (2004) Real time scheduling theory: A historical perspective. Real-Time Syst 28(2):101–155
Article MATH Google Scholar
Taqqu MS, Willinger W, Sherman R (1997) Proof of a fundamental result in self-similar traffic modeling. SIGCOMM Comput Commun Rev 27(2):5–23
Article Google Scholar
Tia T, Deng Z, Shankar M, Storch M, Sun J, Wu L, Liu J (1995) Probabilistic performance guarantee for real-time tasks with varying computation times. In: Proc of the real-time technology and applications symposium, pp 164–173
Chapter Google Scholar
Vila-Carbó J, Hernández-Orallo E (2008) An analysis method for variable execution time tasks based on histograms. Real-Time Syst 38(1):1–37
Article MATH Google Scholar
Willinger W, Taqqu M, Erramilli A (1996) A bibliographical guide to self-similar traffic and performance modeling for modern high-speed networks. In: Stochastic networks: Theory and applications, pp 339–366
Google Scholar
Willinger W, Taqqu MS, Sherman R, Wilson DV (1997) Self-similarity through high-variability: statistical analysis of ethernet lan traffic at the source level. IEEE/ACM Trans Netw 5(1):71–86
Article Google Scholar

Download references

Acknowledgement

This work was developed under a grant from the European Union (FRESCOR-FP6/2005/IST/5-03402).

Author information

Authors and Affiliations

Departamento de Informática de Sistemas y Computadores, Universidad Politécnica de Valencia, Valencia, Spain
Enrique Hernández-Orallo & Joan Vila-Carbó

Authors

Enrique Hernández-Orallo
View author publications
You can also search for this author in PubMed Google Scholar
Joan Vila-Carbó
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Enrique Hernández-Orallo.

Appendix: Background on self-similarity

The analysis of stationary time series data or stochastic processes can reveal the property of self-similarity. This is the case, for example, of the CPU utilization over a sampling period. This has been well studied in network traffic. For a detailed discussion of self-similarity see Beran (1994).

1.1 A.1 Definition of self-similarity

A phenomenon that is self-similar looks the same at different scales on a dimension. This means that at different time scales (microseconds, milliseconds, seconds) the statistical properties are similar.

More formally, let {X _t:t=0,1,2,…} ({X _t} for short) be a discrete stationary stochastic process representing the evolution in time of a statistical distribution (for example, the workload of a system). Let σ ² be the variance and let r(k)=Cov (X _t,X _t+k)/σ ² be the autocorrelation function of {X _t}. We define $\{X_{t}^{(m)}\}$ as the m-aggregated process of {X _t}, which is obtained by aggregating and averaging the data in X _t by blocks of size m:

$$X^{(m)}_t=\frac{1}{m} (X_{mt} + X_{mt+1} + \cdots + X_{m(t+1)-1})$$

(20)

and r ^(m)(k) is defined as the autocovariance function of $\{X_{t}^{(m)}\}$. Aggregating really means multiplying the sampling period by a factor m. An important effect of the process aggregation is to smooth the traffic rate in each period. Therefore, the variation is reduced. The question of how this variation changes depending on the aggregation factor is related to the study of the self-similarity of a process.

A process is distributionally self-similar if the processes {X _t} and $\{X_{t}^{(m)}\}$ have the same distribution, up to a scaling factor. A self-similar process has the property that, when it is aggregated, the new process has the same autocorrelation function as the original one. Formally, the process {X _t} is called second-order self-similar with Hurst parameter $H=1-| \frac{\beta}{2} |$ if:

$$\left\{\begin{array}{l}\operatorname {Var}(X^{(m)}_t)=\sigma^2 m^{-\beta} \\r^{(m)}(k) = r(k)\end{array}\right.\quad m=1,2, \ldots $$

(21)

If the second condition only holds when m→∞, then a process is asymptotically second-order self-similar. If 0<H<0.5, the process is Short-Range Dependent (SRD), and if 0.5<H<1, the process is Long-Range Dependent (LRD).

The variance of the m-aggregated process can be obtained as:

$$\operatorname {Var}(X^{(m)}_t) = m^{2H-2} \cdot \operatorname {Var}(X_t) = m^{2H-2}\sigma^2 $$

(22)

Processes that are LRD exhibit correlations over a wide range of times scales, while processes that are SRD exhibit correlation functions that decay exponentially fast. This implies that time-aggregation quickly results in white noise characterized by the absence of any significant temporal correlations.

The Hurst parameter can be evaluated in several ways (Beran 1994). In this paper, we use the variance-time plot, which is the graph of the variance $\log(\operatorname {Var}(X^{(m)}_{t}))$ versus log(m). With this graph, we can obtain the Hurst parameter by fitting a least-square line with slope β=2H−2 through the resulting points, ignoring those for small m.

Since a self-similar process has observable bursts at a wide range of scales, it can exhibit long-range dependence: values at any instant are correlated with values at all future instants. Although the terms self-similarity and long-range dependence are not exactly equivalent, we use them in an interchangeable fashion throughout the paper.

One of the advantages of using self-similar models is that the degree of variation on multiple time scales can be expressed using only a single parameter: the Hurst parameter.

1.2 A.2 Fractional Brownian motion

A common way to model self-similar processes is using a fractional Brownian model. The fractional Brownian motion (denoted fBm) model can be viewed as an extension of the standard Brownian Motion models that have been used in heavy traffic analysis. Formally, the normalized fractional Brownian motion {Z _H(t):t≥0} is a continuous zero mean Gaussian process with stationary increments and variance |t|^2H. The correlation of the increments is characterized by the Hurst index, H. Unlike the standard Brownian motion, the fractional one has a long-range dependency property when H>1/2. Since this process is self-similar: Z _H(at)=|a|^H Z _H(t). Using the normalized fBM Z _H(t), we can define a new fBM A ^∗(t) with mean μ and variance σ ²:

$$A^*(t) = \mu t + \sigma Z_{H}(t) $$

(23)

In Sect. 4 we see that the proposed “on-off” workload model behaves statistically as this fractional Brownian motion.

1.3 A.3 Heavy-tailed distributions

A distribution is heavy-tailed if

$$P(X > x)\sim x^{-\alpha}, \quad \hbox{as} \ x \to \infty, 0 < \alpha < 2$$

(24)

That is, if the asymptotic shape of the distribution is hyperbolic, it is heavy tailed. Heavy-tailed distributions have a number of properties that are different from exponential or normal distributions. If α≤2, then the distribution has an infinite variance; if α≤1, then the distribution has an infinite mean. Thus, as α decreases, a larger portion of the probabilistic mass function may be present in the tail of the distribution. In practical terms, a random variable that follows a heavy-tailed distribution can have extremely large values with non-negligible probability.

The simplest heavy-tailed distribution is the Pareto distribution:

$$F(x)=P(X \leq x) =\left\{\begin{array}{l@{\quad }l}0 & t < d \\1-(\frac{d}{t})^\alpha & t \geq d, 1 < \alpha < 2\end{array}\right.$$

(25)

where d is the minimal value of the distribution and α is known as the Pareto index.

The problem with using the Pareto distribution is its infinite variance. This poses problems when simulating a system with heavy-tailed distributions. Specifically, in Crovella and Bestavros (1997), it is shown that simulations become infeasible for α<1.5. A practical approach for solving this problem is to bound the Pareto distribution (Harchol-Balter 2002). A Bounded-Pareto distribution is characterized by three parameters: the Pareto index α, the minimal value d, and the largest value p:

$$F(x)=\left\{\begin{array}{l@{\quad }l}0 & t < d \\1-\frac{(\frac{d}{t})^\alpha}{(\frac{d}{p})^\alpha} & d \leq t < p, 1 < \alpha < 2 \\1 & t \geq p\end{array}\right.$$

(26)

It is easy to see that if p is very high, the distribution is very similar to the unbounded one. Therefore, for large values of p it can be assumed that it is heavy-tailed (Harchol-Balter 2002).

Finally, note that self-similarity as well as heavy-tailed distributions are convenient mathematical idealizations and can never be fully validated from finite data sets. However, these idealizations are powerful mathematical tools for modeling important aspects of time series.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hernández-Orallo, E., Vila-Carbó, J. On the nature and impact of self-similarity in real-time systems. Real-Time Syst 48, 294–319 (2012). https://doi.org/10.1007/s11241-012-9146-0

Download citation

Published: 20 January 2012
Issue Date: May 2012
DOI: https://doi.org/10.1007/s11241-012-9146-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the nature and impact of self-similarity in real-time systems

Abstract

Access this article

Similar content being viewed by others

Digital Twin: Mitigating Unpredictable, Undesirable Emergent Behavior in Complex Systems

Evolutionary algorithms and their applications to engineering problems

A survey on the evolution of stream processing systems

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Appendix: Background on self-similarity

1.1 A.1 Definition of self-similarity

1.2 A.2 Fractional Brownian motion

1.3 A.3 Heavy-tailed distributions

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On the nature and impact of self-similarity in real-time systems

Abstract

Access this article

Similar content being viewed by others

Digital Twin: Mitigating Unpredictable, Undesirable Emergent Behavior in Complex Systems

Evolutionary algorithms and their applications to engineering problems

A survey on the evolution of stream processing systems

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Appendix: Background on self-similarity

Appendix: Background on self-similarity

1.1 A.1 Definition of self-similarity

1.2 A.2 Fractional Brownian motion

1.3 A.3 Heavy-tailed distributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation