Empirical Statistical Model for LTE Downlink Channel Occupancy

Hamid, Mohamed; Björsell, Niclas; Slimane, Slimane Ben

doi:10.1007/s11277-017-4205-4

Empirical Statistical Model for LTE Downlink Channel Occupancy

Open access
Published: 10 May 2017

Volume 96, pages 855–866, (2017)
Cite this article

Download PDF

You have full access to this open access article

Wireless Personal Communications Aims and scope Submit manuscript

Empirical Statistical Model for LTE Downlink Channel Occupancy

Download PDF

Mohamed Hamid^1,2,
Niclas Björsell² &
Slimane Ben Slimane¹

1370 Accesses
2 Citations
Explore all metrics

Abstract

This paper develops an empirical statistical channel occupancy model for downlink long-term evolution (LTE) cellular systems. The model is based on statistical distributions mixtures for the holding times of the channels. Moreover, statistical distribution of the time when the channels are free is also considered. The data is obtained through an extensive measurement campaign performed in Stockholm, Sweden. Two types of mixtures are considered, namely, exponential and log-normal distributions to fit the measurement findings. The log-likelihood of both mixtures is used as a quantitative measure of the goodness of fit. Moreover, finding the optimal number of linearly combined distributions using the Akaike information criterion is investigated. The results show that good fitting can be obtained by using either exponential or log-normal distributions mixture. Even though, the fitting is done for a representative case with a tempo-spatial consideration, the model is yet applicable in general for LTE and other cellular systems in a wider sense.

Time-based resource allocation for downlink in heterogeneous wireless cellular networks

Article 30 August 2021

A Simulation Study on LTE Handover and the Impact of Cell Size

On modeling coverage and rate of random cellular networks under generic channel fading

Article 14 November 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A need for different data rates in mobile broadband systems has been rapidly growing in recent years. In that regard, long term evolution (LTE) has been provided by the 3rd generation partnership project (3GPP) as a standard for packet based adaptive data rate systems [1]. LTE has been further developed to LTE advanced (LTE-A) to provide higher data rates and more spectral efficiency [2]. For robust optimization for cellular systems in general and LTE systems in particular, the traffic demand of cellular networks is needed to be modelled.

Beside resource optimization, other several optimization problems in cellular networks impose traffic modelling such as performance evaluation and billing. Among the statistics used for traffic evaluation in cellular systems is the channel occupancy which is defined as the time that a user occupies a channel in a cell while it is located in the serving area of that cell [3]. The channel usage for a cellular system is modelled as a two states Markov chain process [4]. The first state is the busy state when the channel is being assigned for a user whereas the second state is the idle state when the channel is idle.

Many studies have been carried out to characterize the cellular channel occupancy statistical distribution. In [5], it is shown that mobile telephony channel occupancy can be approximated by exponential distribution. A great advantage of the exponential distribution is the traceability in finding analytical solutions for optimization problems. Therefore, exponential distribution has been intensively used to model cellular channel occupancy, see [4] as an example. Nevertheless, many research findings concluded poor similarity between exponential distribution and empirical data [6]. One of the main disagreements between exponential distribution and empirical data is the heavy tail behaviour for the empirical channel occupancy which is not properly characterized by exponential distributions. Therefore, some heavy tail distributions are used as alternatives to model the cellular channel occupancy, among which, the log-normal distribution is found to better fit the empirical data [7, 8].

Even though many studies were carried out to model the cellular channel occupancy, non of these studies considers LTE yet. Therefore, LTE channel occupancy modelling is a topic that needed to be studied which is the main contribution of this paper. Furthermore, this paper contributes also in exploring fitting the empirical data for the cellular channel occupancy into a mixture of either exponential or log-normal distributions, combined linearly. This contribution is seen by using LTE as an example of a cellular system.

Using distribution mixture is motivated by keeping the advantageous of the ease of exponential and log-normal distributions. Hence, we can avoid using complicated distribution to model the cellular channel occupancy such as Beta and Kumaraswamy distributions [9]. Moreover, distribution mixtures are more general than single distributions and can be used to fit the data under different conditions. Consequently, the algorithms developed based on exponential and log-normal distributions of cellular channel occupancy can still be used based on their mixtures with small changes considering the linear combination of many of them.

The rest of this paper is structured as follows: Sect. 2 handles the theoretical aspects of the paper including the channel usage model and using distributions mixture to fit data. Section 3 shows the measurements setup and the fitting results. Finally, Sect. 4 concludes the paper.

2 Theory

The theoretical aspects of the paper are handled in this section. The section starts with presenting the Markov based model for the channel occupancy. Following that, distribution mixture fitting mathematical framework is introduced. Furthermore, exponential and log-normal distributions mixture fitting are studied in particular.

2.1 System Model

The LTE channel usage can be modelled as a two states Markov process. These two states are the ON state representing occupied channel state and the OFF state denoting the channel being idle. ON and OFF states temporal length are random variables (RV). Hereafter, ON and OFF temporal length are assigned the RVs x and y respectively. Figure 1 exhibits the channel usage model. The problem tackled throughout this paper is how to find statistical distributions that fit x and y.

The rest of this section provides the theoretical aspects of distributions mixture fitting in general and exponential and log-normal mixtures fitting in particular.

Without lose of generality, the RV x is considered in the coming parts of this paper. The same findings of x can be applied to y. Denote the empirical probability density function (pdf) of x as g(x) . g(x) can be fitted with a linear combination of k pdfs as

$$\begin{aligned} g(x)\approx \sum \limits _{i=1}^{k}p_{i}f(x|\varvec{\Theta }_{i}), \end{aligned}$$

(1)

where

$$\begin{aligned} 0< p_{i} < 1 \quad \forall i ,\quad \sum \limits _{i=1}^{k}p_{i} = 1, \end{aligned}$$

$p_{i}$ is the weight of the pdf number i, $f(\cdot )$ denotes a single pdf and $\varvec{\Theta _{\mathbf{i}}}$ is the distinct distribution parameters for the pdf number i. For the whole mixture model, $\varvec{\Omega }$ contains all the distinct mixture parameters and defined as

$$\begin{aligned} \varvec{\Omega } =\left [p_{1}, \dots , p_{k}, \varvec{\Theta }_{1}^{T},\dots , \varvec{\Theta }_{k}^{T}\right]^{T}, \end{aligned}$$

(2)

with $(\cdot )^{T}$ denoting the transpose.

An important notice here is that the formulation of $\varvec{\Omega }$ given in (2) assumes that the mixture is composed of the same distribution type which is considered in this paper. The goodness of fit is judged through the log-likelihood estimator, $L(x|\varvec{\Omega })$, found as

$$\begin{aligned} L(x|\varvec{\Omega }) = \int g(x)\text{ log }\left( \dfrac{f(x|\varvec {\Omega )}}{g(x)}\right) dx. \end{aligned}$$

(3)

2.2 Exponential Distributions Mixture Fitting

In [10] a linear combination of exponential pdfs is introduced to fit a heavy tail distributed data. For exponential mixture distribution, the pdf number i has a form as in (4a) while the collection of the distinct parameters, $\varvec{\Omega }_{exp}$, is expressed in (4b).

$$\begin{aligned} f(x|\varvec{\Theta }_{i}) & = {} \lambda _{i}e^{-\lambda _{i} x}, \end{aligned}$$

(4a)

$$\begin{aligned} \varvec{\Omega }_{exp} & = {} \begin{pmatrix} p_{1} &{} \lambda _{1} \\ \vdots &{} \vdots \\ p_{k} &{} \lambda _{k} \\ \end{pmatrix}. \end{aligned}$$

(4b)

The rest of this subsection shows how to find $\varvec{\Omega }_{exp}$ as the essence of [10]. The process of finding $\varvec{\Omega }_{exp}$ is a recursive procedure and starts with fitting the tail and moving backwards. Starting from the assumption that the part of the tail where $x>c_1$ can be fitted exclusively with the first exponential distribution, then

$$\begin{aligned} p_{1}e^{-\lambda _{1}c_1} & = {} F^{c}(c_{1}), \end{aligned}$$

(5a)

$$\begin{aligned} \sum \limits _{i=2}^{k} p_{i}e^{-\lambda _{i}x} & = {} 0\quad \hbox {for } x>c_1. \end{aligned}$$

(5b)

where $F^{c}(x)$ is the empirical complementary cumulative distribution function (CCDF) of x. Similarly, $p_{1}e^{-\lambda _{1}bc_1} = F^{c}(bc_{1})$ where $b>1$. Accordingly, the first pair, $(\lambda _1,p_1)$ is found as

$$\begin{aligned} \lambda _{1} & = {} \dfrac{1}{(b-1)c_{1}}\text{ ln }\left( \dfrac{F^{c}(c_{1})}{F^{c}(bc_{1})} \right) , \end{aligned}$$

(6a)

$$\begin{aligned} p_{1} & = {} F^{c}(c_{1})e^{\lambda _{1}c_{1}}. \end{aligned}$$

(6b)

Following the same idea, the pairs $(\lambda _i,p_i)$ for $2 \le i \le k$ are found as

$$\begin{aligned} \lambda _{i} & = {} \dfrac{1}{(b-1)c_{i}}\text{ ln }\left( \dfrac{F_{i}^{c}(c_{i})}{F_{i}^{c}(bc_{i})} \right) , \end{aligned}$$

(7a)

$$\begin{aligned} p_{i} & = {} F_{i}^{c}(c_{i})e^{\lambda _{i}c_{i}}, \end{aligned}$$

(7b)

where

$$\begin{aligned} c_i & = {} c_{1}\alpha ^{-(i-1)}, \quad \alpha > b,\\ F_{i}^{c}(c_i) & = {} F_{i-1}^{c}(c_i)-\sum \limits _{j=1}^{i-1}e^{-\lambda _{j}c_{i}},\\ F_{i}^{c}(bc_i) & = {} F_{i-1}^{c}(bc_i)-\sum \limits _{j=1}^{i-1}e^{-\lambda _{j}bc_{i}} \end{aligned}$$

and

$$\begin{aligned} F_{1}^{c}(x) = F^{c}(x). \end{aligned}$$

Finally the last pair $(\lambda _k,p_k)$ is found as

$$\begin{aligned} p_{k} & = {} 1-\sum \limits _{j=1}^{k-1}p_{j}, \end{aligned}$$

(8a)

$$\begin{aligned} \lambda _{k} & = {} \dfrac{1}{c_{k}}\text{ ln }\left( \dfrac{p_{k}}{F_{k}^{c}(c_{k})} \right) . \end{aligned}$$

(8b)

The values of $c_{1}$, b, and $\alpha$ are user defined and the reader is referred to [10] for more details on how to set them.

2.3 Log-Normal Distributions Mixture Fitting

In this paper, log-normal distributions mixture is used to improve the goodness of fit for cellular channel occupancy compared to a single log-normal distribution. In [11] a mixture of normal distribution is used to fit a specific data. To deal with the monotonicity behaviour of the measured cellular channel occupancy, log-normal mixture can be used instead of normal mixture. The pdf number i and the collection of distribution distinct parameters, $\varvec{\Omega }_{lgn}$, in a log-normal mixture are shown in (9a) and (9b) respectively.

$$\begin{aligned} f(x|\varvec{\Theta }_{i}) & = {} \dfrac{1}{ \sqrt{2\pi }\sigma x}e^{ -\dfrac{(\text{ ln }(x)-\mu )^{2}}{2\sigma ^{2}}} , \end{aligned}$$

(9a)

$$\begin{aligned} \varvec{\Omega }_{lgn} & = {} \begin{pmatrix} p_{1} &{} \mu _{1} &{} \sigma _{1} \\ \vdots &{} \vdots &{} \vdots \\ p_{k} &{} \mu _{k} &{} \sigma _{k}\\ \end{pmatrix}. \end{aligned}$$

(9b)

$\varvec{\Omega }_{lgn}$ can be found using Newton Raphson optimization method by solving the equation

$$\begin{aligned} L(x|\varvec{\Omega }) = 0. \end{aligned}$$

(10)

Starting from an initial guess of $\varvec{\Omega }_{lgn}^{(1)}$, then $\varvec{\Omega }_{lgn}^{(i+1)}$ is updated as

$$\begin{aligned} \varvec{\Omega }_{lgn}^{(i+1)} = \varvec{\Omega }_{lgn}^{(i)}-{\mathbf {H}}^{-1}\left(\varvec{\Omega }_{lgh}^{i}\right)L\left(x|\varvec{\Omega }_{lgn}^{(i)}\right), \end{aligned}$$

(11)

where ${\mathbf {H}}(\cdot )$ denotes the Hessian matrix. As the Hessian matrix is needed to be updated every iteration, then the stopping criterion is the convergence of $\mathbf {H}$.

2.4 Optimizing the Number of Distributions

To optimize the value of k, Akaike information criterion [12] is used. AIC is a statistical model identification used to optimize the model order [12]. AIC is calculated considering the log-likelihood penalized by the number of independent model parameters. AIC is obtained using

$$\begin{aligned} AIC(x;\varvec{\Omega },N) = \underbrace{-2L(x|\varvec{\Omega })} _{\text {log}\,\, \text{likelihood}}+ \underbrace{2N}_{\text {Parameters}\,\, \text{penalty}}, \end{aligned}$$

(12)

where N is the model order defined as the number of independent model parameters. The optimal model order is found by minimizing the value of AIC in (12). For the exponential mixture distribution, each pair i where $1 \le i \le (k-1)$ represents a single independent parameter while the last pair $(\lambda _{k}, p_{k})$ is fully dependant on the other pairs. Hence, the exponential distributions mixture has $(N = k-1)$ independent parameters. Accordingly, the optimal model order for exponential mixture distribution, $k_{AIC}^{exp}$, is found as

$$\begin{aligned} k_{AIC}^{exp}= \underset{k}{\mathrm {argmin}} \left (-2L(x|\varvec{\Omega }_{exp})+2(k-1) \right ). \end{aligned}$$

(13)

For the log-normal distribution mixture, with Newton Raphson method, there are $N = 3k$ independent parameters as all the components of $\varvec{\Omega }_{lgn}$ are independent. Therefore, the optimal model order for log-normal distributions mixture, $k_{AIC}^{lgn}$ is determined as

$$\begin{aligned} k_{AIC}^{lgn}= \underset{k}{\mathrm {argmin}} \Big (-2L(x|\varvec{\Omega }_{lgn})+6k \Big ). \end{aligned}$$

(14)

3 Measurements

3.1 Measurements Setup

The empirical downlink LTE traffic is obtained through a measurement campaign performed in an indoor location in Kista, Stockholm, Sweden. The measurements location has a GPS coordinates of $59^{\circ }24^{\prime }19.13^{\prime \prime }\text{N }$, $17^{\circ }56^{\prime \prime }56.12^{\prime }\text{E }$. The measurement area is densely occupied by offices with a shopping mall and residential buildings in the surroundings. A google map of the measurement location is shown in Fig. 2.

For robust measurements, a real time spectrum analyser (RTSA) is used to collect the data. The data is fed to the RTSA through a wideband tunable antenna. Figure 3 exhibits the measurements setup. Since different channels experience different loads at different times, the measurements are treated in time spans of 2 h. Hereafter, the findings for an LTE downlink traffic channel will be discussed as a representative case. The results for the other channels and systems are similar with different parameters. The presented results are for the measurements carried out for a 1.4 MHz channel lies between 2650.6 and 2652.0 MHz during the period: Wednesday, 2013/10/02 09:00 am to 11:00 am.

3.2 Fitting Results

Before diving into the fitting results, it is important to note that the LTE load on the measurements area changes with time, This changes are depicted by the obtained values of the duty cycle through a week of measurements shown in Fig. 4. Even-though, different loads are experienced at different times, yet the fitting procedure is the same and the findings are similar with different values. Hereafter, the results for the period Wednesday, 2013/10/02 09:00 am to 11:00 am are shown as an example of the results.

Figures 5 and 6 show the empirical distribution and the fitted exponential and log-normal mixtures respectively. Both Figs. 5 and 6 illustrate how the fitted mixtures of exponential or log-normal distributions approach towards the empirical distribution with the change of k. A quantitative evaluation is obtained by means of the log likelihood estimation which is provided in Fig. 7.

As it is shown in Fig. 5, the lower values of k make the exponential mixture to fit the tail with poor fitting for the lower values of x. In contrast, increasing k improves fitting the lower region of x. This is explained as follows; as the first pair $(\lambda _1, p_1)$ always characterizes the tail beyond $c_{1}$, then there is always a guarantee that all the values greater than $c_1$ are well fitted, depending on the obtained values of $(\lambda _1, p_1)$ and the value of k, rest pairs $(\lambda _i, p_i)$ are obtained and the last pair $(\lambda _k, p_k)$ is fully dependant on the previous obtained pairs. Therefore, when k increases the part that is characterized by $(\lambda _k, p_k)$ decreases. However for very large values of k a point where the property expressed in (5) is not held which makes the recursive fitting procedure for the remaining pairs inapplicable any longer. Therefore, there is a crossover point when the log-likelihood estimation starts to degrade with the increase of k as shown in Fig. 7. For the lognormal mixture the higher the k, the better the fitting as the log-likelihood curve exhibited in Fig. 7.

Figure 7 depicts the obtained AIC for both exponential and log-normal distributions mixtures when k changes. According to the figure, for the exponential distributions mixture the optimal model order is 7 while for log-normal distribution mixture the optimal model order is 4. The difference in the model order between the two mixtures is explained by the influence of the parameters penalty function. As shown in (13) and (14) the log-normal mixture AIC is penalized more than the exponential mixture AIC. Moreover, for the same reason in the case of exponential distributions mixture, the AIC curves follow the log-likelihood curve. On the other side, for the log-normal distributions mixture, the AIC and the log-likelihood curves have different tendencies as the parameters penalty function impacts more in the AIC values.

The obtained distinct mixture parameters matrices for the optimal exponential and log-normal mixtures, $\varvec{\Omega }_{exp}$ and $\varvec{\Omega }_{lgn}$ are shown respectively below.

$$\begin{aligned} \varvec{\Omega }_{exp} & = {} \begin{pmatrix} 0.04 &{} 0.04 \\ 0.08 &{} 0.11 \\ 0.20 &{} 0.47 \\ 0.16 &{} 1.75 \\ 0.23 &{} 8.71 \\ 0.12 &{} 54.24 \\ 0.17 &{} 103.85 \\ \end{pmatrix}.\\ \varvec{\Omega }_{lgn} & = {} \begin{pmatrix} 0.33 &{} 0.67 &{} 1.53 \\ 0.21 &{} -6.38 &{} 2.59 \\ 0.40 &{} -2.32 &{} 1.64 \\ 0.06 &{} -9.84 &{} 5.05 \\ \end{pmatrix}. \end{aligned}$$

As explained by (4b), the first column of $\varvec{\Omega }_{exp}$ is the probabilities of the different exponential distributions with their corresponding values of $\lambda$ in the second column. Similarly, as in (9b) the probabilities of the log-normal distributions are placed in the first column of $\varvec{\Omega }_{log}$ with the corresponding values of the means and the standard deviations in the second and third columns respectively.

4 Conclusions

An empirical statistical model for the downlink LTE channel occupancy is introduced in this paper. The introduced model is based on using a linear mixture of exponential or log-normal distributions. The exponential and log-normal distributions mixture can better characterize the downlink LTE channels occupancy compared to the single exponential and log-normal distributions. Akaike information criterion is used to optimize the number of the exponential or log-normal distributions composing the mixture. Log-normal mixture Akaike information criterion is affected more by the model order compared to the exponential mixture. The model is a general statistical model and can be used for other cellular systems.

References

Astely, D., Dahlman, E., Furuskar, A., Jading, Y., Lindstrom, M., & Parkvall, S. (2009). LTE: The evolution of mobile broadband. IEEE Communications Magazine, 47(4), 44–51.
Article Google Scholar
Ghosh, A., Ratasuk, R., Mondal, B., Mangalvedhe, N., & Thomas, T. (2010). LTE-advanced: Next-generation wireless broadband technology [invited paper]. IEEE Wireless Communications, 17(3), 10–22.
Article Google Scholar
Yavuz, E., & Leung, V. (June, 2007). Modeling channel occupancy times for voice traffic in cellular networks. In IEEE international conference on communications (ICC) (pp. 332–337).
Hamid, M., Mohammed, A., & Yang, Z. (2010). On spectrum sharing and dynamic spectrum allocation: MAC layer spectrum sensing in cognitive radio networks. In Int. Conference on Communications and Mobile Computing (CMC), 2, 183–187.
Article Google Scholar
Hong, D., & Stephen, S Rappaport. (1986). Traffic model and performance analysis for cellular mobile radio telephone systems with prioritized and nonprioritized handoff procedures. IEEE Transactions on Vehicular Technology, 35(3), 77–92.
Article Google Scholar
Guerin, R. (1987). Channel occupancy time distribution in a cellular radio system. IEEE Transactions on Vehicular Technology, 36(3), 89–99.
Article Google Scholar
Wellens, M., Riihijrvi, J., & Mhnen, P. (2009). Empirical time and frequency domain models of spectrum use. Physical Communication, 2(12), 10–32. Cognitive radio networks: Algorithms and system design. http://www.sciencedirect.com/science/article/pii/S1874490709000299.
Yavuz, E., & Leung, V. C. M. (2006). Computationally efficient method to evaluate the performance of guard-channel-based call admission control in cellular networks. IEEE Transactions on Vehicular Technology, 55(4), 1412–1424.
Article Google Scholar
Lopez-Benitez, M., & Casadevall, F. (2011). Empirical time-dimension model of spectrum use based on a discrete-time markov chain with deterministic and stochastic duty cycle models. IEEE Transactions on Vehicular Technology, 60(6), 2519–2533.
Article Google Scholar
Feldmann, A., & Whitt, W. (1997). Fitting mixtures of exponentials to long-tail distributions to analyze network performance models. Performance Evaluation, 31, 245–279.
Article Google Scholar
Du, J. (July, 2002). Combined algorithms for fitting finite mixture distributions. Master’s thesis, McMaster University.
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Communication Systems Lab, The Royal Institute of Technology (KTH), 16440, Stockholm, Sweden
Mohamed Hamid & Slimane Ben Slimane
University of Gävle, Kungsbacksvägen 47, 80176, Gävle, Sweden
Mohamed Hamid & Niclas Björsell

Authors

Mohamed Hamid
View author publications
You can also search for this author in PubMed Google Scholar
Niclas Björsell
View author publications
You can also search for this author in PubMed Google Scholar
Slimane Ben Slimane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohamed Hamid.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Hamid, M., Björsell, N. & Slimane, S.B. Empirical Statistical Model for LTE Downlink Channel Occupancy. Wireless Pers Commun 96, 855–866 (2017). https://doi.org/10.1007/s11277-017-4205-4

Download citation

Published: 10 May 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s11277-017-4205-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Empirical Statistical Model for LTE Downlink Channel Occupancy

Abstract

Similar content being viewed by others

Time-based resource allocation for downlink in heterogeneous wireless cellular networks

A Simulation Study on LTE Handover and the Impact of Cell Size

On modeling coverage and rate of random cellular networks under generic channel fading

1 Introduction