A new method for traffic forecasting in urban wireless communication network
- 228 Downloads
Abstract
With the development of wireless devices and the increase of mobile users, the operator’s focus has shifted from the construction of the communication network to the operation and maintenance of the network. Operators are eager to know the behavior of mobile networks and the real-time experience of users, which requires the using of historical data to accurately predict future network conditions. Big data analysis and computing which is widely adopted can be used as a solution. However, there are still some challenges in data analysis and prediction for mobile network optimization, such as the timeliness and accuracy of the prediction. This paper proposes a traffic analysis and prediction system which is suitable for urban wireless communication networks by combining actual call detail record (CDR) data analysis and multivariate prediction algorithms. Firstly, a spatial-temporal modeling is used for historical traffic data extracting. After that, causality analysis is applied to communication data analysis for the first time. Based on causal analysis, multivariate long short-term memory models are used to predict future data for CDR data. Finally, the prediction algorithm is used to process real data of different scenes in the city to verify the performance of the entire system.
Keywords
CDR Spatial-temporal modeling Causality analysis LSTM multivariate time series forecastingAbbreviations
- AIC
Akaike information criterion
- CDR
Call detail record
- LSTM
Long short-term memory
- MAE
Mean absolute deviation
- RMSE
Root mean squared error
- VARIMA
Vector autoregressive integrated moving average
1 Introduction
- 1.
Avoiding the dissatisfaction of user experience in advance, which is also the main concern of operators.
- 2.
Because the traffic levels of different scenarios in the city are not same, for example, the traffic volume of a station is higher than that of a residential area, which causes the prediction algorithm to be not very adaptable to the whole urban communication network.
In this paper, in order to solve these problems, we extract CDR data through spatial-temporal modeling based on grids of urban communication network. The extracted data would be used for further causality analysis. Causality is mainly reflected in space, which means the causality of traffic variation trend between predicted area and surrounding areas. From the perspective of data analysis, according to the Granger causality test, the data of surrounding areas leading to traffic changes in the prediction area are obtained. Then, the extracted traffic data of adjacent grids which have strong causality, and the greatest impact on the prediction area are used for the final multivariate time series prediction. LSTM is used for multivariate time series prediction.
For urban wireless communication networks, grid partitioning is a common method of spatial-temporal modeling. On this basis, the traffic of each grid is changing in real time, and the trend of variation is different. Therefore, the urban network can be regarded as a dynamic spatial-temporal system. It is the dynamic changes of traffic that the network optimization should be timely adjusted to ensure the user’s experience. The purpose of this method is to forecast the traffic changes in each region and to provide reference for resource allocation and load balancing. Accurate and timely adjustment of a resource allocation strategy and network optimization requires high-accuracy prediction. The purpose of this paper is to improve the accuracy of traffic forecasting. A traffic forecasting system suitable for urban communication network is proposed. Moreover, the effectiveness (short-term prediction with low complexity) and the universality (multi-scenario of cities) of the algorithm are guaranteed.
To the authors’ knowledge, it is the first time that causality analysis has been introduced into the analysis of communication data. In fact, causality is crucial for time series analysis. Causal analysis has always been an important part of economics, because economists are not concerned about how things happen together, but about how the relationship between things is transmitted. In the prediction of multivariate time series, how to select multiple time series is the most noticeable problem in data extraction. Correlation analysis is the most common method for selecting sequences. From the perspective of forecasting, correlation analysis and causality analysis are all possible, but from the perspective of data analysis, the results of causality analysis are more convincing and the results of multivariate time series forecasting will be more accurate through the combination with causality analysis. For example, there are two events, A and B. The results of correlation analysis of the two events are highly correlated. When A leads to B, we use B to fit the model based on previous correlation analysis to predict A. But when A has not happened, using B to predict A is not actually a prediction of the future, but an inference of known events. Causality analysis can make us know which is the cause and which is the result. We find that B cannot predict A well through causality analysis, but B can be predicted more accurately through A. Back to our previous question, for grid-based urban communication networks, changes in each region of the city are related to a certain extent [3]. So, the causality between grids in this system is very strong, which is also the reason why the author introduced causality analysis.
The Granger causality test is the most widely used method in causality analysis, and it has been applied to reveal interdependence structure in multi-variate time series [4]. The Granger causality test is also derived from economics, but it has been gradually used in other fields and has been popular in recent decades. Whether it is economics [5], meteorological science [6], or neuroscience [7], it has a wide range of applications. Moreover, the algorithm has also made some progress on the basis of the original [8].
According to the results of causality analysis, LSTM algorithm is used to predict multiple time series. LSTM algorithm is a special recurrent neural network (RNN) structure. It has long short-term memory (LSTM) units. RNN composed of LSTM units is usually called LSTM network [9]. The difference between LSTM and traditional RNN neural network is that each neuron in LSTM is a memory cell. LSTM links previous data information to the current neuron. Each neuron contains three gates: input gate, forgetting gate, and output gate, which are different from RNN. Using these internal gates, LSTM can solve the problem of long-term data dependence, especially for the prediction of data in a communication network, because the prediction of communication network requires high timeliness and low complexity. Timely fault handling is necessary for network optimization.
According to the characteristics of urban wireless communication network, the authors adopt multivariable prediction based on spatial-temporal model. Multivariable comes from the causality analysis of different grids in the spatial-temporal model. Through the causality test of the previous step, the time series with strong causality is extracted and used for multivariable LSTM prediction. Compared to the LSTM algorithm, multivariable LSTM is embodied in multiple input data, and the output data is the future value of the main prediction time series. Specific introduction of LSTM and its application in this paper will be given in the following chapters. Finally, by comparing the performance with other algorithms and non-causal analysis algorithms, the performance of the prediction system proposed by the author is verified. The algorithm is simulated in three scenarios in the city, which verifies the universal applicability of the algorithm in the urban communication network.
- 1.
A traffic prediction system suitable for urban communication networks is proposed, and it is universal for all urban scenes.
- 2.
Causal analysis for CDR data is proposed for communication network analysis and to enhance the accuracy of multivariate time series prediction.
- 3.
Multivariate time series LSTM prediction method based on spatial-temporal model of communication networks is proposed.
2 Urban communication network prediction system
3 Algorithm simulation
3.1 Data processing
Nowadays, cellular network technology is the most widely used technology in the world. A communication system is composed of several base stations. Mobile devices that receive strong enough signals in the base station area are connected to the network and thus can be used for communication [10]. The design of the network determines the size of each community. The size of micro-cellular community in urban environment is generally 300 m, and some macro-community in rural environment can reach 30 km [11]. All adjacent cells are overlapping, allowing a continuous connection to the network when the mobile equipment is moving. Many adjacent cells are grouped in zones identified by a local area code (LAC) [11]. Operators will keep detailed records of mobile devices in use. These records are called call detail records. CDR data generally includes time stamp, cell number, IMEI (International Mobile Equipment Identity), and time type. This information is highly correlated in spatial-temporal.
In the study of communication networks, urban scenarios are often divided into four categories: office, station, entertainment, and residential areas [13, 14, 15]. In order to improve the universality of the algorithm, we analyze the entertainment area (Quadrilatro della moda) and the office area (Politecnico di Milano). In addition, traffic often increases dramatically in the scenario where major events occur.
3.2 Granger causality test
Causality can be defined by the dependence between variables, that is, the variable as the result is determined by the variable as the cause, and the change of the cause variable causes the change of the result variable. Granger points out that if one variable X is not helpful in predicting another variable Y, then X is not the cause of Y; on the contrary, if X is the cause of Y, two conditions must be satisfied [16]:
Firstly, X should be helpful in predicting Y, that is, in the regression of Y’s past value, adding X’s past value as an independent variable should significantly increase the explanatory power of regression.
Secondly, Y should not be helpful in predicting X. The reason is that if X is helpful in predicting Y and Y is helpful in predicting X, there may be one or more other variables, which are both the cause of X change and the cause of Y change.
Now, people generally call this causality defined from the perspective of prediction Granger causality. Specifically speaking, for the data we extracted, let us assume that X is traffic data in the central area, which is sampled at different time points {X_{1}, X_{2}, X_{3}, …X_{n}}. Where n is the total number of training set samples. Time series Y is traffic data in one of the adjacent regions, from {Y_{1}, Y_{2}, Y_{3}, …Y_{n}}. Now, we use the past of X to predict the future of X. For example, we use X_{1}~X_{n − j} (which is the past value of Y) to predict X_{n − j + 1}~X_{n} (which is the past value of X). In the process of prediction, we produce an error of δ_{1} and then regard this error as the first result we get.
Then, we use the past of X and Y to predict the future of X, such as {X_{1}~X_{n − j}| Y_{1}~Y_{n − j}} to predict X_{n − j + 1}~X_{n}, and an error of δ_{2} is generated in the process of prediction. If δ_{1} is less than δ_{2}, that is to say, the combined prediction error of X and Y is less than the prediction error of X itself, then it must be because Y is helpful to the prediction of X, so the prediction error is reduced. In this case, we call Y Granger cause to X [17].
So, whether variable X is the Granger cause of variable Y is testable. But some processing of data is needed before testing.
Subsequently, unit root test is done for time series data. If the conclusion is that the time series has unit root, then we can be sure that the sequence must not be stationary, and then carry out the subsequent differential processing. On the contrary, subsequent analysis is carried out through AIC criteria.
Our goal is to search lag in the range of 1~n to minimize the value of AIC. And the lag that makes the AIC minimum is the order lag we want. The last step is the normal distribution test and consistency test [18]. Since error obeying normal distribution is a prerequisite for solving regression problems by least square method, the purpose of normal distribution test is to detect whether the residual after regression is obeying normal distribution. If it is not obeyed, the data does not satisfy the precondition of using least squares method and the basis of solving Granger causality.
For consistency test, when the data points of time series are regressed. It is not possible to determine whether the theoretical and actual values obtained by regression come from the same distribution. At this time, consistency test should be adopted. If the conclusion of the consistency test shows that the gap between theoretical value and actual value is small, the regression results are good.
So far, we have completed all the processing steps before causality test. In fact, after such processing and analysis steps, we can do a Granger causality test which is somewhat demanding for data for a complex time series in urban communication network. In causality checking, we proceed step by step [19].
Among them, α_{0} denotes the constant term, P and Q are the maximum lags of Y and X, respectively, ε_{t} is the white noise. Then F-statistics are constructed by sum of residual squares of two regression models which are RSS_{u} and RSS_{r}.
Among them, n is the number of samples. The original hypothesis can be tested by function 3. If F ≥ F_{ɑ} (q, n − p − q − 1), then β_{1}, β_{2}, … β_{q} is significantly not 0. We should reject the hypothesis that X is not the Granger cause of Y; on the contrary, we cannot reject this hypothesis.
Step 2: Exchange the positions of Y and X, and test the original hypothesis in the same way: “Y is not the Granger cause of X change”.
Step 3: To reach the conclusion that “X is the Granger cause of Y,” we must reject the original hypothesis that “X is not the Granger cause of Y” and accept the original hypothesis that “Y is not the Granger cause of X.”
We use this method to analyze the traffic sequence of each region. In this paper, the data are divided into test set and training set. The training data are traffic data from 0:00 to 18:00, and the data from 18:00 to 24:00 are test data. According to the spatial distribution, multiple time series can be obtained. Causality checking is carried out among multiple time series to obtain the causality between traffic in adjacent areas and traffic changes in central areas. It is equivalent to finding out the “cause” of traffic change in the main forecast area and combining the traffic data of the main forecast trend with these causal data to analyze and forecast the future value of the main area.
3.3 LSTM prediction algorithms for multivariate time series
According to the multivariate data obtained from causal analysis, the traditional linear model is difficult to solve the multivariate or multi-input problem, while the neural network such as LSTM is good at dealing with the problem of multiple variables, which makes it helpful to solve the problem of time series prediction. So multivariate LSTM algorithm is used for prediction.
The author fits our LSTM model with keras, a deep learning library [23]. Take the stadium scene data as an example. The first hidden layer defines a LSTM with 50 neurons and a neuron in the output layer used to predict contamination. The time step of input data is 1, which has three characteristics, namely, three input variables, the data of the central region and the data of the two regions which cause the change of the data of the central region after causal analysis. The output variable is the prediction result of traffic volume in the central area. The data are divided into test set and training set as mentioned above.
4 Results and discussion
In order to fully verify the performance of the algorithm, the author compares two other prediction methods, one is correlation-based multivariate LSTM. In this model, the data of different grids in the same scene are correlated with the data of the central area through Pearson correlation coefficient calculation [24]. After that, the highly correlated data are used for subsequent multivariate time series prediction. Another method is the VARIMA (vector autoregressive integrated moving average) model [25], which is often used for prediction. When the VARIMA algorithm is applied, all grid data in the same region are input data, and the output is the prediction of the future value of the central grid data of the region. Both the two multivariate time series prediction algorithms are considered as good choices to improve prediction accuracy.
Error comparison of each algorithm
Error calculation by | RMSE | MAE | ||||
---|---|---|---|---|---|---|
Algorithm in different areas | Quadrilatro Della Moda (office area) | Politecnico di Milano (entertainment area) | Stadio San Siro (stadium with activities) | Quadrilatro Della Moda (office area) | Politecnico di Milano (entertainment area) | Stadio San Siro (stadium with activities) |
LSTM based on causality | 1.7491 | 9.3764 | 56.0866 | 1.4836 | 6.6944 | 36.9817 |
LSTM based on relevance | 1.8073 | 9.5862 | 72.1137 | 1.5028 | 7.6139 | 51.7222 |
VARIMA | 3.3652 | 18.7506 | 128.3342 | 2.7729 | 11.9167 | 72.1137 |
5 Conclusion
Urban wireless communication network is a complex system. To allocate resources reasonably is the key to network optimization. Accurate prediction algorithm is the basis of network optimization. In this paper, the author transformed the problem of network analysis into a time series model through the method of big data analysis and made quantitative analysis of urban communication network while retaining the spatial-temporal characteristics. Firstly, spatial-temporal modeling was introduced through rasterization, and then data extraction was carried out according to this model. In the following data analysis, the Granger causality test was introduced for the first time. Then, multivariate LSTM algorithm was used to predict the traffic. From the simulation results, the addition of causality can improve the accuracy of multivariate time series prediction, and this prediction system is generally applicable to communication networks. Causality analysis can be used to analyze the dynamic variation of urban wireless communication network more effectively, and more accurate prediction results can be obtained, which will considerably assist to the maintenance and management of the network. As we all know, the most important concern of MNOs (mobile network operators) is the real-time changing of network state. In this paper, the proposed analysis and prediction system for urban communication network can not only make operators better understand the real-time changing of urban network, but also help operators manage the network primely. The author’s future work will focus on the specific parameters of the network, which have a large amount of wireless data and complex data types. But only in this case can each user’s experience be better concerned in real-time. Hence, it will be the focus in further studies.
Notes
Acknowledgments
This work described in this paper was supported by the National Science and Technology Major Project: No. 2018ZX03001029-004.
Funding
The funding for the research reported is provided by the National Science and Technology Major Project: No. 2018ZX03001029–004. The funds are mainly used for simulation hardware support.
Availability of data and materials
The datasets supporting the conclusions of this article were collected from reference [12].
Authors’ contributions
WG conceived and designed the study. KZ and SM performed the simulation experiments. KZ and XL wrote the paper. GC and ZS reviewed and edited the manuscript. All authors read and approved the final manuscript.
Authors’ information
Kaisa Zhang received the B.Sc. degree in communication engineering from Hebei University of Science and Technology in 2014. He received MEng degree in communication engineering from the University of York in 2016. He is currently working towards the Ph.D. degree in information and communications engineering, Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications Beijing, China; His main studying area is big data analysis and processing, self-optimizing and machine learning in communication network.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Z. Wang, S. Zhang, in 2016 IEEE 14th Intl Conf on Dependable, Autonomic & Secure Computing, 14th Intl Conf on Pervasive Intelligence & Computing, 2nd Intl Conf on Big Data Intelligence & Computing & Cyber Science & Technology Congress (DASC/PiCom/DataCom/CyberSciTech). CDR based temporal-spatial analysis of anomalous mobile users. p. 710–714 (2016). https://ieeexplore.ieee.org/document/7588924.
- 2.S.C. Nair, M.S. Elayidom, S. Gopalan, 2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS). Impact of CDR Data Analysis Using Big Data Technologies for the Public: an Analysis. p. 1–6 (2017). https://ieeexplore.ieee.org/document/8014651.
- 3.S. Jiang, J. Ferreira, M.C. Gonzalez, Activity-based human mobility patterns inferred from mobile phone data: a case study of Singapore. IEEE Trans. Big Data 3(2), 208–219 (2017).CrossRefGoogle Scholar
- 4.E. Siggiridou, D. Kugiumtzis, Granger causality in multivariate time series using a time-ordered restricted vector autoregressive model[J]. IEEE Trans. Signal Process. 64(7), 1759–1773 (2016).MathSciNetCrossRefGoogle Scholar
- 5.C. Hiemstra, J. Jones, Testing for linear and nonlinear Granger causality in the stock price-volume relation. J. Financ. 49(5), 1639 (1994).Google Scholar
- 6.A.C. Lozano, H. Li, in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France. Spatial-temporal causal modeling for climate change attribution. (2009). https://www.researchgate.net/publication/221654446_Spatial-temporal_causal_modeling_for_climate_change_attribution.
- 7.A. Roebroeck, E. Formisano, R. Goebel, Mapping directed influence over the brain using Granger causality and fMRI. Neuroimage 25(1), 230–242 (2005).CrossRefGoogle Scholar
- 8.Y. Chen, G. Rangarajan, J. Feng, M. Ding, Analyzing multiple nonlinear time series with extended Granger causality. Phys. Lett. A. 324(1), 26–35 (2004).Google Scholar
- 9.Y.T. Tsai, Y.R. Zeng, Y.S. Chang, in 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech). Air pollution forecasting using RNN with LSTM. P. 1074–1079 (2018). https://ieeexplore.ieee.org/document/8512020.
- 10.C.W. Huang, C.T. Chiang, Q. Li, in 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC). A study of deep learning networks on mobile traffic forecasting. p. 1–6 (2017). https://ieeexplore.ieee.org/document/8292737.
- 11.E. Thuillier, L. Moalic, S. Lamrous, A. Caminada, Clustering weekly patterns of human mobility through mobile phone data. IEEE Trans. Mob. Comput. 17(4), 817–830 (2018).Google Scholar
- 12.(2014). [Online]. Available: https://dandelion.eu. Accessed Nov 2018.
- 13.J. Reades, F. Calabrese, A. Sevtsuk, C. Ratti, Cellular census: explorations in urban data collection. IEEE Pervasive Comput. 6(3), 30–38 (2007).Google Scholar
- 14.M. Panda, S.P. Padhy, Traffic analysis and optimization of gsm network. IJSCI Int J Comput Sci Issues 1, 28–31 (2011).Google Scholar
- 15.X. Zuo, Y. Zhang, Detection and analysis of urban area hotspots based on cell phone traffic. JCP 7(7), 1753–1760 (2012).Google Scholar
- 16.A. Seth, Granger causality. Scholarpedia 2(7), 1667 (2007).CrossRefGoogle Scholar
- 17.C.W.J. Granger, Econometrica 37, 424 (1969).CrossRefGoogle Scholar
- 18.F. Xue, G. Fang, X. Yue, E. Zhao, S. Brauth, Y. Tang, Resting-state brain networks revealed by Granger causal connectivity in frogs. Neuroscience, 334, 332–340 (2016).Google Scholar
- 19.M. Dhamala, G. Rangarajan, M. Ding, Estimating Granger causality from Fourier and wavelet transforms of time series data. Phys. Rev. Lett. 100(1), 018701 (2008).CrossRefGoogle Scholar
- 20.D. Yang, H. Chen, Y. Song, Z. Gong, in 2017 IEEE International Conference on Big Knowledge (ICBK). Granger causality for multivariate time series classification. p. 103–110 (2017). https://ieeexplore.ieee.org/document/8023401/.
- 21.Z. Cao, Y. Zhu, Z. Sun, M. Wang, Y. Zheng, P. Xiong, J. Hou, L. Tian, Improving Prediction Accuracy in LSTM Network Model for Aircraft Testing Flight Data. p. 7–12. (IEEE International Conference on Smart Cloud (SmartCloud), 2018). https://ieeexplore.ieee.org/document/8513708.
- 22.Y. Gu, S. Liu, L. He, L. Wang, Research on Failure Prediction Using DBN and LSTM Neural Network. 2018 57th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE) (2018).Google Scholar
- 23.P. Vidnerová, R. Neruda, in Federated Conference Computer Science Information Systems. Evolving KERAS architectures for sensor data analysis (2017).Google Scholar
- 24.C. Thirumalai, C. Raju, G. Saikrishna, M. Senthilkumar, in International Conference on Trends in Electronics and Informatics ICEI. Analysis of global warming in India over maximum temperature using Pearson and Machine learning (2017).Google Scholar
- 25.R.S. Tsay, Multivariate Time Series Analysis and Its Applications[M]// Analysis of Financial Time Series, Second Edition (2000).Google Scholar
Copyright information
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.