1 Introduction

Cloud radio access network (RAN) (C-RAN) is considered to be one of the key technologies to increase efficiency and bring down costs in future mobile networks [1]. In C-RAN, baseband signal processing is offloaded from individual base stations—called remote units (RUs)—to a central unit (CU). This yields many benefits, such as simplified network maintenance, smaller form-factor RUs, efficient use of processing resources through statistical multiplexing at the CU, reduced costs for equipment rooms at base station sites, and spectral efficiency gains from joint processing such as coordinated multi-point (CoMP). On the other hand, it necessitates the deployment of a very demanding fronthaul (FH) network transporting the raw in-phase/quadrature-phase (I/Q) samples from the RUs to the CUs for processing. Commonly, this FH network is implemented based on the Common Public Radio Interface (CPRI) standard [2], which requires data rates of up to 24 Gbps per cell, a round-trip FH latency below 200 μs, low jitter, tight synchronization, and high reliability. These requirements can only be realized with high-capacity fiber or point-to-point wireless links, making the deployment of the FH network very costly, reducing the gains expected from centralization. The most promising approach to reduce the traffic load of the FH interface is the re-evaluation of the so-called functional split [3, 4], i.e., investigating which part of the signal processing can be performed at RUs and which part at CUs. By adopting only partial centralization, the FH requirements can be significantly relaxed while the main centralization benefits persist. These functional splits blur the difference between classical FH and backhaul (BH) networks, calling for converged transport networks that unify BH and FH equipment, and hence reduce deployment as well as operational costs. These networks can be facilitated by the introduction of a next generation fronthaul interface (NGFI) [5].

C-RAN requirements and functional splits have been well investigated for 4G networks [3, 4] and are already considered for standardization by the IEEE 1914 working group [6] and in the CPRI Consortium’s eCPRI standard [2]. However, for 5G networks, the introduction of new radio access technologies (RATs) is currently considered by 3GPP under the term new radio (NR) [7]. These new RATs, which could include technologies such as massive MIMO [8], millimeter wave (mmWave) communication [9], and non-orthogonal waveforms [10], will have a significant impact on the transport network. At the same time, new services, such as the Internet of things (IoT) [11], the Tactile Internet [12], or vehicular communication [13], will add new requirements such as ultra-low latency and extremely high availability. Hence, it is of fundamental importance to design the 5G transport network and the corresponding NGFI in view of the requirements these technologies induce.

Therefore, this article explores key 5G RATs and how their introduction will be reflected in the transport network. For this, we first revisit different functional splits currently under discussion for the NGFI and derive the corresponding peak transport network requirements. We then analyze the most promising 5G RATs to understand their impact on these requirements. For this, we utilize three exemplary configurations of 5G RATs that are aligned with the ongoing standardization as much as possible. To understand the behavior of real network traffic, we utilize live network measurements from a 4G network and extrapolate them to a 5G scenario. From this, we can derive statistical multiplexing gains, which will play a major role in reducing the required transport capacity. Based on these results, we give guidelines on how to design 5G transport networks.

2 5G radio access technologies and their impact on the transport network

2.1 Functional splits

The allocation of functions between the RU and CU, i.e., the functional split, has a major impact on the transport network and the corresponding NGFI requirements regarding data rate, latency, and synchronization. Figure 1 depicts a generic RAN signal processing chain of a mobile network. In principle, the split between RU and CU may be between any of the blocks depicted. In this work, we focus on three functional splits, which are capturing the most relevant trade-offs, denoted as A, B, and CFootnote 1 in Fig. 1. In general, splits higher up in the processing chain offer less centralization gains in terms of RRU size and cooperative processing, while reducing the requirements in terms of fronthaul data rate, latency, and synchronization. In the following, we summarize the most important features of these splits that will be relevant for the rest of the paper. As will be detailed, split A is very similar to the one currently employed in CPRI-based systems, offering full centralization at the price of tight requirements in terms of data rate and latency. Split B offers the benefit of featuring a FH data rate that depends on the load of the air interface, while split C is close to traditional BH, while still offering centralized medium access control (MAC)-layer functionalities like joint scheduling. For more details on the splits and their respective trade-offs, we refer the interested readers to [3, 4].

Fig. 1
figure 1

Functional split options

2.1.1 Split A

Split A resembles the conventional C-RAN setup with a FH interface similar to CPRI. The only difference to CPRI is that we assume that some antenna processing is already performed at the RU, the reason for which will be explained in Section 2.2.3. The data forwarded between RUs and CUs consists of time-domain I/Q samples of constant rate which for this split can be calculated as

$$ {D}_{\mathrm{A}}=2\cdot {N}_{\mathrm{L}}\cdot {f}_{\mathrm{S}}\cdot {N}_{\mathrm{Q},\mathrm{T}}\cdot \gamma, $$
(1)

with N L being the number of analog-to-digital converter (ADC) chains, f S the sampling frequency, N Q,T the resolution of the time domain quantizer, and γ the transport overhead.

For this split, strict latency requirements of around 200 μs are induced by long-term evolution (LTE)’s hybrid automatic repeat request (HARQ) process [1]. However, this is a direct result of the LTE standard. A more fundamental limit to be considered is the channel’s coherence time, as centralized functionalities such as adaptive modulation and coding scheme (MCS) selection, precoding, and scheduling require up-to-date channel information in order to perform adequately. From [14], an approximation of the channel coherence time in seconds can be obtained from

$$ {T}_{\mathrm{C}}\approx \sqrt{\frac{9}{16\uppi}}\frac{c}{\mathrm{v}\cdot {f}_{\mathrm{C}}}, $$
(2)

with c the speed of light in meter per second, v the UE’s speed in meter per second, and f C the carrier frequency in hertz. Note especially that the channel coherence time depends on the carrier frequency, which will play an important role for mmWave communication as discussed later.

In addition to these latency constraints, CPRI defines a synchronization requirement in the form of a delay estimation accuracy of 65 ns. This corresponds approximately to the duration of two samples of LTE, i.e., the delay estimation accuracy, which corresponds to the maximum allowed jitter, can be approximated by

$$ {T}_{\mathrm{J}}\approx \frac{2}{f_{\mathrm{S}}}. $$
(3)

This requirement is motivated by the frequency diversity introduced by the timing offset between multiple antennas [15].

2.1.2 Split B

In split B, the mapping of data symbols to spectral resources is done at the RU. Since data symbols are only exchanged when data is available, the transport capacity is reduced and scales with the instantaneous cell load μ. The data rate for split B can accordingly be calculated as

$$ {D}_{\mathrm{B}}=2\cdot {N}_{\mathrm{L}}\cdot {N}_{\mathrm{SC},\mathrm{act}}\cdot {N}_{\mathrm{Sy}}\cdot {N}_{\mathrm{Q},\mathrm{F}}\cdot {T}_{\mathrm{F}}^{-1}\cdot \mu \cdot \gamma, $$
(4)

with N SC,act being the number of active subcarriers, N Sy the number of symbols per frame, N Q,F the resolution of the frequency domain quantizer, T F the frame duration, and μ the utilization of the subcarriers, i.e., the load. The dependency on the actual load is a strong benefit of this split, as it enables statistical multiplexing, which will be further discussed in Section 3.3.

While split B still needs to meet the same latency constraints as split A, the synchronization on the transport network can potentially be relaxed as the different streams can be aligned at the RU. However, the synchronization between the antenna elements still needs to be ensured.

2.1.3 Split C

In split C, only higher MAC-layer functionalities (e.g., scheduling) are centralized, hence removing the option for joint PHY-layer processing such as joint transmission and reception in CoMP. The decentralization of the HARQ process, however, relaxes the latency requirements, which now depend only on channel coherence times. The transport data rate approximately follows the data rate experienced by the users, i.e., it depends on the used MCSs. In this regard, this functional split is very similar to BH links in current networks. The corresponding data rate can be calculated as

$$ {D}_{\mathrm{C}}={N}_{\mathrm{L}}\cdot {N}_{\mathrm{SC},\mathrm{act}}\cdot {N}_{\mathrm{Sy}}\cdot {R}_{\mathrm{c},\mathrm{MCS}}\cdot { \log}_2{M}_{\mathrm{MCS}}\cdot {T}_{\mathrm{F}}^{-1}\cdot \mu \cdot \gamma, $$
(5)

with R c,MCS being the code rate and M MCS the number modulation symbols. Some of these splits are better suited for certain scenarios than others, e.g., depending on the availability of high capacity fiber or the density of access points. Hence, we foresee that multiple splits will coexist within the same 5G network. Accordingly, a converged network needs to be designed to handle the different traffic types induced by the functional splits, and the different RATs considered for 5G, which are being discussed next.

2.2 5G radio access technologies and use cases

To illustrate the impact of 5G RATs on future transport networks, we utilize three exemplary system configurations. The relevant downlink parameters as well as the resulting requirements in terms of data rate, latency, and synchronization are listed in Table 1 in comparison with a typical LTE system and will be further discussed in the following. Note that the specific 5G parameters are currently under standardization and might change with respect to this work. However, their general impact will made clear to be able to draw important conclusions on the overall network design.

Table 1 Potential parametrization of a 5G radio access network using different frequency bands

2.2.1 Higher carrier frequencies

The adoption of higher carrier frequencies above 6 GHz, including the mmWave bands, is currently widely discussed as a possible enabler for 5G. 3GPP currently considers three bands for evaluation [7]: sub-6 GHz (around 2 GHz or around 4 GHz), around 30 GHz, and around 70 GHz, and, accordingly, we provide exemplary parametrizations for these three bands in Table 1. While higher carrier frequencies are foremost associated with higher bandwidths (see next paragraph), they also lead to lower channel coherence times according to (2). As can be seen from Table 1, this will limit the tolerable FH delay to as little as 30 μs, as centralized precoding and adaptive coding and modulation needs up-to-date channel information in order to function properly. Hence, mmWave carriers will either require a decentralization of these functions, or even lower FH latencies need to be met compared with the ones currently specified in CPRI.

2.2.2 Larger bandwidths

Larger bandwidths are a major factor to enable higher data rates in 5G. This clearly increases the required transport data rates as well, regardless of the split used. The bandwidths considered in this paper (Table 1) are based on the 3GPP considerations [7], assuming that half of the available bandwidth is used for the downlink.

At the same time, the higher bandwidth constrains the delay accuracy in BH/FH due to significantly reduced sample duration, which is directly proportional to the bandwidth. Currently, CPRI has a two-way jitter requirement of 16.276 ns, which is about a quarter of the overall 65-ns time alignment requirement of 2G/3G/4G systems as given in Table 1 and Eq. (3). Looking at the requirements for the higher-bandwidth carriers, it can be observed that timing accuracies down to a few nanoseconds will have to be considered in 5G for centralized baseband processing.

2.2.3 Large antenna arrays

Utilization of large antenna arrays, both in sub-6 GHz and mmWave bands, is expected to be a key component of the envisaged 5G air interface [16]. Massive multi-user MIMO systems benefit from asymptotically orthogonal channels of different users, achieving significantly higher spectral efficiencies. In addition, mmWave systems require very large antenna arrays and corresponding beamforming techniques to overcome unfavorable wireless propagation conditions.

The direct application of CPRI-like FH consisting of I/Q samples for every antenna element (possibly several 100 s) leads to unacceptably high FH data rates. Therefore, only the data corresponding to independent spatial streams should be forwarded, as their number is typically much lower than the number of antennas, thus reducing the transport capacity. As a consequence, data streams have to be mapped to the individual antenna elements at the RUs. While for sub-6 GHz systems, this mapping can be performed in the digital domain using precoders, for mmWave, the performance and power consumption of ADC/DAC chains encourages a partially analog approach, using a so-called hybrid beamforming architecture [17].

In Table 1, a maximum of 16 layers are considered for the 2-GHz carrier, while less layers are assumed for the mmWave carriers.

2.2.4 Higher order modulation

As of release 12 of LTE, modulation schemes up to 256-quadrature amplitude modulation (QAM) are supported on the air interface, and the WiFi standard 802.11ax already considers 1024-QAM. Increasing the modulation order is a straightforward method to increase the spectral efficiency and thus the air interface’s capacity. As these modulation schemes provide an increased peak user data rate, they would also increase the transport data rate required for split C, since it linearly depends on the actual spectral efficiency of the air interface. The transport rates of splits A and B, in contrast, do not directly depend on the spectral efficiency. However, the higher order modulation schemes could ultimately require a higher quantization resolution, both in time and frequency domain, thus increasing the peak transport data rates of split A and B as well. The ADC resolution in Table 1 for the sub-6 GHz system is based on that of LTE systems. Lower resolutions can be expected for mmWave carriers due to the higher sampling rate and the associated power consumption. For frequency-domain samples, as present in split B, it has been observed that a lower resolution can be used [3].

2.2.5 New waveforms and frame structures

Although orthogonal frequency division multiplexing (OFDM) is still considered to be a strong contender for 5G networks, alternative waveforms have been investigated recently. Several multicarrier proposals employing filtering or pulse shaping exhibit better frequency localization properties than the standard OFDM and could be more suitable for use cases requiring flexible spectrum usage or asynchronous access [10]. In LTE systems, roughly 10% of the bandwidth is reserved for reducing adjacent channel interference. This percentage can be significantly reduced by utilizing the new waveforms, increasing the number of active subcarriers (e.g., in Table 1 from 1200 to 1300) and, correspondingly, the transport rates in higher functional splits such as B and C.

These new waveforms are also designed to be more flexible regarding frame and resource grid structure. Accordingly, the transport network will have to cope with varying data rates, which is currently not possible with the static data rate CPRI links.

Finally, the flexible frame structures will also lead to new protocol timings. As previously mentioned, especially the maximum HARQ delay defined for LTE currently limits the tolerable FH delay. If even stricter HARQ timings are introduced for 5G, this could severely limit the applicability of C-RAN architecture for 5G. This again highlights the importance of analyzing the impact of 5G RATs on the transport network already during RAT specification.

Single carrier (SC) waveforms might also play a prominent role in 5G, in particular for machine-type communication (MTC), uplink transmission, and mmWave communications due to better peak-to-average power ratio. In this study, we consider multicarrier modulation for all frequency ranges in order to provide a coherent framework. However, our main conclusions regarding the rate and latency hold for the SC waveforms as well.

2.2.6 Low-latency RAN and transport

A low end-to-end latency is a crucial requirement for the “Tactile Internet” [12]. Latencies as low as 1 ms, including RAN and transport network delays as well as application processing at the server and the user equipment (UE), shall enable a new generation of applications ranging from virtual reality to remote control to factory automation.

Regardless of the functional split, it is clear that the transport network’s share of this latency should be kept to a minimum. While dedicated fiber, as used for CPRI deployments, achieves latencies of a few hundred microseconds (limited by propagation time), in packet-based networks, switching times and queueing significantly impact delay. In addition, the introduction of software defined networking (SDN) [18] in transport equipment (see also Section 4) poses new challenges in this regard. For example, traditional SDN architectures using reactive flow provisioning relay the first packet of each flow to the controller, which might incur unacceptable delays even for low demanding functional splits. Hence, transport SDN architectures based on proactive flow provisioning are required that can still provide the desired reliability and programmability features.

2.2.7 Machine-type communication

Massive MTC will be a major use case in 5G networks, enabling the IoT applications which encompasses a plurality of devices, such as sensors, smart meters, and cars. Although the traffic generated by many IoT applications will be comparatively low, it is very bursty, since few small data packets will be generated irregularly from potentially thousands of devices. Consequently, the total traffic demand on the transport network is expected to be insignificant as compared to, e.g., high-definition video streaming. Thus, utilization-agnostic splits such as split A are highly inefficient for such applications due to their constant transport data rate. In addition, a fixed frame format could be seen as a drawback in a packet-based network, as small packets introduce a large overhead. Instead, a transport network supporting flexible frame formats could better deal with the nature of IoT traffic by, e.g., supporting different levels of packet aggregation.

For other applications such as vehicular communications, very high reliability will be of paramount importance. However, dedicated fiber links as currently used in CPRI-based deployment provide a single point of failure. Instead, a transport network is required in 5G which is able to provide alternative routes in case of outage. For example, ring topologies, which are challenging to implement with dedicated fiber, are quite common in, e.g., Ethernet networks, providing an effective 1 + 1 outage protection.

2.2.8 Network slicing

5G network slicing [16] is a novel architectural concept to enable the diverse set of use cases currently envisioned for 5G. Virtual end-to-end-networks or “slices” are created for each 5G service, which may require a different air interface per slice tailored to a given service, e.g., an “IoT slice” or a “tactile slice.” Each 5G slice could thus benefit from a different functional split according to the service it provides. For example, in [19], an architecture for 5G RANs is proposed where the non-real-time components of the air interface are broken up into functions that can be virtualized and instantiated on a per-slice basis, following the paradigm of network function virtualization (NFV) [20]. Hence, 5G slicing will build on NFV to instantiate per-slice virtual functions, as well as on SDN, to connect the different functions belonging to a given slice. In [21], a cellular architecture combining NFV and SDN is proposed that enables flexible service definition over LTE networks. Similar architectural concepts are likely to be applicable to 5G to define end-to-end network slices. Finally, it is worth highlighting that the impact of adopting SDN/NFV on the security of 5G transport networks deserves special attention and is an area of ongoing research [22].

3 5G transport requirements

3.1 Peak requirements

The peak requirements per cell of the 5G technologies discussed above are summarized at the bottom of Table 1 and are illustrated in Fig. 2. Note that while the parameters of 5G RATs are currently not standardized, the numbers provided are a realistic outlook to the transport requirements if the RATs discussed are indeed implemented. In addition, not all of the technologies might be implemented in the first release of 5G but rather at later stages. Note for example that the data rate given for split C is between 16 and 26 Gbps. Considering that split C is close to the data rates experienced by the user but still includes MAC and transport overhead, these data rates are in line with the capacity of up to 10 Gbps considered for 5G. Looking at the delay accuracy and channel coherence times, we can see that they differ by orders of magnitude depending on the different RATs. This already indicates that future transport networks will have to deal with a large variety of requirements, which will be further discussed in Section 4.

Fig. 2
figure 2

Fronthaul requirements

For the lower functional splits, the data rates range from 35 to almost 200 Gbps per cell. However, these are peak requirements. We have already remarked that a key benefit of splits B and C is that the associated transport data rates scale with the actual utilization of resources on the access link. Thus, when using these splits, dimensioning the transport network according to peak data rates is not efficient in most cases. Instead, the next generation of mobile networks (NGMN) alliance has derived guidelines for dimensioning transport networks based on busy hour utilization, which leverage the statistical multiplexing occurring in practical networks [23]. Hence, it is important to understand the transport requirements introduced by the 5G RATs and functional splits considered under practical network conditions as done in the following.

3.2 Live network measurements

To this effect, we combine the 5G RAT configurations from Table 1 with real downlink measurements from a real-life LTE network. The measurements were collected from 10 LTE sites with C = 33 cells in total, covering an area of about 2.5 km2 in downtown Athens, Greece. Twenty-seven of the 33 cells utilize 20 MHz bandwidth, while the other six cells utilize 10 MHz. The measurements were taken on a 15-min basis for a time period of 15 days. The measurements taken include (for each cell c and 15-min time instance t = (d,m), d being the day and m the 15-min interval):

  • The percentage of utilized physical resource blocks (PRBs) relative to the total number of PRBs, μ(c,t)

  • The MCS distribution, p(MCS i,c,t ), with i = …28 being the index of the 28 MCS utilized in LTE

  • The maximum cell throughput, D max(c,t)

  • The maximum number of UEs per cell, N UE(c,t)

From this, Fig. 3a shows the maximum and average utilization measured, i.e.,

$$ {\mu}_{\max }(t)={ \max}_c\mu \left( c, t\right), $$
(6)

and

$$ {\mu}_{\mathrm{avrg}}(t)=\frac{1}{C}{\displaystyle {\sum}_c}\mu \left( c, t\right), $$
(7)

as well as the MCS distribution p(MCS i ,c,t) of the MCS 0 to 28.

Fig. 3
figure 3

Measured MCS distribution and maximum and average resource utilization for a low network loads and b high network loads

While these measurements are based on a 4G network, we assume that similar traffic patterns will be observed for the mobile broadband use case of 5G, as general user behavior should stay the same.Footnote 2 Furthermore, it can be seen that the average utilization in the measurements is quite low. In order to project a 5G traffic distribution, the load is scaled as described in the following. It is assumed that each user has an average traffic demand per user of D 5G  = 300 Mbps as recommended for the downlink by the NGMN alliance for broadband access in dense areas in [16] and that the spectral efficiency of a 5G network would be approximately five times higher than in a current LTE network. Accordingly, the loads for a high load scenario are defined as

$$ {\mu}^{{\textstyle \hbox{'}}}\left( c, t\right)=\mu \left( c, t\right)\frac{D_{5\mathrm{G}}}{D_{\max }\ \left( c, t\right)/{N}_{\mathrm{UE}}\left( c, t\right)\cdot 5}. $$
(8)

The resulting utilization is shown in Fig. 3b. Thus, we hereafter use the measured utilization μ (“low load”) and scaled utilization μ′ (“high load”) to provide exemplary transport data rate requirements of 5G networks.

In order to dimension the transport network appropriately, we now consider the busy hour traffic following the recommendations introduced by the NGMN alliance in [23]. The busy hour was selected as the hour with the largest utilization sum over all cells, i.e.,

$$ {t}_{\mathrm{busy}}=\underset{m}{\mathrm{argmax}}{\displaystyle \sum_c}{\displaystyle \sum_d}{\displaystyle \sum_m^{m+3}}\mu \left( c, t=\left( d, m\right)\right)\ . $$
(9)

The busy hour was found to be from 12:15 to 13:15 h. From this, a busy hour MCS distribution was calculated as the average MCS distribution in that hour, i.e.,

$$ {p}_{\mathrm{busy}}\left({MCS}_i\right)=\frac{1}{C\cdot D}{\displaystyle \sum_c}{\displaystyle \sum_d} p\left({MCS}_i, c,{t}_{\mathrm{busy}}\right). $$
(10)

The resulting distribution is depicted in Fig. 4a. Note that MCS 10 and 17 are very close to their adjacent MCS in terms of spectral efficiency and are hence not utilized in this network. Accordingly, the figure shows a probability of zero for these two MCS.

Fig. 4
figure 4

Busy hour distributions. a MCS index distribution. b CDF of utilization

All loads observed in the busy hour were used to accumulate a discrete busy hour load distribution as

$$ {p}_{\mathrm{busy}}\left(\mu \in {\displaystyle \underset{d, c}{\cup }}\mu \left( c,{t}_{\mathrm{busy}}\right)\right)=\frac{1}{4\cdot C\cdot D}. $$
(11)

with the factor 4 accounting for the four 15-min traces per hour. The cumulative distribution function (CDF) of the resulting load distributions are illustrated in Fig. 4b.

Using the busy hour utilization and MCS distribution, it is possible to calculate the probability distribution of the data rates of split A, B, and C and for the three different RATs introduced in Table 1. For this, all combinations of occurring loads μ and MCS i are inserted into Eqs. (1), (4), and (5) to obtain the data rates D(μ,MCS i ). Assuming for simplicity that there is no correlation between a certain load and an MCS, the probability of each data rate to occur can then be calculated as

$$ {p}_{\mathrm{busy}}\left( D\left(\mu, {MCS}_i\right)\right)={p}_{\mathrm{busy}}\left(\mu \right)\cdot {p}_{\mathrm{busy}}\left({MCS}_i\right). $$
(12)

To account for the fact that modulation schemes of up to 1024-QAM were assumed instead of the 64-QAM in the actual measurements, the spectral efficiency of each MCS was scaled accordingly by up to 10/6. The resulting complementary CDFs (CCDFs) of the different data rates are depicted in Fig. 5. Note that the data rates of the different RATs are normalized to their peak value (i.e., the values from Table 1) and are hence the same for all three RATs. While the rate is constant for split A, the rates of split B depend on utilization, and those for split C depend both on utilization and on the MCS distribution. We can also see that, e.g., in the low load scenario for split C, the data rate exceeds 30% of the peak rate for only 1% of the time. This fact gives rise to statistical multiplexing which will be discussed in the next section.

Fig. 5
figure 5

CCDF of normalized data rates when employing split A, B, and C

3.3 Statistical multiplexing

Statistical multiplexing can be exploited when two factors are considered: the aggregation of transport streams of several cells and the introduction of a certain outage rate.

First, an outage rate can be considered as the transport network is commonly not dimensioned for peak rates in order to save costs [24]. Instead, it is dimensioned for a certain percentile, e.g., the 95th percentile Q 95, i.e., the offered traffic can be transported without queueing with a probability of 95%:

$$ p\left( D\le {Q}_{95}\right)=0.95. $$
(13)

As an example, consider split C of the low load scenario in Fig. 5: here, the Q 95 is only 17% of the peak data rate Q 100.

Second, when the transport streams of several base stations are aggregated onto one link, the resulting data rate distribution is given as the C-fold convolution of the individual distributions p(D). Accordingly, the percentiles Q(C) of C cells are different from a simply scaled percentile of one cell C Q(1). This combination of outage rate and aggregation of several RUs yields what we call the statistical multiplexing gain. Such gain is illustrated in Fig. 6 for an example of a uniform load distribution. The figure shows the resulting aggregated data rates for the 2-GHz carrier and split B when 1, 2, 4, and 8 cells are aggregated. Note how the uniform distribution becomes increasingly long-tailed; in other words, it is highly unlikely that several cells will exhibit peak rates at the same time instance. In the figure, we also highlight the aggregated peak rates (C Q 100(1)), aggregated 95th percentile without accounting for statistical multiplexing, C Q 95(1), and the 95% percentile of all aggregated cells when considering statistical multiplexing (Q 95(C)) for C = 8 cells. These values illustrate how both the outage percentile and the aggregation of several cells can decrease the required transport data rate on an aggregation link dramatically. As the statistical multiplexing effect cannot be relied on for only one or a few cells, we follow the methodology from [23] and calculate the required capacity as the maximum between the 95th percentile of C cells and the peak capacity required for one cell. We hence define the required data rates and the statistical multiplexing gain as

$$ {R}_{\mathrm{req},\mathrm{nomux}}(C)= C\cdot {Q}_{100}(1)\kern.68em \mathrm{without}\ \mathrm{multiplexing}, $$
(14)
$$ {R}_{\mathrm{req},\mathrm{mux}}(C)= \max \left({Q}_{100}(1),\ {Q}_{95}(C)\right)\ \mathrm{with}\ \mathrm{multiplexing}, $$
(15)
$$ {g}_{\mathrm{mux}}(C)=\frac{R_{\mathrm{req},\mathrm{mux}}(C)}{R_{\mathrm{req},\mathrm{nomux}}(C)}\kern.70em \mathrm{multiplexing}\ \mathrm{gain}. $$
(16)
Fig. 6
figure 6

Convolution of PDFs leading to multiplexing gains for split B and 2-GHz carrier

Figure 7 illustrates the resulting capacity to be deployed in aggregation networks for the different functional splits, for RATs, and for both the high load and low load scenarios. We give the capacity of up to 1000 cells, which we consider to be the approximate order of magnitude for the number of cells in a single large city.Footnote 3 In addition, the corresponding multiplexing gains for the high-load scenario are given in Fig. 8.

Fig. 7
figure 7

Provisioned transport capacity versus number of cells for a low-load scenario and b high-load scenario

Fig. 8
figure 8

Statistical multiplexing gains for high-load scenario

First, note that according to the overall increase in data rates considered for 5G networks, the transport data rates are also expected to increase by order of magnitude and can easily reach the Tbps range. However, the introduction of the lower functional splits B and C can mitigate this effect, and statistical multiplexing can further reduce the required capacity by almost ten times as seen in Fig. 8. The degree of statistical multiplexing gain depends on the number of aggregated cells but shows already high values for a few dozens of cells. In addition, split C, which exhibits the highest variability of per-cell traffic due to its dependence on both the overall load and the utilized MCS, shows the highest gains, while the static rates of split A do not offer any statistical multiplexing. These factors need to be considered in the design of future transport networks in order to make them economically feasible, which will be discussed in the next section.

4 Design guidelines for 5G transport networks

From the analysis and results in the previous sections, important conclusions can be drawn towards the design of 5G transport networks, which will be discussed next. This covers the necessity of converging fronthaul and backhaul networks, the technologies that can enable this convergence, and finally a change in network management towards SDN.

4.1 Transport network convergence

The 5G NGFI capacity requirements derived in Section 3 indicate that full centralization of baseband processing is not reasonable for all air interface technologies. The introduction of bandwidth in the GHz range, as well as large antenna arrays, increase transport data rates beyond what can be supported with current or near-future transport technology at feasible costs. Therefore, functional splits, where the transport rates are proportional to the actual utilization, should be considered for future 5G systems.

We also showed in the previous section how statistical multiplexing can also contribute to greatly reduce the required capacity on aggregation links. We would like to note that the statistical multiplexing gain will also greatly depend on the variance of the transported traffic in addition to the number of aggregated cells. The above analysis included only traces from an urban environment. It can be expected that aggregating more diverse types of cells will increase the variance of the traffic and—correspondingly—the multiplexing gain. Accordingly, future transport networks should aim to aggregate as diverse cells as possible, multiplexing, e.g., small and macro cells or urban and rural ones. Furthermore, recall that the used traces were performed on a 15-min time scale, which basically constitutes an averaging over 15 min. Hence, it can be expected that multiplexing on the timescale of a packet duration would further increase the traffic variance and, subsequently, the statistical multiplexing gain. In addition, the 5G transport network will have to support different traffic types along with varying requirements, not only according to the different splits and air interfaces but also depending on the use cases. For example, while massive MTC applications might require very low data rates and can deal with a certain degree of packet loss, a low-latency and resilient transport network will be required for other applications. Flexible waveforms with variable bandwidth, frame structure, and data rates should also be reflected in the transport network. Flexibility is hence a key aspect for future transport which could be achieved by utilizing packet switching. Since packet-based networks are currently in use for BH traffic, this calls for a convergence of FH and BH networks, transporting different types of traffic over a shared infrastructure, to reduce hardware diversity and to share resources, while simplifying management and deployment. However, currently used technologies will have to be adapted to support packet switching.

4.2 Enabling transport technologies

Dedicated fiber connectivity, currently favored for CPRI-based FH networks, does not offer resource sharing between different fibers or wavelengths. Packet-switched networks based on Ethernet are the most promising alternative as reflected by recent initiatives such as the IEEE 1914 Working Group [6]. Currently, synchronization is still challenging in packet-based networks and jitter introduced by switches and queues could become critical for FH traffic. In this regard, technologies such as Synchronous Ethernet [25] and the Precision Time Protocol (IEEE 1588) [26], which are currently investigated by, e.g., the IEEE 802.1 Time Sensitive Networking Task Group [27], are good candidates to efficiently support FH over Ethernet.

Supporting variable data rates is also a particular challenge at the physical level, due to the inflexible, dedicated links currently utilized. A packet-based flexible optical transport will comprise both passive and active solutions. Passive optical network (PON) solutions will be based on wavelength division multiplexing (WDM)-PONs, while active solutions will adopt more flexible and dynamic WDM technologies such as the time-shared optical network [28], which enable very granular sub-wavelength bandwidth allocation that is a key to efficiently utilize optical bandwidth in the aggregation/metro segment of converged FH/BH networks. Wireless transport technologies are also considered as candidate technologies for future transport networks due to their lower cost and higher flexibility compared to fiber.

Recently, drone-based mobile networks have been proposed in order to provide additional coverage to support temporary high traffic demand or unexpected or critical scenarios [29]. While still in early stages of research, it can already be foreseen that such networks will pose new challenges on the transport network, with both wireless and wired solutions (via a tether, see [30]) having been proposed. However, such aerial networks could also provide new solutions for future transport networks, e.g., by acting as relays or providing additional transport capacity via aerial routes in times of unexpected high transport traffic or failure of ground-based transport equipment.

4.3 Quality of service and management

Current CPRI-based transport networks consider just two types of traffic to be transported: data and control signals. Synchronization is directly supplied via CPRI line rate. Given the variety of potential 5G RATs, applications, and functional splits, many different types of traffic will have to be transported in 5G transport networks, each with their own requirements. While split A traffic will require low latencies, split C traffic can typically cope with much relaxed latencies. On the other hand, Tactile Internet traffic will require low latencies even for split C. To transport these different types of traffic over a unified network with singular requirements based on the strictest use case would clearly be cost-inefficient. A future transport network will hence have to support streams with different qualities of service. Packet-based networking can here also help to facilitate this via packet prioritization. However, this will make the management of the network more challenging, calling for SDN solutions.

SDN [18] is a recent networking paradigm, which separates control and data planes to enhance flexibility and to achieve programmability of network technologies. SDN is a key enabler for converged FH/BH networks in 5G. It will be used to virtualize the transport network in order to support slicing and to allow a flexible deployment of virtual functions in different places of the network, as is required for the support of flexible functional splits. In some cases, a separate out-of-band network for SDN signaling traffic may be too expensive to maintain, or reliability constraints may require a certain degree of distributed control to be kept in the network elements, thus, balancing between distributed and centralized control. Therefore, further research is needed to holistically apply the SDN paradigm to transport networks.

5 Conclusions

5G mobile networks will utilize a diverse set of access technologies, which will increase FH and BH requirements dramatically. Furthermore, multiple RATs with different degrees of centralization as well as novel use cases must be supported at a reasonable cost. The impact of those developments must be taken into account for the design of 5G transport networks and the corresponding next generation fronthaul interface. We showed how new functional splits can avoid a costly increase in necessary transport capacity which, otherwise, would be required by new technologies like mmWave and massive MIMO. By analyzing traffic measurements from a real-life, commercial network, we were able to illustrate that the combination of utilization-dependent functional splits and statistical multiplexing can additionally reduce aggregated transport traffic by up to almost one order of magnitude as compared to today’s CPRI-based networks. However, this will require a more flexible, dynamically configurable transport network, which can transport different types of traffic. To enable the required degree of flexibility while keeping the network manageable, a converged, packet-based, and SDN-enabled transport network will be required to support 5G radio access networks.