Introduction

The exponential growth of wireless network service users worldwide orchestrates the need to deploy novel enabling technologies to satisfy billions of data-hungry applications [1]. In recent times, the emergence of the Internet of Things (IoT) has ushered in new-age internet-enabled smartphones and machine-to-machine communications (M2M) for the growing mobile users [2]. However, the current network infrastructure is already overstretched, and there is a need for novel technologies to improve the existing wireless network architecture [3]. From managing security and privacy challenges, network spectrum issues, traffic spikes, complex network configuration, prohibitive operating costs, network failure, to hardware compatibility issues and more, the ubiquitous fifth-generation (5G) wireless network architecture needs to be enhanced to accommodate these growing concerns optimally [4, 5]. Recently, key enabling technologies for the envisioned beyond 5G and 6G wireless systems have been proposed [6,7,8,9]. Interestingly, these enablers include cell-free massive MIMO (CF-mMIMO) technology [10,11,12], mmWave communication [13,14,15], terahertz communication [16,17,18], quantum communication [19, 20], directional beamforming [21], reconfigurable intelligent surfaces (RIS) [22,23,24], and more.

The concept of CF-mMIMO introduced in [25] presents a promising alternative to guarantee high quality of service (QoS) to all UEs. CF-mMIMO leverages the idea of small-cells (SC), massive MIMO, and user-based joint transmission coordinated multi-point (JT-CoMP) [26] to deal with inter-cell interference [27]. Additionally, CF-mMIMO provides massive macro-diversity to mitigate path loss [28] via minimizing the adverse effects of spatially correlated fading and shadowing [29]. In this case, several ubiquitous access points (APs) with single or multiple antennas jointly serve a smaller number of distributed UEs over the coverage area in time-division duplex (TDD) mode [30]. CF-mMIMO has been described as an embodiment of network MIMO and is regarded as an alternative network MIMO [31]. Compared to the fully distributed SC system and massive cellular MIMO, CF-mMIMO has improved performance under several practical conditions, including but not limited to favorable propagation and channel hardening with spatially well-separated UEs and APs. This results in increased macro-diversity gain from the low distance between UEs and APs [3, 32, 33].

Currently, there is a growing interest in the implementation of sustainable and greener CF-mMIMO systems [34] to boost the energy efficiency (EE) [35, 36] of wireless systems, offset the power consumption cost [37], and minimize the environmental impacts of wireless systems [38]. Several optimization techniques, EE-saving algorithms, and robust power control models [39, 40], such as energy cooperation [41], reconfigurable intelligent surface (RIS) [23, 42, 43], and more, have been explored. Combining CF-mMIMO and simultaneous wireless information and power transfer (SWIPT) technique is considered to drive energy-limited user devices and improve the EE of next-generation wireless networks [44,45,46]. Given that several APs and several users are involved in a CF system [47], the deployment cost and energy consumption may rise, and more energy resources [48, 49] would be required. In order to realize CF-mMIMO in practice, carrier frequency and sampling rate offsets, In-phase/quadrature-phase (I/Q) imbalance, phase noise, and analog-to-digital converter distortions need to be examined [50, 51]. Toward this end, this paper provides an extensive survey on the areas of application of CF-mMIMO in next-generation wireless communication systems. A comprehensive layout of the paper is presented in Fig. 1. Furthermore, different massive MIMO-based solutions, massive cellular MIMO, and network MIMO are examined critically. The design and system configurations, system modeling, application scenarios, potentials, and associated challenges of CF-mMIMO are discussed extensively. Additionally, open research issues, lessons learned, and future research directions are outlined.

Fig. 1
figure 1

A comprehensive layout of the paper

This survey is focused on applying CF-mMIMO in 5G and beyond 5G (B5G) wireless networks. The key highlights of the survey are outlined as follows.

  1. 1.

    We present a background on the evolution of CF-mMIMO in emerging wireless communication systems.

  2. 2.

    We present an overview of cellular massive MIMO, network massive MIMO, and CF massive MIMO, highlighting their areas of application, strengths, limitations, and performance comparison of these architectures with reference to key design metrics, interference management, channel hardening, and SE, among others.

  3. 3.

    We examine the configuration details and system modeling of CF-mMIMO, emphasizing the uplink/downlink (UL/DL) pilot-aided channel estimation, UL/DL training, channel hardening, and outage probability.

  4. 4.

    We highlight key areas of application of CF-mMIMO such as in SWIPT, power control, NOMA, SE, and EE.

  5. 5.

    We present key research findings and current trends in CF-mMIMO, highlighting their focus, coverage, prospects, and limitations.

  6. 6.

    We highlight open research issues, future research directions, and key take-away lessons on CF-mMIMO deployment in 5G and B5G wireless networks.

The rest of this paper is organized as follows. The literature review is presented in the “Related work” section. A comprehensive description of the traditional MIMO architecture, massive cellular MIMO, network MIMO, and CF-mMIMO is presented in the “Overview of massive MIMO systems” section. The “System model of cell-free massive MIMO” section offers a detailed account of the CF-mMIMO system modeling and configuration. The application of CF massive MIMO in channel hardening, NOMA, EE, and more are discussed in the “Areas of application of cell-free massive MIMO” section. Open research issues and lessons learned are highlighted in “Open research issues and lessons learned” section. Finally, the “Conclusions” section gives a concise conclusion to the paper.

Related work

Cell-free massive MIMO has attracted considerable research interest in the past decade, and it is currently regarded as a key 5G and beyond 5G physical layer technology [52,53,54,55]. The authors in [56] provide an in-depth exposition into simulation platforms and insightful schemes for emerging 5G interfaces. An extensive overview of several solutions for 5G infrastructures including, but not limited to massive MIMO, millimeter-wave (mmWave), NOMA, and also the latest achievements on simulator capabilities, are clearly outlined. Also, artificial intelligence (AI)-based discontinuous reception (DRX) technique for greening 5G enabled devices have been proposed in [57]. The proposed mechanism significantly outperforms the conventional long-term evolution (LTE)-DRX technique in efficient energy savings. In recent times, the use of deep learning techniques to perform power control in wireless communication networks has been studied in [58,59,60]. Reference [61] advocate using deep neural networks to perform joint beamforming and interference coordination at mmWave.

Additionally, the authors in [62] consider incorporating channel hardening in CF-mMIMO using stochastic geometry and evaluated the potential constraints to its practical implementation. It suffices that the channel hardening effect is more noticeable in massive cellular MIMO than in CF-mMIMO and depends mainly on the number of antennas per AP and the pathloss exponent of the propagation environment [63]. Fortunately, the authors [64] have shown that significant improvement in channel hardening is achievable with the normalized conjugate beamforming (NCB) precoder compared to the conjugate beamforming (CB) scheme. The works [65,66,67] characterized the coexistence of CF-mMIMO and SWIPT utilizing the Poisson point process (PPP) model.

Furthermore, an insight into the achievable harvested energy, channel variations due to fading and path loss, and UL/DL rates in closed form are considered [68, 69]. The detrimental effects of pilot contamination on the performance of CF-mMIMO are highlighted in [70, [1]. The authors in [53] proposed allocating pilot power for each user in the network to palliate this defect. Interestingly, interference management and joint user association aimed at minimizing cell-edge effects are studied [70]. References [11, 71] have reported novel scalable and distributed algorithms used for initial access and cooperation cluster formation in CF-mMIMO. The authors also proposed scalable signal-to-leakage-and-noise ratio precoding to address the scalability issues in CF-mMIMO. Currently, federated learning (FL) frameworks are introduced in [72, 73], and FL optimization techniques have been presented in [74,75,76]. A detailed account recapitulating the impact of hardware impairments (HI) on the performance of CF-mMIMO is characterized [48]. By employing a hardware scaling law, the impact of HI on APs is shown to vanish asymptotically. The authors in [77] analyzed the UL and DL CF-mMIMO performance under the classical HI model to gain further insights.

Most of the literature provides valuable information on massive MIMO deployment in 5G wireless networks. Though some of these papers present several aspects of CF-mMIMO, there is no detailed study on CF-mMIMO systems that captures entirely cellular massive MIMO, network MIMO, and CF-mMIMO system modeling, architecture, strengths and limitation, and applications in terms of SWIPT, channel hardening, hardware efficiency, power control, NOMA, SE, and EE. To this end, the need for a comprehensive paper covering the aspects mentioned above of CF-mMIMO is vitally important. Therefore, the current paper presents an extensive survey on CF-mMIMO as a candidate enabler for 5G and B5G wireless networks. Specifically, the limitations of some selected literature are outlined, and the contributions of the current paper are highlighted, as presented in Table 1.

Table 1 Limitations of some related works

Overview of massive MIMO systems

The concept of massive MIMO has received considerable attention due to its deployment to meet the demands of wireless capacity and higher data rates [89]. Massive MIMO offers improved spectral and energy efficiency and adopts optimal signal processing schemes [90]. Moreover, massive MIMO can spatially multiplex many user equipment (UE) by using many phase-coherent transmitting/receiving antennas, thus suppressing inter-cell and intra-cell interference [91, 92]. These unique features have revitalized studies leading to the discoveries of several massive MIMO solutions [93]. Massive MIMO technology, empowered with several antennas at each cell site, offers tremendous improvements in the radiated EE, power efficiency, and SE compared to the traditional MIMO systems [94, 95]. Moreover, by operating in either a centralized or distributed fashion, favorable propagation facilitates the realization of near-optimal linear processing [96]. Motivated by the benefits mentioned above, various massive MIMO-based solutions significantly gained traction in academia and industry [97]. A brief discussion on cellular mMIMO, network MIMO, and CF-mMIMO focusing on their architectures, strengths, and applications are presented in the following subsections.

Cellular massive MIMO

Massive MIMO time-division duplex (mMIMO-TDD) systems have been reported to boost the throughput of wireless networks [83, 86]. Since the multiple antennas used in mMIMO-TDD are much smarter, it presents a practical means to outperform partial multiuser MIMO (MU-MIMO) systems. BS antennas could be substantially larger than the number of transmitter terminals. Recently, the attractive features of cellular networks, including exploiting channel reciprocity, especially as more antennas do not necessarily lead to a corresponding increase in the feedback overhead, have been investigated. The traditional cell-size shrinking technique is eliminated via the installation of extra antennas to existing cell sites. Furthermore, UL and DL transmit powers are considerably reduced due to increased antenna aperture and coherent combining. Nonetheless, cellular networks face significant challenges, including estimating the criticality of coherent channels, bandwidth, and interference limitations. Additionally, the substantial cost associated with a large number of transmitting/receive chains and power amplifiers is a major setback. An illustrative description of a typical cellular massive MIMO system is given in Fig. 2. The mobile station (MS) is connected to a central base station (BS) in each cell.

Fig. 2
figure 2

Illustration of a multicell massive MIMO system

Network MIMO

Network MIMO system, which allows for coordination of a set of APs that jointly serve all users in the network, has often been hailed as an exciting alternative to achieving the capacity limit of cellular networks [98, 99]. Network MIMO or multicell MIMO signaling is considered a potential physical layer technique for 5G wireless networks. A plethora of interfering transmitters share user messages in a network MIMO system and enable joint precoding to be performed. Additionally, network MIMO could be referred to as cooperative communications used to improve the interference-limited performance of cellular networks. Specifically, by jointly designing the DL beams to multiplex multiple users spatially, intra-cluster interference can be eliminated. This concept has recently been introduced under a new network structure named CF massive MIMO [3]. It is considered a key enabling technology for 5G and beyond 5G wireless systems. Besides, the cell-edge problem inherent in cellular massive MIMO is eliminated, and all antennas jointly serve all users (UEs). Figure 3 presents the architecture of a typical network MIMO system.

Fig. 3
figure 3

Illustration of network MIMO system

Cell-free massive MIMO

The cellular topology has been the traditional way of covering the subscribers in a given geographical area with wireless network service for many decades. Each BS serves a given set of UEs using highly directional beamforming techniques [94]. This network topology has shown desirable performance gains, spectral efficiency, and energy efficiency [100,101,102]. However, the technology inevitably limits further performance improvements due to inter-cell interference, high QoS variations, and hand-offs [78, 103]. In order to address this problem, a viable option is to eliminate the inherent cell characteristics and take a considerable number of distributed APs densely deployed over a given coverage area to serve a smaller number of UEs optimally [53]. This novel communication architecture is described as cell-free massive MIMO, and it has been identified as a candidate enabling technology for future wireless communication systems [83, 104].

Currently, key disruptive technologies have been deployed to cater to throughput, coverage, EE, and ubiquity requirements of next-generation wireless networks. In particular, having multiple antennas at the APs for several users has been observed as a promising technique to boost the multiplexing gain and enhance the SE in CF massive MIMO [55, 80]. Recently, power domain-centric NOMA integrated with CF-mMIMO emerged as a viable solution to address the conflicting demands on high SE, EE, high reliability with user-fairness, increased connectivity, and reduced latency in 5G wireless networks [105,106,107]. In CF-mMIMO, the number of simultaneously served users can be increased by supporting the users to utilize the same time-frequency resource effectively and invoking superposition-coded transmission and successive interference cancelation (SIC) decoding [108,109,110].

Cell-free massive MIMO leverages a distributed antenna system’s unique features, coordinates beamforming, joint transmission, and scheduling to provide multiuser interference suppression and achieve stronger diversity gains. Moreover, due to the well-designed network topology, CF-mMIMO allows for favorable propagation and high quality of service throughout the coverage area [25, 52, 103, 111]. Compared to conventional cellular networks, some of the fascinating features of CF networks include uniform signal-to-noise ratio (SNR) with smaller variations, improved interference management, increased SNR due to coherent transmission [12], high EE, high SE, low latency, low complex linear processing, minimal power consumption, flexible and cost-efficient deployment, and high reliability, among others [9, 103]. Figure 4 presents a pictorial representation of a CF-mMIMO network, and Table 2 presents a performance comparison among cellular massive MIMO, network MIMO, and CF-mMIMO. According to its performance level, several critical performance metrics are selected, and each metric has been weighted (in percentage), as discussed in Section 5 of the current paper. Last, a pictorial comparison of cellular mMIMO and CF-mMIMO is shown in Fig. 5. For the cellular condition, hundreds and even thousands of BS antennas are selected to serve UEs within disjoint cells, thus achieving considerable throughput and coverage improvement. However, for the CF scenario, a plethora of geographically distributed single APs are chosen to serve a smaller number of UEs. The coverage area is not divided into disjoint cells leading to a CF network where signals from surrounding APs only influence each UE. The distributed AP antennas are connected via a fronthaul network to one or multiple central processing units (CPUs), facilitating effective coordination.

Fig. 4
figure 4

Cell-free massive MIMO configuration

Table 2 Comparison among cellular mMIMO, network MIMO, and CF-mMIMO (metric weighting in percentage)
Fig. 5
figure 5

Comparison of cellular network and cell-free network. Left: cellular network; right: cell-free massive MIMO network

System model of cell-free massive MIMO

The CF-mMIMO network arbitrarily distributed over a wide coverage area operating on a one-time frequency resource is discussed in this framework. Let there be K UEs and M randomly located APs, each equipped with Nap antennas, where Nap≥1. It is often assumed that M ≫ K. Besides, all APs are connected through an unlimited backhaul network to edge-cloud processors, called the CPU. Data-decoding is performed, ensuring that the UEs’ coherent joint transmission and reception in the coverable area are enabled. Remarkably, the pathloss between a user and any AP antenna is unique. The pathloss matrix possesses distinct diagonal elements, and as a result, performance analysis is generally challenging and considerably different from related prior works. TDD protocol with channel reciprocity and a single data stream transmitted per UE is assumed. The communication protocol is usually divided into several phases. These include UL training, UL payload data transmission, DL training, and DL payload data transmission. For the overview of the system model of CF-mMIMO networks captured in this survey, a concise list of relevant mathematical notations used and their meanings are presented in Table 3.

Table 3 Mathematical notations and definitions

First, by taking into consideration a DL CF massive MIMO system, let a set of BSs ß ≜ {1, …, B}, each equipped with M antennas serves a set of UEs К ≜ {1, …, K},each loaded with N antennas. Moreover, let Hb, k ∈ M × Ndenote the UL channel matrix between BS b ∈ ß and UE k ∈ К, while \( {H}_k\triangleq {\left[{H}_{1,k}^T,\dots, {H}_{B,k}^T\right]}^T\in {\mathbb{C}}^{BM\times N} \) represents the global UL channel matrix seen by UE k. More so, let Yb, k ∈ M × 1denote the BS-Specific precoding vector utilized by BS b for UE k, while \( {Y}_k\triangleq {\left[{Y}_{1,k}^T,\dots, {Y}_{B,k}^T\right]}^T\in {\mathbb{C}}^{BM\times 1} \) represents the global precoding vector utilized UE k.

Thus, the received signal at UE k reads as (1)

$$ {p}_k\triangleq {\sum}_{b\in \mathrm{\ss}}{\sum}_{\overline{k}\in K}{H}_{b,k}^H{y}_{b,\overline{k}}{g}_{\overline{k}}+{z}_k\in {\mathbb{C}}^{N\times 1} $$
(1)

where gk~CN(0, 1) denotes the transmit data symbol for UE k, and \( {z}_k\sim CN\ \left(0,{\sigma}_k^2{I}_N\right) \) represents the average AWGN at UE k. When pkis collected, UE k employs the combining vector qk ∈ N × 1. The resulting signal-to-interference-plus-noise ratio (SINR) [118] is given by (2)

$$ {SINR}_K\triangleq \frac{\left|{\sum}_{b\in \mathrm{\ss}}{q}_k^H{H}_{b,k}^H{y}_{b,k}\right|2}{\sum_{\overline{k}\in K\left\{k\right\}}{\left|{\sum}_{b\in \mathrm{\ss}}{q}_k^H{H}_{b,k}^H{y}_{b,\overline{k}}\right|}^2+{\left\Vert {q}_k\right\Vert}^2{\sigma}_k^2} $$
(2)

In addition, the sum rate is expressed as R ≜ ∑k ∈ Klog2(1 + SINRk). Next, the realistic pilot-aided channel state information (CSI) acquired at the BSs, and the UEs are considered.

Uplink pilot-aided channel estimation

Let the effective UL channel vector between UE k and BS b be denoted by db, k ≜ Hb, kqk ∈ M × 1. Likewise, let the pilot assigned to UE k be denoted by lk ∈ ρ × 1, where || lk||² = ρ. In this phase, each UE k jointly transmits its pilot precoded with its combining vector and is expressed as (3)

$$ {W}_k^{UL_{\bar{\mkern6mu}}1}\triangleq {q}_k{l}_k^H\in {\mathbb{C}}^{N\times \rho } $$
(3)

Thus, for each BS b, \( {X}_b^{UL-1} \)is given by (4) and (5)

$$ {X}_b^{UL-1}\triangleq {\sum}_{k\in K}{H}_{b,k}{W}_k^{UL-1}+{Z}_b^{UL-1} $$
(4)
$$ ={\sum}_{k\in K}{h}_{b,k}{l}_k^H+{Z}_b^{UL-1}\in {\mathbb{C}}^{M\times \rho } $$
(5)

where \( {Z}_b^{UL_{\bar{\mkern6mu}}1}\in {\mathbb{C}}^{M\times \uprho} \) denotes the AWGN at BS b having elements distributed as Ϲ (\( 0,{\sigma}_b^2\Big) \). Likewise, the LS estimate of db, k is given by (6) and (7)

$$ {\hat{d}}_{b,k}\triangleq \frac{1}{\rho }{X}_b^{UL-1}{l}_k $$
(6)
$$ ={d}_{b,k}+\frac{1}{\rho }{\sum}_{\overline{k}\in K\backslash \left\{k\right\}}{d}_{b,\overline{k}}{l}_{\overline{k}}^H{l}_k+\frac{1}{\rho }{Z}_b^{UL-1}{l}_k $$
(7)

Downlink pilot-aided channel estimation

Let the effective DL channel vector between all the BSs and UE k be denoted by \( {h}_k\triangleq {\sum}_{b\in \mathrm{\ss}}{H}_{b,k}^H{y}_{b,k}\in {\mathbb{C}}^{N\times 1} \). In this phase, each BS jointly transmits a superposition of pilots after they have been precoded with the corresponding precoding vector and is expressed as (8)

$$ {W}_k^{DL}\triangleq {\sum}_{k\in K}{y}_{b,k}{P}_{\overline{k}}^H\in {\mathbb{C}}^{M\times \uprho} $$
(8)

Thus, for each UE k, \( {X}_k^{DL} \) is given by (9)

$$ {X}_k^{DL}\triangleq {\sum}_{b\in \mathrm{\ss}}{\sum}_{\overline{k}\in K}{H}_{b,k}^H{y}_{b,\overline{k}}{P}_{\overline{k}}^H+{Z}_k^{DL}\in {\mathbb{C}}^{N\times \uprho} $$
(9)

where

\( {Z}_k^{DL}\in {\mathbb{C}}^{N\times \uprho} \)denotes the AWGN at UE k with elements distributed as Ϲ (\( 0,{\sigma}_k^2\Big) \). Likewise, the LS estimate of hkis given by (10) and (11)

$$ {h}_k\triangleq \frac{1}{\rho }{X}_k^{DL}{P}_k $$
(10)
$$ ={h}_k+\frac{1}{\rho }{\sum}_{b\in \mathrm{\ss}}{\sum}_{\overline{k}\in K\left\{k\right\}}{H}_{b,k}^H{y}_{b,\overline{k}}{P}_{\overline{k}}^H{P}_k+\frac{1}{\rho }{Z}_k^{DL}{P}_k $$
(11)

There exist several pilot-based channel estimators, namely LS estimator, minimum mean-squared error (MMSE), element-wise minimum mean-squared error (EW-MMSE), phase-aware minimum mean-squared error (PA-MMSE), and linear MMSE. Table 4 presents a summary of pilot training-based channel estimators [119]. Next, the UL training phase, DL training phase, and outage probability of the communication protocol are considered.

Table 4 Summary of pilot training-based channel estimation schemes

Uplink training

In the uplink training phase, the UEs send UL training pilot sequences to allow channel estimation at the APs. The uplink training phase also applies to the user-centric (UC) massive MIMO architecture. Let τc represent the length of the channel coherence time in discrete-time samples and τs represent the length of the UL training phase, also in discrete-time samples. At all times, τs < τc. The matrix has on its rows the pilot sequences transmitted by the kth UE is denoted by \( {\varnothing}_k\in {C}^{s\times {\tau}_s} \). While the rows of ∅k are assumed to be orthogonal, i.e., \( {\varnothing}_k{\varnothing}_k^D={I}_S \), assuming there is no orthogonality for the pilot sequences directed to other UEs. Of course, by employing orthogonal pilot tout court, a robust system to the effects of pilot contamination is achievable. However, the peak value of the product KS that can be taken in the channel coherence time would be primarily limited. Thus, we define the NAP × τs dimensional matrix Xm by (12)

$$ {X}_m=\sum \limits_{k=1}^K\sqrt{sk}{F}_{k,m}{\varnothing}_k+{Y}_{m,} $$
(12)

where Fk, m = Dk, mLk, Ym denotes the matrix of thermal noise samples. Next, we explore the structure of the LMMSE channel estimator briefly. Defining parameters, xm = vec(Xm), ym = vec(Ym), fk, m = vec(Fk, m), the vectorized model is given as (13)

$$ {x}_m=\sum \limits_{k=1}^K\sqrt{sk}{R}_k{f}_{k,m}+{y}_m $$
(13)

where \( {R}_k={\varnothing}_k^T\otimes I. \)We process xm by a matrix \( {Q}_{k,m}^D \), i.e., \( {\hat{f}}_{k,m}={Q}_{k,m}^D{x}_m \). Thus, the MSE is obtained as (14)

$$ \mathbbm{E}\left[\left\Vert {Q}_{k,m}^D{x}_m-{f}_{k,m}\right\Vert {}^2\right]= tr\left({Q}_{k,m}^D\mathbbm{E}\left[{x}_m{x}_m^D\right]{Q}_{k,m}\right)+\mathbbm{E}\left[\left\Vert {f}_{k,m}\right\Vert {}^2\right]-\mathbbm{E}\left[2\mathfrak{R}\left\{ tr\left({f}_{k,m}^D{Q}_{k,m}^D{x}_m\right)\right\}\right] $$
(14)
$$ = tr\left({Q}_{k,m}^D\left(\sum \limits_{l=1}^K sk{R}_l{R}_l^D+{\delta}^2I\right){Q}_{k,m}\right)+\mathbbm{E}\left[\left\Vert {f}_{k,m}\right\Vert {}^2\right]-\sqrt{sk}\ tr\left({Q}_{k,m}^D{R}_k+{Q}_{k,m}^T{R}_k^{\ast}\right) $$

In this case, the gradient of the MSE concerning the complex matrix \( {Q}_{k,m}^{\ast } \) is assumed to be equal to zero. Solving for Qk, m,the LMMSE estimator is given by (15)

$$ {Q}_{k,m}^{LMMSE}=\sqrt{sk}\left(\sum \limits_{l=1}^K sl{R}_l{R}_l^D+{\delta}^2I\right){\bar{\mkern6mu}}^1\ {R}_k $$
(15)

Downlink training

In this section, the scalable beamforming training scheme is adopted. Let τv, l denote the length of the DL training duration per coherence interval; it suffices that τv, l < τ − τu, l. By exploiting the channel estimates \( \left\{\hat{h}m{k}^{\acute{\mkern6mu}}\right\} \), and beamforming it to all users, the mth AP precodes the pilot sequences \( {\varphi}_{k^{\acute{\mkern6mu}}}\in {\mathbb{C}}^{\tau_{v,l}\times 1},{k}^{\acute{\mkern6mu}}=1,\dots, K. \) Thus, the τv, l × 1 pilot vector wm, l sent from the mth AP is obtained as (16)

$$ {w}_{m,l}=\sqrt{\tau_{v,l}{\sigma}_{v,l}}\sum \limits_{k^{\acute{\mkern6mu}}=1}^K\sqrt{\mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}m{k}^{\acute{\mkern6mu}}}{\hat{h}}^{\ast}\mathrm{m}{k}^{\acute{\mkern6mu}}{\varphi}_{k^{\acute{\mkern6mu}}}, $$
(16)

where σv, l is the normalized transmit SNR per DL pilot symbol and {φk} assume mutual orthonormality i.e. \( {\varphi}_k^D{\varphi}_{k^{\acute{\mkern6mu}}}=0, \)for k´ ≠ k, and ‖φk2 = 1. This requires that τv, l ≥ K. The kth user receives correspondingly, τv, l × 1 pilot vector which is given by (17)

$$ {x}_{vl,k}=\sqrt{\tau_{v,l}{\sigma}_{v,l}}\sum \limits_{k^{\acute{\mkern6mu}}=1}^K{r}_{k{k}^{\varphi_{k^{\acute{\mkern6mu}}}}}+{\boldsymbol{y}}_{vl,k}, $$
(17)

where yvl, k denotes a vector additive noise at the kth user. The effective channel gain rkk, k = 1, …, K is estimated via the processing of the received pilot first by the kth user as (18)

$$ {\overset{\check{} }{x}}_{vl,k}={\varphi}_k^D{\boldsymbol{x}}_{vl,k}=\sqrt{\tau_{v,l}{\sigma}_{v,l}}{r}_{kk}+{\varphi}_k^D{\boldsymbol{y}}_{vl,k}=\sqrt{\tau_{v,l}{\sigma}_{v,l}}{r}_{kk}+{\eta}_{l,k}, $$
(18)

where \( {\eta}_{l,k}\triangleq {\mathit{\mathsf{\varphi}}}_{\mathit{\mathsf{k}}}^{\mathit{\mathsf{D}}}{\boldsymbol{\mathsf{y}}}_{\mathit{\mathsf{vl}},\mathit{\mathsf{k}}}\sim CN\left(0,1\right) \). Given \( {\overset{\check{} }{x}}_{vl,k}, \) linear MMSE estimation of rkkis performed, which is obtained as (19)

$$ {\hat{r}}_{kk}=\mathbbm{E}\left\{{r}_{kk}\right\}+\frac{\sqrt{\tau_{v,l}{\sigma}_{v,l}}\ Var\ \left\{{r}_{kk}\right\}}{\tau_{vl}{\sigma}_{v,l}\ Var\ \left\{{r}_{kk}\right\}+1}\left({\overset{\check{} }{x}}_{vl,k}-\sqrt{\tau_{v,l}{\sigma}_{v,l}}\ \mathbbm{E}\left\{{r}_{kk}\right\}\right) $$
(19)

Outage probability

Obtaining the exact expression for outage probability is quite challenging due to the computational complexity in determining the cumulative distribution function (CDF) of the SINR at the APs [120]. The only exception for massive MIMO systems applies to perfect CSI and identically distributed channels unsatisfied in practice [121, 122]. Therefore, an alternate approach, approximate outage probability, is selected to provide the outage probability of massive MIMO networks where all the APs are collocated. Hence, βmk = βk, γmk = γk, ∀m, k. The outage probability approximation of the kth user is obtained as (20) and (21)

$$ {P}_{out}^K(T)\approx 1-M\sum \limits_{i=1}^{K-1}\frac{\gamma_i^{K-2}}{\prod \limits_{\begin{array}{c}j=1\\ {}j\ne i\end{array}}^{K-1}\left({\gamma}_i-{\gamma}_j\right)}\left[1-\frac{e^{-{c}_2^i}}{c_1^i+1}\right]+\left(M-1\right)\sum \limits_{i=1}^{K-1}\frac{\gamma_i^{K-2}}{\prod \limits_{\begin{array}{c}j=1\\ {}j\ne i\end{array}}^{K-1}\left({\gamma}_i-{\gamma}_j\right)}\left[1-{e}^{-\left({c}_1^i+{c}_2^i\right)}\right] $$
(20)

for \( T\le \frac{\rho_u\left(M-1\right){\gamma}_K}{\left(1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)\right)} \) and

$$ {P}_{out}^K(T)\approx 1-M\sum \limits_{i=1}^{K-1}\frac{\gamma_i^{K-2}}{\prod \limits_{\begin{array}{c}j=1\\ {}j\ne i\end{array}}^{K-1}\left({\gamma}_i-{\gamma}_j\right)}\left[{e}^{-k}-\frac{e^{-{c}_2^i}{e}^{-k\left({c}_1^i+1\right)}}{c_1^i+1}\right]+\left(M-1\right)\sum \limits_{i=1}^{K-1}\frac{\gamma_i^{K-2}}{\prod \limits_{\begin{array}{c}j=1\\ {}j\ne i\end{array}}^{K-1}\left({\gamma}_i-{\gamma}_j\right)}\left[1-{e}^{-\left({c}_1^i+{c}_2^i\right)}\right]U\left({c}_3+{c}_4+{c}_5\right) $$
(21)

for \( T>\frac{\rho_u\left(M-1\right){\gamma}_K}{\left(1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)\right)}, \)

where \( {c}_1^i,{c}_2^i,{c}_3,{c}_4 \) and c5 are given by (22), (23), (24), (25), and (26).

$$ {c}_1^i=\frac{\gamma_K}{T{\gamma}_i} $$
(22)
$$ {c}_2^i=\frac{\left(M-1\right){\gamma}_K}{T{\gamma}_i}-\frac{1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)}{\rho_u{\gamma}_i} $$
(23)
$$ {c}_3=\frac{1}{T}{\gamma}_K^2 $$
(24)
$$ {c}_4=\frac{2}{T}\left(M-1\right){\gamma}_K^2-\frac{1}{\rho_u}{\gamma}_K\left(1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)\right)=\frac{1}{T}\left(M-1\right){\gamma}_K^2+{\gamma}_K\left(\frac{\left(M-1\right){\gamma}_K}{T}-\frac{\left(1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)\right)}{\rho_u}\right) $$
(25)
$$ {c}_5=\left(M-1\right){\gamma}_K\left(\frac{\left(M-1\right){\gamma}_K}{T}-\frac{\left(1+{\rho}_u\left({\beta}_K-{\gamma}_K\right)\right)}{\rho_u}\right). $$
(26)

Areas of application of cell-free massive MIMO

The wireless research community has explored the outstanding features of the mutually beneficial combination between CF structure and massive MIMO technology to enable seamless transfer from theory to practical implementation. Of course, significant progress in signal processing, communication, and optimization algorithms developed has further deepened the range of applications of this technology. Table 5 presents a summary of CF massive MIMO application areas alongside their strengths and limitations. A concise account of the past findings and current research trends based on SWIPT, channel hardening, hardware efficiency, power control, NOMA, SE, and EE for the CF-MIMO are detailed in the following subsections.

Table 5 Areas of application of CF massive MIMO

SWIPT in cell-free massive MIMO

The ultra-high transmission rate of wireless networks has been identified as a significant challenge that decreases the lifetime of battery-powered devices. SWIPT is considered an innovative candidate for the energy-limited environment through energy reclamation. It offers an effective solution to enable a guaranteed energy level and minimize backhaul resources and energy consumption. Compared to traditional massive MIMO, CF massive MIMO presents a new paradigm to boost the performance of SWIPT. A summary of recent advances on SWIPT technology in CF massive MIMO is presented in Table 6.

Table 6 SWIPT technology in cell-free massive MIMO

Additionally, the energy harvest and DL achievable rate for an energy user under a linear scheme are presented. First, for the harvested energy, it is assumed that the harvesting circuitry operates with an efficiency represented as η. With reference to the \( \overset{\sim }{j} \)th user which depicts a typical energy user, the ambient harvested energy \( {E}_{\overset{\sim }{j}} \) during a time slot is obtained as (27)

$$ {E}_{\overset{\sim }{j}}=\eta {T}_E\sum \limits_{l=1}^N\frac{r_{\overset{\sim }{j}l}^{-2{\alpha}_{\overset{\sim }{j}l}}\left(\left|{d}_{\overset{\sim }{j}l}\right|{}^2\right){}^2{P}_{\overset{\sim }{j}l}{P}_P}{\mathbbm{E}\left[\left|{\boldsymbol{a}}_j^D xl\right|{}^2\right]}, $$
(27)

In this case, the noise factor is neglected to owe to its low comparative strength instead of other terms. As a step further, the DL achievable rate for a typical energy user is derived. The average achievable rate \( {R}_{\overset{\sim }{j}} \) can be expressed as (28)

$$ {R}_{\overset{\sim }{j}}=\left(1-\frac{2{T}_E}{T-{T}_p}\right)\mathbbm{E}\left[\log \left(1+{\gamma}_{\overset{\sim }{j}}\right)\right], $$
(28)

where \( {\gamma}_{\overset{\sim }{j}} \) denotes the SNR of the \( \overset{\sim }{j} \)-th user and is expressed as (29)

$$ {\gamma}_{\overset{\sim }{j}}=\frac{\sum \limits_{l=1}^N{r}_{\overset{\sim }{j}l}^{-2{\alpha}_{\overset{\sim }{j}l}}\left(\left|{d}_{\overset{\sim }{j}l}\right|{}^2\right){}^2{P}_{\overset{\sim }{j}l}{P}_P}{\sigma {}^2\mathbbm{E}\left[\left|{\boldsymbol{a}}_j^D xl\right|{}^2\right]}. $$
(29)

Now, \( {R}_{\overset{\sim }{j}} \) can be modeled as (30)

$$ {R}_{\overset{\sim }{j}}=\left(1-\frac{2{T}_E}{T-{T}_p}\right)\ \mathbbm{E}\left[\log \left(1+\frac{E_{\overset{\sim }{j}}}{\sigma {}^2\eta {T}_E}\right)\right] $$
(30)

where \( {E}_{\overset{\sim }{j}} \) is given by (27). Thus, \( {R}_{\overset{\sim }{j}} \) is obtained as (31) after some modifications.

$$ {R}_{\overset{\sim }{j}}=\left(1-\frac{2{T}_E}{T-{T}_p}\right){\int}_{t=0}^{\infty}\mathit{\Pr}\left[{E}_{\overset{\sim }{j}}>\sigma {}^2\eta {T}_E\left({e}^t-1\right)\right] dt. $$
(31)

Further, the distribution of \( {E}_{\overset{\sim }{j}} \) is approximated with the Gamma distribution employing moment matching. The scale parameters which define the Gamma distributions are obtained as \( {k}_h={\left(\mathbbm{E}\left[{E}_{\overset{\sim }{j}}\right]\right)}^2/\mathbbm{VAR}\left[{E}_{\overset{\sim }{j}}\right] \) and \( {\theta}_h=\mathbbm{VAR}\left[{E}_{\overset{\sim }{j}}\right]/\mathbbm{E}\left[{E}_{\overset{\sim }{j}}\right] \). Consequently, (31) can be re-modeled as (32)

$$ {R}_{\overset{\sim }{j}}=\left(1-\frac{2{T}_E}{T-{T}_p}\right){\int}_{t=0}^{\infty}\left(1-{F}_{E_{\overset{\sim }{j}}}\left({\sigma}^2\eta {T}_E\left({e}^t-1\right)\right)\right) dt=\left(1-\frac{2{T}_E}{T-{T}_p}\right){\int}_{t=0}^{\infty}\frac{1}{\Gamma \left({k}_h\right)}\Gamma \left({k}_h,\frac{\sigma^2\eta {T}_E\left({e}^t-1\right)}{\theta_h}\right)\ dt, $$
(32)

which can be simplified numerically.

Channel hardening and favorable propagation in cell-free massive MIMO

Two essential virtues that appear in the regime of hundreds or even thousands of antennas: channel hardening and favorable propagation, are defined and analyzed. By increasing BS antennas, the propagation is assumed to happen through a quasi-deterministic flat-fading equivalent channel. In light of the above, the fading channel behaves as though it was not a fading channel (almost deterministically) [50]. This phenomenon is referred to as channel hardening. Also, when the channel directions of two UEs become spatially orthogonal, the inter-user and intra-cell interference vanishes automatically using a relatively simple signal processing technique called maximum-ratio processing (MRP). This desirable property is termed favorable propagation. The channel hardening effect has been exploited by massive MIMO to decrease the problem of small-scale fading in wireless communication systems to guarantee desirable reliability and low latency. However, channel hardening conditions have also been observed and remain valid for CF massive MIMO systems, with reduced pathloss exponent and increased antenna density. Table 7 presents a summary of recent trends and advances on channel hardening in CF-mMIMO systems.

Table 7 Channel hardening in cell-free massive MIMO

The mathematical representation of channel hardening and favorable propagation in CF massive MIMO is presented. Let djk reflects the propagation channel response between the UE k and AP m. The channel hardening effect is defined as (33)

$$ \frac{\left\Vert {d}_{jk}\right\Vert {}^2}{\mathbbm{E}\left\{\left\Vert {d}_{jk}\right\Vert {}^2\right\}}\to 1\ \mathrm{as}\ \mathrm{the}\ \mathrm{number}\ \mathrm{of}\ \mathrm{antennas}\ {N}_{ap}\to \infty $$
(33)

As a result, the impact of the small-scale fading variations on the communication performance is minimized while the challenge of large-scale fading remains. The degree of channel hardening in CF-mMIMO is dependent mainly on the number of antennas per AP and the geographical AP distribution [12]. The favorable propagation is expressed as (34)

$$ \frac{d_{jk}^D{d}_{j\acute{\mkern6mu}k\acute{\mkern6mu}}}{\sqrt{\mathbbm{E}\left\{\left\Vert {d}_{jk}\right\Vert {}^2\right\}\mathbbm{E}\left\{\left\Vert {d}_{j\acute{\mkern6mu}k\acute{\mkern6mu}}\right\Vert {}^2\right\}}}\to 0\ \mathrm{when}\ {N}_{ap}\to \infty $$
(34)

Hardware impairments and pilot transmission in cell-free massive MIMO systems

Using practical systems with perfect hardware components during production could result in enormous energy consumption and prohibitive hardware costs, which are not satisfied in practice. In short, the closer to ideal a hardware transceiver is, the more costly, bulkier, and energy-hungry it becomes. This bottleneck, a trade-off between the quality of hardware components and cost, is usually considered a breakthrough. Nonetheless, this technique inevitably introduces hardware and channel impairments, including amplifier non-linearities, phase noise, I/Q imbalance, and ADC distortions into the system. Analyzing CF massive MIMO under the effects of HI has gained significant research interest. Table 8 presents a summary of progress made in this regard. Additionally, mathematical models of the hardware impairment and pilot transmission are presented.

Table 8 Cell-free massive MIMO systems with hardware impairments

Hardware impairment model

In order to model the combined effect of hardware distortions, the transmitted/received signal is assumed to be distorted by an additive Gaussian noise [92]. The distorted signal is expressed as (35)

$$ {w}_i=\sqrt{\xi }w+{z}_i, $$
(35)

where w refers to the input signal to the non-ideal hardware. Moreover, ξi ∈ [0, 1], i = {t, r} reflects the hardware quality coefficient. The distortion noise is obtained as (36) and is independent of the input signal w.

$$ {z}_i\sim CN\left(0,\left(1-{\xi}_i\right)\mathbbm{E}\left\{{\left|w\right|}^2\right\}\right) $$
(36)

The term \( \mathbbm{E}\left\{\left|w\right|{}^2\right\}=\mathbbm{E}\left\{\left|{w}_i\right|{}^2\right\} \) indicates the equivalence of the variance of both the input and output signals for the non-ideal hardware device. Besides, the hardware quality is obtained through ξi ∈ [0, 1], where ξi = 1 and ξi = 0 denotes perfect and useless hardware, respectively.

Pilot transmission

In this context, it is assumed that τ-length orthogonal pilots represented as ψk ∈ τ × 1 are assigned to UEs, where τ = K ≤ T, and \( {\psi}_k^D{\psi}_{k^{\acute{\mkern6mu}}}=\delta \left(k-{k}^{\acute{\mkern6mu}}\right) \)for k, k´ ∈ {1, 2, …, K}. As a result, the pilot signal \( \sqrt{\tau {P}_p}{\psi}_k \) is transmitted by UEk, and the signal modeled as (37) is received by the mth AP

$$ {\boldsymbol{x}}_{p,m}=\sqrt{\xi_r}\sum \limits_{k=1}^K{h}_{mk}\left(\sqrt{\tau {P}_p{\xi}_t}{\psi}_k+{z}_{t,k}\right)+{z}_{r,m}+{\boldsymbol{n}}_m $$
(37)

where Pp refers to the pilot power, zt, k expressed as (38) accounts for the distortion caused by a non-ideal hardware device, zr, m expressed as (39) denotes the distortion caused by hardware impairment at the mth AP and nm represented as (40) indicates the additive noise [92, 137]

$$ {z}_{t,k}\sim CN\left(0,{P}_p\left(1-{\xi}_t\right){\boldsymbol{I}}_{\tau}\right), $$
(38)
$$ {z}_{r,m}\left|\left\{{h}_{mk}\right\}\right.\sim CN\left(0,{P}_p\left(1-{\xi}_r\right){\sum}_{k=1}^K{\left|{h}_{mk}\right|}^2{\boldsymbol{I}}_{\tau}\right), $$
(39)
$$ {\boldsymbol{n}}_m\sim CN\left(0,N{\boldsymbol{I}}_{\tau}\right). $$
(40)

Power control in cell-free massive MIMO

Power control has appeared as a key feature and one of the most impacting algorithms in mobile networks. It entails an intelligent selection of transmitter power output to improve the overall performance of wireless systems. The power control techniques find handy applications in CF massive MIMO to limit the generated interference, minimize pilot contamination, maximize the power of the desired received signal, and provide a more uniform QoS to the UEs. Table 9 presents a survey of various areas of application of power control in CF-mMIMO systems. Furthermore, Table 10 shows a more concise account of power control/algorithms (alternative optimization, successive convex approximation (SCA), GP, bisection, SOCP, fractional, and ML-based techniques) used in solving specific utility optimization problems [119].

Table 9 Power control in cell-free massive MIMO
Table 10 Power control optimization techniques

Additionally, an insight into different power control policies is highlighted. In particular, the equal transmits power policy, equal receive power policy, and inverse leakage policy are considered. In all policies, it is assumed that \( {G}_{ik}={f}_k^D{V}_i{f}_k \).

  1. (1)

    Equal Transmit Power Policy: In this policy, the same power is transmitted to each UE i, taking into consideration the power constraints and can be expressed as (41)

$$ {p}_i\sum \limits_{j=1}^M\left|{f}_{ji}\right|{}^2=C $$
(41)

where C denotes a constant chosen to satisfy the power constraints. The maximum value of C is obtained as (42)

$$ C={\left[\underset{1\le m\le M}{\max}\sum \limits_{k=1}^K\frac{\left|{f}_{mk}\right|{}^2}{\sum \limits_{j=1}^M\left|{f}_{jk}\right|{}^2}\right]}^{-1} $$
(42)

Thus, the total power pi transmitted to any UE i, for i = 1, …, K, can be written as (43)

$$ {p}_i=\frac{1}{\sum \limits_{j=1}^M\left|{f}_{ji}\right|{}^2}\ {\left[\underset{1\le m\le M}{\max}\sum \limits_{k=1}^K\frac{\left|{f}_{mk}\right|{}^2}{\sum \limits_{j=1}^M\left|{f}_{jk}\right|{}^2}\right]}^{-1} $$
(43)
  1. (2)

    Equal Receive Power Policy: This policy ensures that the average received signal power that is conditioned on \( \hat{D} \) remains the same for all UE by selecting power scaling factors. Therefore, piGkk = C,for i = 1, …, K. The constant C is also selected to satisfy the power constraints. After some mathematical analysis, pi can be obtained as (44)

$$ {p}_i=\frac{1}{G_{ii}}\ {\left[\underset{1\le m\le M}{\max}\sum \limits_{k=1}^K\frac{\left|{f}_{mk}\right|{}^2}{G_{kk}}\right]}^{-1} $$
(44)
  1. (3)

    Inverse Leakage Policy: This policy allocates lower powers to UEs to minimize the interference in the system, resulting in higher leakage to other UEs. In this policy, it is assumed that pi is inversely proportional to \( \sum \limits_{k=1,k\ne i}^K{G}_{ki} \). Solving for pi yields (45)

$$ {p}_i=\frac{1}{\sum \limits_{k=1,k\ne i}^K{G}_{ki}}\ {\left[\underset{1\le m\le M}{\max}\sum \limits_{k=1}^K\frac{\left|{f}_{mk}\right|{}^2}{\sum \limits_{j=1,j\ne k}^K{G}_{jk}}\right]}^{-1} $$
(45)

NOMA-aided cell-free massive MIMO

Currently, there is a growing interest in implementing NOMA in 5G and beyond 5G systems due to its attractive SE gains and potential to support low latency massive connectivity. NOMA technique, a paradigm shift from the OMA scheme, primarily seeks to allocate non-orthogonal resources to users to manage interference. Considering the enormous benefits of CF-mMIMO and NOMA, integrating these two techniques is envisioned to reap further gains. Recent research efforts on this hybrid combination are summarized in Table 11.

Table 11 NOMA-aided cell-free massive MIMO

The channel model of a NOMA-aided CF massive MIMO system is presented. The DL channel between the kth user and the mth AP in the nth cluster, where k ∈ {1, …, K}, m ∈ {1, …, M} and n ∈ {1, …, N} can be expressed as (46)

$$ {h}_{mnk}={\zeta}_{mnk}^{1/2}{\tilde{h}}_{mnk}, $$
(46)

where ζmnk accounts for the large-scale fading, which changes at a prolonged rate [162]. Besides, the circularly symmetric Gaussian assigned with zero mean unit variance can be written as (47)

$$ {\tilde{h}}_{mnk}\sim CN\left(0,1\right) $$
(47)

Spectral efficiency of cell-free massive MIMO

The need to improve the SE by network service providers owing to an ever-increasing number of users and high rate expectations in B5G wireless networks cannot be overemphasized. Interestingly, massive and dense antennae deployment has been at the forefront, and thanks to its ability to upgrade the BS hardware rather than the deployment of new BS sites. Compared to conventional massive MIMO systems, CF has a great potential of substantially improving the SE by employing additional antennas at users, power allocation, and the receiver filter coefficient design. Table 12 presents certain areas of application of CF massive MIMO in improving the SE.

Table 12 Spectral efficiency of cell-free massive MIMO

Additionally, a closed-form expression for the UL SE of CF massive MIMO with the LS estimator is presented. In this context, spatially correlated Rayleigh fading channels are taken into consideration. The received signal ru, k at the CPU is expressed as (48)

$$ {r}_{u,k}=\sum \limits_{m=1}^M{\hat{h}}_{mk}^D{\boldsymbol{x}}_{u,m}=\sqrt{\rho}\sum \limits_{k^{\acute{\mkern6mu}}=1}^K\sum \limits_{m=1}^M\sqrt{{\mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}k}^{\acute{\mkern6mu}}}{\hat{h}}_{mk}^D{h}_{m{k}^{\acute{\mkern6mu}}}{v}_{k^{\acute{\mkern6mu}}}+\sum \limits_{m=1}^M{\hat{h}}_{mk}^D{\boldsymbol{y}}_{u,m}. $$
(48)

where vk depicts the desired signal detected by ru, k. The received signal in (48) can be further expressed as (49)

$$ {r}_{u,k}={DS}_k.{v}_k+{BU}_k.{v}_k+\sum \limits_{k^{\acute{\mkern6mu}}\ne k}^K{UI}_{k{k}^{\acute{\mkern6mu}}}.{v}_{k^{\acute{\mkern6mu}}}+{NI}_k, $$
(49)

where the desired signal (DS) is given by (50), the beamforming uncertainty gain (BU) is given by (51), the multiuser interference (UI) is given by (52), and the noise interference (NI) is given by (53).

$$ {DS}_k\triangleq \sqrt{\rho \mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}k}\mathbbm{E}\left\{\sum \limits_{m=1}^M{\hat{h}}_{mk}^D{h}_{mk}\right\} $$
(50)
$$ {BU}_k\triangleq \sqrt{\rho \mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}k}\left(\sum \limits_{m=1}^M{\hat{h}}_{mk}^D{h}_{mk}-\mathbbm{E}\left\{\sum \limits_{m=1}^M{\hat{h}}_{mk}^D{h}_{mk}\right\}\right) $$
(51)
$$ {UI}_{k{k}^{\acute{\mkern6mu}}}\triangleq \sqrt{\rho}\sum \limits_{m=1}^M\sqrt{{\mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}k}^{\acute{\mkern6mu}}}{\hat{h}}_{mk}^D{h}_{m{k}^{\acute{\mkern6mu}}} $$
(52)
$$ {NI}_k\triangleq \sum \limits_{m=1}^M{\hat{h}}_{mk}^D{\boldsymbol{y}}_{u,m}\kern0.5em $$
(53)

The UL SINR of the kth user is given by (54)

$$ {SINR}_k=\frac{\left|{DS}_k\right|{}^2}{\mathbbm{E}\left\{\left|{BU}_k\right|{}^2\right\}+\sum \limits_{k^{\acute{\mkern6mu}}\ne k}^K\mathbbm{E}\left\{\left|{UI}_{k{k}^{\acute{\mkern6mu}}}\right|{}^2\right\}+\mathbbm{E}\left\{\left|{NI}_k\right|{}^2\right\}} $$
(54)

Thus, the SE is obtained as (55)

$$ {SE}_k^{cf}=\left(1-\frac{\tau_p}{\tau_c}\right){\mathit{\log}}_2\left(1+\frac{\rho \mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}k\left|\sum \limits_{m=1}^M{\beta}_{mk}\boldsymbol{tr}\left({\boldsymbol{R}}_{mk}\right)\right|{}^2}{\rho \sum \limits_{k^{\acute{\mkern6mu}}=1}^K\mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}{k}^{\acute{\mkern6mu}}\sum \limits_{m=1}^M{\beta}_{mk}{\beta}_{m{k}^{\acute{\mkern6mu}}}\boldsymbol{tr}\left({\boldsymbol{R}}_{mk}{\boldsymbol{R}}_{m{k}^{\acute{\mkern6mu}}}\right)+\frac{1}{\tau_p}\sum \limits_{k^{\acute{\mkern6mu}}=1}^K\mathrm{\mbox{\fontencoding{U}\fontfamily{phon}\selectfont\char110}}{k}^{\acute{\mkern6mu}}\sum \limits_{m=1}^M{\beta}_{m{k}^{\acute{\mkern6mu}}}\boldsymbol{tr}\left({\boldsymbol{R}}_{m{k}^{\acute{\mkern6mu}}}\right)+\sum \limits_{m=1}^M{\beta}_{mk}\boldsymbol{tr}\left({\boldsymbol{R}}_{mk}\right)+\frac{MN}{\rho {\tau}_P}}\right) $$
(55)

Energy efficiency of cell-free massive MIMO

While state-of-the-art technologies are consistently developed to address data, QoS, and capacity demands, issues related to environmental concerns and high-power consumption are escalating rapidly. EE has become an essential criterion in designing future wireless networks, and its importance has been realized even more. The cellular configuration can deliver higher EE, and it is widely recognized as naturally energy-efficient. As a step further, the EE of CF-mMIMO has been characterized by researchers worldwide, and several sophisticated optimization techniques and comprehensive power models have been developed accordingly. Table 13 presents a summary of research efforts and future research directions on the EE of CF-mMIMO.

Table 13 Energy efficiency of cell-free massive MIMO systems

Furthermore, the total EE model of a CF massive MIMO network is presented. From correspondence [168], the total power consumption can be written as (56)

$$ {P}_{\mathrm{total}}={P}_A+{P}_C, $$
(56)

where PA represents the power amplifier’s power consumption and PC represents the circuit power consumption. The power consumption PA and PC is obtained as (57) and (58), respectively

$$ {P}_A=\frac{1}{\zeta}\rho {N}_0\sum \limits_{k=1}^K{v}_k, $$
(57)
$$ {P}_C=M{P}_f+K{P}_U+\sum \limits_{k=1}^K{P}_{b,m}, $$
(58)

where ζ reflects the power amplifiers efficiency at each user, Pf denotes the fixed power consumption at each AP, PU accounts for the power required to run circuit components at each user, and Pb, m denotes the backhaul power consumption from the mth AP to the CPU and is expressed in the following [169, 170]

$$ {P}_{b,m}={P}_{BT}\frac{R_{b,m}}{C_{b,m}} $$
(59)

where PBT accounts for the total power to operate the backhaul traffic at full capacity, Rb refers to the backhaul rate between the mth AP and the CPU, and Cb reflects the capacity of the backhaul link between the mth AP and the CPU. More precisely, the backhaul rate Rb can be expressed as (60)

$$ {R}_{b,m}=\frac{2\ K\ {\tau}_f{\sigma}_m}{T_c}, $$
(60)

where σm represents the quantization bits of the mth AP. Thus, the total EE is obtained as (61)

$$ {E}_e\ \left({v}_k,{\boldsymbol{U}}_k,\sigma \right)=\frac{B\ S\ \left({v}_k,{\boldsymbol{U}}_k,\sigma \right)}{P_{total}}\ \left(\mathrm{bit}/\mathrm{Joule}\right) $$
(61)

Open research issues and lessons learned

The concept of CF massive MIMO has received considerable research efforts to understand its essential features and immense benefits. While significant progress has been made, many open questions, unresolved practical, and various deployment challenges persist, requiring substantial research efforts before realizing its promised gains. Table 14 presents an outline of some research directions in HI, signal detection, EE, channel estimation, pilot contamination, and security and privacy issues. As a step further, we highlight vital lessons learned for future research exploration in the area of CF massive MIMO.

Table 14 Open research issues in cell-free massive MIMO

Lessons learned

In this section, a comprehensive summary of lessons learned from this survey is presented. The summary covers CF massive MIMO, mmWave, Terahertz, EE, NOMA, security and privacy, HI, and SWIPT, as outlined briefly.

Lesson one

The current wireless network infrastructure is faced with unprecedented demand for ultra-reliability and higher data rates. CF massive MIMO, where many APs are densely deployed across the entire coverage area linked to a CPU, has been proposed to address this problem. However, there are still issues to contend with in network management, hardware design, and practical implementation of CF-mMIMO.

Lesson two

By a careful analysis of the current 5G wireless systems, employing the highly congested microwave frequency band spanning from about 300 MHz to 6 GHz alongside the use of large-scale antenna arrays may not be sufficient to satisfy the demands for increased throughput, ubiquitous QoS, and ubiquitous connectivity. The so-called mmWave frequency band, though a promising physical layer solution to the envisioned beyond 5G wireless communication network, is susceptible to high molecular absorption. This lends the terahertz band a natural candidate for 6G wireless networks. However, distance is a major problem in terahertz communication. However, the combined use of an ultra-dense wireless network, CF-mMIMO, and the less-congested terahertz band is a crucial enabler to much larger bandwidths and a basic pillar to sustain the SE and EE. Terahertz-enabled CF-mMIMO will be a key enabler for next-generation wireless communication systems.

Lesson three

Compared to earlier generations of wireless communication networks, the total power consumption of 5G and beyond-5G systems have increased greatly. This is primarily due to the higher density of APs, larger bandwidths, and larger antenna numbers, resulting in increased environmental and economic concerns. Thus, EE has become a critical requirement in the design of emerging wireless networks. In order to address this issue, highly comprehensive power models, efficient energy management strategies, and more sophisticated optimization techniques are required. Specifically, these systems may be enabled with energy harvesting and energy exchange capabilities. The grid source will be incorporated to compensate for the random and intermittent nature of the harvested energy owing to uncontrollable environmental conditions. Also, the possibility of turning off inactive APs is another useful technique to improve the overall EE of CF massive MIMO systems.

Lesson four

An indispensable candidate technique for next-generation wireless standards is NOMA. The power domain-based concept represents a paradigm shift from OMA, which is fast approaching its fundamental SE limit. An interplay of the distinctive benefits of NOMA and CF massive MIMO is expected to substantially boost the system’s performance with respect to bandwidth efficiency, spectral and energy efficiency, massive connectivity with low latency, and concurrent transmission from multiple users. Nonetheless, the additional hardware complexity due to error propagation and SIC processing is undesirable in practice. In order to tackle the trade-off between complexity and performance (sum rate), high-quality optimization techniques are required.

Lesson five

From the inception of wireless communication networks, security and privacy threats have been a real concern for network operators. With the deployment of 5G and the emerging beyond 5G systems to support billions of connected devices and drive high user mobility, threats to availability and integrity of networks, besides the growing concerns for user privacy, will become more visible. The security threat is envisaged to be greater than ever. There is a need for advanced cryptographic schemes to provide a robust security architecture in wireless networks to protect user privacy. More needs to be done in blockchain technology and quantum communication to tackle the confidentiality and privacy threats posed to future wireless networks.

Lesson six

Deployment of CF massive MIMO depends on multiple antenna elements, which increase the system complexity, energy consumption, and hardware design cost. By using low-cost components, the hardware imperfections increase, which is detrimental to the overall system’s performance. These imperfections consisting of phase noise, I/Q imbalance, amplifier non-linearities, and more are referred to as HIs. There is a need for an optimal trade-off between the quality of the transceiver hardware design and cost. Additionally, there is a need for suitable compensation algorithms to minimize the effects of HI.

Lesson seven

Compared to traditional energy storage methods, SWIPT is considered a potential solution to ease the transmission rate and minimize the energy consumed by battery-powered devices. However, the exponential growth in the number of connected devices coupled with the issues posed by mmWave communication in 5G-aided SWIPT networks presents an entirely new challenge. Therefore, there is a need to find proper allocation schemes to optimize the rate-energy trade-off between information retention and energy allocation.

Conclusions

Cell-free massive MIMO has been proposed as a novel architecture to address the ever-increasing demands for high SE, coverage probability, green output, and uniformly distributed throughput for all network users. In the ubiquitous 5G and the envisioned beyond 5G wireless communication systems, CF-mMIMO enables the deployment of dense APs over a wide network area to communicate with several UEs cooperatively. Motivated by the distinctive benefits of CF-mMIMO, this paper attempted to give a concise survey of the design, application scenarios, potentials, and deployment challenges of this disruptive technology. The system model of CF-mMIMO, covering the UL/DL pilot-aided channel estimation, UL/DL training, and channel hardening, is discussed elaborately. Additionally, the performance characteristics of CF-mMIMO using key design metrics like EE, channel hardening, SE, and more, are discussed extensively. Furthermore, the viable application areas of CF-mMIMO are outlined, and an up-to-date review of key findings and current research trends in CF-mMIMO is presented. Finally, open research issues and key take-away lessons are drawn from the survey to explore this exciting area of wireless communications systems deeply.