Downlink channel estimation for millimeter wave communication combining low-rank and sparse structure characteristics

The acquisition of channel state information (CSI) is essential in millimeter wave (mmWave) multiple-input multiple-output (MIMO) systems. The mmWave channel exhibits sparse scattering characteristics and a meaningful low-rank structure, which can be simultaneously employed to reduce the complexity of channel estimation. Most existing works recover the low-rank structure of channels using nuclear norm theory. However, solving the nuclear norm-based convex problem often leads to a suboptimal solution of the rank minimization problem, thus degrading the accuracy of channel estimation. Previous contributions recover the channel using over-complete dictionary with the assumption that the mmWave channel can be sparsely represented under some dictionary. While over-complete dictionary may increase the computational complexity. To address these problems, we propose a channel estimation framework based on non-convex low-rank approximation and dictionary learning by exploring the joint low-rank and sparse representations of wireless channels. We surrogate the widely used nuclear norm theory with non-convex low-rank approximation method and design a dictionary learning algorithm based on channel feature classification employing deep neural network (DNN). Our simulation results reveal the proposed scheme outperform the conventional dictionary learning algorithm, Bayesian framework algorithm, and compressed sensing-based algorithms.


Introduction
With the rapid increase of demand for high-speed wireless transmission communication systems, massive multiple input and multiple output systems (MIMO) have attracted extensive attention in academy and industry due to their outstanding ability to improve system capacity and spectrum utilization rate [1,2].MIMO technology has been widely used in advanced communication standards, such as IEEE 802.11 ac [3], IEEE 802.16m [4], and 3GPP Long Term Evolution Networks [5,6].Owing to the extremely high attenuation and serious signal absorption at the mmWave frequency bands, mmWave communication systems employ large antenna arrays at the base station.Obtaining accurate channel state information is a prerequisite for gaining optimal system performance.In the aspect of CSI detection, Time Division Duplex (TDD) mode takes advantage of the reciprocity of the uplink and downlink link.In Frequency Division Duplex (FDD) mode where the channel reciprocity condition is no longer satisfied, the base station sends a downlink pilot signal, the mobile station receives and detects the pilot signal and then feeds back CSI to the base station.As for traditional channel estimation method, the length of pilot sequence must be proportional to the number of base station antennas, which makes it difficult to complete channel estimation within coherent time.Moreover, the uplink feedback load is high.Therefore, it is unrealistic to use traditional methods for the channel estimation of massive MIMO systems.
However, recent advances in sparse representation and compressed sensing have inspired new approaches, e.g., distributed compressive sensing (DCS) [7], matrix completion [8], and deep neural network (DNN) [9], aiming at reducing heavy pilot load and high uplink feedback quantization overhead.Distributed compressed sensing channel estimation algorithm based on channel joint sparse structure characteristics in multi-user environment was proposed to reduce pilot load and uplink feedback overhead in Rao and Lau [7].Z. Gao [10] originally proposed a structured compressed sensing framework based on joint channel estimation of space and time by exploring the common sparse support set of MIMO channels to reduce pilot load.However, this method increases the computational load of the terminal, and the signal recovery probability based on subspace tracking algorithm needs to be improved.In Fang et al [11], the author proposed an adaptive channel estimation and feedback framework based on spatial common sparsity in FDD massive MIMO communication environment.According to the low-rank characteristics of massive MIMO channels, the author in Sun et al. [12] investigated a method for FDD downlink channel estimation which mines the low-rank or near-low-rank characteristics of the channel covariance matrix.However, the computation of the channel covariance matrix is too large to be suitable for the actual communication scenarios of multi-user cells.Downlink channel estimation for FDD was performed by using the low-rank characteristics of massive MIMO channel with iterative optimization and deep learning methods in [2].However, there exist several major problems with the above methods: (1) Only the sparse or low-rank characteristics of the channel are utilized, while the joint channel estimation with sparse and low-rank characteristics can further reduce the training and feedback load and overhead.(2) In many practical application scenarios, the channel state information only shows sparsity under the conversion of suitable bases.Therefore, the above algorithm based on channel sparsity will degrade the accuracy of channel state estimation in some actual communication scenario.
According to the channel sparsity in the angular domain, Wei [13] introduced the learning dictionary instead of the predefined dictionary to estimate the downlink channel of FDD massive MIMO system and obtains better channel sparse representation performance.However, the learning dictionary of this method comes from the actual measured values of the channel, and the experimental operation is complex, which is not suitable for industrial information transmission systems with high real-time requirements.
In addition to the sparse scattering nature, mmWave channels may exhibit a low-rank structure.The rank minimization problem is challenging to solve.Thus, rank function is usually substituted by the convex nuclear norm, leading to a relaxed convex formulation of channel estimation problem [14][15][16][17][18][19].However, the obtained recovery of channels by convex nuclear norm is usually suboptimal because the nuclear norm is a loose approximation of the rank function [16].In Lu and Tang [16], the authors proposed a non-convex approximation of l 0 norm to approximate the rank function, which is solved by iterative reweighted nuclear norm algorithm.
In this study, a novel channel estimation framework is proposed, which utilizes the joint low-rank and block sparsity feature of mmWave channels in the angle domain and deep learning network-based sparsity representation.For the lowrank characteristics of the channel, a non-convex method is used for low-rank approximation, with its iterative optimization algorithm designed.Besides, we present a deep learning network-based dictionary learning method such that a dictionary is learned from the output of deep learning network which extracts key characteristics of mmWave channel measurements.The learned dictionary adapts specifically to the key characteristic of the cell and promotes a more efficient and robust channel sparse representation, which in turn boosts the performance of the channel estimation.
Notations: l 0 norm is the number of non-zero elements in a vector.l 1 norm is the sum of the absolute values of elements in the vector.

System model
We consider a single-cell massive MIMO system working in FDD mode.The BS is equipped with N antennas, and each mobile station employs M antennas.To realize downlink channel estimation, the base station sends training pilot sequence to the mobile station.The mobile station feeds back CSI to the base station.The pilot signal received at the mobile station at the jth time slot (j = 1, 2, ⋯, T) is expressed as where h ∈ ℂ N × 1 is the downlink channel response vector between the BS and the mobile user and n∈ℂ T d Â1 is the noise vector at the receiver such that n∼CN 0; I ð Þ. x j ∈ ℂ are the aggregated signal and noise, respectively.Using a conventional channel estimation method such as the least square method, the channel estimation is given by the following formula: where X † is the Moore-Penrose pseudoinverse.Accurate acquisition of channel state information by conventional method requires T d ≥ N; the length of pilot sequence must be greater than or equal to the number of antennas.For a massive MIMO system, N is large, making conventional algorithm infeasible.It takes a long training period to complete channel estimation, which makes it impossible to complete channel estimation within the channel coherence time.In addition, the terminal needs to feed back the channel state information to the base station, which also requires the feedback load proportional to the dimension of the channel matrix.
In order to complete the downlink channel estimation with limited training overhead, the channel estimation technology based on compressed sensing has attracted much attention.In the compressed sensing framework, as long as the original signal is sparse on some bases, fewer measurements (T d < N) can be used to represent high-dimensional signals (in this paper, high-dimensional signals refer to Massive MIMO channels).
Assume there exists a matrix D ∈ ℂ N × M (N ≥ M) such that h = Dβ, where the sparse vector β ∈ ℂ M × 1 is sparse, that is S = ‖β‖ 0 ≤ N, ‖n‖ 2 ≤ ε.Therefore, the downlink channel estimation problem can be converted into the following mathematical problem: Provided y, X and D, if we are able to solve for β, then the channel state information could be gained as b h ¼ Dβ.At that time, if the channel is sparse under certain conditions, the high-dimensional channel matrix can be recovered.Note the sparsity of the channel and assume that the channel estimation can be converted into the following mathematical problem: According to existing theories, when T d ≥ cS log(N/S) (c is constant) is satisfied, l 1 norm is used to replace l 0 norm for relaxation solution, which can be recovered with high precision.Therefore, accurate channel estimation can be realized only by using training symbols proportional to channel sparsity, and the training period length is no longer required to be proportional to the number of base station antennas.However, ( 4) is an undetermined equation if we intend to employ a small number of training samples T d < N. Thus, the system has an infinite number of solutions for β, which must be solved by sparse constraints.Therefore, it is necessary to study the constraint conditions of minimum sparsity.The mmWave channel could be characterized in matrix form by the following model where L is the number of paths; α l is the complex gain associated with the lth path; θ l ∈ [0, 2π]and ϕ l ∈ [0, 2π] are the associated azimuth AoA and azimuth AoD, respectively; and α BS ∈ ℂ N ,α MS ∈ ℂ M are the array response vector associated with the BS and mobile station, respectively.Assume a uniform linear (ULA) antenna array is employed.Thus, the steering vectors at the BS and the MS can be expressed as where λ is the signal wavelength and d is the distance between adjacent antenna elements.
3 Block-sparse and low-rank features of mmWave channels

Block-sparse property
To formulate the channel estimation as a sparse signal recovery problem, we first express the channel as a beam space MIMO representation as follows where ] is an over-complete matrix (N 1 ≥ N) with each column a steering vector parameterized by a pre-discretized AoA, ] is an over-complete matrix(N 2 ≥ M) with each column a steering vector parameterized by a pre-discretized AoD, and H ν ∈ℂ N 1 ÂN 2 is a sparse matrix with L non-zero entries corresponding to the channel path gains {α l }.By substituting ( 7) where ⊗ denotes the Kronecker product, ( ) * is the complex conjugate, and e hΔvec Hv ð Þ. Collecting all measurements {y(t)} and staking them into a vector yΔ y 1 ⋯y T d Â Ã T , we arrive at Since e h is sparse, channel estimation now has become a sparse signal recovery problem.To estimate e h, we can resort to corresponding algorithms referring to sparse signal recovery.

Low-rank property
Due to the spatial correlation and the unsymmetric angular spreads over different domains, mmWave channels may exhibit a meaningful low-rank structure that can be utilized to improve the sample complexity.Such a low-rank structure can be observed from recent real-world mmWave channel measurements in [20].

Overall framework of our proposed algorithm
The overall algorithm framework of this paper is shown in Fig. 1.Our proposed algorithm is divided into three stages: (1) Preprocessing: in terms of joint low-rank and sparse characteristics of massive MIMO channels, a mathematical model for channel state acquisition is established and a non-convex weighted low-rank approximation method is used to surrogate the traditional nuclear norm low-rank approximation model.(2) Construction of incomplete learning dictionary: obtain channel measurements as training data set from a specific cell, convert the training data set from one-dimensional data to two-dimensional images, construct channel response samples, and construct the deep neural network to identify the transmission features of the mmWave channels.
(3) Block-sparse compressed sensing [20] method is utilized to obtain the channel state information and complete high-precision estimation of channel state information in FDD downlink massive MIMO.

Channel estimation algorithm combining low-rank approximation and dictionary learning
In this section, we divide our proposed framework into two separate stages.In the first stage, we investigate the received downlink pilot signals in (2) as a low-rank matrix completion process.
In the second stage, we measure channel responses in a specific cell and construct an incomplete learning dictionary such that the dictionary adapts well to the key features of mmWave channels.

Low-rank approximation based on non-convex theory
The received model in ( 2) can be formulated as sampling from a low-rank matrix: where Y Δ ffiffi ffi ρ p H T X is a low-rank matrix with rankrank(Y) = L and Y ij represents the (i,j)th element of matrix Y. Since mmWave channels exhibit low-rank structure, the matrix Ysatisfies the low-rank property as well [19].Therefore, under the joint conditions of low-rank and sparse constraints, the channel state acquisition model can be expressed as the following model: The first term of formula (12) is the nuclear norm approximation of the low-rank matrix; i.e., it represents the singular value of the matrix.The second term is l 1 norm of sparse components, which aims to constrain sparsity.The third term is Frobenius ), which is used to ensure consistent continuity of observed signals.Equation ( 12) is regarded as a convex optimization problem and can be effectively solved by various methods.However, its solution is always suboptimal because the nuclear norm is only a loose approximation of the rank function.
Regarding norm relaxation function, nonconvex penalty function outperforms traditional norm relaxation in accuracy of solution.The nonconvex penalty function is defined as where η(t) is a monotonically non-decreasing nonconvex function in the real field ℝ and η(0) = 0.The rank of the matrix is equal to the norm of the singular value vector of the matrix, and the nuclear norm of the matrix is equal to the norm of the singular value vector of the matrix, so the nonconvex penalty function of the vector is naturally extended to the singular value vector of the matrix, thus obtaining the nonconvex weighted kernel norm of the matrix.
Definition: The nonconvex weighted nuclear norm of any matrix X is defined as the sum of singular values of the matrix after nonconvex function mapping; that is, the nonconvex weighted kernel norm of matrix X is defined as , where n ≤ m and σ X i is ith singular value of matrix X. g(t) is a non-convex penalty function acting on the singular value of the matrix, so the real function g(t) only needs to satisfy two conditions: g(0) = 0 and g(t) is a nonmonotonically decreasing non-convex function in interval [0,1].
Based on the above theory, in this section we propose a non-convex algorithm for channel low-rank function approximation, thus converting the downlink channel estimation of Eq. ( 12) into the solution of the following function: It can been seen from Eq. ( 13) that the nuclear norm in Eq. ( 12) is surrogated by a non-convex function as an approximation of the low-rank matrix and that the third term is a loss function to constrain the estimation error.Eq. ( 13) includes continuous variables together with discrete variables, combined with nonconvex functions.However, the solution to the weighted nuclear norm low-rank approximation of nonconvex function is a tricky as it cannot be directly solved.Thus, we utilize the Alternating Direction Method of Multipliers (ADMM) to address the problem.
In terms of ADMM algorithm, formula (13) is obtained by fixing one variable while solving another variable.Accordingly, when the dictionary matrix D is fixed, the acquisition in Eq. ( 13) can be regarded as a matrix completion problem.Therefore, the low-rank approximation of the received signals Y can be converted into the following equation: The function g(t) is a non-convex on the interval and satisfies the following formula: The kth iteration of matrix Y is denoted as Y k , and σ k i is ith singular value of Y k in the kth iteration, denoted as is a nonconvex function, it is difficult to directly solve Eq. ( 14).Equation (15) gives a linear approximation of non-convex functions, so Eq. ( 14) can be iteratively solved employing the following methods: where ω k i is the sub-gradient of g(t) acting on the interval [0, + ∞).To solve Eq. ( 16) more easily, we utilize first-order Taylor expansion for function (16).Thus, the first-order Taylor expansion at the matrix Κ ∈ R M × N is denoted as Therefore, the iterative function ( 16) can be converted to Equation ( 18) remains still a nonconvex optimization problem, but according to the following theorem, there is an analytical solution to this problem.
As was proved in Menglu [14]: for any arbitrary λ > 0 and Y ∈ ℝ m × n , if 0 ≤ w 1 ≤ w 2 ⋯ ≤ w n , then the optimal solution to the minimization problem min F can be obtained by weighted singular value operators, denoted as X * = US λw (Σ)V T ,where Y = UΣV T is the singular value decomposition of matrix Y and the singular values are expressed as Based on the above derivation, we develop Algorithm 1 to separate and extract low-rank received signals matrix Y, which is shown in Table 1.

Channel measurements extraction utilizing DNN
Under normal circumstances, over-complete DFT dictionary is used to achieve sparse channel recovery.However, the predesigned fixed dictionary cannot adapt to the channel structure information according to the cell propagation environment.In order to reduce the system complexity in the dictionary training stage, a better method is to design an incomplete dictionary that is specific to the transmission environment and suitable for channel characteristics.Ding Yacong et al. proposed a channel estimation method based on learning dictionary for channel thinning recovery in Liu et al. [17].Different from the pre-designed dictionary method, the proposed method learns a dictionary from channel measurements.In the learning process, the sparse representation is optimized, and the sparse vector of the channel is extracted through sparse constraints.Channel characteristics at different locations in the cell are measured indefinitely, and channel measurement vectors are trained to obtain an adaptive overcomplete dictionary.However, this scheme carries out channel measurements randomly and indefinitely, thus lacking selection of typical positions of channels, which may lose channel state information indicating important structural features of the channels, decrease the accuracy of dictionary training, and increase the redundancy of channel measurements.
Therefore, we adopt the DNN-based channel measurement method to classify and measure the key features of downlink mmWave channels.On the one hand, the measurement load of the system is reduced.On the other hand, this method can fully extract most channel structure characteristics, thus constructing an incomplete dictionary suitable for link characteristics.This part corresponds to the third stage of the overall framework of the algorithm.The specific implementation process is shown in Fig. 2.
We classify path loss and other features utilizing DNN.Firstly, we set non-line of sight (NLOS) and line of sight (LOS) path propagation scenes in the actual transmission environment.Secondly, we use the DNN to classify channel measurement vectors from the view of path loss, multipath component energy, delay, phase and angle domains, and other characteristics.During training, links with similar parameters are grouped into the same group and group-based channel measurements are obtained.
Sample the channel measurements to obtain the following expression: where T S is the sampling interval.We make Modulus Operation on the sampled signals to obtain With the evolution of time, the signal g m tends to approach zero.Therefore, only M 2 (M 2 < N) sample signals are required to construct the input set X of the DNN: DNN includes three hidden layers, each with 1024 hidden cells and 64 output cells.Channel measurement matrix x under different transmission environments is obtained from the input of training samples.The dimension of the input unit is equal to the dimension of the sparse feature vector.In this paper, the depth neural network adopts feedforward structure, the input layer is 1845-dimensional feature vector, the output layer has classification function, and the channel feature vector forms an incomplete dictionary.In order to improve the convergence speed of network algorithm, cross entropy loss function and Sigmoid are used as activation functions of hidden layer elements.Hiding layer initialization adopts He initialization method.In addition, the depth neural network covers about 4M adjustable parameters.This paper uses statistical gradient descent and NAG method to optimize the parameters.It is assumed that the output layer result is quite different from the preset result; i.e., the hidden layer feeds back to the input layer to modify the weight of each unit.The gradient is calculated by backward propagation based on the minimum mean square error function; that is, the minimum mean square error between the channel measurement samples and the estimated channel estimation matrix is taken as the loss function.The basic objective function is defined as Initialization :iterative times k = 0, λ, w k i ¼ 0i ¼ 1; 2; ⋯; n 。 while not converge Solve Eq. ( 18) to obtain the solution of Y k + 1 ; , calculate the sub-gradient of g(t), where where W and b represent the weight and the basis matrix to be trained for the estimated channel state information.M is the dimension of the training matrix, and U is the input feature vector.The objective function J MSE (W, b) aims to minimize the mean square error between the estimated channel estimate and the channel measurement.The three hidden layers use 20% dropout to suppress over-fitting images in the training process.The wireless channel measurement matrix is taken as the input signal of the deep learning network.The output of the lth hidden layer is in which ω l−1 ij is the weights of the ith node in the (l − 1) th layer and the jth node and H l−1 i is the output of the ith node in (l − 1)th layer.J is the number of nodes per layer.f(•) is the activation function of the hidden layer.We obtain the following parameters using gradient descent method to obtain in which λ is the learning rate, momentum ω is used to accelerate the convergence rate of the algorithm, and K is the number of hidden layers.The process of DNN-based incomplete dictionary learning is summarized as follows: Step 1. Construct the measurement data set to be classified.
After obtaining the dictionary, the channel matrix is sparsely recovered in the angular domain.The sparse recovery process is implemented employing the block-sparse compressed sensing method.

Simulation results
We now carry out simulation results to illustrate the performance of our proposed non-convex dictionary learning method.We compare our method with the following algorithms: Dictionary Learning based Channel Model (DLCM) [2], Joint Sparse and Low-rank Bayesian Learning(SLAB) [15], and compressed sensing-based sparse channel estimation (CSSCE).We consider a scenario where the BS and the MS employ a uniform linear array with N = 128, M = 1 antenna.The distance between neighboring antenna elements is assumed to be  half the wavelength of the signal.The mmWave channel is assumed to follow the geometric channel model with L = 6 clusters.The mean AoAs and AoDs for these six clusters are set to θ The relative AoA and AoD shifts are uniformly generated within the angular spreads (θ l − δ θ /2, θ l + δ θ /2), (ϕ l − δ ϕ /2, ϕ l + δ ϕ /2), Suppose the base station is equipped with a uniform rectangular array of antennas and the terminal is equipped with a single antenna.The wireless channel model uses NLOS (non-line of sight) UMI-Street Canyon (Urban Microcellular Channel) scenario with a carrier frequency of 28 GHz.
The performance is evaluated via one metric, namely, the normalized mean square error (NMSE).The NMSE is calculated as where Ĥ denotes the estimate of the true channel H. Figure 3 a and b illustrates the NMSE performance versus channel matrix rank for all algorithms with SNR = 5 dB and 15 dB.As can be seen from the figure, the DLCM and CSSCE comparison algorithms are not sensitive to the change of matrix rank, and compared with SLAB, the performance of our proposed algorithm has been significantly improved in the case of low rank.This is because the algorithms DLCM and CSSCE do not consider the low-rank performance of the channel, and for non-convex methods, their algorithms are superior to SLAB algorithms using kernel norm approximation.Although the channel has a low rank in MIMO scenarios, traditional algorithms do not make full use of these features.The algorithm in this paper makes full use of the low-rank characteristics of the channel.In Fig. 3a, when the channel matrix rank is less than 15, the performance of CSSCE and SLAB algorithms is not as good as that of this algorithm.This is because from the perspective of least squares, CSSCE has the best performance, but the algorithm is sensitive to noise and has poor performance under low signal-to-noise ratio, while SLAB algorithm is based on Bayesian posterior mean, so its anti-noise robustness is better than the least squares algorithm.Therefore, SLAB algorithm is superior to CSSCE when SNR is low and the rank of matrix is large.When the bit energy signal-to-noise ratio EbNo is increased to 15 dB, the result is similar to Fig. 3b.
Figure 4 is a graph of normalized mean square error performance versus EbNo for four algorithms.The algorithm proposed in this paper outperforms DLCM, SLBF, and CSSCE for the following reasons: (1) DLCM algorithm carries out random large-scale measurements on the channels in the cell.Thus, DLCM cannot accurately capture all link characteristics and has large data storage load.The method proposed in this paper classifies the channel scenes and extracts the key features of the measured values according to the channel categories to form an incomplete dictionary.Therefore, compared with DLCM method, our proposed algorithm can obtain higher channel estimation accuracy; that is, the normalized mean square error of this algorithm is lower than DLCM algorithm under the same bit energy signal-to-noise ratio.
(2) When SLBF performs channel low-rank approximation, nuclear norm method was adopted.However, this method is not optimal.The nonconvex approximation algorithm used in this paper is an optimal solution for lowrank approximation.Therefore, compared with SLBF, the main features of the channel can be extracted more accurately.(3) CSSCE algorithm only mines the sparse characteristics of the channel and ignores the low-rank characteristics of the channel angle domain.Therefore, compared with the algorithm proposed in this paper, it cannot accurately mine the potential internal structural characteristics of the channel.
Figure 5 shows the NMSE performance comparison of the four algorithms on the base station for different numbers of pilots.In our experiment, EbNo = 10 dB is set.The normalized mean square error decreases as the number of training pilots increases.Compared with the other two methods, SLAB and the algorithm proposed in this paper reduce the number of downlink training pilots.This is because SLAB and this paper make use of the low-rank characteristics of the channel, which further restricts the sparse performance of the channel and reduces the effective dimension and training load of the downlink channel compared with DLCM and CSSCE.Since the optimal solution can be obtained through nonconvex low-rank approximation, the algorithm in this paper can perform low-rank approximation better.Therefore, under the same NMSE performance, the number of training sequences required in this paper is smaller than SLAB algorithm.
As can be seen from Fig. 4, to achieve the same NMSE performance, the algorithm proposed in this paper requires the lowest downlink pilot sequence length.Theoretically, the longer the pilot sequence is, the longer the channel estimation takes, and the more accurate the channel state information is.However, at the same time, it needs to occupy more frequency band resources to transmit pilot signals, resulting in lower frequency band utilization rate.Therefore, the frequency band utilization rate of the algorithm in this paper is higher than that of the other three methods, and accurate channel estimation results can be obtained by using relatively few pilot sequences.

Conclusion
For massive MIMO FDD systems, this paper proposes an estimation channel estimation framework that exploits the joint low rank and sparse characteristics of channels.In the proposed scheme, the base station sends a downlink pilot sequence to the mobile station.The mobile station quantizes the received downlink pilot signal, then feeds back the uplink to the base station and performs a channel estimation algorithm at the base station.The innovation lies in using non-convex algorithm instead of traditional nuclear norm to approximate the rank of wireless channel, and learning sparse coefficient through nonpreset dictionary.In this method, the construction of dictionary is independent of the transmission signal.The channel acquisition matrix is obtained by measuring multiple channel impulse response experiments.Then the key characteristics of the channel are extracted by DNN to form an incomplete dictionary, and the channel state is obtained by sparse representation.Simulation experiments verify the superiority of the proposed method from NMSE and pilot number.

2 F;Fig. 1
Fig. 1 Block diagram of the proposed method

Fig. 2
Fig. 2 Incomplete dictionary based on DNN

Table 1
Downlink received measured signal extraction algorithm based on non-convex low-rank approximation theory and alternated direction (ALM) method Algorithm 1:Estimation of downlink received pilot signals based on non-convex low-rank approximation Input:the measurements Y, the matrices A BS , andA MS , pilot sequences X. output:Y represents a massive MIMO channel model.Step 2. DNN training algorithm is mainly divided into two stages: forward feedback and reverse propagation.Step 3. At the end of training, the trained DNN performs feature extraction on a massive MIMO channel to identify channel features.The dictionary learning algorithm proposed in this paper is shown in Table

Table 2
Incomplete dictionary learning algorithm Construct the incomplete dictionary base on the output of DNN.