[PDF] Deep Learning based Joint Precoder Design and Antenna Selection for Partially Connected Hybrid Massive MIMO Systems

Abstract

Efficient resource allocation with hybrid precoder design is essential for massive MIMO systems operating in millimeter wave (mmW) domain. Owing to a higher energy efficiency and a lower complexity of a partially connected hybrid architecture, in this letter, we propose a joint deep convolutional neural network (CNN) based scheme for precoder design and antenna selection of a partially connected massive MIMO hybrid system. Precoder design and antenna selection is formulated as a regression and classification problem, respectively, for CNN. The channel data is fed to the first CNN network which outputs a subset of selected antennas having the optimal spectral efficiency. This subset is again fed to the second CNN to obtain the block diagonal precoder for a partially connected architecture. Simulation results verifies the superiority of CNN based approach over conventional iterative and alternating minimization (alt-min) algorithms. Moreover, the proposed scheme is computationally efficient and is not very sensitive to channel irregularities.

Full PDF

11 Deep Learning based Joint Precoder Design andAntenna Selection for Partially Connected HybridMassive MIMO Systems

Salman Khalid, Waqas bin Abbas, Farhan Khalid,

Member, IEEE

Abstract —Efﬁcient resource allocation with hybrid precoderdesign is essential for massive MIMO systems operating inmillimeter wave (mmW) domain. Owing to a higher energyefﬁciency and a lower complexity of a partially connected hybridarchitecture, in this letter, we propose a joint deep convolutionalneural network (CNN) based scheme for precoder design andantenna selection of a partially connected massive MIMO hybridsystem. Precoder design and antenna selection is formulated as aregression and classiﬁcation problem, respectively, for CNN. Thechannel data is fed to the ﬁrst CNN network which outputs asubset of selected antennas having the optimal spectral efﬁciency.This subset is again fed to the second CNN to obtain theblock diagonal precoder for a partially connected architecture.Simulation results veriﬁes the superiority of CNN based approachover conventional iterative and alternating minimization (alt-min)algorithms. Moreover, the proposed scheme is computationallyefﬁcient and is not very sensitive to channel irregularities.

Index terms —

Millimeter Wave Communication, MassiveMIMO, CNN, Partially Connected Hybrid PrecoderI. I

NTRODUCTION

Researchers are exploring the millimeter (mmW) and ter-ahertz domain to meet the ever increasing demand of widerbandwidth and higher data rates [1]. The propagation environ-ment at such higher frequencies is suffered by severe path loss,scattering and penetration losses. Massive MIMO architecturewith precoding/beamforming gain is utilized to compensatefor the propagation losses [2]. Researchers are tilted towardsthe hybrid (analog coupled with digital) beamforming archi-tecture since it provides the gains of digital processing withlower power consumption [3][4]. Existing literature proposesmany techniques for both fully connected (where each RFchain is connected to every antenna) and partially connected(where each RF chain is connected to a subset of antennas)hybrid architectures [3]-[7]. Authors in [5] have utilized theorthogonal matching pursuit (OMP), a greedy algorithm, forthe computation of analog and digital precoders for a fullyconnected hybrid architecture by utilizing the array responsesof the transmitter and the receiver. [6] proposes the iterativesuccessive interference cancellation (SIC) algorithm for com-putation of hybrid precoder for an energy efﬁcient partiallyconnected hybrid architecture. Authors in [7] have proposedthe manifold optimization (MO) and the phase extraction(PE) based alternating minimization (alt-min) techniques to

Salman Khalid (corresponding author, email: [email protected]),W. bin Abbas and Farhan Khalid are with the National University of Computerand Emerging Sciences (NUCES), Islamabad, Pakistan. compute the hybrid precoder for a fully connected architec-ture and the semi deﬁnite relaxation (SDR) based techniquefor a partially connected architecture. Alt-min approachesexplores the linkage between optimal and hybrid precodersto estimate the digital and analog precoder. The applicationof evolutionary algorithms for evaluation of hybrid precodersis demonstrated in [8][9]. For Massive MIMO systems, theresource allocation in terms of active antennas selection iscritical to ensure high energy efﬁciency. The spectral efﬁciencygain becomes constant beyond a certain number of antennas,hence to optimize the hardware, antennas experiencing goodchannel conditions should be selected. For antenna selectionproblem, authors in [10] and [11] have applied an iterativeevolutionary and estimation of distribution based algorithm.The above mentioned iterative, greedy and alt-min techniquesregarding antenna selection and precoding have drawbacks interms of computational time and achieving optimal solution interms of spectral efﬁciency.All above works on antenna selection and precoding givesthe sub optimal solution with considerable computationalcomplexity despite applying various optimization strategiesand selection criteria. Recently, the machine learning basedconvolutional neural network (CNN) methods have gainedinterest of researchers to solve the optimization problemsrelated to wireless communication. Well trained CNNs haveability to deduce features from a given set of observations withhigh efﬁciency and at a very low complexity as compared toconventional techniques. CNNs ﬁnd its applications for solvingproblems such as channel estimation [12], interference coor-dination, beam management [13] and analog beam selection[14]. Very recently precoder design problem is also formulatedand solved using CNN [15]-[19]. However, all the research forprecoder and combiner design is limited to a fully connectedhybrid architecture and do not consider the partially connectedhybrid architecture which has proven ability of being energyefﬁcient and bears reduced complexity [6].Keeping in view the low latency, low power consumptionand high energy efﬁciency requirements of future 5G and B5Gcommunication networks, in this letter, we propose a CNNbased joint antenna selection and precoder design for a par-tially connected hybrid structure. Two separate CNNs i.e., clas-siﬁcation and regression are trained for an antenna selectionand a precoding problem, respectively. Channel realizationsadded with noise are used to train the networks and estimatethe optimum antenna subset and precoder weights. The inputof the channel matrix is fed to the ﬁrst stage CNN deployed a r X i v : . [ c s . I T ] F e b for an antenna selection. The reduced subset is further fedto second stage CNN which outputs the optimum analogprecoder. The training of both CNNs is performed ofﬂine,hence, all the computational overhead for the data generationand training is not present during online prediction, where thehybrid precoders prediction and antennas classiﬁcation is doneby only feeding the channel matrix to the network.II. H YBRID M ASSIVE

MIMO S

YSTEM M ODEL

For joint estimation of precoder weights and antenna selec-tion, in this letter, we have considered a hybrid architecture in apartially connected conﬁguration i.e, each RF chain energizesonly a subset of antennas M = N T / N RFT . We have consideredthe base station (BS) equipped with N T transmit antennas and N RFT

RF chains to transmit N S data streams. The user isconsidered to be equipped with N R receive antennas wherethe selection is performed to determine N r best antennas.The analog and baseband precoder at the transmiter arerepresented as F RF ∈ C N T × N RFT and F BB ∈ C N RFT × N S respectively. Therefore, the signal transmitted through BS isrepresented as x = F RF F BB s , where s is the N S × transmitted symbol vector. The analog precoder F RF is ablock diagonal matrix realized by phase shifters having equalmagnitude with variable phases and satisfying the powerconstraint (cid:107) F RF F BB (cid:107) F ≤ N RFT . The received signal y atthe user having N R antennas is expressed as y = (cid:112) P av HF RF F BB s + n (1)The average received power is P av , n ( CN (0 , σ ) ) is i.i.dcomplex Gaussian noise. H denotes the N R × N T full arraychannel between the transmitter and receiver. The clusteredgeometric Saleh-Valenzuela model [3] representing the lowrank mmW channel is used in this letter. H = (cid:115)(cid:18) N T N R (cid:15)K (cid:19) K (cid:88) k =0 η l a R ( µ k ) a HT ( θ k ) (2)where K is the number of paths, (cid:15) is the pathloss, the pathgain linked with the k th path is η k , the corresponding spatialsignatures of the receiver and the transmitter are a R and a T ,respectively, and µ k and θ k are the angle of arrival (AoA)and the angle of departure (AoD) of the k th path, respectively.Finally, the full array (without any antenna selection) spectralefﬁciency is deﬁned as R = log( I N R + ρN s HF RF F BB F HBB F HRF H H ) (3)III. J OINT A NTENNA S ELECTION AND H YBRID P RECODER

Our ﬁrst goal is to determine a subset of N r best antennasout of total available N R receive antennas. After the antennaselection, the RF and baseband precoders are determined usingreduced dimensions. The joint solution of antenna selectionand precoding have to satisfy following condition max Q , F RF , F BB log( I N r + ρN s H sel F RF F BB F HBB F HRF H Hsel ) (4) Here H sel = QH is a reduced dimension ( N r × N T )channel matrix obtained by performing antenna selection. Q is a ( N r × N R ) selection matrix with entries either or representing the antenna index. A. Antenna Selection

For antenna selection, picking N r antennas out of N R yields Q A = (cid:0) N R N r (cid:1) possible combinations. Hence, selectinga subset of antennas becomes a CNN classiﬁcation problemwith Q A classes. Let q A th antenna subset conﬁguration with q A ∈ Q A = { , ...., Q A } is selected, than the received signalvector with q A th selected subset with H q A ( N r × N T ) beingcorresponding channel matrix is expressed as y q A = (cid:112) P av H q A F RF F BB s + n q A (5)Similarly, the spectral efﬁciency with q A th selected subsetis expressed as R = log( I N r + ρN s H q A F RF F BB F HBB F HRF H Hq A ) (6)Note that R is dependent on q A through H q A . By maximiz-ing R for all combinations of antenna selection conﬁgurations,the best antenna subset is expressed as q A = arg max q A ∈Q A R ( q A ) (7) B. Partially Connected Hybrid Precoder Design

Let H q A is the reduced dimensions channel matrix obtainedafter performing antenna selection than the hybrid precoderproblem is deﬁned as max F RF , F BB log( I N R + ρN s H q A F RF F BB F HBB F HRF H Hq A ) (8)We have considered a partially connected hybrid structurewhere each RF chain is connected to M = N T / N RFT numberof antennas. This implies that the structure of the RF precoder F RF must be a block diagonal with f RF i being the precodingvector for the i th RF chain only having M non zero elementsand is expressed as F RF =  f RF · · ·

00 f RF · · · ... ... . . . ... · · · f RF Nrf  (9)Hybrid precoder has to meet two constraints; C1: All non-zero elements of F RF must have the same amplitude and, C2:Meet the total power constraint (cid:107) F RF F BB (cid:107) F ≤ N RFT . For thecase of hybrid precoder, based on the proof [5], the Euclideandistance between the optimal unconstrained precoder and thehybrid precoder should be minimized. In other words, thehybrid precoder design problem is rewritten as arg min F RF , F BB (cid:107) F opt − F RF F BB (cid:107) F (10) Fig. 1. Joint CNN Architecture for Antenna Selection and Precoding

The optimal solution for the above mentioned optimizationproblem can be obtained using the singular value decom-position performed on the channel matrix, which can beused to generate the labels for output layer of regressionCNN network. Even with the memory-friendly approaches,it is computationally complex to enumerate over all possibleantenna selection subsets in real time. Also the optimal designof hybrid precoders requires iterations and extensive computa-tions. In order to address this issue, we have formulated a deepCNN based solution where the networks are trained ofﬂine andperform computations for antenna subset conﬁguration andoptimal precoder. Afterwards, the trained network can simplybe deployed as a classiﬁcation and regression network to selectantennas and estimate hybrid precoders.IV. CNN T

RAINING AND D ATASET G ENERATION

Our proposed deep neural network consists of two separateCNNs (Fig. 1) to perform antenna selection and precoding.The input to the ﬁrst CNN AS is the full dimension channelmatrix which selects the best antenna subset q A . The secondCNN RF accepts the reduced dimensions channel matrix withonly selected rows corresponding to the selected antennas andestimates the RF precoder at its output. For both architectures,the training data is generated using channel realizations whichare further assigned with corresponding output class/label forantenna selection and hyrid precoding.Let the input data X be a N R × N T × c = 3 channels.The ﬁrst channel of input is the absolute value of imperfectchannel matrix ˜ H whereas the real and imaginary part ofchannel matrix are stored in second and third channels, re-spectively. For data generation, N different channel matrixrealizations are generated. Afterwards for each realization, L noisy channel matrices are created with synthetic noise whichis added element wise. Hence, the total size of training inputdata becomes N R × N T × × N L . In order to obtain the outputlabels, for antenna selection CNN RF the best antenna subset isselected and afterwards for second CNN RF , F RF is obtainedby performing SVD operation on reduced dimension channelrealizations. Hence, the input/output pairs are established. Thetraining process of both CNNs is identical but with differentinput dimensions.The input sizes of CNN AS and CNN RF is N R × N T × N r × N T ×

3, respectively. Each CNN is composed of 14 layers.Input layer being the ﬁrst layer of corresponding input datasize. The convolutional layers are second, fourth and sixthwith 64 ﬁlters of dimensions 2 ×

2. Eighth and eleventh layers are fully connected layers with 512 nodes. The tenth andthirteenth layers are dropout layers with 50% probability. TheRELU activation function is utilized. Finally, the output layerof CNN AS is a classiﬁcation layer with softmax function tooutput the antenna subset class which gives maximum spectralefﬁciency and output layer of CNN RF is a regression layer ofdimensions N T × F RF . After estimating the non zeros elements of F RF , theblock diagonal structure is obtained by appending zeros atappropriate locations. CNN RF is used to predict the F RF andthe F BB is obtained using equivalent channel approach.V. N UMERICAL S IMULATIONS AND R ESULTS

In this section, we evaluate the performance of our proposedapproach. The performance of CNN based hybrid precoderis evaluated against state of the art SDR alt-min and SICalgorithms. Uniform planner array with N T = 36 or 144 and N R = 16 for the transmitter and receiver respectively, aregenerated. For antenna selection, the N r is kept as 8. TheRF chains at the transmitter N RFT and at the receiver N RFR are kept as 4. For CNNs, the training data is generated for N = L = 100 realizations. The proposed network is trained usingMATLAB as a simulation environment. SGD algorithm is usedfor network parameters with learning rate of 0.005 and mini-batch size 500 with 200 epochs. The cross entropy function isused as the loss function. During the training phase, 30% and70% of all data is divided into validation and training datasets,respectively. Finally the validation data is used to verify theperformance of the proposed architecture in the simulationsfor 100 Monte Carlo trials.Fig. 2 and Fig. 3 with N T as 36 and 144 respectively,shows the spectral efﬁciency for different algorithms withdeep antenna selection (DAS). The N TRF and N S are con-sidered equal and set to 4, the N R is set as 16 and DASCNN network selects N r = 8 antennas. After performingDAS, the hybrid precoders are determined. CNN RF is usedto determine the F RF whereas the F BB is obtained usingequivalent channel approach. The CNN based hybrid precoderis outperforming the SDR alt-min and SIC algorithms. TheCNN based hybrid precoder is efﬁciently predicting the RFprecoder which contributes towards maximization of spectralefﬁciency. To evaluate the antenna selection technique, theperformance of CNN based antenna selection is compared withrandom antenna selection (RAS) applied with precoding. It isevident that spectral efﬁciency using RAS algorithm is trailingbehind DAS algorithm irrespective of precoding technique. -20 -15 -10 -5 0 5 10 15 20 SNR, [dB] S pe c t r a l E ff i c i en cy [ b i t s / s / H z ] DAS + OPTRAS + OPTDAS + DHBRAS + DHBDAS + SDR Alt-MinRAS + SDR Alt-MinDAS + SICRAS + SIC

Fig. 2. Spectral Efﬁciency with N T =36, N R =16, N r =8, N TRF =4 -20 -15 -10 -5 0 5 10 15 20 SNR, [dB] S pe c t r a l E ff i c i en cy [ b i t s / s / H z ] DAS + OPTRAS + OPTDAS + DHBRAS + DHBDAS + SDR Alt-MinRAS + SDR Alt-MinDAS + SICRAS + SIC

Fig. 3. Spectral Efﬁciency with N T =144, N R =16, N r =8, N TRF =4 The computation time of CNN based precoder, SDR alt-minand SIC algorithms are also computed for N T = 144. The CNNbased precoder only required 0.01s, SDR alt-min requires 1.6sand SIC based precoder requires 0.02s for computations. TheCNN based hybrid precoder is outperforming all algorithmsin terms of spectral efﬁciency. Hence, both the computationalefﬁciency and spectral efﬁciency of the proposed CNN basedhybrid precoder is established.VI. C ONCLUSIONS

This letter presents the CNN based solution for joint hybridprecoder design of a partially connected mmW massive MIMO system with antenna selection. The proposed novel techniqueoutperforms the existing algorithms for a partially connectedhybrid precoder design both in terms of spectral efﬁciencyand computational complexity. Moreover the antenna selectionenables efﬁcient resource allocation for systems with largeantenna arrays. R

EFERENCES[1] K. V. Mishra, M. R. Bhavani Shankar, V. Koivunen, B. Ottersten, andS. A. Vorobyov, ”Toward millimeter wave joint radar-communications:A signal processing perspective,”

IEEE Signal Processing Magazine ,vol. 36, no. 5, pp. 100-114, 2019.[2] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M.Sayeed, ”An overview of signal processing techniques for millimeterwave MIMO systems,”

IEEE J. Sel. Topics Signal Process. , vol. 10,no. 3, pp. 436-453, Apr. 2016.[3] A. Alkhateeb, et al., ”Channel estimation and hybrid precoding formillimeter wave cellular systems,”

IEEE J. Sel. Topics Signal Process. ,vol. 8, no. 5, pp. 831-846, Oct. 2014.[4] A. Alkhateeb, et al., ”Limited Feedback Hybrid Precoding for Multi-User Millimeter Wave Systems,” in

IEEE Transactions on WirelessCommunications , vol. 14, no. 11, pp. 6481-6494, Nov. 2015[5] O. El Ayach, et al., ”Spatially sparse precoding in millimeter waveMIMO systems,”

IEEE Trans. Wireless Commun. , vol. 13, no. 3, pp.1499-1513, Mar. 2014.[6] X. Gao, et al., ”Energy-efﬁcient hybrid analog and digital precodingfor mmWave MIMO systems with large antenna arrays,”

IEEE J. Sel.Areas Commun. , vol. 34, no. 4, pp. 998-1009, Apr. 2016[7] X. Yu, J.-C. Shen, J. Zhang, and K. B. Letaief, ”Alternating mini-mization algorithms for hybrid precoding in millimeter wave MIMOsystems,”

IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 485-500, Apr. 2016.[8] O. Alluhaibi, Q.Z. Ahmed, J. Wang, H. Zhu, ”Hybrid digital-to-analog precoding design for mm-wave systems”,

IEEE InternationalConference on Communications (ICC) , Paris, 2017, pp. 1-6.[9] Khalid, S.; Abbas, W.B.; Kim, H.S.; Niaz, M.T. ”Evolutionary Algo-rithm Based Capacity Maximization of 5G/B5G Hybrid Pre-CodingSystems”.

Sensors 2020 , 20, 5338.[10] Salman Khalid, Rashid Mehmood, Waqas bin Abbas, Farhan Khalid,Muhammad Naeem, ”Joint transmit antenna selection and precoding formillimeter wave massive MIMO systems”,

Physical Communication ,vol 42, Oct. 2020.[11] Taneja, A., Saluja, N., ”Linear Precoding with User and TransmitAntenna Selection”’.

Wireless Pers Commun , 109, 1631-1644 (2019)[12] H. Ye, G. Y. Li, and B. Juang, ”Power of deep learning for channelestimation and signal detection in OFDM systems,”

IEEE WirelessCommunications Letters , vol. 7, no. 1, pp. 114-117, 2018.[13] P. Zhou, X. Fang, X. Wang, Y. Long, R. He, and X. Han, ”DeepLearning-Based Beam Management and Interference Coordination inDense mmWave Networks,”

IEEE Transactions on Vehicular Technol-ogy , vol. 68, pp. 592-603, Jan 2019.[14] Y. Long, Z. Chen, J. Fang, and C. Tellambura, ”Data-driven-based ana-log beam selection for hybrid beamforming under mm-Wave channels,”

IEEE Journal of Selected Topics in Signal Processing , vol. 12, no. 2,pp. 340-352, 2018.[15] X. Li and A. Alkhateeb, ”Deep Learning for Direct Hybrid Precoding inMillimeter Wave Massive MIMO Systems,” 53rd Asilomar Conferenceon Signals, Systems, and Computers, Paciﬁc Grove, CA, USA, 2019,pp. 800-805[16] H. Huang, Y. Song, J. Yang, G. Gui, and F. Adachi, ”Deep-learningbased millimeter-wave massive MIMO for hybrid precoding,”

IEEETrans. Veh. Technol. , vol. 68, no. 3, pp. 3027-3032, Mar. 2019.[17] A. M. Elbir, ”CNN-Based Precoder and Combiner Design in mmWaveMIMO Systems,” in

IEEE Communications Letters , vol. 23, no. 7, pp.1240-1243, July 2019[18] A. M. Elbir and K. V. Mishra, ”Joint Antenna Selection and HybridBeamformer Design Using Unquantized and Quantized Deep LearningNetworks,” in

IEEE Transactions on Wireless Communications , vol.19, no. 3, pp. 1677-1688, March 2020[19] X. Bao, W. Feng, J. Zheng and J. Li, ”Deep CNN and Equivalent Chan-nel Based Hybrid Precoding for mmWave Massive MIMO Systems,”in