Home > Published Issues > 2025 > Volume 14, No. 1, January 2025 >
IJEETC 2025 Vol.14(1): 13-22
doi: 10.18178/ijeetc.14.1.13-22

Deep Deterministic Policy Gradient-Based Spectral Efficiency for Massive MIMO Communication System

Nasaruddin Nasaruddin*, Afzal Risky, Yunida Yunida, and Ramzi Adriman
Department of Electrical and Computer Engineering, Universitas Syiah Kuala, Banda Aceh, Indonesia
Email: nasaruddin@usk.ac.id (N.N.), afzal.r@mhs.usk.ac.id (A.R.), yunida@usk.ac.id (Y.Y.), ramzi.adriman@usk.ac.id (R.A.)
*Corresponding author

Manuscript received July 25, 2024; revised November 6, 2024; accepted December 5, 2024

Abstract—Massive MIMO is a new configuration of MIMO technology that uses many antennas up to the order of hundreds to serve tens or hundreds of User Equipment (UE) at the same time and frequency. Massive MIMO technology is one of the spatial diversity techniques used to increase the Spectral Efficiency (SE) of current Fifth-Generation (5G) communication systems. Massive MIMO has a high complexity in signal processing because it serves a large amount of user traffic at the same time. Therefore, this paper proposes using the Deep Deterministic Policy Gradient (DDPG), a deep learning method that significantly improves both SE and runtime performance, making it highly effective for large-scale wireless communication systems. In the simulation, we are modeling a massive MIMO system with multiple Access Points (APs) and User Equipment (UEs). We are training the channel using the proposed DDPG model. Then, we analyze each end-user path’s Signal-to-Interference plus Noise Ratio (SINR) and compare it with conventional massive MIMO (without deep learning). In addition, the complexity of the proposed DDPG model in terms of runtime is analyzed and compared with the Convex Optimization Algorithm (CVX). The simulation results indicate that the performance of the massive MIMO system is improved with the proposed DDPG model. It achieves an optimal and higher spectral efficiency (SE) of 85% compared to not using the DDPG method. Additionally, it achieves an average Signalto- Interference-plus-Noise Ratio (SINR) of 19.54 dB, while the conventional method only provides an average SINR of 15.32 dB. Furthermore, the proposed DDPG model has a lower complexity with a runtime ratio of 1:8000 compared to the CVX algorithm for the same number of epochs.

 
Index Terms—Convex optimization, Deep Deterministic Policy Gradient (DDPG), deep learning, massive MIMO, spectral efficiency

Cite: Nasaruddin Nasaruddin, Afzal Risky, Yunida Yunida, and Ramzi Adriman, "Deep Deterministic Policy Gradient-Based Spectral Efficiency for Massive MIMO Communication System," International Journal of Electrical and Electronic Engineering & Telecommunications, Vol. 14, No. 1, pp. 13-22, 2025. doi: 10.18178/ijeetc.14.1.13-22

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.