Deep Reinforcement Learning Based Throughput Maximization Scheme for D2D Users Underlaying NOMA-Enabled Cellular Network

Vishnoi, Vineet; Malik, Praveen Kumar; Budhiraja, Ishan; Yadav, Ashima

doi:10.1007/978-3-030-95502-1_25

Vineet Vishnoi¹⁰,
Praveen Kumar Malik¹¹,
Ishan Budhiraja¹² &
…
Ashima Yadav¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1528))

Included in the following conference series:

International Advanced Computing Conference

696 Accesses
4 Citations

Abstract

Device-to-Device (D2D) communication is a potential technology that efficiently reuses spectrum resources with CMUs in a fifth-generation (5G) underlay and even beyond the network. It improves network capacity and spectral efficiency at the cost of co-channel interference. Moreover, massive connectivity has not been fully exploited for efficient spectral efficiency usage in the existing solutions. To resolve the aforementioned issues, we combine non-orthogonal multiple access (NOMA) approaches with cellular mobile users (CMUs) in order to improve their throughput while preserving the signal-to-interference noise ratio (SINR) offered by CMUs and D2D mobile pairs (DMPs). The problem of power allocation is formulated as mixed-integer non-linear programming, which is then transformed to machine learning using the markov decision process (MDP). Then, a deep reinforcement learning (DRL) approach is proposed for solving the continuous optimisation problem in a centralised fashion. Furthermore, to achieve better performance and a faster convergence rate, the higher proximal policy optimization (PPO) scheme is employed. Numerical results reveal that the proposed algorithm outperformed state-of-the-art schemes in terms of throughput.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

Article 30 May 2024

Multi-agent reinforcement learning based joint uplink–downlink subcarrier assignment and power allocation for D2D underlay networks

Article 09 November 2022

Resource Allocation for Multi-service NOMA System Based on Deep Reinforcement Learning

References

Shafi, M., et al.: 5G: a tutorial overview of standards, trials, challenges, deployment, and practice. IEEE J. Sel. Areas Commun. 35(6), 1201–1221 (2017)
Article Google Scholar
Budhiraja, I., et al.: A systematic review on NOMA variants for 5G and beyond. IEEE Access 9, 85573–85644 (2021)
Article Google Scholar
Budhiraja, I., et al.: ISHU: interference reduction scheme for D2D mobile groups using uplink NOMA. IEEE Trans. Mob. Comput. (2021)
Google Scholar
Goodfellow, I., et al.: Deep Learning. MIT Press, Cambridge (2016)
Google Scholar
Luong, N.C., et al.: Applications of deep reinforcement learning in communications and networking: a survey. IEEE Commun. Surv. Tutor. 21(4), 3133–3174 (2019)
Article Google Scholar
Budhiraja, I., Kumar, N., Tyagi, S., Tanwar, S., Han, Z.: An energy efficient scheme for WPCN-NOMA based device-to-device communication. IEEE Trans. Veh. Technol. 70(11), 11935–11948 (2021)
Google Scholar
Nguyen, K.K., et al.: Non-cooperative energy efficient power allocation game in D2D communication: a multi-agent deep reinforcement learning approach. IEEE Access 7, 100480–100490 (2019)
Article Google Scholar
Bi, Z., et al.: Deep reinforcement learning based power allocation for D2D network. In: 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring), pp. 1–5, March 2020
Google Scholar
Ji, Z., et al.: Power optimization in device-to-device communications: a deep reinforcement learning approach with dynamic reward. IEEE Wirel. Commun. Lett. 10(3), 508–511 (2021)
Article Google Scholar
Zhang, T., et al.: Energy-efficient mode selection and resource allocation for D2D-enabled heterogeneous networks: a deep reinforcement learning approach. IEEE Trans. Wirel. Commun. 20(2), 1175–1187 (2021)
Article Google Scholar
Chen, M., et al.: Continuous incentive mechanism for D2D content sharing: a deep reinforcement learning approach. In: 2020 IEEE International Conference on Communications Workshops (ICC Workshops), pp. 1–6, June 2020
Google Scholar
Tan, J., et al.: Deep reinforcement learning for joint channel selection and power control in D2D networks. IEEE Trans. Wirel. Commun. 20(2), 1363–1378 (2021)
Google Scholar
Budhiraja, I., et al.: Deep-reinforcement-learning-based proportional fair scheduling control scheme for underlay D2D communication. IEEE Internet Things J. 8(5), 3143–3156 (2021)
Article Google Scholar
Tang, J., et al.: Energy minimization in D2D-assisted cache-enabled Internet of Things: a deep reinforcement learning approach. IEEE Trans. Ind. Inf. 16(8), 5412–5423 (2020)
Article Google Scholar
Mnih, V., et al.: Asynchronous methods for deep reinforcement learning. In: International Conference on Machine Learning, pp. 1928–1937, June 2016
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronics and Electrical Engineering, Lovely Professional University, Phagwara, Punjab, India
Vineet Vishnoi
Lovely Professional University, Phagwara, Punjab, India
Praveen Kumar Malik
Bennett University, Greater Noida, Uttar Pradesh, India
Ishan Budhiraja & Ashima Yadav

Authors

Vineet Vishnoi
View author publications
You can also search for this author in PubMed Google Scholar
Praveen Kumar Malik
View author publications
You can also search for this author in PubMed Google Scholar
Ishan Budhiraja
View author publications
You can also search for this author in PubMed Google Scholar
Ashima Yadav
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vineet Vishnoi .

Editor information

Editors and Affiliations

Bennett University, Greater Noida, India
Deepak Garg
Missouri University of Science and Technology, Rolla, CA, USA
Sarangapani Jagannathan
Model Institute of Engineering and Technology, Kot Bhalwal, Jammu and Kashmir, India
Ankur Gupta
University of Malta, Msida, Malta
Lalit Garg
Bennett University, Greater Noida, India
Suneet Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vishnoi, V., Malik, P.K., Budhiraja, I., Yadav, A. (2022). Deep Reinforcement Learning Based Throughput Maximization Scheme for D2D Users Underlaying NOMA-Enabled Cellular Network. In: Garg, D., Jagannathan, S., Gupta, A., Garg, L., Gupta, S. (eds) Advanced Computing. IACC 2021. Communications in Computer and Information Science, vol 1528. Springer, Cham. https://doi.org/10.1007/978-3-030-95502-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-95502-1_25
Published: 08 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95501-4
Online ISBN: 978-3-030-95502-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Deep Reinforcement Learning Based Throughput Maximization Scheme for D2D Users Underlaying NOMA-Enabled Cellular Network

Abstract

Access this chapter

Similar content being viewed by others

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

Multi-agent reinforcement learning based joint uplink–downlink subcarrier assignment and power allocation for D2D underlay networks

Resource Allocation for Multi-service NOMA System Based on Deep Reinforcement Learning

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Deep Reinforcement Learning Based Throughput Maximization Scheme for D2D Users Underlaying NOMA-Enabled Cellular Network

Abstract

Access this chapter

Similar content being viewed by others

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

Multi-agent reinforcement learning based joint uplink–downlink subcarrier assignment and power allocation for D2D underlay networks

Resource Allocation for Multi-service NOMA System Based on Deep Reinforcement Learning

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation