Deep reinforcement learning for practical phase shift optimization in RIS-assisted networks over short packet communications

Hashemi, Ramin; Ali, Samad; Taghavi, Ehsan Moeen; Mahmood, Nurul Huda; Latva-aho, Matti

Deep reinforcement learning for practical phase shift optimization in RIS-assisted networks over short packet communications

Hashemi, Ramin; Ali, Samad; Taghavi, Ehsan Moeen; Mahmood, Nurul Huda; Latva-aho, Matti (2022-07-08)

Avaa tiedosto

nbnfi-fe2022082956575.pdf (733.1Kt)

nbnfi-fe2022082956575_meta.xml (41.38Kt)

nbnfi-fe2022082956575_solr.xml (36.96Kt)

Lataukset:

URL:

10.1109/EuCNC/6GSummit54941.2022.9815804

Hashemi, Ramin

Ali, Samad

Taghavi, Ehsan Moeen

Mahmood, Nurul Huda

Latva-aho, Matti

Institute of Electrical and Electronics Engineers

08.07.2022

R. Hashemi, S. Ali, E. M. Taghavi, N. H. Mahmood and M. Latva-Aho, "Deep Reinforcement Learning for Practical Phase Shift Optimization in RIS-assisted Networks over Short Packet Communications," 2022 Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit), 2022, pp. 518-523, doi: 10.1109/EuCNC/6GSummit54941.2022.9815804.

https://rightsstatements.org/vocab/InC/1.0/
© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/eucnc/6gsummit54941.2022.9815804

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022082956575

Tiivistelmä

Abstract

We study the practical phase shift design in a non-ideal reconfigurable intelligent surface (RIS)-aided ultra-reliable and low-latency communication (URLLC) system under finite blocklength (FBL) regime by leveraging a novel deep reinforcement learning (DRL) algorithm named as twin-delayed deep deterministic policy gradient (TD3). First, assuming industrial automation system with multiple actuators, the signal-to-interference-plus-noise ratio (SINR) and achievable rate in FBL regime are identified for each actuator in terms of the phase shift configuration matrix at the RIS. The channel state information (CSI) variations due to feedback delay are also considered that result in channel coefficients’ obsolescence. Then, the problem framework is proposed where the objective is to maximize the total achievable FBL rate in all ACs, subject to the practical phase shift constraint at the RIS elements. Since the problem is intractable to solve using conventional optimization methods, we resort to employing an actor-critic policy gradient DRL algorithm based on TD3, which relies on interacting RIS with FA environment by taking actions which are the phase shifts at the RIS elements, to maximize the expected observed reward, which is defined as the total FBL rate. The numerical results show that optimizing the practical phase shifts in the RIS via the proposed TD3 method is highly beneficial to improve the network total FBL rate in comparison with typical DRL methods.

Kokoelmat

Avoin saatavuus [32038]