Millimeter wave communications with an intelligent reflector : performance optimization and distributional reinforcement learning

Zhang, Qianqian; Saad, Walid; Bennis, Mehdi

Millimeter wave communications with an intelligent reflector : performance optimization and distributional reinforcement learning

Zhang, Qianqian; Saad, Walid; Bennis, Mehdi (2021-09-03)

Avaa tiedosto

nbnfi-fe2022083156895.pdf (802.4Kt)

nbnfi-fe2022083156895_meta.xml (31.74Kt)

nbnfi-fe2022083156895_solr.xml (37.95Kt)

Lataukset:

URL:

https://doi.org/10.1109/twc.2021.3107520

Zhang, Qianqian

Saad, Walid

Bennis, Mehdi

Institute of Electrical and Electronics Engineers

03.09.2021

Q. Zhang, W. Saad and M. Bennis, "Millimeter Wave Communications With an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning," in IEEE Transactions on Wireless Communications, vol. 21, no. 3, pp. 1836-1850, March 2022, doi: 10.1109/TWC.2021.3107520

https://rightsstatements.org/vocab/InC/1.0/
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/twc.2021.3107520

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022083156895

Tiivistelmä

Abstract

In this paper, a novel framework is proposed to optimize the downlink multi-user communication of a millimeter wave base station, which is assisted by a reconfigurable intelligent reflector (IR). In particular, a channel estimation approach is developed to measure the channel state information (CSI) in real-time. First, for a perfect CSI scenario, the precoding transmission of the BS and the reflection coefficient of the IR are jointly optimized, via an iterative approach, so as to maximize the sum of downlink rates towards multiple users. Next, in the imperfect CSI scenario, a distributional reinforcement learning (DRL) approach is proposed to learn the optimal IR reflection and maximize the expectation of downlink capacity. In order to model the transmission rate’s probability distribution, a learning algorithm, based on quantile regression (QR), is developed, and the proposed QR-DRL method is proved to converge to a stable distribution of downlink transmission rate. Simulation results show that, in the error-free CSI scenario, the proposed approach yields over 30% and 2-fold increase in the downlink sum-rate, compared with a fixed IR reflection scheme and direct transmission scheme, respectively. Simulation results also show that by deploying more IR elements, the downlink sum-rate can be significantly improved. However, as the number of IR components increases, more time is required for channel estimation, and the slope of increase in the IR-aided transmission rate will become smaller. Furthermore, under limited knowledge of CSI, simulation results show that the proposed QR-DRL method, which learns a full distribution of the downlink rate, yields a better prediction accuracy and improves the downlink rate by 10% for online deployments, compared with a Q-learning baseline.

Kokoelmat

Avoin saatavuus [32049]