Age of information-aware radio resource management in vehicular networks : a proactive deep reinforcement learning perspective

Chen, Xianfu; Wu, Celimuge; Chen, Tao; Zhang, Honggang; Liu, Zhi; Zhang, Yan; Bennis, Mehdi

Age of information-aware radio resource management in vehicular networks : a proactive deep reinforcement learning perspective

Chen, Xianfu; Wu, Celimuge; Chen, Tao; Zhang, Honggang; Liu, Zhi; Zhang, Yan; Bennis, Mehdi (2020-01-09)

Avaa tiedosto

nbnfi-fe202002034244.pdf (6.789Mt)

nbnfi-fe202002034244_meta.xml (38.36Kt)

nbnfi-fe202002034244_solr.xml (38.93Kt)

Lataukset:

URL:

https://doi.org/10.1109/TWC.2019.2963667

Chen, Xianfu

Wu, Celimuge

Chen, Tao

Zhang, Honggang

Liu, Zhi

Zhang, Yan

Bennis, Mehdi

Institute of Electrical and Electronics Engineers

09.01.2020

X. Chen et al., "Age of Information Aware Radio Resource Management in Vehicular Networks: A Proactive Deep Reinforcement Learning Perspective," in IEEE Transactions on Wireless Communications, vol. 19, no. 4, pp. 2268-2281, April 2020. doi: 10.1109/TWC.2019.2963667

https://rightsstatements.org/vocab/InC/1.0/
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/TWC.2019.2963667

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe202002034244

Tiivistelmä

Abstract

In this paper, we investigate the problem of age of information (AoI)-aware radio resource management for expected long-term performance optimization in a Manhattan grid vehicle-to-vehicle network. With the observation of global network state at each scheduling slot, the roadside unit (RSU) allocates the frequency bands and schedules packet transmissions for all vehicle user equipment-pairs (VUE-pairs). We model the stochastic decision-making procedure as a discrete-time single-agent Markov decision process (MDP). The technical challenges in solving the optimal control policy originate from high spatial mobility and temporally varying traffic information arrivals of the VUE-pairs. To make the problem solving tractable, we first decompose the original MDP into a series of per-VUE-pair MDPs. Then we propose a proactive algorithm based on long short-term memory and deep reinforcement learning techniques to address the partial observability and the curse of high dimensionality in local network state space faced by each VUE-pair. With the proposed algorithm, the RSU makes the optimal frequency band allocation and packet scheduling decision at each scheduling slot in a decentralized way in accordance with the partial observations of the global network state at the VUE-pairs. Numerical experiments validate the theoretical analysis and demonstrate the significant performance improvements from the proposed algorithm.

Kokoelmat

Avoin saatavuus [31928]