Average AoI minimization in an HARQ-based status update system under random arrivals

Sadeghi Vilni, Saeid; Moltafet, Mohammad; Leinonen, Markus; Codreanu, Marian

Average AoI minimization in an HARQ-based status update system under random arrivals

Sadeghi Vilni, Saeid; Moltafet, Mohammad; Leinonen, Markus; Codreanu, Marian (2022-12-13)

Avaa tiedosto

nbnfi-fe2023021026823.pdf (638.9Kt)

nbnfi-fe2023021026823_meta.xml (36.61Kt)

nbnfi-fe2023021026823_solr.xml (36.71Kt)

Lataukset:

URL:

https://doi.org/10.1109/IoTaIS56727.2022.9975894

Sadeghi Vilni, Saeid

Moltafet, Mohammad

Leinonen, Markus

Codreanu, Marian

Institute of Electrical and Electronics Engineers

13.12.2022

S. S. Vilni, M. Moltafet, M. Leinonen and M. Codreanu, "Average AoI Minimization in an HARQ-based Status Update System under Random Arrivals," 2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS), BALI, Indonesia, 2022, pp. 160-165, doi: 10.1109/IoTaIS56727.2022.9975894

https://rightsstatements.org/vocab/InC/1.0/
© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/IoTaIS56727.2022.9975894

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2023021026823

Tiivistelmä

Abstract

We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.

Kokoelmat

Avoin saatavuus [31987]