D2D assisted Q-learning random access for NOMA-based MTC networks

da Silva, Matheus V.; Montejo-Sánchez, Samuel; Souza, Richard Demo; Alves, Hirley; Abrão, Taufik

D2D assisted Q-learning random access for NOMA-based MTC networks

da Silva, Matheus V.; Montejo-Sánchez, Samuel; Souza, Richard Demo; Alves, Hirley; Abrão, Taufik (2022-03-16)

Avaa tiedosto

nbnfi-fe2022042630431.pdf (1.783Mt)

nbnfi-fe2022042630431_meta.xml (37.94Kt)

nbnfi-fe2022042630431_solr.xml (38.52Kt)

Lataukset:

URL:

https://doi.org/10.1109/ACCESS.2022.3160156

da Silva, Matheus V.

Montejo-Sánchez, Samuel

Souza, Richard Demo

Alves, Hirley

Abrão, Taufik

Institute of Electrical and Electronics Engineers

16.03.2022

M. V. da Silva, S. Montejo-Sánchez, R. D. Souza, H. Alves and T. Abrão, "D2D Assisted Q-Learning Random Access for NOMA-Based MTC Networks," in IEEE Access, vol. 10, pp. 30694-30706, 2022, doi: 10.1109/ACCESS.2022.3160156

https://creativecommons.org/licenses/by/4.0/
© The Author(s) 2022. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
https://creativecommons.org/licenses/by/4.0/

doi:https://doi.org/10.1109/ACCESS.2022.3160156

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022042630431

Tiivistelmä

Abstract

Machine-type communications (MTC) should account for half the connections to the internet by 2030. The use case massive MTC (mMTC) allows for applications to connect a massive number of low-power and low-complexity devices, leading to challenges in resource allocation. Not only that, mMTC networks suffer under rigid random access schemes due to mMTC ultra-dense nature resulting in poor performance. In this sense, this paper proposes a Q -Learning-based random access method for massive machine-type communications, with device clustering and non-orthogonal multiple access (NOMA). The traditional NOMA implementation increases spectral efficiency, but at the same time, demands a larger Q -Table, thus slowing down convergence, which is known to be a highly detrimental effect on massive networks. We use pre-clustering through short-range device-to-device technology to mitigate this drawback, allowing devices to operate with a smaller Q -Table. Furthermore, the previous selection of partner devices allows us to implement a full-feedback-based reward mechanism so that clusters avoid time slots already successfully allocated. Additionally, to cope with the negative impact of system overload, we propose an adaptive frame size algorithm to run in the base station (BS). It allows adjusting the frame size to the network load, preventing idle slots in an underloaded scenario, and providing extra slots when the network is overloaded. The results show the great benefits in terms of throughput of the proposed method. In addition, the impact of the use of clustering and the size of the clusters, as well as the frame size adaptation, are analyzed.

Kokoelmat

Avoin saatavuus [31872]

Ellei muuten mainita, aineiston lisenssi on https://creativecommons.org/licenses/by/4.0/