论文标题

基于DRL的QoS感知资源分配方案,用于在LTE及以后的许可和未经许可用户共存

DRL-Based QoS-Aware Resource Allocation Scheme for Coexistence of Licensed and Unlicensed Users in LTE and Beyond

论文作者

Boroujerdi, Mahdi Nouri, Akbari, Mohammad, Joda, Roghayeh, Maddah-Ali, Mohammad Ali, Khalaj, Babak Hossein

论文摘要

在本文中,我们采用深入的强化学习来为不同的服务质量(QoS)要求开发新型的无线电资源分配和数据包调度方案,适用于LTEADVANCEND和5G网络。此外,关于低于6GHz频段中光谱的稀缺性,提出的算法将资源块(RBS)动态分配给有执照的用户,以主要保留未分配的RBS的连续性。这将通过增加不间断沟通的机会并减少协调开销的负担,从而提高无执照实体之间的通信效率。优化问题被提出为马尔可夫决策过程(MDP),观察了需求的整个队列,而无法满足QoS约束的需求的整个队列会以乘法因素惩罚目标。此外,未分配资源的连续性概念被视为目标函数中的加法术语。考虑到频道系数和用户要求的变化,我们利用深厚的增强学习算法作为在线和数值高效的方法来解决MDP。数值结果表明,与常规的贪婪的米尔德延迟和最大通知方案相比,所提出的方法可实现较高的平均光谱效率,同时考虑延迟预算和数据包损耗比率,在该方案中,无限制实体的固定部分被迫空置。

In this paper, we employ deep reinforcement learning to develop a novel radio resource allocation and packet scheduling scheme for different Quality of Service (QoS) requirements applicable to LTEadvanced and 5G networks. In addition, regarding the scarcity of spectrum in below 6GHz bands, the proposed algorithm dynamically allocates the resource blocks (RBs) to licensed users in a way to mostly preserve the continuity of unallocated RBs. This would improve the efficiency of communication among the unlicensed entities by increasing the chance of uninterrupted communication and reducing the load of coordination overheads. The optimization problem is formulated as a Markov Decision Process (MDP), observing the entire queue of the demands, where failing to meet QoS constraints penalizes the goal with a multiplicative factor. Furthermore, a notion of continuity for unallocated resources is taken into account as an additive term in the objective function. Considering the variations in both channel coefficients and users requests, we utilize a deep reinforcement learning algorithm as an online and numerically efficient approach to solve the MDP. Numerical results show that the proposed method achieves higher average spectral efficiency, while considering delay budget and packet loss ratio, compared to the conventional greedy min-delay and max-throughput schemes, in which a fixed part of the spectrum is forced to be vacant for unlicensed entities.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源