论文标题
动态运行通过约束学习,用于以服务为导向的车辆网络
Dynamic RAN Slicing for Service-Oriented Vehicular Networks via Constrained Learning
论文作者
论文摘要
在本文中,我们研究了具有不同服务质量(QoS)要求的车辆互联网服务(IOV)服务的无线电访问网络(RAN)切片问题,其中在共同的路边网络基础架构上构建了多个逻辑分离的切片。动态RAN切片框架被提交给动态分配无线电频谱和计算资源,并为切片分配计算工作负载。为了获得最佳的切片政策,以适应车辆交通密度的时空动力学,我们首先制定了一个约束的切片问题,其目标是最大程度地减少长期系统成本。由于决策之间复杂的耦合约束,传统的加强学习(RL)算法无法直接解决这个问题。因此,我们将问题分解为资源分配子问题和工作负载分布子问题,并提出了一种两层约束的RL算法,命名为资源分配和工作负载分布(RAWS)来解决它们。具体而言,外层首先通过RL算法进行资源分配决策,然后内层通过优化子例程来使工作负载分配决策。广泛的痕量驱动模拟表明,与基准相比,RAWS有效地降低了系统成本,同时满足了QoS要求的高可能性。
In this paper, we investigate a radio access network (RAN) slicing problem for Internet of vehicles (IoV) services with different quality of service (QoS) requirements, in which multiple logically-isolated slices are constructed on a common roadside network infrastructure. A dynamic RAN slicing framework is presented to dynamically allocate radio spectrum and computing resource, and distribute computation workloads for the slices. To obtain an optimal RAN slicing policy for accommodating the spatial-temporal dynamics of vehicle traffic density, we first formulate a constrained RAN slicing problem with the objective to minimize long-term system cost. This problem cannot be directly solved by traditional reinforcement learning (RL) algorithms due to complicated coupled constraints among decisions. Therefore, we decouple the problem into a resource allocation subproblem and a workload distribution subproblem, and propose a two-layer constrained RL algorithm, named Resource Allocation and Workload diStribution (RAWS) to solve them. Specifically, an outer layer first makes the resource allocation decision via an RL algorithm, and then an inner layer makes the workload distribution decision via an optimization subroutine. Extensive trace-driven simulations show that the RAWS effectively reduces the system cost while satisfying QoS requirements with a high probability, as compared with benchmarks.