基于学习的基于学习的联合自我优化方法，用于5G Hetnets中的模糊逻辑移交算法

论文标题

基于学习的基于学习的联合自我优化方法，用于5G Hetnets中的模糊逻辑移交算法

Reinforcement Learning-Based Joint Self-Optimisation Method for the Fuzzy Logic Handover Algorithm in 5G HetNets

论文作者

Liu, Qianyu, Kwong, Chiew Foong, Wei, Sun, Zhou, Sijia, Li, Lincan

论文摘要

5G异质网络（HETNET）可以通过在4G宏系统中部署大量的小型基站（BSS）来为用户提供更高的网络覆盖范围和系统容量。但是，小型BS的大规模部署显着增加了网络维护和优化的复杂性和工作量。当前的交换（HO）触发机制A3事件仅用于宏系统中的移动性管理。直接实现5G螺旋中的A3可能会降低用户移动性鲁棒性。这项研究是由自组织网络（SON）的概念（SON）的动机开发了一种自我优化的触发机制，可以实现自动化网络维护并增强5G-HETNET的用户移动性鲁棒性。所提出的方法将减法聚类和Q学习框架的优势集成到基于常规模糊逻辑的HO算法（FLHA）中。首先采用减法聚类来生成FLHA的会员功能（MF），以启用具有自配置功能的FLHA。随后，Q-学习被用来从环境中学习最佳HO策略，作为模糊规则，使FLHA具有自动化功能。具有儿子功能的FLHA还克服了常规FLHA的局限性，必须严重依赖于专业体验来设计。仿真结果表明，提出的自我优化的FLHA可以有效地为FLHA生成MF和模糊规则。通过与常规触发机制进行比较，所提出的方法可以将HO，Ping-Pong HO和HO失败率降低约91％，49％和97.5％，同时将网络吞吐量和延迟提高8％和35％。

5G heterogeneous networks (HetNets) can provide higher network coverage and system capacity to the user by deploying massive small base stations (BSs) within the 4G macro system. However, the large-scale deployment of small BSs significantly increases the complexity and workload of network maintenance and optimisation. The current handover (HO) triggering mechanism A3 event was designed only for mobility management in the macro system. Directly implementing A3 in 5G-HetNets may degrade the user mobility robustness. Motivated by the concept of self-organisation networks (SON), this study developed a self-optimised triggering mechanism to enable automated network maintenance and enhance user mobility robustness in 5G-HetNets. The proposed method integrates the advantages of subtractive clustering and Q-learning frameworks into the conventional fuzzy logic-based HO algorithm (FLHA). Subtractive clustering is first adopted to generate a membership function (MF) for the FLHA to enable FLHA with the self-configuration feature. Subsequently, Q-learning is utilised to learn the optimal HO policy from the environment as fuzzy rules that empower the FLHA with a self-optimisation function. The FLHA with SON functionality also overcomes the limitations of the conventional FLHA that must rely heavily on professional experience to design. The simulation results show that the proposed self-optimised FLHA can effectively generate MF and fuzzy rules for the FLHA. By comparing with conventional triggering mechanisms, the proposed approach can decrease the HO, ping-pong HO, and HO failure ratios by approximately 91%, 49%, and 97.5% while improving network throughput and latency by 8% and 35%, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题