用于以用户为中心的不含单元的MIMO网络的基于POMDP的交接

论文标题

用于以用户为中心的不含单元的MIMO网络的基于POMDP的交接

POMDP-based Handoffs for User-Centric Cell-Free MIMO Networks

论文作者

Ammar, Hussein A., Adve, Raviraj, Shahbazpanahi, Shahram, Boudreau, Gary, Srinivas, Kothapalli Venkata

论文摘要

我们建议通过部分可观察到的马尔可夫决策过程（POMDP）在不含用户中心的大型MIMO网络中控制交接（HOS），其状态空间代表大规模淡出的离散版本（LSF）和代表用户与接入点的关联决策的大型动作空间的离散版本。我们提出的公式解释了渠道状态的时间进化和部分可观察性。这使我们可以在执行HO决策时考虑将来的奖励，从而获得强大的HO政策。为了减轻解决POMDP的高复杂性，我们通过将POMDP公式分解为子问题，每个人都单独解决，我们遵循了分裂和诱导的方法。然后，最佳解决子问题的策略和候选访问点群集用于在特定时间范围内执行HOS。我们通过确定何时使用HO策略来控制HOS的数量。我们的仿真结果表明，与基于时间触发的LSF HOS相比，我们提出的解决方案将HOS降低了47％，与基于数据率阈值触发的LSF HOS相比，HOS降低了70％。通过增加POMDP的时间范围，可以进一步降低该量。

We propose to control handoffs (HOs) in user-centric cell-free massive MIMO networks through a partially observable Markov decision process (POMDP) with the state space representing the discrete versions of the large-scale fading (LSF) and the action space representing the association decisions of the user with the access points. Our proposed formulation accounts for the temporal evolution and the partial observability of the channel states. This allows us to consider future rewards when performing HO decisions, and hence obtain a robust HO policy. To alleviate the high complexity of solving our POMDP, we follow a divide-and-conquer approach by breaking down the POMDP formulation into sub-problems, each solved individually. Then, the policy and the candidate cluster of access points for the best solved sub-problem is used to perform HOs within a specific time horizon. We control the number of HOs by determining when to use the HO policy. Our simulation results show that our proposed solution reduces HOs by 47% compared to time-triggered LSF-based HOs and by 70% compared to data rate threshold-triggered LSF-based HOs. This amount can be further reduced through increasing the time horizon of the POMDP.

下载PDF全文

下载文献需遵守相关版权规定

论文标题