论文标题
SOBA:会话最佳基于MDP的网络友好建议
SOBA: Session optimal MDP-based network friendly recommendations
论文作者
论文摘要
通过CDN或网络边缘上的缓存内容已巩固,作为提高网络成本并为用户提供更好的流媒体体验的一种手段。此外,将用户推向低成本内容,最近已经获得了势头,以提高网络性能的策略。我们专注于网络友好建议(NFR)的最佳政策设计问题。我们偏离了最近的建模尝试,并提出了马尔可夫决策过程(MDP)配方。 MDP提供了一个统一的框架,该框架可以对具有随机会话长度的用户建模。事实证明,许多最先进的方法可以作为我们MDP配方的子案例施放。此外,该方法为模型用户提供了灵活性,这些用户对收到的建议的质量进行了反应。在绩效方面,对于用户,我们在理论上表现出了任意数量的内容,并且对实际痕迹进行了广泛的验证,MDP方法在会话成本中均超过了近视算法,并且提供了建议的质量。最后,即使与针对特定子案例的最佳最新算法相比,我们的MDP框架的效率明显更高,使执行时间的加快倍数为10倍,并通过内容目录和建议批次尺寸享受更好的扩展。
Caching content over CDNs or at the network edge has been solidified as a means to improve network cost and offer better streaming experience to users. Furthermore, nudging the users towards low-cost content has recently gained momentum as a strategy to boost network performance. We focus on the problem of optimal policy design for Network Friendly Recommendations (NFR). We depart from recent modeling attempts, and propose a Markov Decision Process (MDP) formulation. MDPs offer a unified framework that can model a user with random session length. As it turns out, many state-of-the-art approaches can be cast as subcases of our MDP formulation. Moreover, the approach offers flexibility to model users who are reactive to the quality of the received recommendations. In terms of performance, for users consuming an arbitrary number of contents in sequence, we show theoretically and using extensive validation over real traces that the MDP approach outperforms myopic algorithms both in session cost as well as in offered recommendation quality. Finally, even compared to optimal state-of-art algorithms targeting specific subcases, our MDP framework is significantly more efficient, speeding the execution time by a factor of 10, and enjoying better scaling with the content catalog and recommendation batch sizes.