论文标题

通过Q传输的无界动态编程

Unbounded Dynamic Programming via the Q-Transform

论文作者

Ma, Qingyin, Stachurski, John, Toda, Alexis Akira

论文摘要

我们提出了一种基于Q学习中使用的转换的无界奖励来解决动态决策问题的新方法。在我们的情况下,转换的目的是将无限的动态程序转换为有限的程序。该方法足够一般,可以处理现有方法挣扎的问题,但相对于其他技术而言,却很简单,并且可以用于应用工作。我们以身作则表明,许多常见的决策问题满足了我们的状况。

We propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, the objective of the transform is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that many common decision problems satisfy our conditions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源