论文标题
数据驱动的仿射系统的最佳控制:线性编程角度
Data-Driven Optimal Control of Affine Systems: A Linear Programming Perspective
论文作者
论文摘要
在这封信中,我们在数据驱动的线性编程中讨论了仿射系统最佳控制的问题。首先,我们引入了一个统一的框架,以供价值函数,Q功能和放松的贝尔曼操作员的固定点表征。然后,在无模型设置中,我们展示了如何从一个小但充足的数据集中综合和估计贝尔曼的不平等现象。为了确保探索丰富性,我们完成了Willem的基本引理到仿射系统的扩展。
In this letter, we discuss the problem of optimal control for affine systems in the context of data-driven linear programming. First, we introduce a unified framework for the fixed point characterization of the value function, Q-function and relaxed Bellman operators. Then, in a model-free setting, we show how to synthesize and estimate Bellman inequalities from a small but sufficiently rich dataset. To guarantee exploration richness, we complete the extension of Willem's fundamental lemma to affine systems.