论文标题
通过增强学习优化空中风能
Optimizing Airborne Wind Energy with Reinforcement Learning
论文作者
论文摘要
机载风能是一种轻巧的技术,可以使用风筝和滑翔机等机载设备从风中提取电源,在这种设备上可以动态控制机翼方向,以最大程度地发挥性能。湍流空气动力学的动态复杂性使该优化问题无法通过常规方法(例如经典控制理论),这些方法依赖于手头动力学系统的准确且可拖延的分析模型。在这里,我们建议通过加强学习来攻击这个问题,这种技术(通过与环境的反复进行反复互动)学会将观察结果与有利可图的行动联系起来,而无需对系统的先验知识。我们表明,在模拟环境中,增强学习可以找到一种控制风筝的有效方法,以便它可以长途拖动车辆。我们使用的算法是基于一小部分直观观察,其物理透明的解释允许将大约最佳策略描述为简单的操纵指令列表。
Airborne Wind Energy is a lightweight technology that allows power extraction from the wind using airborne devices such as kites and gliders, where the airfoil orientation can be dynamically controlled in order to maximize performance. The dynamical complexity of turbulent aerodynamics makes this optimization problem unapproachable by conventional methods such as classical control theory, which rely on accurate and tractable analytical models of the dynamical system at hand. Here we propose to attack this problem through Reinforcement Learning, a technique that -- by repeated trial-and-error interactions with the environment -- learns to associate observations with profitable actions without requiring prior knowledge of the system. We show that in a simulated environment Reinforcement Learning finds an efficient way to control a kite so that it can tow a vehicle for long distances. The algorithm we use is based on a small set of intuitive observations and its physically transparent interpretation allows to describe the approximately optimal strategy as a simple list of manoeuvring instructions.