多尺度增强学习，用于发现非线性时空传输系统的减少订单闭合模型

论文标题

多尺度增强学习，用于发现非线性时空传输系统的减少订单闭合模型

Variational multiscale reinforcement learning for discovering reduced order closure models of nonlinear spatiotemporal transport systems

论文作者

San, Omer, Pawar, Suraj, Rasheed, Adil

论文摘要

在许多科学应用的计算建模和仿真中，一个核心挑战是由于基本的高度非线性多尺度相互作用而实现其粗粒表示的稳健而准确的封闭。这些封闭模型在许多非线性时空系统中很常见，可以考虑降低订单表示，包括流体中的许多运输现象。以前的数据驱动的封闭建模工作主要集中在使用高保真模拟数据的监督学习方法上。另一方面，增强学习（RL）是一种强大但相对未知的方法，在时空扩展系统中。在这项研究中，我们提出了一个模块化的动态闭合建模和发现框架，以稳定基于二次非线性的许多非线性时空动力学系统中可能出现的Galerkin投影降低订单模型。但是，创建强大的RL代理的关键要素是引入可行的奖励函数，该奖励函数可以由RL模型和高保真仿真数据之间的任何差异指标构成。首先，我们引入了一个多模式RL（MMRL），以发现模式依赖的闭合策略，该策略利用高保真数据来奖励我们的RL代理。然后，我们制定了一种变异多尺度RL（VMRL）方法来发现闭合模型，而无需在设计奖励功能时访问高保真数据。具体而言，我们的主要创新是利用变异多形式主义来量化Galerkin Systems中模态相互作用之间的差异。我们模拟粘性汉堡方程的结果表明，所提出的VMRL方法会导致稳健而准确的闭合参数化，并且可能有可能用于发现复杂动力学系统的规模感知闭合模型。

A central challenge in the computational modeling and simulation of a multitude of science applications is to achieve robust and accurate closures for their coarse-grained representations due to underlying highly nonlinear multiscale interactions. These closure models are common in many nonlinear spatiotemporal systems to account for losses due to reduced order representations, including many transport phenomena in fluids. Previous data-driven closure modeling efforts have mostly focused on supervised learning approaches using high fidelity simulation data. On the other hand, reinforcement learning (RL) is a powerful yet relatively uncharted method in spatiotemporally extended systems. In this study, we put forth a modular dynamic closure modeling and discovery framework to stabilize the Galerkin projection based reduced order models that may arise in many nonlinear spatiotemporal dynamical systems with quadratic nonlinearity. However, a key element in creating a robust RL agent is to introduce a feasible reward function, which can be constituted of any difference metrics between the RL model and high fidelity simulation data. First, we introduce a multi-modal RL (MMRL) to discover mode-dependant closure policies that utilize the high fidelity data in rewarding our RL agent. We then formulate a variational multiscale RL (VMRL) approach to discover closure models without requiring access to the high fidelity data in designing the reward function. Specifically, our chief innovation is to leverage variational multiscale formalism to quantify the difference between modal interactions in Galerkin systems. Our results in simulating the viscous Burgers equation indicate that the proposed VMRL method leads to robust and accurate closure parameterizations, and it may potentially be used to discover scale-aware closure models for complex dynamical systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题