基于多目标的基于模型的强化学习，用于传染病控制

论文标题

基于多目标的基于模型的强化学习，用于传染病控制

Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control

论文作者

Wan, Runzhe, Zhang, Xinyu, Song, Rui

论文摘要

新型冠状病毒（Covid-19）等严重的传染病对公共卫生构成了巨大威胁。严格的控制措施，例如关闭学校和全职命令，同时产生重大影响，也带来了巨大的经济损失。面对一种新兴的传染病，对决策者来说，一个关键的问题是如何使权衡并及时实施适当的干预措施，鉴于巨大的不确定性。在这项工作中，我们提出了一个基于多目标的基于模型的增强学习框架，以促进数据驱动的决策并最大程度地降低整体长期成本。具体而言，在每个决策点上，首先将贝叶斯流行病学模型作为环境模型学习，然后将基于模型的多目标计划算法用于查找一组帕托托最佳策略。该框架与每个策略的预测频段相结合，为决策者提供了实时决策支持工具。该应用程序是通过19日在中国传播的。

Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. In the face of an emerging infectious disease, a crucial question for policymakers is how to make the trade-off and implement the appropriate interventions timely given the huge uncertainty. In this work, we propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost. Specifically, at each decision point, a Bayesian epidemiological model is first learned as the environment model, and then the proposed model-based multi-objective planning algorithm is applied to find a set of Pareto-optimal policies. This framework, combined with the prediction bands for each policy, provides a real-time decision support tool for policymakers. The application is demonstrated with the spread of COVID-19 in China.

下载PDF全文

下载文献需遵守相关版权规定

论文标题