论文标题
推断和传达意图:超出数值奖励到逻辑意图
Inferring and Conveying Intentionality: Beyond Numerical Rewards to Logical Intentions
论文作者
论文摘要
共同的意图是发展能够协作,自我反思,审议和推理的有意识的AI代理的关键组成部分。我们将共同意图的推论作为逻辑奖励规范作为反向加强学习问题。我们展示了该方法如何从演示中推断任务描述。我们还扩展了积极传达意图的方法。我们在一个简单的网格世界示例上演示了该方法。
Shared intentionality is a critical component in developing conscious AI agents capable of collaboration, self-reflection, deliberation, and reasoning. We formulate inference of shared intentionality as an inverse reinforcement learning problem with logical reward specifications. We show how the approach can infer task descriptions from demonstrations. We also extend our approach to actively convey intentionality. We demonstrate the approach on a simple grid-world example.