深度神经网络中的捷径学习

论文标题

深度神经网络中的捷径学习

Shortcut Learning in Deep Neural Networks

论文作者

Geirhos, Robert, Jacobsen, Jörn-Henrik, Michaelis, Claudio, Zemel, Richard, Brendel, Wieland, Bethge, Matthias, Wichmann, Felix A.

论文摘要

深度学习引发了当前人工智能的崛起，并且是当今机器智能的主力。许多成功案例已经迅速传播到整个科学，工业和社会上，但是它的局限性直到最近才开始焦点。从这个角度来看，我们试图提炼多少深度学习问题可以看作是同一基本问题的不同症状：快捷方式学习。快捷方式是在标准基准上表现良好的决策规则，但未能转移到更具挑战性的测试条件（例如现实世界情景）上。相关问题在比较心理学，教育和语言学中是已知的，这表明快捷方式学习可能是学习系统，生物学和人工的共同特征。基于这些观察结果，我们为模型解释和基准测试开发了一组建议，强调了机器学习的最新进展，以提高从实验室到现实世界应用程序的鲁棒性和可传递性。

Deep learning has triggered the current rise of artificial intelligence and is the workhorse of today's machine intelligence. Numerous success stories have rapidly spread all over science, industry and society, but its limitations have only recently come into focus. In this perspective we seek to distill how many of deep learning's problems can be seen as different symptoms of the same underlying problem: shortcut learning. Shortcuts are decision rules that perform well on standard benchmarks but fail to transfer to more challenging testing conditions, such as real-world scenarios. Related issues are known in Comparative Psychology, Education and Linguistics, suggesting that shortcut learning may be a common characteristic of learning systems, biological and artificial alike. Based on these observations, we develop a set of recommendations for model interpretation and benchmarking, highlighting recent advances in machine learning to improve robustness and transferability from the lab to real-world applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题