深度学习中当前的多任务优化方法是否有帮助？

论文标题

深度学习中当前的多任务优化方法是否有帮助？

Do Current Multi-Task Optimization Methods in Deep Learning Even Help?

论文作者

Xin, Derrick, Ghorbani, Behrooz, Garg, Ankush, Firat, Orhan, Gilmer, Justin

论文摘要

最近的研究提出了一系列针对深度任务模型的专业优化算法。通常认为这些多任务优化（MTO）方法产生的解决方案优于仅通过优化任务损失的加权平均值而获得的解决方案。在本文中，我们对各种语言和视觉任务进行大规模实验，以检查这些主张的经验有效性。我们表明，尽管这些算法的设计和计算复杂性增加了，但MTO方法并未产生超出传统优化方法可实现的性能的任何改进。我们重点介绍了替代性策略，这些策略始终如一地提高性能概况，并指出可能导致次优效果的常见训练陷阱。最后，我们概述了可靠地评估MTO算法的性能并讨论潜在解决方案的挑战。

Recent research has proposed a series of specialized optimization algorithms for deep multi-task models. It is often claimed that these multi-task optimization (MTO) methods yield solutions that are superior to the ones found by simply optimizing a weighted average of the task losses. In this paper, we perform large-scale experiments on a variety of language and vision tasks to examine the empirical validity of these claims. We show that, despite the added design and computational complexity of these algorithms, MTO methods do not yield any performance improvements beyond what is achievable via traditional optimization approaches. We highlight alternative strategies that consistently yield improvements to the performance profile and point out common training pitfalls that might cause suboptimal results. Finally, we outline challenges in reliably evaluating the performance of MTO algorithms and discuss potential solutions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题