论文标题
通过有限尺寸的神经网络可证明解决背包问题的良好解决方案
Provably Good Solutions to the Knapsack Problem via Neural Networks of Bounded Size
论文作者
论文摘要
对神经网络表现的令人满意和严格的数学理解的发展是人工智能的主要挑战。在这种背景下,我们通过经典的NP-Hard Knapsack问题的示例来研究神经网络的表达能力。我们的主要贡献是一类经常性的神经网络(RNN),其具有校准的线性单元,这些单元迭代应用于背包实例的每个项目,从而计算最佳或可证明的良好解决方案值。我们表明,根据最佳背包解决方案的利润,深度为四和宽度的RNN足以找到最佳的背包解决方案。我们还证明了RNN的大小和计算的背包解决方案的质量之间的以下权衡:对于由$ n $项目组成的背包实例,五个深度和宽度$ w $的rnn计算至少$ 1- \ MATHCAL {O}(n^2/\ sqrt {w} w} w} w})的价值的解决方案。我们的结果基于背包问题的经典动态编程公式,以及对Knapsack问题的众所周知的完全多项式时间近似方案的核心核心。经过精心进行的计算研究在定性上支持我们的理论规模界限。最后,我们指出,我们的结果可以推广到许多其他组合优化问题,这些问题吸收了动态编程解决方案方法,例如各种最短路径问题,最长的常见子序列问题以及旅行销售人员的问题。
The development of a satisfying and rigorous mathematical understanding of the performance of neural networks is a major challenge in artificial intelligence. Against this background, we study the expressive power of neural networks through the example of the classical NP-hard Knapsack Problem. Our main contribution is a class of recurrent neural networks (RNNs) with rectified linear units that are iteratively applied to each item of a Knapsack instance and thereby compute optimal or provably good solution values. We show that an RNN of depth four and width depending quadratically on the profit of an optimum Knapsack solution is sufficient to find optimum Knapsack solutions. We also prove the following tradeoff between the size of an RNN and the quality of the computed Knapsack solution: for Knapsack instances consisting of $n$ items, an RNN of depth five and width $w$ computes a solution of value at least $1-\mathcal{O}(n^2/\sqrt{w})$ times the optimum solution value. Our results build upon a classical dynamic programming formulation of the Knapsack Problem as well as a careful rounding of profit values that are also at the core of the well-known fully polynomial-time approximation scheme for the Knapsack Problem. A carefully conducted computational study qualitatively supports our theoretical size bounds. Finally, we point out that our results can be generalized to many other combinatorial optimization problems that admit dynamic programming solution methods, such as various Shortest Path Problems, the Longest Common Subsequence Problem, and the Traveling Salesperson Problem.