在线性时间内学习在现实世界图上解决组合优化问题

论文标题

在线性时间内学习在现实世界图上解决组合优化问题

Learning to Solve Combinatorial Optimization Problems on Real-World Graphs in Linear Time

论文作者

Drori, Iddo, Kharkar, Anant, Sickinger, William R., Kates, Brandon, Ma, Qiang, Ge, Suwen, Dolev, Eden, Dietrich, Brenda, Williamson, David P., Udell, Madeleine

论文摘要

对于图形问题的组合优化算法通常是针对每个新问题的重新设计的，专家对问题结构进行了仔细的关注。在这项工作中，我们开发了一个新的框架，以在图表上解决任何组合优化问题，这些问题可以由状态，动作和奖励定义为单个玩家游戏，包括最小跨越树，最短路径，旅行推销员问题和车辆路由问题，而没有专家知识。我们的方法在未标记的训练集上使用强化学习训练图形神经网络。然后，受过训练的网络在线性运行时间中将近似解决方案输出到新的图实例。相比之下，针对图表上NP问题量身定制的先前近似算法或启发式方法通常至少具有二次运行时间。我们证明了我们的方法在接近1的最佳差距的多项式和NP硬性问题上的适用性，并证明我们的方法能够很好地概括：（i）从小图上的训练到大图上的测试；（ii）从一种类型的随机图训练到在另一种类型的随机图上进行测试；（iii）从随机图上的训练到在现实世界图上运行。

Combinatorial optimization algorithms for graph problems are usually designed afresh for each new problem with careful attention by an expert to the problem structure. In this work, we develop a new framework to solve any combinatorial optimization problem over graphs that can be formulated as a single player game defined by states, actions, and rewards, including minimum spanning tree, shortest paths, traveling salesman problem, and vehicle routing problem, without expert knowledge. Our method trains a graph neural network using reinforcement learning on an unlabeled training set of graphs. The trained network then outputs approximate solutions to new graph instances in linear running time. In contrast, previous approximation algorithms or heuristics tailored to NP-hard problems on graphs generally have at least quadratic running time. We demonstrate the applicability of our approach on both polynomial and NP-hard problems with optimality gaps close to 1, and show that our method is able to generalize well: (i) from training on small graphs to testing on large graphs; (ii) from training on random graphs of one type to testing on random graphs of another type; and (iii) from training on random graphs to running on real world graphs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题