论文标题
学习通过基于图的模仿学习来优化置换流店调度
Learning to Optimize Permutation Flow Shop Scheduling via Graph-based Imitation Learning
论文作者
论文摘要
旨在寻找工作最佳排列的置换流店调度(PFSS)广泛用于制造系统中。在解决大规模PFSS问题时,传统优化算法(例如启发式方法)几乎无法满足解决方案准确性和计算效率的需求,因此基于学习的方法最近引起了更多的关注。一些工作试图通过加强学习方法来解决问题,这些方法在训练过程中遇到了缓慢的收敛问题,并且在解决方案方面仍然不够准确。为此,我们建议通过专家驱动的模仿学习来训练模型,从而更准确,准确地加速收敛。此外,为了提取输入作业的更好特征表示,我们将图形结构纳入编码器。广泛的实验表明,我们提出的模型获得了重大的促进性,并在多达1000个工作岗位的大规模问题中提供了出色的概括性。与最先进的增强学习方法相比,我们的模型网络参数仅减少到其37%的%,而我们模型对专家解决方案的解决方案差距从6.8 \%\%降低至1.3 \%。该代码可在:\ url {https://github.com/longkangli/pfss-il}中获得。
The permutation flow shop scheduling (PFSS), aiming at finding the optimal permutation of jobs, is widely used in manufacturing systems. When solving large-scale PFSS problems, traditional optimization algorithms such as heuristics could hardly meet the demands of both solution accuracy and computational efficiency, thus learning-based methods have recently garnered more attention. Some work attempts to solve the problems by reinforcement learning methods, which suffer from slow convergence issues during training and are still not accurate enough regarding the solutions. To that end, we propose to train the model via expert-driven imitation learning, which accelerates convergence more stably and accurately. Moreover, in order to extract better feature representations of input jobs, we incorporate the graph structure as the encoder. The extensive experiments reveal that our proposed model obtains significant promotion and presents excellent generalizability in large-scale problems with up to 1000 jobs. Compared to the state-of-the-art reinforcement learning method, our model's network parameters are reduced to only 37\% of theirs, and the solution gap of our model towards the expert solutions decreases from 6.8\% to 1.3\% on average. The code is available at: \url{https://github.com/longkangli/PFSS-IL}.