Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt
In this paper, we present Neural k-Opt (NeuOpt), a novel learning-to-search (L2S) solver for routing problems. It learns to perform flexible k-opt exchanges based on a tailored action factorization method and a customized recurrent dual-stream decoder. As a pioneering work to circumvent the pure fea...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8399 https://ink.library.smu.edu.sg/context/sis_research/article/9402/viewcontent/2310.18264.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9402 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-94022024-01-09T03:51:26Z Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt MA, Yining CAO, Zhiguang CHEE, Yew Meng In this paper, we present Neural k-Opt (NeuOpt), a novel learning-to-search (L2S) solver for routing problems. It learns to perform flexible k-opt exchanges based on a tailored action factorization method and a customized recurrent dual-stream decoder. As a pioneering work to circumvent the pure feasibility masking scheme and enable the autonomous exploration of both feasible and infeasible regions, we then propose the Guided Infeasible Region Exploration (GIRE) scheme, which supplements the NeuOpt policy network with feasibility-related features and leverages reward shaping to steer reinforcement learning more effectively. Besides, we further equip NeuOpt with dynamic data augmentations during inference for more diverse searches. Extensive experiments on the Traveling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that our NeuOpt not only significantly outstrips existing (masking-based) L2S solvers, but also showcases superiority over the learning-to-construct (L2C) and learning-to-predict (L2P) solvers. Notably, we offer fresh perspectives on how neural solvers could efficiently handle VRP constraints, against masking-based feasibility representation. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8399 https://ink.library.smu.edu.sg/context/sis_research/article/9402/viewcontent/2310.18264.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems MA, Yining CAO, Zhiguang CHEE, Yew Meng Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
description |
In this paper, we present Neural k-Opt (NeuOpt), a novel learning-to-search (L2S) solver for routing problems. It learns to perform flexible k-opt exchanges based on a tailored action factorization method and a customized recurrent dual-stream decoder. As a pioneering work to circumvent the pure feasibility masking scheme and enable the autonomous exploration of both feasible and infeasible regions, we then propose the Guided Infeasible Region Exploration (GIRE) scheme, which supplements the NeuOpt policy network with feasibility-related features and leverages reward shaping to steer reinforcement learning more effectively. Besides, we further equip NeuOpt with dynamic data augmentations during inference for more diverse searches. Extensive experiments on the Traveling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that our NeuOpt not only significantly outstrips existing (masking-based) L2S solvers, but also showcases superiority over the learning-to-construct (L2C) and learning-to-predict (L2P) solvers. Notably, we offer fresh perspectives on how neural solvers could efficiently handle VRP constraints, against masking-based feasibility representation. |
format |
text |
author |
MA, Yining CAO, Zhiguang CHEE, Yew Meng |
author_facet |
MA, Yining CAO, Zhiguang CHEE, Yew Meng |
author_sort |
MA, Yining |
title |
Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
title_short |
Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
title_full |
Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
title_fullStr |
Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
title_full_unstemmed |
Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
title_sort |
learning to search feasible and infeasible regions of routing problems with flexible neural k-opt |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8399 https://ink.library.smu.edu.sg/context/sis_research/article/9402/viewcontent/2310.18264.pdf |
_version_ |
1787590768999792640 |