Imitation improvement learning for large-scale capacitated vehicle routing problems
Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8025 https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9028 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-90282023-08-11T08:30:44Z Imitation improvement learning for large-scale capacitated vehicle routing problems BUI, The Viet MAI, Tien Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a newimprovement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar or better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learningto solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art results for several largeinstances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms. 2023-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8025 info:doi/10.1609/icaps.v33i1.27236 https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University capacitated vehicle routing problem Clockwise clustering imitation learning improvement learning reinforcement learning Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Transportation |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
capacitated vehicle routing problem Clockwise clustering imitation learning improvement learning reinforcement learning Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Transportation |
spellingShingle |
capacitated vehicle routing problem Clockwise clustering imitation learning improvement learning reinforcement learning Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Transportation BUI, The Viet MAI, Tien Imitation improvement learning for large-scale capacitated vehicle routing problems |
description |
Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a newimprovement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar or better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learningto solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art results for several largeinstances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms. |
format |
text |
author |
BUI, The Viet MAI, Tien |
author_facet |
BUI, The Viet MAI, Tien |
author_sort |
BUI, The Viet |
title |
Imitation improvement learning for large-scale capacitated vehicle routing problems |
title_short |
Imitation improvement learning for large-scale capacitated vehicle routing problems |
title_full |
Imitation improvement learning for large-scale capacitated vehicle routing problems |
title_fullStr |
Imitation improvement learning for large-scale capacitated vehicle routing problems |
title_full_unstemmed |
Imitation improvement learning for large-scale capacitated vehicle routing problems |
title_sort |
imitation improvement learning for large-scale capacitated vehicle routing problems |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8025 https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf |
_version_ |
1779156863368036352 |