Imitation improvement learning for large-scale capacitated vehicle routing problems

Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved...

Full description

Saved in:
Bibliographic Details
Main Authors: BUI, The Viet, MAI, Tien
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8025
https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9028
record_format dspace
spelling sg-smu-ink.sis_research-90282023-08-11T08:30:44Z Imitation improvement learning for large-scale capacitated vehicle routing problems BUI, The Viet MAI, Tien Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a newimprovement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar or better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learningto solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art results for several largeinstances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms. 2023-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8025 info:doi/10.1609/icaps.v33i1.27236 https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University capacitated vehicle routing problem Clockwise clustering imitation learning improvement learning reinforcement learning Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Transportation
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic capacitated vehicle routing problem
Clockwise clustering
imitation learning
improvement learning
reinforcement learning
Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
Transportation
spellingShingle capacitated vehicle routing problem
Clockwise clustering
imitation learning
improvement learning
reinforcement learning
Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
Transportation
BUI, The Viet
MAI, Tien
Imitation improvement learning for large-scale capacitated vehicle routing problems
description Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a newimprovement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar or better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learningto solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art results for several largeinstances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms.
format text
author BUI, The Viet
MAI, Tien
author_facet BUI, The Viet
MAI, Tien
author_sort BUI, The Viet
title Imitation improvement learning for large-scale capacitated vehicle routing problems
title_short Imitation improvement learning for large-scale capacitated vehicle routing problems
title_full Imitation improvement learning for large-scale capacitated vehicle routing problems
title_fullStr Imitation improvement learning for large-scale capacitated vehicle routing problems
title_full_unstemmed Imitation improvement learning for large-scale capacitated vehicle routing problems
title_sort imitation improvement learning for large-scale capacitated vehicle routing problems
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8025
https://ink.library.smu.edu.sg/context/sis_research/article/9028/viewcontent/main__2_.pdf
_version_ 1779156863368036352