Learning improvement heuristics for solving routing problems

Recent studies in using deep learning to solve routing problems focus on construction heuristics, the solutions of which are still far from optimality. Improvement heuristics have great potential to narrow this gap by iteratively refining a solution. However, classic improvement heuristics are all g...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Yaoxin, SONG, Wen, CAO, Zhiguang, ZHANG, Jie, LIM, Andrew
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8129
https://ink.library.smu.edu.sg/context/sis_research/article/9132/viewcontent/1912.05784.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9132
record_format dspace
spelling sg-smu-ink.sis_research-91322023-09-14T08:32:58Z Learning improvement heuristics for solving routing problems WU, Yaoxin SONG, Wen CAO, Zhiguang ZHANG, Jie LIM, Andrew Recent studies in using deep learning to solve routing problems focus on construction heuristics, the solutions of which are still far from optimality. Improvement heuristics have great potential to narrow this gap by iteratively refining a solution. However, classic improvement heuristics are all guided by hand-crafted rules which may limit their performance. In this paper, we propose a deep reinforcement learning framework to learn the improvement heuristics for routing problems. We design a self-attention based deep architecture as the policy network to guide the selection of next solution. We apply our method to two important routing problems, i.e. travelling salesman problem (TSP) and capacitated vehicle routing problem (CVRP). Experiments show that our method outperforms state-of-theart deep learning based approaches. The learned policies are more effective than the traditional hand-crafted ones, and can be further enhanced by simple diversifying strategies. Moreover, the policies generalize well to different problem sizes, initial solutions and even real-world dataset. 2022-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8129 info:doi/10.1109/TNNLS.2021.3068828 https://ink.library.smu.edu.sg/context/sis_research/article/9132/viewcontent/1912.05784.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Routing Heuristic algorithms Vehicle routing Traveling salesman problems Training Task analysis Search problems OS and Networks
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Routing
Heuristic algorithms
Vehicle routing
Traveling salesman problems
Training
Task analysis
Search problems
OS and Networks
spellingShingle Routing
Heuristic algorithms
Vehicle routing
Traveling salesman problems
Training
Task analysis
Search problems
OS and Networks
WU, Yaoxin
SONG, Wen
CAO, Zhiguang
ZHANG, Jie
LIM, Andrew
Learning improvement heuristics for solving routing problems
description Recent studies in using deep learning to solve routing problems focus on construction heuristics, the solutions of which are still far from optimality. Improvement heuristics have great potential to narrow this gap by iteratively refining a solution. However, classic improvement heuristics are all guided by hand-crafted rules which may limit their performance. In this paper, we propose a deep reinforcement learning framework to learn the improvement heuristics for routing problems. We design a self-attention based deep architecture as the policy network to guide the selection of next solution. We apply our method to two important routing problems, i.e. travelling salesman problem (TSP) and capacitated vehicle routing problem (CVRP). Experiments show that our method outperforms state-of-theart deep learning based approaches. The learned policies are more effective than the traditional hand-crafted ones, and can be further enhanced by simple diversifying strategies. Moreover, the policies generalize well to different problem sizes, initial solutions and even real-world dataset.
format text
author WU, Yaoxin
SONG, Wen
CAO, Zhiguang
ZHANG, Jie
LIM, Andrew
author_facet WU, Yaoxin
SONG, Wen
CAO, Zhiguang
ZHANG, Jie
LIM, Andrew
author_sort WU, Yaoxin
title Learning improvement heuristics for solving routing problems
title_short Learning improvement heuristics for solving routing problems
title_full Learning improvement heuristics for solving routing problems
title_fullStr Learning improvement heuristics for solving routing problems
title_full_unstemmed Learning improvement heuristics for solving routing problems
title_sort learning improvement heuristics for solving routing problems
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/8129
https://ink.library.smu.edu.sg/context/sis_research/article/9132/viewcontent/1912.05784.pdf
_version_ 1779157174861168640