Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning
Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and deli...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8127 https://ink.library.smu.edu.sg/context/sis_research/article/9130/viewcontent/2110.02634.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9130 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-91302023-09-14T08:34:12Z Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning LI, Jingwen XIN, Liang CAO, Zhiguang LIM, Andrew SONG, Wen ZHANG, Jie Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep reinforcement learning to automatically select the nodes. In particular, the heterogeneous attention mechanism specifically prescribes attentions for each role of the nodes while taking into account the precedence constraint, i.e., the pickup node must precede the pairing delivery node. Further integrated with a masking scheme, the learnt policy is expected to find higher-quality solutions for solving PDP. Extensive experimental results show that our method outperforms the state-of-the-art heuristic and deep learning model, respectively, and generalizes well to different distributions and problem sizes. 2022-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8127 info:doi/10.1109/TITS.2021.3056120 https://ink.library.smu.edu.sg/context/sis_research/article/9130/viewcontent/2110.02634.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Reinforcement learning Routing Peer-to-peer computing Heuristic algorithms Deep learning Decoding Decision making Heterogeneous attention deep reinforcement learning pickup and delivery problem Artificial Intelligence and Robotics Transportation |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Reinforcement learning Routing Peer-to-peer computing Heuristic algorithms Deep learning Decoding Decision making Heterogeneous attention deep reinforcement learning pickup and delivery problem Artificial Intelligence and Robotics Transportation |
spellingShingle |
Reinforcement learning Routing Peer-to-peer computing Heuristic algorithms Deep learning Decoding Decision making Heterogeneous attention deep reinforcement learning pickup and delivery problem Artificial Intelligence and Robotics Transportation LI, Jingwen XIN, Liang CAO, Zhiguang LIM, Andrew SONG, Wen ZHANG, Jie Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
description |
Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep reinforcement learning to automatically select the nodes. In particular, the heterogeneous attention mechanism specifically prescribes attentions for each role of the nodes while taking into account the precedence constraint, i.e., the pickup node must precede the pairing delivery node. Further integrated with a masking scheme, the learnt policy is expected to find higher-quality solutions for solving PDP. Extensive experimental results show that our method outperforms the state-of-the-art heuristic and deep learning model, respectively, and generalizes well to different distributions and problem sizes. |
format |
text |
author |
LI, Jingwen XIN, Liang CAO, Zhiguang LIM, Andrew SONG, Wen ZHANG, Jie |
author_facet |
LI, Jingwen XIN, Liang CAO, Zhiguang LIM, Andrew SONG, Wen ZHANG, Jie |
author_sort |
LI, Jingwen |
title |
Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
title_short |
Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
title_full |
Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
title_fullStr |
Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
title_full_unstemmed |
Heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
title_sort |
heterogeneous attentions for solving pickup and delivery problem via deep reinforcement learning |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/8127 https://ink.library.smu.edu.sg/context/sis_research/article/9130/viewcontent/2110.02634.pdf |
_version_ |
1779157162398842880 |