Maximizing the probability of arriving on time: A practical q-learning method
The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from l...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8131 https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9134 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-91342023-09-14T08:32:04Z Maximizing the probability of arriving on time: A practical q-learning method CAO, Zhiguang GUO, Hongliang ZHANG, Jie OLIEHOEK, Frans FASTENRATH, Ulrich The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts. 2017-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8131 info:doi/10.1609/aaai.v31i1.11170 https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems CAO, Zhiguang GUO, Hongliang ZHANG, Jie OLIEHOEK, Frans FASTENRATH, Ulrich Maximizing the probability of arriving on time: A practical q-learning method |
description |
The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts. |
format |
text |
author |
CAO, Zhiguang GUO, Hongliang ZHANG, Jie OLIEHOEK, Frans FASTENRATH, Ulrich |
author_facet |
CAO, Zhiguang GUO, Hongliang ZHANG, Jie OLIEHOEK, Frans FASTENRATH, Ulrich |
author_sort |
CAO, Zhiguang |
title |
Maximizing the probability of arriving on time: A practical q-learning method |
title_short |
Maximizing the probability of arriving on time: A practical q-learning method |
title_full |
Maximizing the probability of arriving on time: A practical q-learning method |
title_fullStr |
Maximizing the probability of arriving on time: A practical q-learning method |
title_full_unstemmed |
Maximizing the probability of arriving on time: A practical q-learning method |
title_sort |
maximizing the probability of arriving on time: a practical q-learning method |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/8131 https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf |
_version_ |
1779157175728340992 |