Maximizing the probability of arriving on time: A practical q-learning method

The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from l...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Zhiguang, GUO, Hongliang, ZHANG, Jie, OLIEHOEK, Frans, FASTENRATH, Ulrich
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8131
https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9134
record_format dspace
spelling sg-smu-ink.sis_research-91342023-09-14T08:32:04Z Maximizing the probability of arriving on time: A practical q-learning method CAO, Zhiguang GUO, Hongliang ZHANG, Jie OLIEHOEK, Frans FASTENRATH, Ulrich The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts. 2017-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8131 info:doi/10.1609/aaai.v31i1.11170 https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
CAO, Zhiguang
GUO, Hongliang
ZHANG, Jie
OLIEHOEK, Frans
FASTENRATH, Ulrich
Maximizing the probability of arriving on time: A practical q-learning method
description The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts.
format text
author CAO, Zhiguang
GUO, Hongliang
ZHANG, Jie
OLIEHOEK, Frans
FASTENRATH, Ulrich
author_facet CAO, Zhiguang
GUO, Hongliang
ZHANG, Jie
OLIEHOEK, Frans
FASTENRATH, Ulrich
author_sort CAO, Zhiguang
title Maximizing the probability of arriving on time: A practical q-learning method
title_short Maximizing the probability of arriving on time: A practical q-learning method
title_full Maximizing the probability of arriving on time: A practical q-learning method
title_fullStr Maximizing the probability of arriving on time: A practical q-learning method
title_full_unstemmed Maximizing the probability of arriving on time: A practical q-learning method
title_sort maximizing the probability of arriving on time: a practical q-learning method
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/8131
https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf
_version_ 1779157175728340992