Maximizing the probability of arriving on time: A practical q-learning method

The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from l...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	CAO, Zhiguang, GUO, Hongliang, ZHANG, Jie, OLIEHOEK, Frans, FASTENRATH, Ulrich
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2017
الموضوعات:	Databases and Information Systems
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/8131 https://ink.library.smu.edu.sg/context/sis_research/article/9134/viewcontent/11170_Article_Text_14698_1_2_20201228.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

الوصف
الملخص:	The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts.

Maximizing the probability of arriving on time: A practical q-learning method

مواد مشابهة