Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique
Reinforcement learning promises to provide a state-of-the-art solution to the decision making problem of autonomous driving. Nonetheless, numerous real-world decision making problems involve balancing multiple conflicting or competing objectives. In addition, passengers may typically prefer to explo...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171196 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171196 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1711962023-10-17T04:43:56Z Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique He, Xiangkun Lv, Chen School of Mechanical and Aerospace Engineering Engineering::Mechanical engineering Autonomous Vehicle Personalized Decision Making Reinforcement learning promises to provide a state-of-the-art solution to the decision making problem of autonomous driving. Nonetheless, numerous real-world decision making problems involve balancing multiple conflicting or competing objectives. In addition, passengers may typically prefer to explore diversified driving modes through their specific preferences (i.e., relative importance of different objectives). Taking into account these demands, traditional reinforcement learning algorithms with applications in personalized self-driving vehicles remain challenging. Consequently, here we present a novel constrained multi-objective reinforcement learning technique for personalized decision making in autonomous driving, with the goal of learning a single model for Pareto optimal policies across the space of all possible user preferences. Specifically, a nonlinear constraint incorporating a user-specified preference and a vectorized action–value function is introduced to ensure both diversity in learned decision behaviors and efficient alignment between the user-specified preference and the corresponding optimal policy. Additionally, a constrained multi-objective actor–critic approach is advanced to approximate the Pareto optimal policies for any user-specified preferences while adhering to the nonlinear constraint. Finally, the proposed personalized decision making scheme for autonomous driving is assessed in a highway on-ramp merging scenario with dynamic traffic flows. The results demonstrate the effectiveness of our method by comparing it with classical and state-of-the-art baselines. Agency for Science, Technology and Research (A*STAR) Nanyang Technological University This work was supported in part by A*STAR AME Young Individual Research Grant (No. A2084c0156), the MTC Individual Research Grants (No. M22K2c0079), the ANR-NRF joint grant (No.NRF2021-NRF-ANR003 HM Science), and SUG-NAP Grant of Nanyang Technological University, Singapore. 2023-10-17T04:43:56Z 2023-10-17T04:43:56Z 2023 Journal Article He, X. & Lv, C. (2023). Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique. Transportation Research Part C: Emerging Technologies, 156, 104352-. https://dx.doi.org/10.1016/j.trc.2023.104352 0968-090X https://hdl.handle.net/10356/171196 10.1016/j.trc.2023.104352 2-s2.0-85171561567 156 104352 en A2084c0156 M22K2c0079 NRF2021-NRF-ANR003 HM Science Transportation Research Part C: Emerging Technologies © 2023 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Mechanical engineering Autonomous Vehicle Personalized Decision Making |
spellingShingle |
Engineering::Mechanical engineering Autonomous Vehicle Personalized Decision Making He, Xiangkun Lv, Chen Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
description |
Reinforcement learning promises to provide a state-of-the-art solution to the decision making problem of autonomous driving. Nonetheless, numerous real-world decision making problems involve balancing multiple conflicting or competing objectives. In addition, passengers may typically prefer to explore diversified driving modes through their specific preferences (i.e., relative importance of different objectives). Taking into account these demands, traditional reinforcement learning algorithms with applications in personalized self-driving vehicles remain challenging. Consequently, here we present a novel constrained multi-objective reinforcement learning technique for personalized decision making in autonomous driving, with the goal of learning a single model for Pareto optimal policies across the space of all possible user preferences. Specifically, a nonlinear constraint incorporating a user-specified preference and a vectorized action–value function is introduced to ensure both diversity in learned decision behaviors and efficient alignment between the user-specified preference and the corresponding optimal policy. Additionally, a constrained multi-objective actor–critic approach is advanced to approximate the Pareto optimal policies for any user-specified preferences while adhering to the nonlinear constraint. Finally, the proposed personalized decision making scheme for autonomous driving is assessed in a highway on-ramp merging scenario with dynamic traffic flows. The results demonstrate the effectiveness of our method by comparing it with classical and state-of-the-art baselines. |
author2 |
School of Mechanical and Aerospace Engineering |
author_facet |
School of Mechanical and Aerospace Engineering He, Xiangkun Lv, Chen |
format |
Article |
author |
He, Xiangkun Lv, Chen |
author_sort |
He, Xiangkun |
title |
Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
title_short |
Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
title_full |
Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
title_fullStr |
Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
title_full_unstemmed |
Toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
title_sort |
toward personalized decision making for autonomous vehicles: a constrained multi-objective reinforcement learning technique |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171196 |
_version_ |
1781793680246439936 |