Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization

Interacting with the surrounding road users is crucial for autonomous vehicles (AV). However, the inherent multimodality and uncertainties associated with traffic participants (TP) pose challenges in AVs' prediction and decision-making (PnD). A primary challenge is adapting predictors trained o...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Yiran, Lou, Shanhe, Hang, Peng, Huang, Wenhui, Yang, Lie, Lv, Chen
Other Authors: School of Mechanical and Aerospace Engineering
Format: Article
Language:English
Published: 2025
Subjects:
Online Access:https://hdl.handle.net/10356/182434
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-182434
record_format dspace
spelling sg-ntu-dr.10356-1824342025-02-03T02:21:06Z Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization Zhang, Yiran Lou, Shanhe Hang, Peng Huang, Wenhui Yang, Lie Lv, Chen School of Mechanical and Aerospace Engineering Engineering Trajectory prediction Autonomous driving Interacting with the surrounding road users is crucial for autonomous vehicles (AV). However, the inherent multimodality and uncertainties associated with traffic participants (TP) pose challenges in AVs' prediction and decision-making (PnD). A primary challenge is adapting predictors trained on static offline datasets to the dynamic, diverse data streams encountered in reality. Secondly, utilizing one single forecast trajectory with the highest probability for decision-making contains potential risks as it neglects that even a small probability represents a subset of TP behaviors. Based on the existing prediction backbone, we propose an online learning approach incorporating pseudo-labels inferred from partial feedback as compensation for conventional methodologies, considering both the commonsense and personalization facets of driving. Drawing inspiration from the second law of thermodynamics, we propose to minimize microscopic traffic entropy as an additional objective in decision-making. This objective aims to reduce the chaos of traffic scenes, thus achieving more predictable future interactions and, conversely, making future decisions easier. Through real-time human-in-the-loop experiments, we quantifiably and comparably reveal that adopting one single trajectory without online learning in PnD is risky. However, this reliability is verified to be significantly improved by our proposed techniques, and the efficacy is further analyzed in a subsequent qualitative study. A static experiment transferring the prediction algorithm trained exclusively on Argoverse 2 to datasets including NGSIM, HighD, RounD, and NuScenes is also conducted, demonstrating that the proposed correction can effectively mitigate the gap between the datasets and real-world scenarios. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) National Research Foundation (NRF) This work was supported in part by the Agency for Science, Technology and Research (A*STAR), Singapore, under the MTC Individual Research under Grant M22K2c0079, in part by the ANR-NRF Joint under Grant NRF2021-NRF-ANR003 HM Science, and in part by the Ministry of Education (MOE), Singapore, under the Tier 2 under Grant MOE-T2EP50222-0002. 2025-02-03T02:21:06Z 2025-02-03T02:21:06Z 2024 Journal Article Zhang, Y., Lou, S., Hang, P., Huang, W., Yang, L. & Lv, C. (2024). Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization. IEEE Transactions On Intelligent Transportation Systems, 25(11), 17718-17732. https://dx.doi.org/10.1109/TITS.2024.3419003 1524-9050 https://hdl.handle.net/10356/182434 10.1109/TITS.2024.3419003 2-s2.0-85208709421 11 25 17718 17732 en M22K2c0079 NRF2021-NRF-ANR003 HM Science MOE-T2EP50222-0002 IEEE Transactions on Intelligent Transportation Systems © 2024 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Trajectory prediction
Autonomous driving
spellingShingle Engineering
Trajectory prediction
Autonomous driving
Zhang, Yiran
Lou, Shanhe
Hang, Peng
Huang, Wenhui
Yang, Lie
Lv, Chen
Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
description Interacting with the surrounding road users is crucial for autonomous vehicles (AV). However, the inherent multimodality and uncertainties associated with traffic participants (TP) pose challenges in AVs' prediction and decision-making (PnD). A primary challenge is adapting predictors trained on static offline datasets to the dynamic, diverse data streams encountered in reality. Secondly, utilizing one single forecast trajectory with the highest probability for decision-making contains potential risks as it neglects that even a small probability represents a subset of TP behaviors. Based on the existing prediction backbone, we propose an online learning approach incorporating pseudo-labels inferred from partial feedback as compensation for conventional methodologies, considering both the commonsense and personalization facets of driving. Drawing inspiration from the second law of thermodynamics, we propose to minimize microscopic traffic entropy as an additional objective in decision-making. This objective aims to reduce the chaos of traffic scenes, thus achieving more predictable future interactions and, conversely, making future decisions easier. Through real-time human-in-the-loop experiments, we quantifiably and comparably reveal that adopting one single trajectory without online learning in PnD is risky. However, this reliability is verified to be significantly improved by our proposed techniques, and the efficacy is further analyzed in a subsequent qualitative study. A static experiment transferring the prediction algorithm trained exclusively on Argoverse 2 to datasets including NGSIM, HighD, RounD, and NuScenes is also conducted, demonstrating that the proposed correction can effectively mitigate the gap between the datasets and real-world scenarios.
author2 School of Mechanical and Aerospace Engineering
author_facet School of Mechanical and Aerospace Engineering
Zhang, Yiran
Lou, Shanhe
Hang, Peng
Huang, Wenhui
Yang, Lie
Lv, Chen
format Article
author Zhang, Yiran
Lou, Shanhe
Hang, Peng
Huang, Wenhui
Yang, Lie
Lv, Chen
author_sort Zhang, Yiran
title Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
title_short Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
title_full Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
title_fullStr Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
title_full_unstemmed Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
title_sort interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization
publishDate 2025
url https://hdl.handle.net/10356/182434
_version_ 1823108740573822976