Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road

Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes...

Full description

Saved in:
Bibliographic Details
Main Authors: Shi, Xiupeng, Wong, Yiik Diew, Chai, Chen, Li, Michael Zhi Feng, Chen, Tianyi, Zeng, Zeng
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/162965
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-162965
record_format dspace
spelling sg-ntu-dr.10356-1629652023-05-19T07:31:19Z Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road Shi, Xiupeng Wong, Yiik Diew Chai, Chen Li, Michael Zhi Feng Chen, Tianyi Zeng, Zeng School of Civil and Environmental Engineering Nanyang Business School Engineering::Civil engineering Automatic Clustering Unsupervised Feature Selection Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes a domain-specific automatic clustering (termed AutoCluster) to self-learn the optimal models for unsupervised risk assessment, which integrates key steps of clustering into an auto-optimisable pipeline, including feature and algorithm selection, hyperparameter auto-tuning. Firstly, based on surrogate conflict measures, a series of risk indicator features are constructed to represent temporal-spatial and kinematical risk exposures. Then, we develop an unsupervised feature selection method to identify the useful features by elimination-based model reliance importance (EMRI). Secondly, we propose balanced Silhouette Index (bSI) to evaluate the internal quality of imbalanced clustering. A loss function is designed that considers the clustering performance in terms of internal quality, inter-cluster variation, and model stability. Thirdly, based on Bayesian optimisation, the algorithm auto-selection and hyperparameter auto-tuning are self-learned to generate the best clustering results. Herein, NGSIM vehicle trajectory data is used for test-bedding. Findings show that AutoCluster is reliable and promising to diagnose multiple distinct risk levels inherent to generalised driving behaviour. We also delve into risk clustering, such as, algorithms heterogeneity, Silhouette analysis, hierarchical clustering flows, etc. Meanwhile, the AutoCluster is also a method for unsupervised data labelling and indicator threshold calibration. Furthermore, AutoCluster is useful to tackle the challenges in imbalanced clustering without ground truth or a priori knowledge. This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1600502, in part by the Chinese National Science Foundation under Grant 61803283, in part by the Shanghai Municipal Education Commission and Shanghai Education Development Foundation under the “Chen Guang” Project (18CG17), and in part by the Shanghai Municipal Science and Technology Major Project (2021SHZDZX0100). 2022-11-14T02:25:14Z 2022-11-14T02:25:14Z 2022 Journal Article Shi, X., Wong, Y. D., Chai, C., Li, M. Z. F., Chen, T. & Zeng, Z. (2022). Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road. IEEE Transactions On Intelligent Transportation Systems, 23(10), 17451-17465. https://dx.doi.org/10.1109/TITS.2022.3166838 1524-9050 https://hdl.handle.net/10356/162965 10.1109/TITS.2022.3166838 2-s2.0-85129426078 10 23 17451 17465 en IEEE Transactions on Intelligent Transportation Systems © 2022 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Civil engineering
Automatic Clustering
Unsupervised Feature Selection
spellingShingle Engineering::Civil engineering
Automatic Clustering
Unsupervised Feature Selection
Shi, Xiupeng
Wong, Yiik Diew
Chai, Chen
Li, Michael Zhi Feng
Chen, Tianyi
Zeng, Zeng
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
description Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes a domain-specific automatic clustering (termed AutoCluster) to self-learn the optimal models for unsupervised risk assessment, which integrates key steps of clustering into an auto-optimisable pipeline, including feature and algorithm selection, hyperparameter auto-tuning. Firstly, based on surrogate conflict measures, a series of risk indicator features are constructed to represent temporal-spatial and kinematical risk exposures. Then, we develop an unsupervised feature selection method to identify the useful features by elimination-based model reliance importance (EMRI). Secondly, we propose balanced Silhouette Index (bSI) to evaluate the internal quality of imbalanced clustering. A loss function is designed that considers the clustering performance in terms of internal quality, inter-cluster variation, and model stability. Thirdly, based on Bayesian optimisation, the algorithm auto-selection and hyperparameter auto-tuning are self-learned to generate the best clustering results. Herein, NGSIM vehicle trajectory data is used for test-bedding. Findings show that AutoCluster is reliable and promising to diagnose multiple distinct risk levels inherent to generalised driving behaviour. We also delve into risk clustering, such as, algorithms heterogeneity, Silhouette analysis, hierarchical clustering flows, etc. Meanwhile, the AutoCluster is also a method for unsupervised data labelling and indicator threshold calibration. Furthermore, AutoCluster is useful to tackle the challenges in imbalanced clustering without ground truth or a priori knowledge.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Shi, Xiupeng
Wong, Yiik Diew
Chai, Chen
Li, Michael Zhi Feng
Chen, Tianyi
Zeng, Zeng
format Article
author Shi, Xiupeng
Wong, Yiik Diew
Chai, Chen
Li, Michael Zhi Feng
Chen, Tianyi
Zeng, Zeng
author_sort Shi, Xiupeng
title Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
title_short Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
title_full Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
title_fullStr Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
title_full_unstemmed Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
title_sort automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
publishDate 2022
url https://hdl.handle.net/10356/162965
_version_ 1772827363454746624