Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162965 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162965 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1629652023-05-19T07:31:19Z Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road Shi, Xiupeng Wong, Yiik Diew Chai, Chen Li, Michael Zhi Feng Chen, Tianyi Zeng, Zeng School of Civil and Environmental Engineering Nanyang Business School Engineering::Civil engineering Automatic Clustering Unsupervised Feature Selection Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes a domain-specific automatic clustering (termed AutoCluster) to self-learn the optimal models for unsupervised risk assessment, which integrates key steps of clustering into an auto-optimisable pipeline, including feature and algorithm selection, hyperparameter auto-tuning. Firstly, based on surrogate conflict measures, a series of risk indicator features are constructed to represent temporal-spatial and kinematical risk exposures. Then, we develop an unsupervised feature selection method to identify the useful features by elimination-based model reliance importance (EMRI). Secondly, we propose balanced Silhouette Index (bSI) to evaluate the internal quality of imbalanced clustering. A loss function is designed that considers the clustering performance in terms of internal quality, inter-cluster variation, and model stability. Thirdly, based on Bayesian optimisation, the algorithm auto-selection and hyperparameter auto-tuning are self-learned to generate the best clustering results. Herein, NGSIM vehicle trajectory data is used for test-bedding. Findings show that AutoCluster is reliable and promising to diagnose multiple distinct risk levels inherent to generalised driving behaviour. We also delve into risk clustering, such as, algorithms heterogeneity, Silhouette analysis, hierarchical clustering flows, etc. Meanwhile, the AutoCluster is also a method for unsupervised data labelling and indicator threshold calibration. Furthermore, AutoCluster is useful to tackle the challenges in imbalanced clustering without ground truth or a priori knowledge. This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1600502, in part by the Chinese National Science Foundation under Grant 61803283, in part by the Shanghai Municipal Education Commission and Shanghai Education Development Foundation under the “Chen Guang” Project (18CG17), and in part by the Shanghai Municipal Science and Technology Major Project (2021SHZDZX0100). 2022-11-14T02:25:14Z 2022-11-14T02:25:14Z 2022 Journal Article Shi, X., Wong, Y. D., Chai, C., Li, M. Z. F., Chen, T. & Zeng, Z. (2022). Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road. IEEE Transactions On Intelligent Transportation Systems, 23(10), 17451-17465. https://dx.doi.org/10.1109/TITS.2022.3166838 1524-9050 https://hdl.handle.net/10356/162965 10.1109/TITS.2022.3166838 2-s2.0-85129426078 10 23 17451 17465 en IEEE Transactions on Intelligent Transportation Systems © 2022 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Civil engineering Automatic Clustering Unsupervised Feature Selection |
spellingShingle |
Engineering::Civil engineering Automatic Clustering Unsupervised Feature Selection Shi, Xiupeng Wong, Yiik Diew Chai, Chen Li, Michael Zhi Feng Chen, Tianyi Zeng, Zeng Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
description |
Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes a domain-specific automatic clustering (termed AutoCluster) to self-learn the optimal models for unsupervised risk assessment, which integrates key steps of clustering into an auto-optimisable pipeline, including feature and algorithm selection, hyperparameter auto-tuning. Firstly, based on surrogate conflict measures, a series of risk indicator features are constructed to represent temporal-spatial and kinematical risk exposures. Then, we develop an unsupervised feature selection method to identify the useful features by elimination-based model reliance importance (EMRI). Secondly, we propose balanced Silhouette Index (bSI) to evaluate the internal quality of imbalanced clustering. A loss function is designed that considers the clustering performance in terms of internal quality, inter-cluster variation, and model stability. Thirdly, based on Bayesian optimisation, the algorithm auto-selection and hyperparameter auto-tuning are self-learned to generate the best clustering results. Herein, NGSIM vehicle trajectory data is used for test-bedding. Findings show that AutoCluster is reliable and promising to diagnose multiple distinct risk levels inherent to generalised driving behaviour. We also delve into risk clustering, such as, algorithms heterogeneity, Silhouette analysis, hierarchical clustering flows, etc. Meanwhile, the AutoCluster is also a method for unsupervised data labelling and indicator threshold calibration. Furthermore, AutoCluster is useful to tackle the challenges in imbalanced clustering without ground truth or a priori knowledge. |
author2 |
School of Civil and Environmental Engineering |
author_facet |
School of Civil and Environmental Engineering Shi, Xiupeng Wong, Yiik Diew Chai, Chen Li, Michael Zhi Feng Chen, Tianyi Zeng, Zeng |
format |
Article |
author |
Shi, Xiupeng Wong, Yiik Diew Chai, Chen Li, Michael Zhi Feng Chen, Tianyi Zeng, Zeng |
author_sort |
Shi, Xiupeng |
title |
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
title_short |
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
title_full |
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
title_fullStr |
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
title_full_unstemmed |
Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
title_sort |
automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162965 |
_version_ |
1772827363454746624 |