Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, a...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2015
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2930 https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3930 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-39302019-11-18T06:19:55Z Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset LIU, Dongchang CHENG, Shih-Fen YANG, Yiping In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands. 2015-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2930 info:doi/10.1109/ITSC.2015.297 https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Artificial Intelligence and Robotics |
spellingShingle |
Artificial Intelligence and Robotics LIU, Dongchang CHENG, Shih-Fen YANG, Yiping Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
description |
In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands. |
format |
text |
author |
LIU, Dongchang CHENG, Shih-Fen YANG, Yiping |
author_facet |
LIU, Dongchang CHENG, Shih-Fen YANG, Yiping |
author_sort |
LIU, Dongchang |
title |
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
title_short |
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
title_full |
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
title_fullStr |
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
title_full_unstemmed |
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset |
title_sort |
density peaks clustering approach for discovering demand hot spots in city-scale taxi fleet dataset |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2015 |
url |
https://ink.library.smu.edu.sg/sis_research/2930 https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf |
_version_ |
1770572757495447552 |