Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset

In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, a...

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Dongchang, CHENG, Shih-Fen, YANG, Yiping
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2930
https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3930
record_format dspace
spelling sg-smu-ink.sis_research-39302019-11-18T06:19:55Z Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset LIU, Dongchang CHENG, Shih-Fen YANG, Yiping In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands. 2015-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2930 info:doi/10.1109/ITSC.2015.297 https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
spellingShingle Artificial Intelligence and Robotics
LIU, Dongchang
CHENG, Shih-Fen
YANG, Yiping
Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
description In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands.
format text
author LIU, Dongchang
CHENG, Shih-Fen
YANG, Yiping
author_facet LIU, Dongchang
CHENG, Shih-Fen
YANG, Yiping
author_sort LIU, Dongchang
title Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
title_short Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
title_full Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
title_fullStr Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
title_full_unstemmed Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset
title_sort density peaks clustering approach for discovering demand hot spots in city-scale taxi fleet dataset
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/2930
https://ink.library.smu.edu.sg/context/sis_research/article/3930/viewcontent/10.1109_ITSC.2015.297.pdf
_version_ 1770572757495447552