Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives

Mining latent information from human trajectories for understanding our cities has been persistent endeavors in urban studies and spatial information science. Many previous studies relied on manually crafted features and followed a supervised learning pipeline for a particular task, e.g. land use cl...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Yu, Huang, Weiming, Yao, Yao, Gao, Song, Cui, Lizhen, Yan, Zhongmin
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181780
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181780
record_format dspace
spelling sg-ntu-dr.10356-1817802024-12-20T15:37:41Z Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives Zhang, Yu Huang, Weiming Yao, Yao Gao, Song Cui, Lizhen Yan, Zhongmin School of Computer Science and Engineering Computer and Information Science Urban region embedding Human trajectories Mining latent information from human trajectories for understanding our cities has been persistent endeavors in urban studies and spatial information science. Many previous studies relied on manually crafted features and followed a supervised learning pipeline for a particular task, e.g. land use classification. However, such methods often overlook some types of latent information and the commonalities between varying urban sensing tasks, making the features engineered for one specific task sometimes not useful in other tasks. To tackle the limitations, we propose a multi-view trajectory embedding (MTE) approach to learn the features of urban regions (region representations) in an unsupervised manner, which does not rely on a specific task and thus can be generalized to varying urban sensing tasks. Specifically, MTE incorporates three salient information views carried by human trajectories, i.e. transition, spatial, and temporal views. We utilize skip-gram to model human transition patterns exhibited from massive amounts of human trajectories, where long-range dependency is meaningful. Subsequently, we leverage unsupervised graph representation learning to model spatial adjacency and temporal pattern similarities, where short-range dependency is favorable. We perform extensive experiments on three downstream tasks, i.e. land use classification, population density estimation, and house price prediction. The results indicate that MTE considerably outperforms a series of competitive baselines in all three tasks, and different information views have varying levels of effectiveness in particular downstream tasks, e.g. the temporal view is more effective than the spatial view in land use classification, while it is the opposite in house price prediction. Published version This work was funded in part by the National Natural Science Foundation of China [No. 42101421 and 92367202]; the Knut and Alice Wallenberg Foundation [No. KAW 2019.0550]; the “CUG Scholar” Scientific Research Funds at China University of Geosciences (Wuhan) [No. 2022034]; a grant from State Key Laboratory of Resources and Environmental Information System. 2024-12-17T06:23:26Z 2024-12-17T06:23:26Z 2024 Journal Article Zhang, Y., Huang, W., Yao, Y., Gao, S., Cui, L. & Yan, Z. (2024). Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives. GIScience and Remote Sensing, 61(1), 2387392-. https://dx.doi.org/10.1080/15481603.2024.2387392 1548-1603 https://hdl.handle.net/10356/181780 10.1080/15481603.2024.2387392 2-s2.0-85203257647 1 61 2387392 en GIScience and Remote Sensing © 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Urban region embedding
Human trajectories
spellingShingle Computer and Information Science
Urban region embedding
Human trajectories
Zhang, Yu
Huang, Weiming
Yao, Yao
Gao, Song
Cui, Lizhen
Yan, Zhongmin
Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
description Mining latent information from human trajectories for understanding our cities has been persistent endeavors in urban studies and spatial information science. Many previous studies relied on manually crafted features and followed a supervised learning pipeline for a particular task, e.g. land use classification. However, such methods often overlook some types of latent information and the commonalities between varying urban sensing tasks, making the features engineered for one specific task sometimes not useful in other tasks. To tackle the limitations, we propose a multi-view trajectory embedding (MTE) approach to learn the features of urban regions (region representations) in an unsupervised manner, which does not rely on a specific task and thus can be generalized to varying urban sensing tasks. Specifically, MTE incorporates three salient information views carried by human trajectories, i.e. transition, spatial, and temporal views. We utilize skip-gram to model human transition patterns exhibited from massive amounts of human trajectories, where long-range dependency is meaningful. Subsequently, we leverage unsupervised graph representation learning to model spatial adjacency and temporal pattern similarities, where short-range dependency is favorable. We perform extensive experiments on three downstream tasks, i.e. land use classification, population density estimation, and house price prediction. The results indicate that MTE considerably outperforms a series of competitive baselines in all three tasks, and different information views have varying levels of effectiveness in particular downstream tasks, e.g. the temporal view is more effective than the spatial view in land use classification, while it is the opposite in house price prediction.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Zhang, Yu
Huang, Weiming
Yao, Yao
Gao, Song
Cui, Lizhen
Yan, Zhongmin
format Article
author Zhang, Yu
Huang, Weiming
Yao, Yao
Gao, Song
Cui, Lizhen
Yan, Zhongmin
author_sort Zhang, Yu
title Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
title_short Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
title_full Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
title_fullStr Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
title_full_unstemmed Urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
title_sort urban region representation learning with human trajectories: a multi-view approach incorporating transition, spatial, and temporal perspectives
publishDate 2024
url https://hdl.handle.net/10356/181780
_version_ 1819113079759175680