Exploiting contextual information for fine-grained tweet geolocation

The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse...

Full description

Saved in:
Bibliographic Details
Main Authors: CHONG, Wen Haw, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3656
https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4658
record_format dspace
spelling sg-smu-ink.sis_research-46582021-03-26T07:21:43Z Exploiting contextual information for fine-grained tweet geolocation CHONG, Wen Haw LIM, Ee Peng The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse for building models. Nonetheless, although only a small fraction of tweets are geocoded, we find that they are posted by a substantial proportion of users. Essentially, such users have location history data. Along with tweet posting time, these serve as additional contextual information for geolocation. In designing our geolocation models, we also utilize the properties of (1) spatial focus where users are more likely to visit venues near each other and (2) spatial homophily where venues near each other tend to share more similar tweet content, compared to venues further apart. Our proposed model significantly outperforms the content-only approaches. 2017-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3656 https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Building model Contextual information Fine grained Geolocations Homophily Learning to rank Location history Databases and Information Systems Social Media
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Building model
Contextual information
Fine grained
Geolocations
Homophily
Learning to rank
Location history
Databases and Information Systems
Social Media
spellingShingle Building model
Contextual information
Fine grained
Geolocations
Homophily
Learning to rank
Location history
Databases and Information Systems
Social Media
CHONG, Wen Haw
LIM, Ee Peng
Exploiting contextual information for fine-grained tweet geolocation
description The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse for building models. Nonetheless, although only a small fraction of tweets are geocoded, we find that they are posted by a substantial proportion of users. Essentially, such users have location history data. Along with tweet posting time, these serve as additional contextual information for geolocation. In designing our geolocation models, we also utilize the properties of (1) spatial focus where users are more likely to visit venues near each other and (2) spatial homophily where venues near each other tend to share more similar tweet content, compared to venues further apart. Our proposed model significantly outperforms the content-only approaches.
format text
author CHONG, Wen Haw
LIM, Ee Peng
author_facet CHONG, Wen Haw
LIM, Ee Peng
author_sort CHONG, Wen Haw
title Exploiting contextual information for fine-grained tweet geolocation
title_short Exploiting contextual information for fine-grained tweet geolocation
title_full Exploiting contextual information for fine-grained tweet geolocation
title_fullStr Exploiting contextual information for fine-grained tweet geolocation
title_full_unstemmed Exploiting contextual information for fine-grained tweet geolocation
title_sort exploiting contextual information for fine-grained tweet geolocation
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3656
https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf
_version_ 1770573404511928320