Exploiting contextual information for fine-grained tweet geolocation
The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/3656 https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-4658 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-46582021-03-26T07:21:43Z Exploiting contextual information for fine-grained tweet geolocation CHONG, Wen Haw LIM, Ee Peng The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse for building models. Nonetheless, although only a small fraction of tweets are geocoded, we find that they are posted by a substantial proportion of users. Essentially, such users have location history data. Along with tweet posting time, these serve as additional contextual information for geolocation. In designing our geolocation models, we also utilize the properties of (1) spatial focus where users are more likely to visit venues near each other and (2) spatial homophily where venues near each other tend to share more similar tweet content, compared to venues further apart. Our proposed model significantly outperforms the content-only approaches. 2017-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3656 https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Building model Contextual information Fine grained Geolocations Homophily Learning to rank Location history Databases and Information Systems Social Media |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Building model Contextual information Fine grained Geolocations Homophily Learning to rank Location history Databases and Information Systems Social Media |
spellingShingle |
Building model Contextual information Fine grained Geolocations Homophily Learning to rank Location history Databases and Information Systems Social Media CHONG, Wen Haw LIM, Ee Peng Exploiting contextual information for fine-grained tweet geolocation |
description |
The problem of fine-grained tweet geolocation is to link tweets to their posting venues. We solve this in a learning to rank framework by ranking candidate venues given a test tweet. The problem is challenging as tweets are short and the vast majority are non-geocoded, meaning information is sparse for building models. Nonetheless, although only a small fraction of tweets are geocoded, we find that they are posted by a substantial proportion of users. Essentially, such users have location history data. Along with tweet posting time, these serve as additional contextual information for geolocation. In designing our geolocation models, we also utilize the properties of (1) spatial focus where users are more likely to visit venues near each other and (2) spatial homophily where venues near each other tend to share more similar tweet content, compared to venues further apart. Our proposed model significantly outperforms the content-only approaches. |
format |
text |
author |
CHONG, Wen Haw LIM, Ee Peng |
author_facet |
CHONG, Wen Haw LIM, Ee Peng |
author_sort |
CHONG, Wen Haw |
title |
Exploiting contextual information for fine-grained tweet geolocation |
title_short |
Exploiting contextual information for fine-grained tweet geolocation |
title_full |
Exploiting contextual information for fine-grained tweet geolocation |
title_fullStr |
Exploiting contextual information for fine-grained tweet geolocation |
title_full_unstemmed |
Exploiting contextual information for fine-grained tweet geolocation |
title_sort |
exploiting contextual information for fine-grained tweet geolocation |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/3656 https://ink.library.smu.edu.sg/context/sis_research/article/4658/viewcontent/17._May01_2017___Exploiting_Contextual_Information_for_Fine_grained_Tweet_Geolocation__ICWSM17_.pdf |
_version_ |
1770573404511928320 |