Tweet Geolocation: Leveraging location, user and peer signals

Which venue is a tweet posted from? We referred this as fine-grained geolocation. To solve this problem effectively, we develop novel techniques to exploit each posting user's content history. This is motivated by our finding that most users do not share their visitation history, but have ample...

Full description

Saved in:
Bibliographic Details
Main Authors: CHONG, Wen-Haw, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3956
https://ink.library.smu.edu.sg/context/sis_research/article/4958/viewcontent/TweetGeolocation_2017_afv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4958
record_format dspace
spelling sg-smu-ink.sis_research-49582018-12-07T05:11:13Z Tweet Geolocation: Leveraging location, user and peer signals CHONG, Wen-Haw LIM, Ee Peng Which venue is a tweet posted from? We referred this as fine-grained geolocation. To solve this problem effectively, we develop novel techniques to exploit each posting user's content history. This is motivated by our finding that most users do not share their visitation history, but have ample content history from tweet posts. We formulate fine-grained geolocation as a ranking problem whereby given a test tweet, we rank candidate venues. We propose several models that leverage on three types of signals from locations, users and peers. Firstly, the location signals are words that are indicative of venues. We propose a location-indicative weighting scheme to capture this. Next we exploit user signals from each user's content history to enrich the very limited content of their tweets which have been targeted for geolocation. The intuition is that the user's other tweets may have been from the test venue or related venues, thus providing informative words. In this regard, we propose query expansion as the enrichment approach. Finally, we exploit the signals from peer users who have similar content history and thus potentially similar visitation behavior as the users of the test tweets. This suggests collaborative filtering where visitation information is propagated via content similarities. We proposed several models incorporating different combinations of the three signals. Our experiments show that the best model incorporates all three signals. It performs 6% to 40% better than the baselines depending on the metric and dataset. 2017-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3956 info:doi/10.1145/3132847.3132906 https://ink.library.smu.edu.sg/context/sis_research/article/4958/viewcontent/TweetGeolocation_2017_afv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Query expansion Tweet geolocation Collaborative filtering Databases and Information Systems Social Media
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Query expansion
Tweet geolocation
Collaborative filtering
Databases and Information Systems
Social Media
spellingShingle Query expansion
Tweet geolocation
Collaborative filtering
Databases and Information Systems
Social Media
CHONG, Wen-Haw
LIM, Ee Peng
Tweet Geolocation: Leveraging location, user and peer signals
description Which venue is a tweet posted from? We referred this as fine-grained geolocation. To solve this problem effectively, we develop novel techniques to exploit each posting user's content history. This is motivated by our finding that most users do not share their visitation history, but have ample content history from tweet posts. We formulate fine-grained geolocation as a ranking problem whereby given a test tweet, we rank candidate venues. We propose several models that leverage on three types of signals from locations, users and peers. Firstly, the location signals are words that are indicative of venues. We propose a location-indicative weighting scheme to capture this. Next we exploit user signals from each user's content history to enrich the very limited content of their tweets which have been targeted for geolocation. The intuition is that the user's other tweets may have been from the test venue or related venues, thus providing informative words. In this regard, we propose query expansion as the enrichment approach. Finally, we exploit the signals from peer users who have similar content history and thus potentially similar visitation behavior as the users of the test tweets. This suggests collaborative filtering where visitation information is propagated via content similarities. We proposed several models incorporating different combinations of the three signals. Our experiments show that the best model incorporates all three signals. It performs 6% to 40% better than the baselines depending on the metric and dataset.
format text
author CHONG, Wen-Haw
LIM, Ee Peng
author_facet CHONG, Wen-Haw
LIM, Ee Peng
author_sort CHONG, Wen-Haw
title Tweet Geolocation: Leveraging location, user and peer signals
title_short Tweet Geolocation: Leveraging location, user and peer signals
title_full Tweet Geolocation: Leveraging location, user and peer signals
title_fullStr Tweet Geolocation: Leveraging location, user and peer signals
title_full_unstemmed Tweet Geolocation: Leveraging location, user and peer signals
title_sort tweet geolocation: leveraging location, user and peer signals
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3956
https://ink.library.smu.edu.sg/context/sis_research/article/4958/viewcontent/TweetGeolocation_2017_afv.pdf
_version_ 1770574057302917120