Mapping of aliases to points of interest

In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes exp...

Full description

Saved in:
Bibliographic Details
Main Author: Ng, Jun Hao
Other Authors: Cong Gao
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74300
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74300
record_format dspace
spelling sg-ntu-dr.10356-743002023-03-03T20:41:55Z Mapping of aliases to points of interest Ng, Jun Hao Cong Gao School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes explicitly and implicitly reveal their location through the tweets, we can extract valuable information such as a user’s current location or locations they have been at a very fine granularity. Being able to recognise such location mentions and then mapping it to a well-defined location is a huge challenge. Tweets are often short, containing only 12-16 words on average, ungrammatical and informal in nature and locations are often mentioned with acronyms, incomplete names and slang terms. In this solution, it will first identify potential location aliases (variations of location entity mention) found within a Tweet itself using grammatical features through a trained classifier. In order to overcome the lack of context in Tweet, I have utilised the Google search engine to enrich the Tweet itself and post-process the potential aliases identified. Moreover, the solution will be able to map a specific location entity (which will be referred to as Point of Interest[POI]) with an alias. Finally, a list of POIs and aliases for each tweet will be outputted. In the evaluation section, you will be able to find the performance of the proposed solution. Bachelor of Engineering (Computer Science) 2018-05-15T03:34:52Z 2018-05-15T03:34:52Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74300 en Nanyang Technological University 44 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Ng, Jun Hao
Mapping of aliases to points of interest
description In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes explicitly and implicitly reveal their location through the tweets, we can extract valuable information such as a user’s current location or locations they have been at a very fine granularity. Being able to recognise such location mentions and then mapping it to a well-defined location is a huge challenge. Tweets are often short, containing only 12-16 words on average, ungrammatical and informal in nature and locations are often mentioned with acronyms, incomplete names and slang terms. In this solution, it will first identify potential location aliases (variations of location entity mention) found within a Tweet itself using grammatical features through a trained classifier. In order to overcome the lack of context in Tweet, I have utilised the Google search engine to enrich the Tweet itself and post-process the potential aliases identified. Moreover, the solution will be able to map a specific location entity (which will be referred to as Point of Interest[POI]) with an alias. Finally, a list of POIs and aliases for each tweet will be outputted. In the evaluation section, you will be able to find the performance of the proposed solution.
author2 Cong Gao
author_facet Cong Gao
Ng, Jun Hao
format Final Year Project
author Ng, Jun Hao
author_sort Ng, Jun Hao
title Mapping of aliases to points of interest
title_short Mapping of aliases to points of interest
title_full Mapping of aliases to points of interest
title_fullStr Mapping of aliases to points of interest
title_full_unstemmed Mapping of aliases to points of interest
title_sort mapping of aliases to points of interest
publishDate 2018
url http://hdl.handle.net/10356/74300
_version_ 1759856696257150976