Mapping of aliases to points of interest
In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes exp...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/74300 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-74300 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-743002023-03-03T20:41:55Z Mapping of aliases to points of interest Ng, Jun Hao Cong Gao School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes explicitly and implicitly reveal their location through the tweets, we can extract valuable information such as a user’s current location or locations they have been at a very fine granularity. Being able to recognise such location mentions and then mapping it to a well-defined location is a huge challenge. Tweets are often short, containing only 12-16 words on average, ungrammatical and informal in nature and locations are often mentioned with acronyms, incomplete names and slang terms. In this solution, it will first identify potential location aliases (variations of location entity mention) found within a Tweet itself using grammatical features through a trained classifier. In order to overcome the lack of context in Tweet, I have utilised the Google search engine to enrich the Tweet itself and post-process the potential aliases identified. Moreover, the solution will be able to map a specific location entity (which will be referred to as Point of Interest[POI]) with an alias. Finally, a list of POIs and aliases for each tweet will be outputted. In the evaluation section, you will be able to find the performance of the proposed solution. Bachelor of Engineering (Computer Science) 2018-05-15T03:34:52Z 2018-05-15T03:34:52Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74300 en Nanyang Technological University 44 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Ng, Jun Hao Mapping of aliases to points of interest |
description |
In 2006, the social networking messaging service was created and allowed users to publish short messages of 140 characters. Through these short messages, also known as tweets, it allows users to publish their thoughts, opinions, daily activities and other valuable information. As users sometimes explicitly and implicitly reveal their location through the tweets, we can extract valuable information such as a user’s current location or locations they have been at a very fine granularity. Being able to recognise such location mentions and then mapping it to a well-defined location is a huge challenge. Tweets are often short, containing only 12-16 words on average, ungrammatical and informal in nature and locations are often mentioned with acronyms, incomplete names and slang terms. In this solution, it will first identify potential location aliases (variations of location entity mention) found within a Tweet itself using grammatical features through a trained classifier. In order to overcome the lack of context in Tweet, I have utilised the Google search engine to enrich the Tweet itself and post-process the potential aliases identified. Moreover, the solution will be able to map a specific location entity (which will be referred to as Point of Interest[POI]) with an alias. Finally, a list of POIs and aliases for each tweet will be outputted. In the evaluation section, you will be able to find the performance of the proposed
solution. |
author2 |
Cong Gao |
author_facet |
Cong Gao Ng, Jun Hao |
format |
Final Year Project |
author |
Ng, Jun Hao |
author_sort |
Ng, Jun Hao |
title |
Mapping of aliases to points of interest |
title_short |
Mapping of aliases to points of interest |
title_full |
Mapping of aliases to points of interest |
title_fullStr |
Mapping of aliases to points of interest |
title_full_unstemmed |
Mapping of aliases to points of interest |
title_sort |
mapping of aliases to points of interest |
publishDate |
2018 |
url |
http://hdl.handle.net/10356/74300 |
_version_ |
1759856696257150976 |