A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection

Due to the huge volume and linguistic variation of data shared online, accurate detection of the sentiment of a message (polarity detection) can no longer rely on human assessors or through simple lexicon keyword matching. This paper presents a semi-supervised approach in constructing essential tool...

Full description

Saved in:
Bibliographic Details
Main Authors: LO, Siaw Ling, CAMBRIA, Erik, CHIONG, Raymond, CORNFORTH, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4872
https://ink.library.smu.edu.sg/context/sis_research/article/5875/viewcontent/A_multilingual___PV.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5875
record_format dspace
spelling sg-smu-ink.sis_research-58752020-02-13T08:50:32Z A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection LO, Siaw Ling CAMBRIA, Erik CHIONG, Raymond CORNFORTH, David Due to the huge volume and linguistic variation of data shared online, accurate detection of the sentiment of a message (polarity detection) can no longer rely on human assessors or through simple lexicon keyword matching. This paper presents a semi-supervised approach in constructing essential toolkits for analysing the polarity of a localised scarce-resource language, Singlish (Singaporean English). Corpus-based bootstrapping using a multilingual, multifaceted lexicon was applied to construct an annotated testing dataset, while unsupervised methods such as lexicon polarity detection, frequent item extraction through association rules and latent semantic analysis were used to identify the polarity of Singlish n-grams before human assessment was done to isolate misleading terms and remove concept ambiguity. The findings suggest that this multilingual approach outshines polarity analysis using only the English language. In addition, a hybrid combination of the Support Vector Machine and a proposed Singlish Polarity Detection algorithm, which incorporates unigram and n-gram Singlish sentic patterns with other multilingual polarity sentic patterns such as negation and adversative, is able to outperform other approaches in comparison. The promising results of a pooled testing dataset generated from the vast amount of unannotated Singlish data clearly show that our multilingual Singlish sentic pattern approach has the potential to be adopted in real-world polarity detection. 2016-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4872 info:doi/10.1016/j.knosys.2016.04.024 https://ink.library.smu.edu.sg/context/sis_research/article/5875/viewcontent/A_multilingual___PV.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Sentic computing Polarity detection Semi-supervised Singlish Twitter Computer Engineering Digital Communications and Networking
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Sentic computing
Polarity detection
Semi-supervised
Singlish
Twitter
Computer Engineering
Digital Communications and Networking
spellingShingle Sentic computing
Polarity detection
Semi-supervised
Singlish
Twitter
Computer Engineering
Digital Communications and Networking
LO, Siaw Ling
CAMBRIA, Erik
CHIONG, Raymond
CORNFORTH, David
A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
description Due to the huge volume and linguistic variation of data shared online, accurate detection of the sentiment of a message (polarity detection) can no longer rely on human assessors or through simple lexicon keyword matching. This paper presents a semi-supervised approach in constructing essential toolkits for analysing the polarity of a localised scarce-resource language, Singlish (Singaporean English). Corpus-based bootstrapping using a multilingual, multifaceted lexicon was applied to construct an annotated testing dataset, while unsupervised methods such as lexicon polarity detection, frequent item extraction through association rules and latent semantic analysis were used to identify the polarity of Singlish n-grams before human assessment was done to isolate misleading terms and remove concept ambiguity. The findings suggest that this multilingual approach outshines polarity analysis using only the English language. In addition, a hybrid combination of the Support Vector Machine and a proposed Singlish Polarity Detection algorithm, which incorporates unigram and n-gram Singlish sentic patterns with other multilingual polarity sentic patterns such as negation and adversative, is able to outperform other approaches in comparison. The promising results of a pooled testing dataset generated from the vast amount of unannotated Singlish data clearly show that our multilingual Singlish sentic pattern approach has the potential to be adopted in real-world polarity detection.
format text
author LO, Siaw Ling
CAMBRIA, Erik
CHIONG, Raymond
CORNFORTH, David
author_facet LO, Siaw Ling
CAMBRIA, Erik
CHIONG, Raymond
CORNFORTH, David
author_sort LO, Siaw Ling
title A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
title_short A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
title_full A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
title_fullStr A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
title_full_unstemmed A multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
title_sort multilingual semi-supervised approach in deriving singlish sentic patterns for polarity detection
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/4872
https://ink.library.smu.edu.sg/context/sis_research/article/5875/viewcontent/A_multilingual___PV.pdf
_version_ 1770575080164687872