Searching for the X-Factor: Exploring corpus subjectivity for word embeddings

We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses...

Full description

Saved in:
Bibliographic Details
Main Authors: TKACHENKO, Maksim, CHIA, Chong Cher, LAUW, Hady W.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4229
https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5232
record_format dspace
spelling sg-smu-ink.sis_research-52322020-03-24T05:36:19Z Searching for the X-Factor: Exploring corpus subjectivity for word embeddings TKACHENKO, Maksim CHIA, Chong Cher LAUW, Hady W. We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses, we establish this to be the case indeed. Moreover, based on the discovery of the outsized role that sentiment words play on subjectivity-sensitive tasks such as sentiment classification, we develop a novel word embedding SentiVec which is infused with sentiment information from a lexical resource, and is shown to outperform baselines on such tasks. 2018-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4229 info:doi/10.18653/v1/P18-1112 https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computational linguistics Embeddings Natural language processing systems Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computational linguistics
Embeddings
Natural language processing systems
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Computational linguistics
Embeddings
Natural language processing systems
Databases and Information Systems
Numerical Analysis and Scientific Computing
TKACHENKO, Maksim
CHIA, Chong Cher
LAUW, Hady W.
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
description We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses, we establish this to be the case indeed. Moreover, based on the discovery of the outsized role that sentiment words play on subjectivity-sensitive tasks such as sentiment classification, we develop a novel word embedding SentiVec which is infused with sentiment information from a lexical resource, and is shown to outperform baselines on such tasks.
format text
author TKACHENKO, Maksim
CHIA, Chong Cher
LAUW, Hady W.
author_facet TKACHENKO, Maksim
CHIA, Chong Cher
LAUW, Hady W.
author_sort TKACHENKO, Maksim
title Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
title_short Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
title_full Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
title_fullStr Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
title_full_unstemmed Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
title_sort searching for the x-factor: exploring corpus subjectivity for word embeddings
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/4229
https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf
_version_ 1770574494606295040