Searching for the X-Factor: Exploring corpus subjectivity for word embeddings
We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2018
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4229 https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5232 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-52322020-03-24T05:36:19Z Searching for the X-Factor: Exploring corpus subjectivity for word embeddings TKACHENKO, Maksim CHIA, Chong Cher LAUW, Hady W. We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses, we establish this to be the case indeed. Moreover, based on the discovery of the outsized role that sentiment words play on subjectivity-sensitive tasks such as sentiment classification, we develop a novel word embedding SentiVec which is infused with sentiment information from a lexical resource, and is shown to outperform baselines on such tasks. 2018-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4229 info:doi/10.18653/v1/P18-1112 https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computational linguistics Embeddings Natural language processing systems Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Computational linguistics Embeddings Natural language processing systems Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Computational linguistics Embeddings Natural language processing systems Databases and Information Systems Numerical Analysis and Scientific Computing TKACHENKO, Maksim CHIA, Chong Cher LAUW, Hady W. Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
description |
We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses, we establish this to be the case indeed. Moreover, based on the discovery of the outsized role that sentiment words play on subjectivity-sensitive tasks such as sentiment classification, we develop a novel word embedding SentiVec which is infused with sentiment information from a lexical resource, and is shown to outperform baselines on such tasks. |
format |
text |
author |
TKACHENKO, Maksim CHIA, Chong Cher LAUW, Hady W. |
author_facet |
TKACHENKO, Maksim CHIA, Chong Cher LAUW, Hady W. |
author_sort |
TKACHENKO, Maksim |
title |
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
title_short |
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
title_full |
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
title_fullStr |
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
title_full_unstemmed |
Searching for the X-Factor: Exploring corpus subjectivity for word embeddings |
title_sort |
searching for the x-factor: exploring corpus subjectivity for word embeddings |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2018 |
url |
https://ink.library.smu.edu.sg/sis_research/4229 https://ink.library.smu.edu.sg/context/sis_research/article/5232/viewcontent/acl2018.pdf |
_version_ |
1770574494606295040 |