Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning

The Chinese pronunciation system offers two characteristics that distinguish it from other languages: deep phonemic orthography and intonation variations. In this paper, we hypothesize that these two important properties can play a major role in Chinese sentiment analysis. In particular, we propose...

Full description

Saved in:
Bibliographic Details
Main Authors: Peng, Haiyun, Ma, Yukun, Poria, Soujanya, Li, Yang, Cambria, Erik
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160266
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160266
record_format dspace
spelling sg-ntu-dr.10356-1602662022-07-18T07:38:29Z Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning Peng, Haiyun Ma, Yukun Poria, Soujanya Li, Yang Cambria, Erik School of Computer Science and Engineering Engineering::Computer science and engineering Sentiment Analysis Multilingual Sentiment Analysis The Chinese pronunciation system offers two characteristics that distinguish it from other languages: deep phonemic orthography and intonation variations. In this paper, we hypothesize that these two important properties can play a major role in Chinese sentiment analysis. In particular, we propose two effective features to encode phonetic information and, hence, fuse it with textual information. With this hypothesis, we propose Disambiguate Intonation for Sentiment Analysis (DISA), a network that we develop based on the principles of reinforcement learning. DISA disambiguates intonations for each Chinese character (pinyin) and, hence, learns precise phonetic representations. We also fuse phonetic features with textual and visual features to further improve performance. Experimental results on five different Chinese sentiment analysis datasets show that the inclusion of phonetic features significantly and consistently improves the performance of textual and visual representations and surpasses the state-of-the-art Chinese character-level representations. 2022-07-18T07:38:29Z 2022-07-18T07:38:29Z 2021 Journal Article Peng, H., Ma, Y., Poria, S., Li, Y. & Cambria, E. (2021). Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning. Information Fusion, 70, 88-99. https://dx.doi.org/10.1016/j.inffus.2021.01.005 1566-2535 https://hdl.handle.net/10356/160266 10.1016/j.inffus.2021.01.005 2-s2.0-85099510729 70 88 99 en Information Fusion © 2021 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Sentiment Analysis
Multilingual Sentiment Analysis
spellingShingle Engineering::Computer science and engineering
Sentiment Analysis
Multilingual Sentiment Analysis
Peng, Haiyun
Ma, Yukun
Poria, Soujanya
Li, Yang
Cambria, Erik
Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
description The Chinese pronunciation system offers two characteristics that distinguish it from other languages: deep phonemic orthography and intonation variations. In this paper, we hypothesize that these two important properties can play a major role in Chinese sentiment analysis. In particular, we propose two effective features to encode phonetic information and, hence, fuse it with textual information. With this hypothesis, we propose Disambiguate Intonation for Sentiment Analysis (DISA), a network that we develop based on the principles of reinforcement learning. DISA disambiguates intonations for each Chinese character (pinyin) and, hence, learns precise phonetic representations. We also fuse phonetic features with textual and visual features to further improve performance. Experimental results on five different Chinese sentiment analysis datasets show that the inclusion of phonetic features significantly and consistently improves the performance of textual and visual representations and surpasses the state-of-the-art Chinese character-level representations.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Peng, Haiyun
Ma, Yukun
Poria, Soujanya
Li, Yang
Cambria, Erik
format Article
author Peng, Haiyun
Ma, Yukun
Poria, Soujanya
Li, Yang
Cambria, Erik
author_sort Peng, Haiyun
title Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
title_short Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
title_full Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
title_fullStr Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
title_full_unstemmed Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning
title_sort phonetic-enriched text representation for chinese sentiment analysis with reinforcement learning
publishDate 2022
url https://hdl.handle.net/10356/160266
_version_ 1738844916053704704