CrowdTC: Crowd-powered learning for text classification

Text classification is a fundamental task in content analysis. Nowadays, deep learning has demonstrated promising performance in text classification compared with shallow models. However, almost all the existing models do not take advantage of the wisdom of human beings to help text classification....

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Keyu, Gao, Yunjun, LIANG, Lei, BIAN, Song, CHEN, Lu, ZHENG, Baihua
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7149
https://ink.library.smu.edu.sg/context/sis_research/article/8152/viewcontent/CrowdTC_pv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8152
record_format dspace
spelling sg-smu-ink.sis_research-81522022-06-21T01:32:48Z CrowdTC: Crowd-powered learning for text classification YANG, Keyu Gao, Yunjun LIANG, Lei BIAN, Song CHEN, Lu ZHENG, Baihua Text classification is a fundamental task in content analysis. Nowadays, deep learning has demonstrated promising performance in text classification compared with shallow models. However, almost all the existing models do not take advantage of the wisdom of human beings to help text classification. Human beings are more intelligent and capable than machine learning models in terms of understanding and capturing the implicit semantic information from text. In this article, we try to take guidance from human beings to classify text. We propose Crowd-powered learning for Text Classification (CrowdTC for short). We design and post the questions on a crowdsourcing platform to extract keywords in text. Sampling and clustering techniques are utilized to reduce the cost of crowdsourcing. Also, we present an attention-based neural network and a hybrid neural network to incorporate the extracted keywords as human guidance into deep neural networks. Extensive experiments on public datasets confirm that CrowdTC improves the text classification accuracy of neural networks by using the crowd-powered keyword guidance. 2022-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7149 info:doi/10.1145/3457216 https://ink.library.smu.edu.sg/context/sis_research/article/8152/viewcontent/CrowdTC_pv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Text classification crowdsourcing keyword extraction neural networks Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Text classification
crowdsourcing
keyword extraction
neural networks
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Text classification
crowdsourcing
keyword extraction
neural networks
Databases and Information Systems
Numerical Analysis and Scientific Computing
YANG, Keyu
Gao, Yunjun
LIANG, Lei
BIAN, Song
CHEN, Lu
ZHENG, Baihua
CrowdTC: Crowd-powered learning for text classification
description Text classification is a fundamental task in content analysis. Nowadays, deep learning has demonstrated promising performance in text classification compared with shallow models. However, almost all the existing models do not take advantage of the wisdom of human beings to help text classification. Human beings are more intelligent and capable than machine learning models in terms of understanding and capturing the implicit semantic information from text. In this article, we try to take guidance from human beings to classify text. We propose Crowd-powered learning for Text Classification (CrowdTC for short). We design and post the questions on a crowdsourcing platform to extract keywords in text. Sampling and clustering techniques are utilized to reduce the cost of crowdsourcing. Also, we present an attention-based neural network and a hybrid neural network to incorporate the extracted keywords as human guidance into deep neural networks. Extensive experiments on public datasets confirm that CrowdTC improves the text classification accuracy of neural networks by using the crowd-powered keyword guidance.
format text
author YANG, Keyu
Gao, Yunjun
LIANG, Lei
BIAN, Song
CHEN, Lu
ZHENG, Baihua
author_facet YANG, Keyu
Gao, Yunjun
LIANG, Lei
BIAN, Song
CHEN, Lu
ZHENG, Baihua
author_sort YANG, Keyu
title CrowdTC: Crowd-powered learning for text classification
title_short CrowdTC: Crowd-powered learning for text classification
title_full CrowdTC: Crowd-powered learning for text classification
title_fullStr CrowdTC: Crowd-powered learning for text classification
title_full_unstemmed CrowdTC: Crowd-powered learning for text classification
title_sort crowdtc: crowd-powered learning for text classification
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7149
https://ink.library.smu.edu.sg/context/sis_research/article/8152/viewcontent/CrowdTC_pv.pdf
_version_ 1770576232369356800