Ordinal text quantification
In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several appl...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4569 https://ink.library.smu.edu.sg/context/sis_research/article/5572/viewcontent/p937_da_san_martino.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5572 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-55722019-12-26T08:21:05Z Ordinal text quantification MARTINO, Giovanni Da San GAO, Wei SEBASTIANI, Fabrizio In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength. 2016-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4569 info:doi/10.1145/2911451.2914749 https://ink.library.smu.edu.sg/context/sis_research/article/5572/viewcontent/p937_da_san_martino.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems MARTINO, Giovanni Da San GAO, Wei SEBASTIANI, Fabrizio Ordinal text quantification |
description |
In recent years there has been a growing interest in text quantification, a supervised learning task where the goal is to accurately estimate, in an unlabelled set of items, the prevalence (or "relative frequency") of each class c in a predefined set C. Text quantification has several applications, and is a dominant concern in fields such as market research, the social sciences, political science, and epidemiology. In this paper we tackle, for the first time, the problem of ordinal text quantification, defined as the task of performing text quantification when a total order is defined on the set of classes; estimating the prevalence of "five stars" reviews in a set of reviews of a given product, and monitoring this prevalence across time, is an example application. We present OQT, a novel tree-based OQ algorithm, and discuss experimental results obtained on a dataset of tweets classified according to sentiment strength. |
format |
text |
author |
MARTINO, Giovanni Da San GAO, Wei SEBASTIANI, Fabrizio |
author_facet |
MARTINO, Giovanni Da San GAO, Wei SEBASTIANI, Fabrizio |
author_sort |
MARTINO, Giovanni Da San |
title |
Ordinal text quantification |
title_short |
Ordinal text quantification |
title_full |
Ordinal text quantification |
title_fullStr |
Ordinal text quantification |
title_full_unstemmed |
Ordinal text quantification |
title_sort |
ordinal text quantification |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/sis_research/4569 https://ink.library.smu.edu.sg/context/sis_research/article/5572/viewcontent/p937_da_san_martino.pdf |
_version_ |
1770574916954882048 |