What makes categories difficult to classify?

In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Aixin, LIM, Ee Peng, LIU, Ying
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2009
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/488
https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1487
record_format dspace
spelling sg-smu-ink.sis_research-14872018-06-25T07:44:38Z What makes categories difficult to classify? SUN, Aixin LIM, Ee Peng LIU, Ying In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can be taken to address the predicted poor accuracies of these categories. Inspired by the work on query performance prediction in ad-hoc retrieval, we propose to predict classification performance using two measures, namely, category size and category coherence. Our experiments on 20-Newsgroup and Reuters-21578 datasets show that the Spearman rank correlation coefficient between the predicted rank of classification performance and the expected classification accuracy is as high as 0.9. 2009-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/488 info:doi/10.1145/1645953.1646258 https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Classification performance prediction Text classification Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Classification performance prediction
Text classification
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Classification performance prediction
Text classification
Databases and Information Systems
Numerical Analysis and Scientific Computing
SUN, Aixin
LIM, Ee Peng
LIU, Ying
What makes categories difficult to classify?
description In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can be taken to address the predicted poor accuracies of these categories. Inspired by the work on query performance prediction in ad-hoc retrieval, we propose to predict classification performance using two measures, namely, category size and category coherence. Our experiments on 20-Newsgroup and Reuters-21578 datasets show that the Spearman rank correlation coefficient between the predicted rank of classification performance and the expected classification accuracy is as high as 0.9.
format text
author SUN, Aixin
LIM, Ee Peng
LIU, Ying
author_facet SUN, Aixin
LIM, Ee Peng
LIU, Ying
author_sort SUN, Aixin
title What makes categories difficult to classify?
title_short What makes categories difficult to classify?
title_full What makes categories difficult to classify?
title_fullStr What makes categories difficult to classify?
title_full_unstemmed What makes categories difficult to classify?
title_sort what makes categories difficult to classify?
publisher Institutional Knowledge at Singapore Management University
publishDate 2009
url https://ink.library.smu.edu.sg/sis_research/488
https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf
_version_ 1770570440495857664