What makes categories difficult to classify?
In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2009
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/488 https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-1487 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-14872018-06-25T07:44:38Z What makes categories difficult to classify? SUN, Aixin LIM, Ee Peng LIU, Ying In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can be taken to address the predicted poor accuracies of these categories. Inspired by the work on query performance prediction in ad-hoc retrieval, we propose to predict classification performance using two measures, namely, category size and category coherence. Our experiments on 20-Newsgroup and Reuters-21578 datasets show that the Spearman rank correlation coefficient between the predicted rank of classification performance and the expected classification accuracy is as high as 0.9. 2009-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/488 info:doi/10.1145/1645953.1646258 https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Classification performance prediction Text classification Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Classification performance prediction Text classification Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Classification performance prediction Text classification Databases and Information Systems Numerical Analysis and Scientific Computing SUN, Aixin LIM, Ee Peng LIU, Ying What makes categories difficult to classify? |
description |
In this paper, we try to predict which category will be less accurately classified compared with other categories in a classification task that involves multiple categories. The categories with poor predicted performance will be identified before any classifiers are trained and additional steps can be taken to address the predicted poor accuracies of these categories. Inspired by the work on query performance prediction in ad-hoc retrieval, we propose to predict classification performance using two measures, namely, category size and category coherence. Our experiments on 20-Newsgroup and Reuters-21578 datasets show that the Spearman rank correlation coefficient between the predicted rank of classification performance and the expected classification accuracy is as high as 0.9. |
format |
text |
author |
SUN, Aixin LIM, Ee Peng LIU, Ying |
author_facet |
SUN, Aixin LIM, Ee Peng LIU, Ying |
author_sort |
SUN, Aixin |
title |
What makes categories difficult to classify? |
title_short |
What makes categories difficult to classify? |
title_full |
What makes categories difficult to classify? |
title_fullStr |
What makes categories difficult to classify? |
title_full_unstemmed |
What makes categories difficult to classify? |
title_sort |
what makes categories difficult to classify? |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2009 |
url |
https://ink.library.smu.edu.sg/sis_research/488 https://ink.library.smu.edu.sg/context/sis_research/article/1487/viewcontent/What_makes_categories_difficult_to_classify_a_stud.pdf |
_version_ |
1770570440495857664 |