Efficient multi-class selective sampling on graphs
A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-4442 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-44422017-12-01T09:49:32Z Efficient multi-class selective sampling on graphs YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy. 2016-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Selective sampling Active learning Databases and Information Systems Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Selective sampling Active learning Databases and Information Systems Software Engineering |
spellingShingle |
Selective sampling Active learning Databases and Information Systems Software Engineering YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li Efficient multi-class selective sampling on graphs |
description |
A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy. |
format |
text |
author |
YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li |
author_facet |
YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li |
author_sort |
YANG, Peng |
title |
Efficient multi-class selective sampling on graphs |
title_short |
Efficient multi-class selective sampling on graphs |
title_full |
Efficient multi-class selective sampling on graphs |
title_fullStr |
Efficient multi-class selective sampling on graphs |
title_full_unstemmed |
Efficient multi-class selective sampling on graphs |
title_sort |
efficient multi-class selective sampling on graphs |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf |
_version_ |
1770573204007419904 |