Efficient multi-class selective sampling on graphs

A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Peng, ZHAO, Peilin, HAI, Zhen, LIU, Wei, HOI, Steven C. H., LI, Xiao-Li
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3441
https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4442
record_format dspace
spelling sg-smu-ink.sis_research-44422017-12-01T09:49:32Z Efficient multi-class selective sampling on graphs YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy. 2016-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Selective sampling Active learning Databases and Information Systems Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Selective sampling
Active learning
Databases and Information Systems
Software Engineering
spellingShingle Selective sampling
Active learning
Databases and Information Systems
Software Engineering
YANG, Peng
ZHAO, Peilin
HAI, Zhen
LIU, Wei
HOI, Steven C. H.,
LI, Xiao-Li
Efficient multi-class selective sampling on graphs
description A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy.
format text
author YANG, Peng
ZHAO, Peilin
HAI, Zhen
LIU, Wei
HOI, Steven C. H.,
LI, Xiao-Li
author_facet YANG, Peng
ZHAO, Peilin
HAI, Zhen
LIU, Wei
HOI, Steven C. H.,
LI, Xiao-Li
author_sort YANG, Peng
title Efficient multi-class selective sampling on graphs
title_short Efficient multi-class selective sampling on graphs
title_full Efficient multi-class selective sampling on graphs
title_fullStr Efficient multi-class selective sampling on graphs
title_full_unstemmed Efficient multi-class selective sampling on graphs
title_sort efficient multi-class selective sampling on graphs
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/3441
https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf
_version_ 1770573204007419904