Efficient multi-class selective sampling on graphs

A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates...

Full description

Saved in:

Bibliographic Details
Main Authors:	YANG, Peng, ZHAO, Peilin, HAI, Zhen, LIU, Wei, HOI, Steven C. H., LI, Xiao-Li
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2016
Subjects:	Selective sampling Active learning Databases and Information Systems Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-4442
record_format	dspace
spelling	sg-smu-ink.sis_research-44422017-12-01T09:49:32Z Efficient multi-class selective sampling on graphs YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy. 2016-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Selective sampling Active learning Databases and Information Systems Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Selective sampling Active learning Databases and Information Systems Software Engineering
spellingShingle	Selective sampling Active learning Databases and Information Systems Software Engineering YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li Efficient multi-class selective sampling on graphs
description	A graph-based multi-class classification problem is typically converted into a collection of binary classification tasks via the one-vs.-all strategy, and then tackled by applying proper binary classification algorithms. Unlike the one-vs.-all strategy, we suggest a unified framework which operates directly on the multi-class problem without reducing it to a collection of binary tasks. Moreover, this framework makes active learning practically feasible for multi-class problems, while the one-vs.-all strategy cannot. Specifically, we employ a novel randomized query technique to prioritize the informative instances. This query technique based on the hybrid criterion of "margin" and "uncertainty" can achieve a comparable mistake bound with its fully supervised counterpart. To take full advantage of correctly predicted labels discarded in traditional conservative algorithms, we propose an aggressive selective sampling algorithm that can update the model even if no error occurs. Thanks to the aggressive updating strategy, the aggressive algorithm attains a lower mistake bound than its conservative competitors in expectation. Encouraging experimental results on real-world graph databases show that the proposed technique by querying an extremely small ratio of labels is able to accomplish better classification accuracy.
format	text
author	YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li
author_facet	YANG, Peng ZHAO, Peilin HAI, Zhen LIU, Wei HOI, Steven C. H., LI, Xiao-Li
author_sort	YANG, Peng
title	Efficient multi-class selective sampling on graphs
title_short	Efficient multi-class selective sampling on graphs
title_full	Efficient multi-class selective sampling on graphs
title_fullStr	Efficient multi-class selective sampling on graphs
title_full_unstemmed	Efficient multi-class selective sampling on graphs
title_sort	efficient multi-class selective sampling on graphs
publisher	Institutional Knowledge at Singapore Management University
publishDate	2016
url	https://ink.library.smu.edu.sg/sis_research/3441 https://ink.library.smu.edu.sg/context/sis_research/article/4442/viewcontent/Efficient_multi_class_selective_sampling_on_graphs.pdf
_version_	1770573204007419904

Efficient multi-class selective sampling on graphs

Similar Items