Initialization independent clustering with actively self-training method

The results of traditional clustering methods are usually unreliable as there is not any guidance from the data labels, while the class labels can be predicted more reliable by the semisupervised learning if the labels of partial data are given. In this paper, we propose an actively self-training cl...

Full description

Saved in:
Bibliographic Details
Main Authors: Nie, Feiping, Xu, Dong, Li, Xuelong
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/96337
http://hdl.handle.net/10220/11431
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-96337
record_format dspace
spelling sg-ntu-dr.10356-963372020-05-28T07:17:59Z Initialization independent clustering with actively self-training method Nie, Feiping Xu, Dong Li, Xuelong School of Computer Engineering DRNTU::Engineering::Computer science and engineering The results of traditional clustering methods are usually unreliable as there is not any guidance from the data labels, while the class labels can be predicted more reliable by the semisupervised learning if the labels of partial data are given. In this paper, we propose an actively self-training clustering method, in which the samples are actively selected as training set to minimize an estimated Bayes error, and then explore semisupervised learning to perform clustering. Traditional graph-based semisupervised learning methods are not convenient to estimate the Bayes error; we develop a specific regularization framework on graph to perform semisupervised learning, in which the Bayes error can be effectively estimated. In addition, the proposed clustering algorithm can be readily applied in a semisupervised setting with partial class labels. Experimental results on toy data and real-world data sets demonstrate the effectiveness of the proposed clustering method on the unsupervised and the semisupervised setting. It is worthy noting that the proposed clustering method is free of initialization, while traditional clustering methods are usually dependent on initialization. 2013-07-15T06:51:57Z 2019-12-06T19:29:14Z 2013-07-15T06:51:57Z 2019-12-06T19:29:14Z 2011 2011 Journal Article Nie, F., Xu, D, & Li, X. (2012). Initialization Independent Clustering With Actively Self-Training Method. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 42(1), 17-27. 1083-4419 https://hdl.handle.net/10356/96337 http://hdl.handle.net/10220/11431 10.1109/TSMCB.2011.2161607 en IEEE transactions on systems, man, and cybernetics, part b (cybernetics) © 2011 IEEE.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Nie, Feiping
Xu, Dong
Li, Xuelong
Initialization independent clustering with actively self-training method
description The results of traditional clustering methods are usually unreliable as there is not any guidance from the data labels, while the class labels can be predicted more reliable by the semisupervised learning if the labels of partial data are given. In this paper, we propose an actively self-training clustering method, in which the samples are actively selected as training set to minimize an estimated Bayes error, and then explore semisupervised learning to perform clustering. Traditional graph-based semisupervised learning methods are not convenient to estimate the Bayes error; we develop a specific regularization framework on graph to perform semisupervised learning, in which the Bayes error can be effectively estimated. In addition, the proposed clustering algorithm can be readily applied in a semisupervised setting with partial class labels. Experimental results on toy data and real-world data sets demonstrate the effectiveness of the proposed clustering method on the unsupervised and the semisupervised setting. It is worthy noting that the proposed clustering method is free of initialization, while traditional clustering methods are usually dependent on initialization.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Nie, Feiping
Xu, Dong
Li, Xuelong
format Article
author Nie, Feiping
Xu, Dong
Li, Xuelong
author_sort Nie, Feiping
title Initialization independent clustering with actively self-training method
title_short Initialization independent clustering with actively self-training method
title_full Initialization independent clustering with actively self-training method
title_fullStr Initialization independent clustering with actively self-training method
title_full_unstemmed Initialization independent clustering with actively self-training method
title_sort initialization independent clustering with actively self-training method
publishDate 2013
url https://hdl.handle.net/10356/96337
http://hdl.handle.net/10220/11431
_version_ 1681056518076628992