Salience-aware adaptive resonance theory for large-scale sparse data clustering

Sparse data is known to pose challenges to cluster analysis, as the similarity between data tends to be ill-posed in the high-dimensional Hilbert space. Solutions in the literature typically extend either k-means or spectral clustering with additional steps on representation learning and/or feature...

Full description

Saved in:

Bibliographic Details
Main Authors:	MENG, Lei, TAN, Ah-hwee, MIAO, Chunyan
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2019
Subjects:	Adaptive resonance theory Clustering Sparse data Subspace learning Feature weighting Parameter adaptation Databases and Information Systems OS and Networks Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/5240 https://ink.library.smu.edu.sg/context/sis_research/article/6243/viewcontent/1_s2.0_S0893608019302758_main.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6243
record_format	dspace
spelling	sg-smu-ink.sis_research-62432020-07-23T18:24:42Z Salience-aware adaptive resonance theory for large-scale sparse data clustering MENG, Lei TAN, Ah-hwee MIAO, Chunyan Sparse data is known to pose challenges to cluster analysis, as the similarity between data tends to be ill-posed in the high-dimensional Hilbert space. Solutions in the literature typically extend either k-means or spectral clustering with additional steps on representation learning and/or feature weighting. However, adding these usually introduces new parameters and increases computational cost, thus inevitably lowering the robustness of these algorithms when handling massive ill-represented data. To alleviate these issues, this paper presents a class of self-organizing neural networks, called the salience-aware adaptive resonance theory (SA-ART) model. SA-ART extends Fuzzy ART with measures for cluster-wise salient feature modeling. Specifically, two strategies, i.e. cluster space matching and salience feature weighting, are incorporated to alleviate the side-effect of noisy features incurred by high dimensionality. Additionally, cluster weights are bounded by the statistical means and minimums of the samples therein, making the learning rate also self-adaptable. Notably, SA-ART allows clusters to have their own sets of self-adaptable parameters. It has the same time complexity of Fuzzy ART and does not introduce additional hyperparameters that profile cluster properties. Comparative experiments have been conducted on the ImageNet and BlogCatalog datasets, which are large-scale and include sparsely-represented data. The results show that, SA-ART achieves 51.8% and 18.2% improvement over Fuzzy ART, respectively. While both have a similar time cost, SA-ART converges faster and can reach a better local minimum. In addition, SA-ART consistently outperforms six other state-of-the-art algorithms in terms of precision and F1 score. More importantly, it is much faster and exhibits stronger robustness to large and complex data. 2019-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5240 info:doi/10.1016/j.neunet.2019.09.014 https://ink.library.smu.edu.sg/context/sis_research/article/6243/viewcontent/1_s2.0_S0893608019302758_main.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Adaptive resonance theory Clustering Sparse data Subspace learning Feature weighting Parameter adaptation Databases and Information Systems OS and Networks Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Adaptive resonance theory Clustering Sparse data Subspace learning Feature weighting Parameter adaptation Databases and Information Systems OS and Networks Software Engineering
spellingShingle	Adaptive resonance theory Clustering Sparse data Subspace learning Feature weighting Parameter adaptation Databases and Information Systems OS and Networks Software Engineering MENG, Lei TAN, Ah-hwee MIAO, Chunyan Salience-aware adaptive resonance theory for large-scale sparse data clustering
description	Sparse data is known to pose challenges to cluster analysis, as the similarity between data tends to be ill-posed in the high-dimensional Hilbert space. Solutions in the literature typically extend either k-means or spectral clustering with additional steps on representation learning and/or feature weighting. However, adding these usually introduces new parameters and increases computational cost, thus inevitably lowering the robustness of these algorithms when handling massive ill-represented data. To alleviate these issues, this paper presents a class of self-organizing neural networks, called the salience-aware adaptive resonance theory (SA-ART) model. SA-ART extends Fuzzy ART with measures for cluster-wise salient feature modeling. Specifically, two strategies, i.e. cluster space matching and salience feature weighting, are incorporated to alleviate the side-effect of noisy features incurred by high dimensionality. Additionally, cluster weights are bounded by the statistical means and minimums of the samples therein, making the learning rate also self-adaptable. Notably, SA-ART allows clusters to have their own sets of self-adaptable parameters. It has the same time complexity of Fuzzy ART and does not introduce additional hyperparameters that profile cluster properties. Comparative experiments have been conducted on the ImageNet and BlogCatalog datasets, which are large-scale and include sparsely-represented data. The results show that, SA-ART achieves 51.8% and 18.2% improvement over Fuzzy ART, respectively. While both have a similar time cost, SA-ART converges faster and can reach a better local minimum. In addition, SA-ART consistently outperforms six other state-of-the-art algorithms in terms of precision and F1 score. More importantly, it is much faster and exhibits stronger robustness to large and complex data.
format	text
author	MENG, Lei TAN, Ah-hwee MIAO, Chunyan
author_facet	MENG, Lei TAN, Ah-hwee MIAO, Chunyan
author_sort	MENG, Lei
title	Salience-aware adaptive resonance theory for large-scale sparse data clustering
title_short	Salience-aware adaptive resonance theory for large-scale sparse data clustering
title_full	Salience-aware adaptive resonance theory for large-scale sparse data clustering
title_fullStr	Salience-aware adaptive resonance theory for large-scale sparse data clustering
title_full_unstemmed	Salience-aware adaptive resonance theory for large-scale sparse data clustering
title_sort	salience-aware adaptive resonance theory for large-scale sparse data clustering
publisher	Institutional Knowledge at Singapore Management University
publishDate	2019
url	https://ink.library.smu.edu.sg/sis_research/5240 https://ink.library.smu.edu.sg/context/sis_research/article/6243/viewcontent/1_s2.0_S0893608019302758_main.pdf
_version_	1770575346328928256

Salience-aware adaptive resonance theory for large-scale sparse data clustering

Similar Items