Performance Evaluation of Regular Decomposition and Benchmark Clustering Methods

This study compares three benchmark clustering methods—mini batch k-means, DBSCAN, and spectral clustering—with regular decomposition (RD), a new method developed for large graph data. RD is first converted so that applicable to numerical data without graph structure by changing the input into a dis...

全面介紹

Saved in:
書目詳細資料
Main Authors: Haryo, Laura, Pulungan, Reza
格式: Other NonPeerReviewed
出版: Communications in Computer and Information Science 2022
主題:
在線閱讀:https://repository.ugm.ac.id/284263/
https://link.springer.com/chapter/10.1007/978-981-19-8069-5_12
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:This study compares three benchmark clustering methods—mini batch k-means, DBSCAN, and spectral clustering—with regular decomposition (RD), a new method developed for large graph data. RD is first converted so that applicable to numerical data without graph structure by changing the input into a distance matrix and the output into cluster labels. The results indicate that mini batch k-means has the best overall performance in terms of accuracy, time, and space consumption. RD and spectral clustering have competitive adjusted Rand index (ARI), even though their time and space consumption is considerable and can reach 2 and 30 times greater than mini batch k-means when applied to the artificial datasets. On the other hand, DBSCAN produces ARI as low as 0% in most default cases but increases up to 100% in almost all experiments of the artificial datasets after varying the parameters. DBSCAN’s accuracy, time, and space consumption, however, are still worse than mini batch k-means.