On mitigating hard clusters for face clustering

Face clustering is a promising way to scale up face recognition systems using large-scale unlabeled face images. It remains challenging to identify small or sparse face image clusters that we call hard clusters, which is caused by the heterogeneity, i.e., high variations in size and sparsity, of the...

Full description

Saved in:

Bibliographic Details
Main Authors:	CHEN, Yingjie, ZHONG, Huasong, CHEN, Chong, SHEN, Chen, HUANG, Jianqiang, WANG, Tao, LIANG, Yun, Qianru SUN
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	face clustering unsupervised learning density estimation Databases and Information Systems Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/7512 https://ink.library.smu.edu.sg/context/sis_research/article/8515/viewcontent/ECCV2022_FaceClustering.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Description
Summary:	Face clustering is a promising way to scale up face recognition systems using large-scale unlabeled face images. It remains challenging to identify small or sparse face image clusters that we call hard clusters, which is caused by the heterogeneity, i.e., high variations in size and sparsity, of the clusters. Consequently, the conventional way of using a uniform threshold (to identify clusters) often leads to a terrible misclassification for the samples that should belong to hard clusters. We tackle this problem by leveraging the neighborhood information of samples and inferring the cluster memberships (of samples) in a probabilistic way. We introduce two novel modules, Neighborhood-Diffusion-based Density (NDDe) and Transition-Probability-based Distance (TPDi), based on which we can simply apply the standard Density Peak Clustering algorithm with a uniform threshold. Our experiments on multiple benchmarks show that each module contributes to the final performance of our method, and by incorporating them into other advanced face clustering methods, these two modules can boost the performance of these methods to a new state-of-the-art.

On mitigating hard clusters for face clustering

Similar Items