Unsupervised celebrity face naming in web videos

This paper investigates the problem of celebrity face naming in unconstrained videos with user-provided metadata. Instead of relying on accurate face labels for supervised learning, a rich set of relationships automatically derived from video content and knowledge from image domain and social cues i...

Full description

Saved in:
Bibliographic Details
Main Authors: PANG, Lei, NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6361
https://ink.library.smu.edu.sg/context/sis_research/article/7364/viewcontent/07078858.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7364
record_format dspace
spelling sg-smu-ink.sis_research-73642021-11-23T02:54:40Z Unsupervised celebrity face naming in web videos PANG, Lei NGO, Chong-wah This paper investigates the problem of celebrity face naming in unconstrained videos with user-provided metadata. Instead of relying on accurate face labels for supervised learning, a rich set of relationships automatically derived from video content and knowledge from image domain and social cues is leveraged for unsupervised face labeling. The relationships refer to the appearances of faces under different spatio-temporal contexts and their visual similarities. The knowledge includes Web images weakly tagged with celebrity names and the celebrity social networks. The relationships and knowledge are elegantly encoded using conditional random field (CRF) for label inference. Two versions of face annotation are considered: within-video and between-video face labeling. The former addresses the problem of incomplete and noisy labels in metadata, where null assignment of names is allowed-a problem seldom been considered in the literature. The latter further rectifies the errors in metadata, specifically to correct false labels and annotate faces with missing names in the metadata of a video, by considering a group of socially connected videos for joint label inference. Experimental results on a large archive of Web videos show the robustness of the proposed approach in dealing with the problems of missing and false labels, leading to higher accuracy in face labeling than several existing approaches but with minor degradation in speed efficiency. 2015-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6361 info:doi/10.1109/TMM.2015.2419452 https://ink.library.smu.edu.sg/context/sis_research/article/7364/viewcontent/07078858.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Celebrity face naming social network unconstrained web videos unsupervised Computer Sciences Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Celebrity face naming
social network
unconstrained web videos
unsupervised
Computer Sciences
Graphics and Human Computer Interfaces
spellingShingle Celebrity face naming
social network
unconstrained web videos
unsupervised
Computer Sciences
Graphics and Human Computer Interfaces
PANG, Lei
NGO, Chong-wah
Unsupervised celebrity face naming in web videos
description This paper investigates the problem of celebrity face naming in unconstrained videos with user-provided metadata. Instead of relying on accurate face labels for supervised learning, a rich set of relationships automatically derived from video content and knowledge from image domain and social cues is leveraged for unsupervised face labeling. The relationships refer to the appearances of faces under different spatio-temporal contexts and their visual similarities. The knowledge includes Web images weakly tagged with celebrity names and the celebrity social networks. The relationships and knowledge are elegantly encoded using conditional random field (CRF) for label inference. Two versions of face annotation are considered: within-video and between-video face labeling. The former addresses the problem of incomplete and noisy labels in metadata, where null assignment of names is allowed-a problem seldom been considered in the literature. The latter further rectifies the errors in metadata, specifically to correct false labels and annotate faces with missing names in the metadata of a video, by considering a group of socially connected videos for joint label inference. Experimental results on a large archive of Web videos show the robustness of the proposed approach in dealing with the problems of missing and false labels, leading to higher accuracy in face labeling than several existing approaches but with minor degradation in speed efficiency.
format text
author PANG, Lei
NGO, Chong-wah
author_facet PANG, Lei
NGO, Chong-wah
author_sort PANG, Lei
title Unsupervised celebrity face naming in web videos
title_short Unsupervised celebrity face naming in web videos
title_full Unsupervised celebrity face naming in web videos
title_fullStr Unsupervised celebrity face naming in web videos
title_full_unstemmed Unsupervised celebrity face naming in web videos
title_sort unsupervised celebrity face naming in web videos
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/6361
https://ink.library.smu.edu.sg/context/sis_research/article/7364/viewcontent/07078858.pdf
_version_ 1770575941860327424