Learning to name faces: A multimodal learning scheme for search-based face annotation

Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images free...

Full description

Saved in:
Bibliographic Details
Main Authors: WANG, Dayong, HOI, Steven C. H., WU, Pengcheng, ZHU, Jianke, HE, Ying, MIAO, Chunyan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2013
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2336
https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3336
record_format dspace
spelling sg-smu-ink.sis_research-33362020-04-01T02:58:06Z Learning to name faces: A multimodal learning scheme for search-based face annotation WANG, Dayong HOI, Steven C. H. WU, Pengcheng ZHU, Jianke HE, Ying MIAO, Chunyan Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) – a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the “label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task. 2013-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2336 info:doi/10.1145/2484028.2484040 https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University web facial images auto face annotation supervised learning Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic web facial images
auto face annotation
supervised learning
Computer Sciences
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle web facial images
auto face annotation
supervised learning
Computer Sciences
Databases and Information Systems
Numerical Analysis and Scientific Computing
WANG, Dayong
HOI, Steven C. H.
WU, Pengcheng
ZHU, Jianke
HE, Ying
MIAO, Chunyan
Learning to name faces: A multimodal learning scheme for search-based face annotation
description Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) – a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the “label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task.
format text
author WANG, Dayong
HOI, Steven C. H.
WU, Pengcheng
ZHU, Jianke
HE, Ying
MIAO, Chunyan
author_facet WANG, Dayong
HOI, Steven C. H.
WU, Pengcheng
ZHU, Jianke
HE, Ying
MIAO, Chunyan
author_sort WANG, Dayong
title Learning to name faces: A multimodal learning scheme for search-based face annotation
title_short Learning to name faces: A multimodal learning scheme for search-based face annotation
title_full Learning to name faces: A multimodal learning scheme for search-based face annotation
title_fullStr Learning to name faces: A multimodal learning scheme for search-based face annotation
title_full_unstemmed Learning to name faces: A multimodal learning scheme for search-based face annotation
title_sort learning to name faces: a multimodal learning scheme for search-based face annotation
publisher Institutional Knowledge at Singapore Management University
publishDate 2013
url https://ink.library.smu.edu.sg/sis_research/2336
https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf
_version_ 1770572101315461120