Learning to name faces: A multimodal learning scheme for search-based face annotation
Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images free...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2013
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2336 https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3336 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-33362020-04-01T02:58:06Z Learning to name faces: A multimodal learning scheme for search-based face annotation WANG, Dayong HOI, Steven C. H. WU, Pengcheng ZHU, Jianke HE, Ying MIAO, Chunyan Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) – a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the “label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task. 2013-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2336 info:doi/10.1145/2484028.2484040 https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University web facial images auto face annotation supervised learning Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
web facial images auto face annotation supervised learning Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
web facial images auto face annotation supervised learning Computer Sciences Databases and Information Systems Numerical Analysis and Scientific Computing WANG, Dayong HOI, Steven C. H. WU, Pengcheng ZHU, Jianke HE, Ying MIAO, Chunyan Learning to name faces: A multimodal learning scheme for search-based face annotation |
description |
Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) – a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the “label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task. |
format |
text |
author |
WANG, Dayong HOI, Steven C. H. WU, Pengcheng ZHU, Jianke HE, Ying MIAO, Chunyan |
author_facet |
WANG, Dayong HOI, Steven C. H. WU, Pengcheng ZHU, Jianke HE, Ying MIAO, Chunyan |
author_sort |
WANG, Dayong |
title |
Learning to name faces: A multimodal learning scheme for search-based face annotation |
title_short |
Learning to name faces: A multimodal learning scheme for search-based face annotation |
title_full |
Learning to name faces: A multimodal learning scheme for search-based face annotation |
title_fullStr |
Learning to name faces: A multimodal learning scheme for search-based face annotation |
title_full_unstemmed |
Learning to name faces: A multimodal learning scheme for search-based face annotation |
title_sort |
learning to name faces: a multimodal learning scheme for search-based face annotation |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2013 |
url |
https://ink.library.smu.edu.sg/sis_research/2336 https://ink.library.smu.edu.sg/context/sis_research/article/3336/viewcontent/Learniing_to_Name_Faces_A_Multimodal_Learning_Scheme_for_Search_Based_Face_Annotation.pdf |
_version_ |
1770572101315461120 |