Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach

Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely...

Full description

Saved in:

Bibliographic Details
Main Authors:	Duan, Lixin, Xu, Dong, Chang, Shih-Fu
Other Authors:	School of Computer Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-98517
record_format	dspace
spelling	sg-ntu-dr.10356-985172020-05-28T07:17:41Z Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach Duan, Lixin Xu, Dong Chang, Shih-Fu School of Computer Engineering IEEE Conference on Computer Vision and Pattern Recognition (2012 : Providence, Rhode Island, US) DRNTU::Engineering::Computer science and engineering Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%. 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2012 2012 Conference Paper Duan, L., Xu, D., & Chang, S.-F. (2012). Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach. 2012 IEEE Conference on Computer Vision and Pattern Recognition. https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479 10.1109/CVPR.2012.6247819 en © 2012 IEEE.
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering
spellingShingle	DRNTU::Engineering::Computer science and engineering Duan, Lixin Xu, Dong Chang, Shih-Fu Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
description	Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Duan, Lixin Xu, Dong Chang, Shih-Fu
format	Conference or Workshop Item
author	Duan, Lixin Xu, Dong Chang, Shih-Fu
author_sort	Duan, Lixin
title	Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_short	Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_full	Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_fullStr	Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_full_unstemmed	Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_sort	exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
publishDate	2013
url	https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479
_version_	1681056536889131008

Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach

Similar Items