Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach

Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely...

Full description

Saved in:
Bibliographic Details
Main Authors: Duan, Lixin, Xu, Dong, Chang, Shih-Fu
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/98517
http://hdl.handle.net/10220/12479
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-98517
record_format dspace
spelling sg-ntu-dr.10356-985172020-05-28T07:17:41Z Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach Duan, Lixin Xu, Dong Chang, Shih-Fu School of Computer Engineering IEEE Conference on Computer Vision and Pattern Recognition (2012 : Providence, Rhode Island, US) DRNTU::Engineering::Computer science and engineering Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%. 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2012 2012 Conference Paper Duan, L., Xu, D., & Chang, S.-F. (2012). Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach. 2012 IEEE Conference on Computer Vision and Pattern Recognition. https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479 10.1109/CVPR.2012.6247819 en © 2012 IEEE.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Duan, Lixin
Xu, Dong
Chang, Shih-Fu
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
description Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Duan, Lixin
Xu, Dong
Chang, Shih-Fu
format Conference or Workshop Item
author Duan, Lixin
Xu, Dong
Chang, Shih-Fu
author_sort Duan, Lixin
title Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_short Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_full Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_fullStr Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_full_unstemmed Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
title_sort exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
publishDate 2013
url https://hdl.handle.net/10356/98517
http://hdl.handle.net/10220/12479
_version_ 1681056536889131008