Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach
Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-98517 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-985172020-05-28T07:17:41Z Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach Duan, Lixin Xu, Dong Chang, Shih-Fu School of Computer Engineering IEEE Conference on Computer Vision and Pattern Recognition (2012 : Providence, Rhode Island, US) DRNTU::Engineering::Computer science and engineering Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%. 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2013-07-29T07:13:45Z 2019-12-06T19:56:25Z 2012 2012 Conference Paper Duan, L., Xu, D., & Chang, S.-F. (2012). Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach. 2012 IEEE Conference on Computer Vision and Pattern Recognition. https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479 10.1109/CVPR.2012.6247819 en © 2012 IEEE. |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Duan, Lixin Xu, Dong Chang, Shih-Fu Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
description |
Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the spacetime (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the ϵ-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%. |
author2 |
School of Computer Engineering |
author_facet |
School of Computer Engineering Duan, Lixin Xu, Dong Chang, Shih-Fu |
format |
Conference or Workshop Item |
author |
Duan, Lixin Xu, Dong Chang, Shih-Fu |
author_sort |
Duan, Lixin |
title |
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
title_short |
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
title_full |
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
title_fullStr |
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
title_full_unstemmed |
Exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
title_sort |
exploiting web images for event recognition in consumer videos : a multiple source domain adaptation approach |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/98517 http://hdl.handle.net/10220/12479 |
_version_ |
1681056536889131008 |