GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement

The deep learning based approaches which have been repeatedly proven to bring benefits to visual recognition tasks usually make a strong assumption that the training and test data are drawn from similar feature spaces and distributions. However, such an assumption may not always hold in various prac...

Full description

Saved in:

Bibliographic Details
Main Authors:	Li, Haoliang, Wang, Shiqi, Wan, Renjie, Kot, Alex Chichung
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2022
Subjects:	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering Generalization Capability Covariance Matrix
Online Access:	https://hdl.handle.net/10356/161899
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-161899
record_format	dspace
spelling	sg-ntu-dr.10356-1618992022-09-26T00:57:31Z GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement Li, Haoliang Wang, Shiqi Wan, Renjie Kot, Alex Chichung School of Electrical and Electronic Engineering Rapid-Rich Object Search (ROSE) Lab Engineering::Electrical and electronic engineering Engineering::Computer science and engineering Generalization Capability Covariance Matrix The deep learning based approaches which have been repeatedly proven to bring benefits to visual recognition tasks usually make a strong assumption that the training and test data are drawn from similar feature spaces and distributions. However, such an assumption may not always hold in various practical application scenarios on visual recognition tasks. Inspired by the hierarchical organization of deep feature representation that progressively leads to more abstract features at higher layers of representations, we propose to tackle this problem with a novel feature learning framework, which is called GMFAD, with better generalization capability in a multilayer perceptron manner. We first learn feature representations at the shallow layer where shareable underlying factors among domains (e.g., a subset of which could be relevant for each particular domain) can be explored. In particular, we propose to align the domain divergence between domain pair(s) by considering both inter-dimension and inter-sample correlations, which have been largely ignored by many cross-domain visual recognition methods. Subsequently, to learn more abstract information which could further benefit transferability, we propose to conduct feature disentanglement at the deep feature layer. Extensive experiments based on different visual recognition tasks demonstrate that our proposed framework can learn better transferable feature representation compared with state-of-the-art baselines. Nanyang Technological University This research was supported in part by the Wallenberg-NTU Presidential Postdoctoral Fellowship, the NTU-PKU Joint Research Institute, a collaboration between the Nanyang Technological University and Peking University that is sponsored by a donation from the Ng Teng Fong Charitable Foundation, the Science and Technology Foundation of Guangzhou Huangpu Development District under Grant 2017GH22 and 201902 010028, and Sino-Singapore International Joint Research Institute (Project No. 206-A017023 and 206-A018001). 2022-09-26T00:57:31Z 2022-09-26T00:57:31Z 2020 Journal Article Li, H., Wang, S., Wan, R. & Kot, A. C. (2020). GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44(3), 1289-1303. https://dx.doi.org/10.1109/TPAMI.2020.3020554 0162-8828 https://hdl.handle.net/10356/161899 10.1109/TPAMI.2020.3020554 32870783 2-s2.0-85124055653 3 44 1289 1303 en 206-A017023 206-A018001 IEEE Transactions on Pattern Analysis and Machine Intelligence © 2020 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering Generalization Capability Covariance Matrix
spellingShingle	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering Generalization Capability Covariance Matrix Li, Haoliang Wang, Shiqi Wan, Renjie Kot, Alex Chichung GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
description	The deep learning based approaches which have been repeatedly proven to bring benefits to visual recognition tasks usually make a strong assumption that the training and test data are drawn from similar feature spaces and distributions. However, such an assumption may not always hold in various practical application scenarios on visual recognition tasks. Inspired by the hierarchical organization of deep feature representation that progressively leads to more abstract features at higher layers of representations, we propose to tackle this problem with a novel feature learning framework, which is called GMFAD, with better generalization capability in a multilayer perceptron manner. We first learn feature representations at the shallow layer where shareable underlying factors among domains (e.g., a subset of which could be relevant for each particular domain) can be explored. In particular, we propose to align the domain divergence between domain pair(s) by considering both inter-dimension and inter-sample correlations, which have been largely ignored by many cross-domain visual recognition methods. Subsequently, to learn more abstract information which could further benefit transferability, we propose to conduct feature disentanglement at the deep feature layer. Extensive experiments based on different visual recognition tasks demonstrate that our proposed framework can learn better transferable feature representation compared with state-of-the-art baselines.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Li, Haoliang Wang, Shiqi Wan, Renjie Kot, Alex Chichung
format	Article
author	Li, Haoliang Wang, Shiqi Wan, Renjie Kot, Alex Chichung
author_sort	Li, Haoliang
title	GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
title_short	GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
title_full	GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
title_fullStr	GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
title_full_unstemmed	GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement
title_sort	gmfad: towards generalized visual recognition via multilayer feature alignment and disentanglement
publishDate	2022
url	https://hdl.handle.net/10356/161899
_version_	1745574653731012608

GMFAD: Towards generalized visual recognition via multilayer feature alignment and disentanglement

Similar Items