Disentangling multi-view representations beyond inductive bias

Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their gener...

Full description

Saved in:
Bibliographic Details
Main Authors: KE, Guanzhou, YU, Yang, CHAO, Guoqing, WANG, Xiaoli, XU, Chenyang, HE, Shengfeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8420
https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9423
record_format dspace
spelling sg-smu-ink.sis_research-94232024-01-09T03:31:18Z Disentangling multi-view representations beyond inductive bias KE, Guanzhou YU, Yang CHAO, Guoqing WANG, Xiaoli XU, Chenyang HE, Shengfeng Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a two-stage framework. In the first stage, we obtain multi-view consistency by training a consistent encoder to produce semantically-consistent representations across views as well as their corresponding pseudo-labels. In the second stage, we disentangle specificity from comprehensive representations by minimizing the upper bound of mutual information between consistent and comprehensive representations. Finally, we reconstruct the original data by concatenating pseudo-labels and view-specific representations. Our experiments on four multi-view datasets demonstrate that our proposed method outperforms 12 comparison methods in terms of clustering and classification performance. The visualization results also show that the extracted consistency and specificity are compact and interpretable. 2023-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8420 info:doi/10.1145/3581783.3611794 https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Multi-view representation learning Disentangled representation Consistency and specificity Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Multi-view representation learning
Disentangled representation
Consistency and specificity
Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Multi-view representation learning
Disentangled representation
Consistency and specificity
Databases and Information Systems
Graphics and Human Computer Interfaces
KE, Guanzhou
YU, Yang
CHAO, Guoqing
WANG, Xiaoli
XU, Chenyang
HE, Shengfeng
Disentangling multi-view representations beyond inductive bias
description Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a two-stage framework. In the first stage, we obtain multi-view consistency by training a consistent encoder to produce semantically-consistent representations across views as well as their corresponding pseudo-labels. In the second stage, we disentangle specificity from comprehensive representations by minimizing the upper bound of mutual information between consistent and comprehensive representations. Finally, we reconstruct the original data by concatenating pseudo-labels and view-specific representations. Our experiments on four multi-view datasets demonstrate that our proposed method outperforms 12 comparison methods in terms of clustering and classification performance. The visualization results also show that the extracted consistency and specificity are compact and interpretable.
format text
author KE, Guanzhou
YU, Yang
CHAO, Guoqing
WANG, Xiaoli
XU, Chenyang
HE, Shengfeng
author_facet KE, Guanzhou
YU, Yang
CHAO, Guoqing
WANG, Xiaoli
XU, Chenyang
HE, Shengfeng
author_sort KE, Guanzhou
title Disentangling multi-view representations beyond inductive bias
title_short Disentangling multi-view representations beyond inductive bias
title_full Disentangling multi-view representations beyond inductive bias
title_fullStr Disentangling multi-view representations beyond inductive bias
title_full_unstemmed Disentangling multi-view representations beyond inductive bias
title_sort disentangling multi-view representations beyond inductive bias
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8420
https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf
_version_ 1787590772535590912