Disentangling multi-view representations beyond inductive bias
Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their gener...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8420 https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9423 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-94232024-01-09T03:31:18Z Disentangling multi-view representations beyond inductive bias KE, Guanzhou YU, Yang CHAO, Guoqing WANG, Xiaoli XU, Chenyang HE, Shengfeng Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a two-stage framework. In the first stage, we obtain multi-view consistency by training a consistent encoder to produce semantically-consistent representations across views as well as their corresponding pseudo-labels. In the second stage, we disentangle specificity from comprehensive representations by minimizing the upper bound of mutual information between consistent and comprehensive representations. Finally, we reconstruct the original data by concatenating pseudo-labels and view-specific representations. Our experiments on four multi-view datasets demonstrate that our proposed method outperforms 12 comparison methods in terms of clustering and classification performance. The visualization results also show that the extracted consistency and specificity are compact and interpretable. 2023-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8420 info:doi/10.1145/3581783.3611794 https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Multi-view representation learning Disentangled representation Consistency and specificity Databases and Information Systems Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Multi-view representation learning Disentangled representation Consistency and specificity Databases and Information Systems Graphics and Human Computer Interfaces |
spellingShingle |
Multi-view representation learning Disentangled representation Consistency and specificity Databases and Information Systems Graphics and Human Computer Interfaces KE, Guanzhou YU, Yang CHAO, Guoqing WANG, Xiaoli XU, Chenyang HE, Shengfeng Disentangling multi-view representations beyond inductive bias |
description |
Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a two-stage framework. In the first stage, we obtain multi-view consistency by training a consistent encoder to produce semantically-consistent representations across views as well as their corresponding pseudo-labels. In the second stage, we disentangle specificity from comprehensive representations by minimizing the upper bound of mutual information between consistent and comprehensive representations. Finally, we reconstruct the original data by concatenating pseudo-labels and view-specific representations. Our experiments on four multi-view datasets demonstrate that our proposed method outperforms 12 comparison methods in terms of clustering and classification performance. The visualization results also show that the extracted consistency and specificity are compact and interpretable. |
format |
text |
author |
KE, Guanzhou YU, Yang CHAO, Guoqing WANG, Xiaoli XU, Chenyang HE, Shengfeng |
author_facet |
KE, Guanzhou YU, Yang CHAO, Guoqing WANG, Xiaoli XU, Chenyang HE, Shengfeng |
author_sort |
KE, Guanzhou |
title |
Disentangling multi-view representations beyond inductive bias |
title_short |
Disentangling multi-view representations beyond inductive bias |
title_full |
Disentangling multi-view representations beyond inductive bias |
title_fullStr |
Disentangling multi-view representations beyond inductive bias |
title_full_unstemmed |
Disentangling multi-view representations beyond inductive bias |
title_sort |
disentangling multi-view representations beyond inductive bias |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8420 https://ink.library.smu.edu.sg/context/sis_research/article/9423/viewcontent/Disentangling_Multi_view_Representations_Beyond_Inductive_Bias.pdf |
_version_ |
1787590772535590912 |