Information-theoretic multi-view domain adaptation: A theoretical and empirical study

Multi-view learning aims to improve classification performance by leveraging the consistency among different views of data. The incorporation of multiple views was paid little attention in the studies of domain adaptation, where the view consistency based on source data is largely violated in the ta...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Pei, GAO, Wei
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2014
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4548
https://ink.library.smu.edu.sg/context/sis_research/article/5551/viewcontent/10870_Article_Text_20275_1_10_20180216.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5551
record_format dspace
spelling sg-smu-ink.sis_research-55512019-12-26T09:04:31Z Information-theoretic multi-view domain adaptation: A theoretical and empirical study YANG, Pei GAO, Wei Multi-view learning aims to improve classification performance by leveraging the consistency among different views of data. The incorporation of multiple views was paid little attention in the studies of domain adaptation, where the view consistency based on source data is largely violated in the target domain due to the distribution gap between different domain data. In this paper, we leverage multiple views for cross-domain document classification. The central idea is to strengthen the views' consistency on target data by identifying the associations of domain-specific features from different domains. We present an Information-theoretic Multi-view Adaptation Model (IMAM) using a multi-way clustering scheme, where word and link clusters can draw together seemingly unrelated features across domains, which boosts the consistency between document clusterings that are based on the respective word and link views. Moreover, we demonstrate that IMAM can always find the document clustering with the minimal disagreement rate to the overlap of view-based clusterings. We provide both theoretical and empirical justifications of the proposed method. Our experiments show that IMAM significantly outperforms traditional multi-view algorithm co-training, the co-training-based adaptation algorithm CODA, the single-view transfer model CoCC and the large-margin-based multi-view transfer model MVTL-LM. 2014-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4548 info:doi/10.1613/jair.4190 https://ink.library.smu.edu.sg/context/sis_research/article/5551/viewcontent/10870_Article_Text_20275_1_10_20180216.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
YANG, Pei
GAO, Wei
Information-theoretic multi-view domain adaptation: A theoretical and empirical study
description Multi-view learning aims to improve classification performance by leveraging the consistency among different views of data. The incorporation of multiple views was paid little attention in the studies of domain adaptation, where the view consistency based on source data is largely violated in the target domain due to the distribution gap between different domain data. In this paper, we leverage multiple views for cross-domain document classification. The central idea is to strengthen the views' consistency on target data by identifying the associations of domain-specific features from different domains. We present an Information-theoretic Multi-view Adaptation Model (IMAM) using a multi-way clustering scheme, where word and link clusters can draw together seemingly unrelated features across domains, which boosts the consistency between document clusterings that are based on the respective word and link views. Moreover, we demonstrate that IMAM can always find the document clustering with the minimal disagreement rate to the overlap of view-based clusterings. We provide both theoretical and empirical justifications of the proposed method. Our experiments show that IMAM significantly outperforms traditional multi-view algorithm co-training, the co-training-based adaptation algorithm CODA, the single-view transfer model CoCC and the large-margin-based multi-view transfer model MVTL-LM.
format text
author YANG, Pei
GAO, Wei
author_facet YANG, Pei
GAO, Wei
author_sort YANG, Pei
title Information-theoretic multi-view domain adaptation: A theoretical and empirical study
title_short Information-theoretic multi-view domain adaptation: A theoretical and empirical study
title_full Information-theoretic multi-view domain adaptation: A theoretical and empirical study
title_fullStr Information-theoretic multi-view domain adaptation: A theoretical and empirical study
title_full_unstemmed Information-theoretic multi-view domain adaptation: A theoretical and empirical study
title_sort information-theoretic multi-view domain adaptation: a theoretical and empirical study
publisher Institutional Knowledge at Singapore Management University
publishDate 2014
url https://ink.library.smu.edu.sg/sis_research/4548
https://ink.library.smu.edu.sg/context/sis_research/article/5551/viewcontent/10870_Article_Text_20275_1_10_20180216.pdf
_version_ 1770574911088099328