Combating Negative Transfer From Predictive Distribution Differences

Domain adaptation (DA), which leverages labeled data from related source domains, comes in handy when the label information of the target domain is scarce or unavailable. However, as the source data do not come from the same origin as that of the target domain, the predictive distributions of the so...

Full description

Saved in:
Bibliographic Details
Main Authors: Seah, Chun-Wei, Ong, Yew-Soon, Tsang, Ivor W.
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2016
Subjects:
Online Access:https://hdl.handle.net/10356/81712
http://hdl.handle.net/10220/39657
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-81712
record_format dspace
spelling sg-ntu-dr.10356-817122020-05-28T07:17:22Z Combating Negative Transfer From Predictive Distribution Differences Seah, Chun-Wei Ong, Yew-Soon Tsang, Ivor W. School of Computer Engineering Domain adaptation (DA); logistic regression (LR); negative transfer; predictive distribution matching (PDM); support vector machines (SVMs) Domain adaptation (DA), which leverages labeled data from related source domains, comes in handy when the label information of the target domain is scarce or unavailable. However, as the source data do not come from the same origin as that of the target domain, the predictive distributions of the source and target domains are likely to differ in reality. At the extreme, the predictive distributions of the source domains can differ completely from that of the target domain. In such case, using the learned source classifier to assist in the prediction of target data can result in prediction performance that is poorer than that with the omission of the source data. This phenomenon is established as negative transfer with impact known to be more severe in the multiclass context. To combat negative transfer due to differing predictive distributions across domains, we first introduce the notion of positive transferability for the assessment of synergy between the source and target domains in their prediction models, and we also propose a criterion to measure the positive transferability between sample pairs of different domains in terms of their prediction distributions. With the new measure, a predictive distribution matching (PDM) regularizer and a PDM framework learn the target classifier by favoring source data with large positive transferability while inferring the labels of target unlabeled data. Extensive experiments are conducted to validate the performance efficacy of the proposed PDM framework using several commonly used multidomain benchmark data sets, including Sentiment, Reuters, and Newsgroup, in the context of both binary-class and multiclass domains. Subsequently, the PDM framework is put to work on a real-world scenario pertaining to water cluster molecule identification. The experimental results illustrate the adverse impact of negative transfer on several state-of-the-art DA methods, whereas the proposed framework exhibits excellent and robust predicti- e performances. ASTAR (Agency for Sci., Tech. and Research, S’pore) Accepted version 2016-01-11T08:43:01Z 2019-12-06T14:36:41Z 2016-01-11T08:43:01Z 2019-12-06T14:36:41Z 2012 Journal Article Seah, C.-W., Ong, Y.-S., & Tsang, I. W. (2013). Combating Negative Transfer From Predictive Distribution Differences. IEEE Transactions on Cybernetics, 43(4), 1153-1165. 2168-2267 https://hdl.handle.net/10356/81712 http://hdl.handle.net/10220/39657 10.1109/TSMCB.2012.2225102 en IEEE Transactions on Cybernetics © 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TSMCB.2012.2225102]. 13 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Domain adaptation (DA); logistic regression (LR); negative transfer; predictive distribution matching (PDM); support vector machines (SVMs)
spellingShingle Domain adaptation (DA); logistic regression (LR); negative transfer; predictive distribution matching (PDM); support vector machines (SVMs)
Seah, Chun-Wei
Ong, Yew-Soon
Tsang, Ivor W.
Combating Negative Transfer From Predictive Distribution Differences
description Domain adaptation (DA), which leverages labeled data from related source domains, comes in handy when the label information of the target domain is scarce or unavailable. However, as the source data do not come from the same origin as that of the target domain, the predictive distributions of the source and target domains are likely to differ in reality. At the extreme, the predictive distributions of the source domains can differ completely from that of the target domain. In such case, using the learned source classifier to assist in the prediction of target data can result in prediction performance that is poorer than that with the omission of the source data. This phenomenon is established as negative transfer with impact known to be more severe in the multiclass context. To combat negative transfer due to differing predictive distributions across domains, we first introduce the notion of positive transferability for the assessment of synergy between the source and target domains in their prediction models, and we also propose a criterion to measure the positive transferability between sample pairs of different domains in terms of their prediction distributions. With the new measure, a predictive distribution matching (PDM) regularizer and a PDM framework learn the target classifier by favoring source data with large positive transferability while inferring the labels of target unlabeled data. Extensive experiments are conducted to validate the performance efficacy of the proposed PDM framework using several commonly used multidomain benchmark data sets, including Sentiment, Reuters, and Newsgroup, in the context of both binary-class and multiclass domains. Subsequently, the PDM framework is put to work on a real-world scenario pertaining to water cluster molecule identification. The experimental results illustrate the adverse impact of negative transfer on several state-of-the-art DA methods, whereas the proposed framework exhibits excellent and robust predicti- e performances.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Seah, Chun-Wei
Ong, Yew-Soon
Tsang, Ivor W.
format Article
author Seah, Chun-Wei
Ong, Yew-Soon
Tsang, Ivor W.
author_sort Seah, Chun-Wei
title Combating Negative Transfer From Predictive Distribution Differences
title_short Combating Negative Transfer From Predictive Distribution Differences
title_full Combating Negative Transfer From Predictive Distribution Differences
title_fullStr Combating Negative Transfer From Predictive Distribution Differences
title_full_unstemmed Combating Negative Transfer From Predictive Distribution Differences
title_sort combating negative transfer from predictive distribution differences
publishDate 2016
url https://hdl.handle.net/10356/81712
http://hdl.handle.net/10220/39657
_version_ 1681058800093626368