Semi-supervised federated heterogeneous transfer learning

Federated learning (FL) is a privacy-preserving paradigm that collaboratively train machine learning models with distributed data stored in different silos without exposing sensitive information. Different from most existing FL approaches requiring data from different parties share either the same f...

Full description

Saved in:

Bibliographic Details
Main Authors:	Feng, Siwei, Li, Boyang, Yu, Han, Liu, Yang, Yang, Qiang
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2022
Subjects:	Engineering::Computer science and engineering Federated Transfer Learning Data Privacy Preservation
Online Access:	https://hdl.handle.net/10356/163377
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Description
Summary:	Federated learning (FL) is a privacy-preserving paradigm that collaboratively train machine learning models with distributed data stored in different silos without exposing sensitive information. Different from most existing FL approaches requiring data from different parties share either the same feature space or sample ID space, federated transfer learning (FTL), which is a recently proposed FL concept, is designed for situations where data from different parties differ not only in samples but also in feature space. However, like most traditional FL approaches, FTL methods also suffer from issues caused by insufficiency of overlapping data. In this paper, we propose a novel FTL framework referred to as Semi-Supervised Federated Heterogeneous Transfer Learning (SFHTL) to leverage on the unlabeled non-overlapping samples to reduce model overfitting as a result of insufficient overlapping training samples in FL scenarios. Unlike existing FTL approaches, SFHTL makes use of non-overlapping samples from all parties to expand the training set for each party to improve local model performance. Through extensive experimental evaluation based on real-world datasets, we demonstrate significant advantages of SFHTL over state-of-the-art approaches.

Semi-supervised federated heterogeneous transfer learning

Similar Items