Is aggregation the only choice? Federated learning via layer-wise model recombination

Although Federated Learning (FL) enables global model training Xiaofei Xie xfxie@smu.edu.sg Singapore Management University Singapore, Singapore Xian Wei xwei@sei.ecnu.edu.cn East China Normal University Shanghai, China Mingsong Chen∗ mschen@sei.ecnu.edu.cn East China Normal University Shanghai, Chi...

Full description

Saved in:

Bibliographic Details
Main Authors:	HU, Ming, YUE, Zhihao, XIE, Xiaofei, CHEN, Cheng Chen
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Federated learning Model recombination Non-IID Generalization Artificial Intelligence and Robotics Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/9507 https://ink.library.smu.edu.sg/context/sis_research/article/10507/viewcontent/Is_Aggregation_the_Only_Choice__Federated_Learning_via_Layer_wise_Model_Recombination.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10507
record_format	dspace
spelling	sg-smu-ink.sis_research-105072024-11-15T07:44:48Z Is aggregation the only choice? Federated learning via layer-wise model recombination HU, Ming YUE, Zhihao XIE, Xiaofei CHEN, Cheng Chen Although Federated Learning (FL) enables global model training Xiaofei Xie xfxie@smu.edu.sg Singapore Management University Singapore, Singapore Xian Wei xwei@sei.ecnu.edu.cn East China Normal University Shanghai, China Mingsong Chen∗ mschen@sei.ecnu.edu.cn East China Normal University Shanghai, China • Computing methodologies → Distributed artificial intelligence. across clients without compromising their raw data, due to the unevenly distributed data among clients, existing Federated Averaging (FedAvg)-based methods suffer from the problem of low inference performance. Specifically, different data distributions among clients lead to various optimization directions of local models. Aggregating local models usually results in a low-generalized global model, which performs worse on most of the clients. To address the above issue, inspired by the observation from a geometric perspective that a well-generalized solution is located in a flat area rather than a sharp area, we propose a novel and heuristic FL paradigm named FedMR (Federated Model Recombination). The goal of FedMR is to guide the recombined models to be trained towards a flat area. Unlike conventional FedAvg-based methods, in FedMR, the cloud server recombines collected local models by shuffling each layer of them to generate multiple recombined models for local training on clients rather than an aggregated global model. Since the area of the f lat area is larger than the sharp area, when local models are located in different areas, recombined models have a higher probability of locating in a flat area. When all recombined models are located in the same flat area, they are optimized towards the same direction. Wetheoretically analyze the convergence of model recombination. Experimental results show that, compared with state-of-the-art FL methods, FedMR can significantly improve the inference accuracy without exposing the privacy of each client. 2024-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9507 info:doi/10.1145/3637528.3671722 https://ink.library.smu.edu.sg/context/sis_research/article/10507/viewcontent/Is_Aggregation_the_Only_Choice__Federated_Learning_via_Layer_wise_Model_Recombination.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Federated learning Model recombination Non-IID Generalization Artificial Intelligence and Robotics Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Federated learning Model recombination Non-IID Generalization Artificial Intelligence and Robotics Databases and Information Systems
spellingShingle	Federated learning Model recombination Non-IID Generalization Artificial Intelligence and Robotics Databases and Information Systems HU, Ming YUE, Zhihao XIE, Xiaofei CHEN, Cheng Chen Is aggregation the only choice? Federated learning via layer-wise model recombination
description	Although Federated Learning (FL) enables global model training Xiaofei Xie xfxie@smu.edu.sg Singapore Management University Singapore, Singapore Xian Wei xwei@sei.ecnu.edu.cn East China Normal University Shanghai, China Mingsong Chen∗ mschen@sei.ecnu.edu.cn East China Normal University Shanghai, China • Computing methodologies → Distributed artificial intelligence. across clients without compromising their raw data, due to the unevenly distributed data among clients, existing Federated Averaging (FedAvg)-based methods suffer from the problem of low inference performance. Specifically, different data distributions among clients lead to various optimization directions of local models. Aggregating local models usually results in a low-generalized global model, which performs worse on most of the clients. To address the above issue, inspired by the observation from a geometric perspective that a well-generalized solution is located in a flat area rather than a sharp area, we propose a novel and heuristic FL paradigm named FedMR (Federated Model Recombination). The goal of FedMR is to guide the recombined models to be trained towards a flat area. Unlike conventional FedAvg-based methods, in FedMR, the cloud server recombines collected local models by shuffling each layer of them to generate multiple recombined models for local training on clients rather than an aggregated global model. Since the area of the f lat area is larger than the sharp area, when local models are located in different areas, recombined models have a higher probability of locating in a flat area. When all recombined models are located in the same flat area, they are optimized towards the same direction. Wetheoretically analyze the convergence of model recombination. Experimental results show that, compared with state-of-the-art FL methods, FedMR can significantly improve the inference accuracy without exposing the privacy of each client.
format	text
author	HU, Ming YUE, Zhihao XIE, Xiaofei CHEN, Cheng Chen
author_facet	HU, Ming YUE, Zhihao XIE, Xiaofei CHEN, Cheng Chen
author_sort	HU, Ming
title	Is aggregation the only choice? Federated learning via layer-wise model recombination
title_short	Is aggregation the only choice? Federated learning via layer-wise model recombination
title_full	Is aggregation the only choice? Federated learning via layer-wise model recombination
title_fullStr	Is aggregation the only choice? Federated learning via layer-wise model recombination
title_full_unstemmed	Is aggregation the only choice? Federated learning via layer-wise model recombination
title_sort	is aggregation the only choice? federated learning via layer-wise model recombination
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/9507 https://ink.library.smu.edu.sg/context/sis_research/article/10507/viewcontent/Is_Aggregation_the_Only_Choice__Federated_Learning_via_Layer_wise_Model_Recombination.pdf
_version_	1816859116041142272

Is aggregation the only choice? Federated learning via layer-wise model recombination

Similar Items