How to Collaborate: Towards Maximizing the Generalization Performance in Cross-Silo Federated Learning

Sun, Yuchang; Kountouris, Marios; Zhang, Jun

doi:10.48550/arXiv.2401.13236

dc.contributor.author	Sun, Yuchang
dc.contributor.author	Kountouris, Marios
dc.contributor.author	Zhang, Jun
dc.date.accessioned	2025-04-07T07:30:04Z
dc.date.available	2025-04-07T07:30:04Z
dc.date.issued	2024-11-28
dc.identifier.citation	Y. Sun, M. Kountouris, and J. Zhang, “How to collaborate: Towards maximizing the generalization performance in cross-silo federated learning,” accepted to IEEE Trans. Mobile Comput. https://doi.org/10.48550/arXiv.2401.13236	es_ES
dc.identifier.uri	https://hdl.handle.net/10481/103477
dc.description	The work of J. Zhang was supported by the Hong Kong Research Grants Council under the Areas of Excellence scheme grant AoE/E-601/22-R and NSFC/RGC Collaborative Research Scheme grant CRS HKUST603/22. The work of M. Kountouris was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (Grant agreement No. 101003431).	es_ES
dc.description.abstract	Federated learning (FL) has attracted vivid attention as a privacy-preserving distributed learning framework. In this work, we focus on cross-silo FL, where clients become the model owners after training and are only concerned about the model’s generalization performance on their local data. Due to the data heterogeneity issue, asking all the clients to join a single FL training process may result in model performance degradation. To investigate the effectiveness of collaboration, we first derive a generalization bound for each client when collaborating with others or when training independently. We show that the generalization performance of a client can be improved by collaborating with other clients that have more training data and similar data distributions. Our analysis allows us to formulate a client utility maximization problem by partitioning clients into multiple collaborating groups. A hierarchical clustering-based collaborative training (HCCT) scheme is then proposed, which does not need to fix in advance the number of groups. We further analyze the convergence of HCCT for general non-convex loss functions which unveils the effect of data similarity among clients. Extensive simulations show that HCCT achieves better generalization performance than baseline schemes, whereas it degenerates to independent training and conventional FL in specific scenarios.	es_ES
dc.description.sponsorship	Hong Kong Research Grants Council AoE/E-601/22-R	es_ES
dc.description.sponsorship	NSFC/RGC Collaborative Research Scheme CRS HKUST603/22	es_ES
dc.description.sponsorship	European Research Council (ERC)	es_ES
dc.description.sponsorship	Union’s Horizon 2020 No. 101003431	es_ES
dc.language.iso	eng	es_ES
dc.publisher	IEEE	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Federated Learning	es_ES
dc.subject	Generalization	es_ES
dc.subject	Collaboration pattern	es_ES
dc.subject	Hierarchical clustering	es_ES
dc.title	How to Collaborate: Towards Maximizing the Generalization Performance in Cross-Silo Federated Learning	es_ES
dc.type	journal article	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/101003431	es_ES
dc.rights.accessRights	open access	es_ES
dc.identifier.doi	10.48550/arXiv.2401.13236
dc.type.hasVersion	SMUR	es_ES

Fichier(s) constituant ce document

Nom:: 2401.13236v2.pdf
Taille:: 712.6Ko
Format:: PDF

Ce document figure dans la(les) collection(s) suivante(s)

OpenAIRE (Open Access Infrastructure for Research in Europe)
Publicaciones financiadas por Framework Programme 7, Horizonte 2020, Horizonte Europa... del European Research Council de la Unión Europea en el marco del Proyecto OpenAIRE que promueve el acceso abierto a Europa.

Afficher la notice abrégée

Excepté là où spécifié autrement, la license de ce document est décrite en tant que Attribution-NonCommercial-NoDerivatives 4.0 Internacional