Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients

Wu, Zheshun; Xu, Zenglin; Zeng, Dun; Wang, Qifan; Liu, Jie

Computer Science > Machine Learning

arXiv:2310.07171 (cs)

[Submitted on 11 Oct 2023 (v1), last revised 11 Dec 2024 (this version, v7)]

Title:Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients

Authors:Zheshun Wu, Zenglin Xu, Dun Zeng, Qifan Wang, Jie Liu

View PDF HTML (experimental)

Abstract:Federated Learning (FL) has surged in prominence due to its capability of collaborative model training without direct data sharing. However, the vast disparity in local data distributions among clients, often termed the Non-Independent Identically Distributed (Non-IID) challenge, poses a significant hurdle to FL's generalization efficacy. The scenario becomes even more complex when not all clients participate in the training process, a common occurrence due to unstable network connections or limited computational capacities. This can greatly complicate the assessment of the trained models' generalization abilities. While a plethora of recent studies has centered on the generalization gap pertaining to unseen data from participating clients with diverse distributions, the distinction between the training distributions of participating clients and the testing distributions of non-participating ones has been largely overlooked. In response, our paper unveils an information-theoretic generalization framework for FL. Specifically, it quantifies generalization errors by evaluating the information entropy of local distributions and discerning discrepancies across these distributions. Inspired by our deduced generalization bounds, we introduce a weighted aggregation approach and a duo of client selection strategies. These innovations are designed to strengthen FL's ability to generalize and thus ensure that trained models perform better on non-participating clients by incorporating a more diverse range of client data distributions. Our extensive empirical evaluations reaffirm the potency of our proposed methods, aligning seamlessly with our theoretical construct.

Comments:	This manuscript is the accepted version for TNNLS
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT)
Cite as:	arXiv:2310.07171 [cs.LG]
	(or arXiv:2310.07171v7 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.07171

Submission history

From: Zheshun Wu [view email]
[v1] Wed, 11 Oct 2023 03:39:56 UTC (256 KB)
[v2] Thu, 12 Oct 2023 02:27:12 UTC (256 KB)
[v3] Fri, 13 Oct 2023 00:35:33 UTC (256 KB)
[v4] Sun, 3 Mar 2024 07:58:38 UTC (1,230 KB)
[v5] Tue, 30 Jul 2024 05:07:01 UTC (1,032 KB)
[v6] Thu, 10 Oct 2024 13:58:08 UTC (1,229 KB)
[v7] Wed, 11 Dec 2024 01:17:25 UTC (2,462 KB)

Computer Science > Machine Learning

Title:Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators