ProvFL: Client-Driven Interpretability of Global Model Predictions in Federated Learning

Gill, Waris; Anwar, Ali; Gulzar, Muhammad Ali

Computer Science > Machine Learning

arXiv:2312.13632v1 (cs)

[Submitted on 21 Dec 2023 (this version), latest version 17 Jan 2025 (v4)]

Title:ProvFL: Client-Driven Interpretability of Global Model Predictions in Federated Learning

Authors:Waris Gill (1), Ali Anwar (2), Muhammad Ali Gulzar (1) ((1) Virginia Tech, (2) University of Minnesota Twin Cities)

View PDF

Abstract:Federated Learning (FL) trains a collaborative machine learning model by aggregating multiple privately trained clients' models over several training rounds. Such a long, continuous action of model aggregations poses significant challenges in reasoning about the origin and composition of such a global model. Regardless of the quality of the global model or if it has a fault, understanding the model's origin is equally important for debugging, interpretability, and explainability in federated learning. FL application developers often question: (1) what clients contributed towards a global model and (2) if a global model predicts a label, which clients are responsible for it?
We introduce, neuron provenance, a fine-grained lineage capturing mechanism that tracks the flow of information between the individual participating clients in FL and the final global model. We operationalize this concept in ProvFL that functions on two key principles. First, recognizing that monitoring every neuron of every client's model statically is ineffective and noisy due to the uninterpretable nature of individual neurons, ProvFL dynamically isolates influential and sensitive neurons in the global model, significantly reducing the search space. Second, as multiple clients' models are fused in each round to form a global model, tracking each client's contribution becomes challenging. ProvFL leverages the invertible nature of fusion algorithms to precisely isolate each client's contribution derived from selected neurons. When asked to localize the clients responsible for the given behavior (i.e., prediction) of the global model, ProvFL successfully localizes them with an average provenance accuracy of 97%. Additionally, ProvFL outperforms the state-of-the-art FL fault localization approach by an average margin of 50%.

Comments:	22 pages. For access to the source code used in this study, please contact the authors directly
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Software Engineering (cs.SE)
Cite as:	arXiv:2312.13632 [cs.LG]
	(or arXiv:2312.13632v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.13632

Submission history

From: Waris Gill [view email]
[v1] Thu, 21 Dec 2023 07:48:54 UTC (807 KB)
[v2] Tue, 13 Aug 2024 17:57:07 UTC (7,377 KB)
[v3] Tue, 12 Nov 2024 00:12:39 UTC (7,378 KB)
[v4] Fri, 17 Jan 2025 06:09:13 UTC (7,403 KB)

Computer Science > Machine Learning

Title:ProvFL: Client-Driven Interpretability of Global Model Predictions in Federated Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ProvFL: Client-Driven Interpretability of Global Model Predictions in Federated Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators