Skip to main content

Showing 1–50 of 107 results for author: Lane, N D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03004  [pdf, ps, other

    cs.CL cs.MA

    CLUES: Collaborative High-Quality Data Selection for LLMs via Training Dynamics

    Authors: Wanru Zhao, Hongxiang Fan, Shell Xu Hu, Wangchunshu Zhou, Bofan Chen, Nicholas D. Lane

    Abstract: Recent research has highlighted the importance of data quality in scaling large language models (LLMs). However, automated data quality control faces unique challenges in collaborative settings where sharing is not allowed directly between data silos. To tackle this issue, this paper proposes a novel data quality control technique based on the notion of data influence on the training dynamics of L… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: NeurIPS 2024

  2. arXiv:2507.03003  [pdf, ps, other

    cs.CL

    Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

    Authors: Wanru Zhao, Yihong Chen, Royson Lee, Xinchi Qiu, Yan Gao, Hongxiang Fan, Nicholas D. Lane

    Abstract: Pre-trained large language models (LLMs) have become a cornerstone of modern natural language processing, with their capabilities extending across a wide range of applications and languages. However, the fine-tuning of multilingual LLMs, especially for low-resource languages, faces significant challenges arising from data-sharing restrictions (the physical border) and inherent linguistic differenc… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: ICLR 2024

  3. arXiv:2506.14387  [pdf, ps, other

    cs.AI

    Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning

    Authors: William F. Shen, Xinchi Qiu, Nicola Cancedda, Nicholas D. Lane

    Abstract: Existing work on mitigating catastrophic forgetting in large language model (LLM) fine-tuning has primarily focused on preserving specific data or tasks, while critically overlooking the degradation of essential capabilities instilled through safety alignment, particularly the model's ability to faithfully express ignorance. In this work, we show that this capability is significantly degraded duri… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  4. arXiv:2506.04203  [pdf, ps, other

    cs.DC

    Cascadia: A Cascade Serving System for Large Language Models

    Authors: Youhe Jiang, Fangcheng Fu, Wanru Zhao, Stephan Rabanser, Nicholas D. Lane, Binhang Yuan

    Abstract: Recent advances in large language models (LLMs) have intensified the need to deliver both rapid responses and high-quality answers. More powerful models yield better results but incur higher inference latency, whereas smaller models are faster yet less capable. Recent work proposes balancing this latency-quality trade-off using model cascades, which route simpler queries to smaller models and more… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  5. arXiv:2506.02961  [pdf, ps, other

    cs.CL

    FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models

    Authors: Yan Gao, Massimo Roberto Scamarcia, Javier Fernandez-Marques, Mohammad Naseri, Chong Shen Ng, Dimitris Stripelis, Zexi Li, Tao Shen, Jiamu Bai, Daoyuan Chen, Zikai Zhang, Rui Hu, InSeo Song, Lee KangYoon, Hong Jia, Ting Dang, Junyan Wang, Zheyuan Liu, Daniel Janes Beutel, Lingjuan Lyu, Nicholas D. Lane

    Abstract: Large Language Models (LLMs) have achieved state-of-the-art results across diverse domains, yet their development remains reliant on vast amounts of publicly available data, raising concerns about data scarcity and the lack of access to domain-specific, sensitive information. Federated Learning (FL) presents a compelling framework to address these challenges by enabling decentralized fine-tuning o… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  6. arXiv:2505.22549  [pdf, other

    cs.LG

    DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models

    Authors: Alex Iacob, Lorenzo Sani, Mher Safaryan, Paris Giampouras, Samuel Horváth, Andrej Jovanovic, Meghdad Kurmanji, Preslav Aleksandrov, William F. Shen, Xinchi Qiu, Nicholas D. Lane

    Abstract: Scaling foundation model training with Distributed Data Parallel (DDP) methods is bandwidth-limited. Existing infrequent communication methods like Local SGD were designed to synchronize only model parameters and cannot be trivially applied to adaptive optimizers due to additional optimizer states. Current approaches extending Local SGD either lack convergence guarantees or require synchronizing a… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Keywords: Distributed Training, Foundation Models, Large Language Models, Optimizers, Communication Efficiency, Federated Learning, Distributed Systems, Optimization Theory, Scaling, Robustness. Preprint, under review at NeurIPS

  7. arXiv:2505.19855  [pdf, ps, other

    cs.LG

    Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

    Authors: Zexi Li, Xiangzhu Wang, William F. Shen, Meghdad Kurmanji, Xinchi Qiu, Dongqi Cai, Chao Wu, Nicholas D. Lane

    Abstract: Large language Model (LLM) unlearning, i.e., selectively removing information from LLMs, is vital for responsible model deployment. Differently, LLM knowledge editing aims to modify LLM knowledge instead of removing it. Though editing and unlearning seem to be two distinct tasks, we find there is a tight connection between them. In this paper, we conceptualize unlearning as a special case of editi… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Preprint

  8. arXiv:2504.05153  [pdf, other

    cs.LG

    SparsyFed: Sparse Adaptive Federated Training

    Authors: Adriano Guastella, Lorenzo Sani, Alex Iacob, Alessio Mora, Paolo Bellavista, Nicholas D. Lane

    Abstract: Sparse training is often adopted in cross-device federated learning (FL) environments where constrained devices collaboratively train a machine learning model on private data by exchanging pseudo-gradients across heterogeneous networks. Although sparse training methods can reduce communication overhead and computational burden in FL, they are often not used in practice for the following key reason… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Published as a conference paper at ICLR 2025

  9. arXiv:2502.12430  [pdf, ps, other

    cs.LG cs.AI

    Bridge the Gaps between Machine Unlearning and AI Regulation

    Authors: Bill Marino, Meghdad Kurmanji, Nicholas D. Lane

    Abstract: The "right to be forgotten" and the data privacy laws that encode it have motivated machine unlearning since its earliest days. Now, an inbound wave of artificial intelligence regulations - like the European Union's Artificial Intelligence Act (AIA) - potentially offer important new use cases for machine unlearning. However, this position paper argues, this opportunity will only be realized if res… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  10. arXiv:2502.07218  [pdf, other

    cs.LG cs.AI

    LUNAR: LLM Unlearning via Neural Activation Redirection

    Authors: William F. Shen, Xinchi Qiu, Meghdad Kurmanji, Alex Iacob, Lorenzo Sani, Yihong Chen, Nicola Cancedda, Nicholas D. Lane

    Abstract: Large Language Models (LLMs) benefit from training on ever larger amounts of textual data, but as a result, they increasingly incur the risk of leaking private information. The ability to selectively remove knowledge from LLMs is, therefore, a highly desirable capability. In this paper, we propose LUNAR, a novel unlearning methodology grounded in the Linear Representation Hypothesis. LUNAR operate… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  11. arXiv:2501.04000  [pdf, other

    cs.LG cs.HC

    A Survey on Federated Learning in Human Sensing

    Authors: Mohan Li, Martin Gjoreski, Pietro Barbiero, Gašper Slapničar, Mitja Luštrek, Nicholas D. Lane, Marc Langheinrich

    Abstract: Human Sensing, a field that leverages technology to monitor human activities, psycho-physiological states, and interactions with the environment, enhances our understanding of human behavior and drives the development of advanced services that improve overall quality of life. However, its reliance on detailed and often privacy-sensitive data as the basis for its machine learning (ML) models raises… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  12. arXiv:2411.17831  [pdf, other

    cs.LG cs.CV cs.DC

    Rapid Distributed Fine-tuning of a Segmentation Model Onboard Satellites

    Authors: Meghan Plumridge, Rasmus Maråk, Chiara Ceccobello, Pablo Gómez, Gabriele Meoni, Filip Svoboda, Nicholas D. Lane

    Abstract: Segmentation of Earth observation (EO) satellite data is critical for natural hazard analysis and disaster response. However, processing EO data at ground stations introduces delays due to data transmission bottlenecks and communication windows. Using segmentation models capable of near-real-time data analysis onboard satellites can therefore improve response times. This study presents a proof-of-… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: Accepted at the Sixth IEEE International Conference on Image Processing Applications and Systems (IPAS) 2025

  13. arXiv:2411.02908  [pdf, other

    cs.LG cs.DC

    Photon: Federated LLM Pre-Training

    Authors: Lorenzo Sani, Alex Iacob, Zeyu Cao, Royson Lee, Bill Marino, Yan Gao, Dongqi Cai, Zexi Li, Wanru Zhao, Xinchi Qiu, Nicholas D. Lane

    Abstract: Scaling large language models (LLMs) demands extensive data and computing resources, which are traditionally constrained to data centers by the high-bandwidth requirements of distributed training. Low-bandwidth methods like federated learning (FL) could enable collaborative training of larger models across weakly-connected GPUs if they can effectively be used for pre-training. To achieve this, we… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 13 pages, 9 appendix pages, 10 figures, 3 algorithms, 8 tables

  14. arXiv:2410.05021  [pdf, other

    cs.LG cs.CL

    DEPT: Decoupled Embeddings for Pre-training Language Models

    Authors: Alex Iacob, Lorenzo Sani, Meghdad Kurmanji, William F. Shen, Xinchi Qiu, Dongqi Cai, Yan Gao, Nicholas D. Lane

    Abstract: Language Model pre-training uses broad data mixtures to enhance performance across domains and languages. However, training on such heterogeneous text corpora requires extensive and expensive efforts. Since these data sources vary significantly in lexical, syntactic, and semantic aspects, they cause negative interference or the ``curse of multilinguality''. To address these challenges we propose a… ▽ More

    Submitted 7 April, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at ICLR 2025

  15. arXiv:2409.15790  [pdf, other

    cs.CL cs.AI cs.LG

    Small Language Models: Survey, Measurements, and Insights

    Authors: Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi, Fangming Liu, Xiwen Zhang, Nicholas D. Lane, Mengwei Xu

    Abstract: Small language models (SLMs), despite their widespread adoption in modern smart devices, have received significantly less academic attention compared to their large language model (LLM) counterparts, which are predominantly deployed in data centers and cloud environments. While researchers continue to improve the capabilities of LLMs in the pursuit of artificial general intelligence, SLM research… ▽ More

    Submitted 26 February, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

  16. arXiv:2407.00031  [pdf, other

    cs.DC cs.SE

    Supercharging Federated Learning with Flower and NVIDIA FLARE

    Authors: Holger R. Roth, Daniel J. Beutel, Yan Cheng, Javier Fernandez Marques, Heng Pan, Chester Chen, Zhihong Zhang, Yuhong Wen, Sean Yang, Isaac, Yang, Yuan-Ting Hsieh, Ziyue Xu, Daguang Xu, Nicholas D. Lane, Andrew Feng

    Abstract: Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re… ▽ More

    Submitted 22 July, 2024; v1 submitted 21 May, 2024; originally announced July 2024.

    Comments: Added a figure comparing running a Flower application natively or within FLARE

  17. arXiv:2406.16810  [pdf, other

    cs.LG cs.AI cs.CL

    How Data Inter-connectivity Shapes LLMs Unlearning: A Structural Unlearning Perspective

    Authors: Xinchi Qiu, William F. Shen, Yihong Chen, Meghdad Kurmanji, Nicola Cancedda, Pontus Stenetorp, Nicholas D. Lane

    Abstract: While unlearning knowledge from large language models (LLMs) is receiving increasing attention, one important aspect remains unexplored. Existing approaches and benchmarks assume data points to-be-forgotten are independent, ignoring their inter-connectivity - a fundamental characteristic of real-world data structures. In this paper, we propose PISTOL, a method for compiling structural datasets. PI… ▽ More

    Submitted 10 March, 2025; v1 submitted 24 June, 2024; originally announced June 2024.

  18. arXiv:2406.14758  [pdf, ps, other

    cs.AI

    Compliance Cards: Automated EU AI Act Compliance Analyses amidst a Complex AI Supply Chain

    Authors: Bill Marino, Yaqub Chaudhary, Yulu Pi, Rui-Jie Yew, Preslav Aleksandrov, Carwyn Rahman, William F. Shen, Isaac Robinson, Nicholas D. Lane

    Abstract: As the AI supply chain grows more complex, AI systems and models are increasingly likely to incorporate multiple internally- or externally-sourced components such as datasets and (pre-trained) models. In such cases, determining whether or not the aggregate AI system or model complies with the EU AI Act (AIA) requires a multi-step process in which compliance-related information about both the AI sy… ▽ More

    Submitted 12 September, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  19. arXiv:2405.20882  [pdf, other

    cs.LG

    Sheaf HyperNetworks for Personalized Federated Learning

    Authors: Bao Nguyen, Lorenzo Sani, Xinchi Qiu, Pietro Liò, Nicholas D. Lane

    Abstract: Graph hypernetworks (GHNs), constructed by combining graph neural networks (GNNs) with hypernetworks (HNs), leverage relational data across various domains such as neural architecture search, molecular property prediction and federated learning. Despite GNNs and HNs being individually successful, we show that GHNs present problems compromising their performance, such as over-smoothing and heteroph… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 25 pages, 12 figures, 7 tables, pre-print under review

  20. arXiv:2405.14791  [pdf, other

    cs.LG cs.CV cs.DC

    Recurrent Early Exits for Federated Learning with Heterogeneous Clients

    Authors: Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

    Abstract: Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hardware capacities; clients have differing compute and memory requirements. To tackle this challenge, recent state-of-the-art approaches leverage the use of early exits. Nonetheless, these approaches fal… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted at the 41st International Conference on Machine Learning (ICML 2024)

  21. arXiv:2405.14446  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    Worldwide Federated Training of Language Models

    Authors: Alex Iacob, Lorenzo Sani, Bill Marino, Preslav Aleksandrov, William F. Shen, Nicholas Donald Lane

    Abstract: The reliance of language model training on massive amounts of computation and vast datasets scraped from potentially low-quality, copyrighted, or sensitive data has come into question practically, legally, and ethically. Federated learning provides a plausible alternative by enabling previously untapped data to be voluntarily gathered from collaborating organizations. However, when scaled globally… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 19 pages, 8 figures, Under Review

    ACM Class: I.2.7

  22. arXiv:2405.10853  [pdf, other

    cs.LG cs.AI cs.DC

    The Future of Large Language Model Pre-training is Federated

    Authors: Lorenzo Sani, Alex Iacob, Zeyu Cao, Bill Marino, Yan Gao, Tomas Paulik, Wanru Zhao, William F. Shen, Preslav Aleksandrov, Xinchi Qiu, Nicholas D. Lane

    Abstract: Generative pre-trained large language models (LLMs) have demonstrated impressive performance over a wide range of tasks, thanks to the unprecedented amount of data they have been trained on. As established scaling laws indicate, LLMs' future performance improvement depends on the amount of computing and data sources they can leverage for pre-training. Federated learning (FL) has the potential to u… ▽ More

    Submitted 14 October, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 24 pages, 15 figures, pre-print

  23. arXiv:2404.16891  [pdf, other

    cs.CR cs.AI cs.CL cs.CY

    Attacks on Third-Party APIs of Large Language Models

    Authors: Wanru Zhao, Vidit Khazanchi, Haodi Xing, Xuanli He, Qiongkai Xu, Nicholas Donald Lane

    Abstract: Large language model (LLM) services have recently begun offering a plugin ecosystem to interact with third-party API services. This innovation enhances the capabilities of LLMs, but it also introduces risks, as these plugins developed by various third parties cannot be easily trusted. This paper proposes a new attacking framework to examine security and safety vulnerabilities within LLM platforms… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 Workshop on Secure and Trustworthy Large Language Models

  24. arXiv:2404.00411  [pdf, other

    physics.ao-ph cs.LG

    Aardvark weather: end-to-end data-driven weather forecasting

    Authors: Anna Vaughan, Stratis Markou, Will Tebbutt, James Requeima, Wessel P. Bruinsma, Tom R. Andersson, Michael Herzog, Nicholas D. Lane, Matthew Chantry, J. Scott Hosking, Richard E. Turner

    Abstract: Weather forecasting is critical for a range of human activities including transportation, agriculture, industry, as well as the safety of the general public. Machine learning models have the potential to transform the complex weather prediction pipeline, but current approaches still rely on numerical weather prediction (NWP) systems, limiting forecast speed and accuracy. Here we demonstrate that a… ▽ More

    Submitted 13 July, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  25. arXiv:2403.04529  [pdf, other

    cs.LG cs.AI cs.DC

    Enhancing Data Quality in Federated Fine-Tuning of Foundation Models

    Authors: Wanru Zhao, Yaxin Du, Nicholas Donald Lane, Siheng Chen, Yanfeng Wang

    Abstract: In the current landscape of foundation model training, there is a significant reliance on public domain data, which is nearing exhaustion according to recent research. To further scale up, it is crucial to incorporate collaboration among multiple specialized and high-quality private domain data sources. However, the challenge of training models locally without sharing private data presents numerou… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at ICLR 2024 Workshop on Navigating and Addressing Data Problems for Foundation Models (DPFM)

  26. arXiv:2402.18949  [pdf, other

    cs.LG

    FedGuCci: Making Local Models More Connected in Landscape for Federated Learning

    Authors: Zexi Li, Jie Lin, Zhiqi Li, Didi Zhu, Tao Shen, Tao Lin, Chao Wu, Nicholas D. Lane

    Abstract: Federated learning (FL) involves multiple heterogeneous clients collaboratively training a global model via iterative local updates and model fusion. The generalization of FL's global model has a large gap compared with centralized training, which is its bottleneck for broader applications. In this paper, we study and improve FL's generalization through a fundamental ``connectivity'' perspective,… ▽ More

    Submitted 25 May, 2025; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: SIGKDD 2025

  27. arXiv:2402.10191  [pdf, other

    cs.LG

    FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients

    Authors: Xinchi Qiu, Yan Gao, Lorenzo Sani, Heng Pan, Wanru Zhao, Pedro P. B. Gusmao, Mina Alibeigi, Alex Iacob, Nicholas D. Lane

    Abstract: Federated learning (FL) is a distributed learning paradigm that facilitates collaborative training of a shared global model across devices while keeping data localized. The deployment of FL in numerous real-world applications faces delays, primarily due to the prevalent reliance on supervised tasks. Generating detailed labels at edge devices, if feasible, is demanding, given resource constraints a… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  28. arXiv:2402.05968  [pdf, other

    cs.LG cs.AI cs.CY cs.DC

    Federated Learning Priorities Under the European Union Artificial Intelligence Act

    Authors: Herbert Woisetschläger, Alexander Erben, Bill Marino, Shiqiang Wang, Nicholas D. Lane, Ruben Mayer, Hans-Arno Jacobsen

    Abstract: The age of AI regulation is upon us, with the European Union Artificial Intelligence Act (AI Act) leading the way. Our key inquiry is how this will affect Federated Learning (FL), whose starting point of prioritizing data privacy while performing ML fundamentally differs from that of centralized learning. We believe the AI Act and future regulations could be the missing catalyst that pushes FL tow… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    ACM Class: I.2; I.2.11; K.5

  29. arXiv:2311.18451  [pdf, other

    cs.LG

    How Much Is Hidden in the NAS Benchmarks? Few-Shot Adaptation of a NAS Predictor

    Authors: Hrushikesh Loya, Łukasz Dudziak, Abhinav Mehrotra, Royson Lee, Javier Fernandez-Marques, Nicholas D. Lane, Hongkai Wen

    Abstract: Neural architecture search has proven to be a powerful approach to designing and refining neural networks, often boosting their performance and efficiency over manually-designed variations, but comes with computational overhead. While there has been a considerable amount of research focused on lowering the cost of NAS for mainstream tasks, such as image classification, a lot of those improvements… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  30. arXiv:2310.11096  [pdf, other

    cs.DC cs.AR cs.LG

    Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi-DNN Workloads

    Authors: Hongxiang Fan, Stylianos I. Venieris, Alexandros Kouris, Nicholas D. Lane

    Abstract: Running multiple deep neural networks (DNNs) in parallel has become an emerging workload in both edge devices, such as mobile phones where multiple tasks serve a single user for daily activities, and data centers, where various requests are raised from millions of users, as seen with large language models. To reduce the costly computational and memory requirements of these workloads, various effic… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Paper accepted by MICRO'23

  31. arXiv:2310.02420  [pdf, other

    cs.LG cs.CV cs.DC

    FedL2P: Federated Learning to Personalize

    Authors: Royson Lee, Minyoung Kim, Da Li, Xinchi Qiu, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

    Abstract: Federated learning (FL) research has made progress in developing algorithms for distributed learning of global models, as well as algorithms for local personalization of those common models to the specifics of each client's local data distribution. However, different FL problems may require different personalization strategies, and it may not even be possible to define an effective one-size-fits-a… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  32. arXiv:2307.13412  [pdf, other

    cs.LG cs.AR cs.CV

    Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation

    Authors: Stylianos I. Venieris, Javier Fernandez-Marques, Nicholas D. Lane

    Abstract: The unprecedented accuracy of convolutional neural networks (CNNs) across a broad range of AI tasks has led to their widespread deployment in mobile and embedded settings. In a pursuit for high-performance and energy-efficient inference, significant research effort has been invested in the design of FPGA-based CNN accelerators. In this context, single computation engines constitute a popular appro… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at ACM TODAES, 2023. arXiv admin note: substantial text overlap with arXiv:2103.05600

  33. arXiv:2307.09988  [pdf, other

    cs.LG cs.CV

    TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge

    Authors: Young D. Kwon, Rui Li, Stylianos I. Venieris, Jagmohan Chauhan, Nicholas D. Lane, Cecilia Mascolo

    Abstract: On-device training is essential for user personalisation and privacy. With the pervasiveness of IoT devices and microcontroller units (MCUs), this task becomes more challenging due to the constrained memory and compute resources, and the limited availability of labelled user data. Nonetheless, prior works neglect the data scarcity issue, require excessively long training time (e.g. a few hours), o… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted by ICML 2024

  34. arXiv:2307.07393  [pdf, other

    cs.CV

    L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning

    Authors: Yasar Abbas Ur Rehman, Yan Gao, Pedro Porto Buarque de Gusmão, Mina Alibeigi, Jiajun Shen, Nicholas D. Lane

    Abstract: The ubiquity of camera-enabled devices has led to large amounts of unlabeled image data being produced at the edge. The integration of self-supervised learning (SSL) and federated learning (FL) into one coherent system can potentially offer data privacy guarantees while also advancing the quality and robustness of the learned visual representations without needing to move data around. However, cli… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  35. arXiv:2307.06933  [pdf, other

    cs.LG cs.AI cs.DC

    FDAPT: Federated Domain-adaptive Pre-training for Language Models

    Authors: Lekang Jiang, Filip Svoboda, Nicholas D. Lane

    Abstract: Foundation models (FMs) have shown prominent success in a wide range of tasks. Their applicability to specific domain-task pairings relies on the availability of, both, high-quality data and significant computational resources. These challenges are not new to the field and, indeed, Federated Learning (FL) has been shown to be a promising solution in similar setups. This paper tackles the specific… ▽ More

    Submitted 9 November, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted at International Workshop on Federated Learning in the Age of Foundation Models in Conjunction with NeurIPS 2023

  36. arXiv:2306.17453  [pdf, other

    cs.DC

    Pollen: High-throughput Federated Learning Simulation via Resource-Aware Client Placement

    Authors: Lorenzo Sani, Pedro Porto Buarque de Gusmão, Alex Iacob, Wanru Zhao, Xinchi Qiu, Yan Gao, Javier Fernandez-Marques, Nicholas Donald Lane

    Abstract: Federated Learning (FL) is a privacy-focused machine learning paradigm that collaboratively trains models directly on edge devices. Simulation plays an essential role in FL adoption, helping develop novel aggregation and client sampling strategies. However, current simulators cannot emulate large-scale systems in a time-efficient manner, which limits their utility and casts doubts on generalizabil… ▽ More

    Submitted 20 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 22 pages, 22 figures, 9 tables, under review

  37. arXiv:2306.04040  [pdf, other

    cs.LG cs.AI cs.CR

    FedVal: Different good or different bad in federated learning

    Authors: Viktor Valadi, Xinchi Qiu, Pedro Porto Buarque de Gusmão, Nicholas D. Lane, Mina Alibeigi

    Abstract: Federated learning (FL) systems are susceptible to attacks from malicious actors who might attempt to corrupt the training model through various poisoning attacks. FL also poses new challenges in addressing group bias, such as ensuring fair performance for different demographic groups. Traditional methods used to address such biases require centralized access to the data, which FL systems do not h… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: To appear in the proceedings of the USENIX Security Symposium 2023

  38. arXiv:2305.18334  [pdf, other

    cs.AR cs.LG

    PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration

    Authors: Ahmed F. AbouElhamayed, Angela Cui, Javier Fernandez-Marques, Nicholas D. Lane, Mohamed S. Abdelfattah

    Abstract: Conventional multiply-accumulate (MAC) operations have long dominated computation time for deep neural networks (DNNs), espcially convolutional neural networks (CNNs). Recently, product quantization (PQ) has been applied to these workloads, replacing MACs with memory lookups to pre-computed dot products. To better understand the efficiency tradeoffs of product-quantized DNNs (PQ-DNNs), we create a… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: ACM Transactions on Reconfigurable Technology and Systems (TRETS) - FCCM 2024 Journal Track

  39. arXiv:2305.16794  [pdf, other

    cs.CR cs.LG

    Secure Vertical Federated Learning Under Unreliable Connectivity

    Authors: Xinchi Qiu, Heng Pan, Wanru Zhao, Yan Gao, Pedro P. B. Gusmao, William F. Shen, Chenyang Ma, Nicholas D. Lane

    Abstract: Most work in privacy-preserving federated learning (FL) has focused on horizontally partitioned datasets where clients hold the same features and train complete client-level models independently. However, individual data points are often scattered across different institutions, known as clients, in vertical FL (VFL) settings. Addressing this category of FL necessitates the exchange of intermediate… ▽ More

    Submitted 17 February, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Generalised extension from our previous work: arXiv:2305.11236

  40. arXiv:2305.12134  [pdf, other

    cs.LG cs.AI

    Privacy in Multimodal Federated Human Activity Recognition

    Authors: Alex Iacob, Pedro P. B. Gusmão, Nicholas D. Lane, Armand K. Koupai, Mohammud J. Bocus, Raúl Santos-Rodríguez, Robert J. Piechocki, Ryan McConville

    Abstract: Human Activity Recognition (HAR) training data is often privacy-sensitive or held by non-cooperative entities. Federated Learning (FL) addresses such concerns by training ML models on edge clients. This work studies the impact of privacy in federated HAR at a user, environment, and sensor level. We show that the performance of FL for HAR depends on the assumed privacy level of the FL system and pr… ▽ More

    Submitted 2 June, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: In 3rd On-Device Intelligence Workshop at MLSys 2023, 8 pages

  41. arXiv:2305.11236  [pdf, other

    cs.LG cs.AI cs.CR

    Efficient Vertical Federated Learning with Secure Aggregation

    Authors: Xinchi Qiu, Heng Pan, Wanru Zhao, Chenyang Ma, Pedro Porto Buarque de Gusmão, Nicholas D. Lane

    Abstract: The majority of work in privacy-preserving federated learning (FL) has been focusing on horizontally partitioned datasets where clients share the same sets of features and can train complete models independently. However, in many interesting problems, such as financial fraud detection and disease detection, individual data points are scattered across different clients/organizations in vertical fed… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Federated Learning Systems (FLSys) Workshop @ MLSys 2023

  42. Can Fair Federated Learning reduce the need for Personalisation?

    Authors: Alex Iacob, Pedro P. B. Gusmão, Nicholas D. Lane

    Abstract: Federated Learning (FL) enables training ML models on edge clients without sharing data. However, the federated model's performance on local data varies, disincentivising the participation of clients who benefit little from FL. Fair FL reduces accuracy disparity by focusing on clients with higher losses while personalisation locally fine-tunes the model. Personalisation provides a participation in… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: In 3rd Workshop on Machine Learning and Systems (EuroMLSys 2023), 9 pages

  43. arXiv:2304.07537  [pdf, other

    cs.LG cs.AI cs.DC cs.DS

    Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

    Authors: Chenyang Ma, Xinchi Qiu, Daniel J. Beutel, Nicholas D. Lane

    Abstract: The privacy-sensitive nature of decentralized datasets and the robustness of eXtreme Gradient Boosting (XGBoost) on tabular data raise the needs to train XGBoost in the context of federated learning (FL). Existing works on federated XGBoost in the horizontal setting rely on the sharing of gradients, which induce per-node level communication frequency and serious privacy concerns. To alleviate thes… ▽ More

    Submitted 24 March, 2024; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: Accepted at the 3rd ACM Workshop on Machine Learning and Systems (EuroMLSys), May 8th 2023, Rome, Italy

  44. A Federated Learning Benchmark for Drug-Target Interaction

    Authors: Gianluca Mittone, Filip Svoboda, Marco Aldinucci, Nicholas D. Lane, Pietro Lio

    Abstract: Aggregating pharmaceutical data in the drug-target interaction (DTI) domain has the potential to deliver life-saving breakthroughs. It is, however, notoriously difficult due to regulatory constraints and commercial interests. This work proposes the application of federated learning, which we argue to be reconcilable with the industry's constraints, as it does not require sharing of any information… ▽ More

    Submitted 18 October, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: This paper is the accepted version of ACM copyrighted material published at the WWW'23 conference

    Journal ref: In Companion Proceedings of the ACM Web Conference 2023 (pp. 1177-1181)

  45. NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

    Authors: Stylianos I. Venieris, Mario Almeida, Royson Lee, Nicholas D. Lane

    Abstract: In recent years, image and video delivery systems have begun integrating deep learning super-resolution (SR) approaches, leveraging their unprecedented visual enhancement capabilities while reducing reliance on networking conditions. Nevertheless, deploying these solutions on mobile devices still remains an active challenge as SR models are excessively demanding with respect to workload and memory… ▽ More

    Submitted 14 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted for publication at the IEEE Transactions on Mobile Computing (TMC), 2023

  46. arXiv:2212.07886  [pdf, other

    cs.CV

    Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

    Authors: Royson Lee, Rui Li, Stylianos I. Venieris, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

    Abstract: Recent image degradation estimation methods have enabled single-image super-resolution (SR) approaches to better upsample real-world images. Among these methods, explicit kernel estimation approaches have demonstrated unprecedented performance at handling unknown degradations. Nonetheless, a number of limitations constrain their efficacy when used by downstream SR models. Specifically, this family… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Preprint: Accepted at the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  47. arXiv:2211.17246  [pdf, other

    cs.LG

    Pex: Memory-efficient Microcontroller Deep Learning through Partial Execution

    Authors: Edgar Liberis, Nicholas D. Lane

    Abstract: Embedded and IoT devices, largely powered by microcontroller units (MCUs), could be made more intelligent by leveraging on-device deep learning. One of the main challenges of neural network inference on an MCU is the extremely limited amount of read-write on-chip memory (SRAM, < 512 kB). SRAM is consumed by the neural network layer (operator) input and output buffers, which, traditionally, must be… ▽ More

    Submitted 24 January, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  48. arXiv:2210.10514  [pdf, other

    cs.LG

    The Future of Consumer Edge-AI Computing

    Authors: Stefanos Laskaridis, Stylianos I. Venieris, Alexandros Kouris, Rui Li, Nicholas D. Lane

    Abstract: In the last decade, Deep Learning has rapidly infiltrated the consumer end, mainly thanks to hardware acceleration across devices. However, as we look towards the future, it is evident that isolated hardware will be insufficient. Increasingly complex AI tasks demand shared resources, cross-device collaboration, and multiple data types, all without compromising user privacy or quality of experience… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Extended version of accepted paper at IEEE Pervasive Computing

  49. arXiv:2210.07271  [pdf, other

    cs.LG

    BLOX: Macro Neural Architecture Search Benchmark and Algorithms

    Authors: Thomas Chun Pong Chau, Łukasz Dudziak, Hongkai Wen, Nicholas Donald Lane, Mohamed S Abdelfattah

    Abstract: Neural architecture search (NAS) has been successfully used to design numerous high-performance neural networks. However, NAS is typically compute-intensive, so most existing approaches restrict the search to decide the operations and topological structure of a single block only, then the same block is stacked repeatedly to form an end-to-end model. Although such an approach reduces the size of se… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Published in the Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks

  50. arXiv:2209.15575  [pdf, other

    cs.SD cs.LG eess.AS

    Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

    Authors: Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Pedro P. B. de Gusmao, Nicholas D. Lane

    Abstract: Self-supervised learning (SSL) has proven vital in speech and audio-related applications. The paradigm trains a general model on unlabeled data that can later be used to solve specific downstream tasks. This type of model is costly to train as it requires manipulating long input sequences that can only be handled by powerful centralised servers. Surprisingly, despite many attempts to increase trai… ▽ More

    Submitted 22 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.