Skip to main content

Showing 1–6 of 6 results for author: Emerson, D B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.09200  [pdf, ps, other

    cs.LG cs.CL

    FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems

    Authors: Val Andrei Fajardo, David B. Emerson, Amandeep Singh, Veronica Chatrath, Marcelo Lotif, Ravi Theja, Alex Cheung, Izuki Matsuba

    Abstract: Retrieval-augmented generation (RAG) systems have been shown to be effective in addressing many of the drawbacks of relying solely on the parametric memory of large language models. Recent work has demonstrated that RAG systems can be improved via fine-tuning of their retriever and generator models. In this work, we introduce FedRAG, a framework for fine-tuning RAG systems across centralized and f… ▽ More

    Submitted 12 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 9 pages, 4 figures, 2 tables. Accepted for the CODEML Workshop at ICML 2025. Framework code available at https://github.com/VectorInstitute/fed-rag

  2. arXiv:2505.07525  [pdf, other

    cs.LG

    Adaptive Latent-Space Constraints in Personalized FL

    Authors: Sana Ayromlou, D. B. Emerson

    Abstract: Federated learning (FL) has become an effective and widely used approach to training deep learning models on decentralized datasets held by distinct clients. FL also strengthens both security and privacy protections for training data. Common challenges associated with statistical heterogeneity between distributed datasets have spurred significant interest in personalized FL (pFL) methods, where mo… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 14 Pages, 1 Algorithm, 3 Figures, 3 Tables

    MSC Class: 68T07 ACM Class: I.2.0; I.2.11; I.2.6

  3. arXiv:2505.00216  [pdf, other

    cs.LG cs.AI cs.GT

    Online Federation For Mixtures of Proprietary Agents with Black-Box Encoders

    Authors: Xuwei Yang, Fatemeh Tavakoli, David B. Emerson, Anastasis Kratsios

    Abstract: Most industry-standard generative AIs and feature encoders are proprietary, offering only black-box access: their outputs are observable, but their internal parameters and architectures remain hidden from the end-user. This black-box access is especially limiting when constructing mixture-of-expert type ensemble models since the user cannot optimize each proprietary AI's internal parameters. Our p… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: 47 pages, 16 figures, 7 tables

    MSC Class: 68T05; 68T07; 91A80 ACM Class: I.2.1; I.2.11; G.1.6

  4. arXiv:2404.03471  [pdf, other

    cs.CL cs.CY cs.LG

    The Impact of Unstated Norms in Bias Analysis of Language Models

    Authors: Farnaz Kohankhaki, D. B. Emerson, Jacob-Junqi Tian, Laleh Seyyed-Kalantari, Faiza Khan Khattak

    Abstract: Bias in large language models (LLMs) has many forms, from overt discrimination to implicit stereotypes. Counterfactual bias evaluation is a widely used approach to quantifying bias and often relies on template-based probes that explicitly state group membership. It measures whether the outcome of a task performed by an LLM is invariant to a change in group membership. In this work, we find that te… ▽ More

    Submitted 27 February, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 15 Pages, 4 Figures, 4 Tables

    MSC Class: 68T50

  5. arXiv:2309.16825  [pdf, other

    cs.LG

    A Comprehensive View of Personalized Federated Learning on Heterogeneous Clinical Datasets

    Authors: Fatemeh Tavakoli, D. B. Emerson, Sana Ayromlou, John Jewell, Amrit Krishnan, Yuchong Zhang, Amol Verma, Fahad Razak

    Abstract: Federated learning (FL) is increasingly being recognized as a key approach to overcoming the data silos that so frequently obstruct the training and deployment of machine-learning models in clinical settings. This work contributes to a growing body of FL research specifically focused on clinical applications along three important directions. First, we expand the FLamby benchmark (du Terrail et al.… ▽ More

    Submitted 4 July, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 34 pages, 4 figures, 12 tables, 1 algorithm. The update includes a significant number of new experiments, a new format, and additional results

    MSC Class: 68T07

  6. arXiv:2308.00071  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    On The Role of Reasoning in the Identification of Subtle Stereotypes in Natural Language

    Authors: Jacob-Junqi Tian, Omkar Dige, D. B. Emerson, Faiza Khan Khattak

    Abstract: Large language models (LLMs) are trained on vast, uncurated datasets that contain various forms of biases and language reinforcing harmful stereotypes that may be subsequently inherited by the models themselves. Therefore, it is essential to examine and address biases in language models, integrating fairness into their development to ensure that these models do not perpetuate social biases. In thi… ▽ More

    Submitted 28 September, 2024; v1 submitted 24 July, 2023; originally announced August 2023.

    Comments: 15 pages, 11 Figures, 3 Tables

    MSC Class: 68T50