Skip to main content

Showing 1–11 of 11 results for author: Ribero, M

.
  1. arXiv:2410.06881  [pdf, other

    cs.CR

    Privately Counting Partially Ordered Data

    Authors: Matthew Joseph, Mónica Ribero, Alexander Yu

    Abstract: We consider differentially private counting when each data point consists of $d$ bits satisfying a partial order. Our main technical contribution is a problem-specific $K$-norm mechanism that runs in time $O(d^2)$. Experiments show that, depending on the partial order in question, our solution dominates existing pure differentially private mechanisms, and can reduce their error by an order of magn… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  2. arXiv:2407.05237  [pdf, other

    cs.LG cs.CR cs.DS math.OC stat.ML

    Privacy of the last iterate in cyclically-sampled DP-SGD on nonconvex composite losses

    Authors: Weiwei Kong, Mónica Ribero

    Abstract: Differentially-private stochastic gradient descent (DP-SGD) is a family of iterative machine learning training algorithms that privatize gradients to generate a sequence of differentially-private (DP) model parameters. It is also the standard tool used to train DP models in practice, even though most users are only interested in protecting the privacy of the final model. Tight DP accounting for th… ▽ More

    Submitted 10 February, 2025; v1 submitted 6 July, 2024; originally announced July 2024.

    MSC Class: 65K10 (Primary); 60G15; 68P27 ACM Class: G.3; G.1.6

  3. arXiv:2310.03104  [pdf, other

    cs.LG cs.CR

    Differentially Private Optimization for Non-Decomposable Objective Functions

    Authors: Weiwei Kong, Andrés Muñoz Medina, Mónica Ribero

    Abstract: Unsupervised pre-training is a common step in developing computer vision models and large language models. In this setting, the absence of labels requires the use of similarity-based loss functions, such as contrastive loss, that favor minimizing the distance between similar inputs and maximizing the distance between distinct inputs. As privacy concerns mount, training these models using different… ▽ More

    Submitted 20 February, 2025; v1 submitted 4 October, 2023; originally announced October 2023.

  4. arXiv:2307.05608  [pdf, other

    cs.CR

    DP-Auditorium: a Large Scale Library for Auditing Differential Privacy

    Authors: William Kong, Andrés Muñoz Medina, Mónica Ribero, Umar Syed

    Abstract: New regulations and increased awareness of data privacy have led to the deployment of new and more efficient differentially private mechanisms across public institutions and industries. Ensuring the correctness of these mechanisms is therefore crucial to ensure the proper protection of data. However, since differential privacy is a property of the mechanism itself, and not of an individual output,… ▽ More

    Submitted 18 December, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  5. arXiv:2208.07353  [pdf, other

    cs.LG cs.CR

    Easy Differentially Private Linear Regression

    Authors: Kareem Amin, Matthew Joseph, Mónica Ribero, Sergei Vassilvitskii

    Abstract: Linear regression is a fundamental tool for statistical analysis. This has motivated the development of linear regression methods that also satisfy differential privacy and thus guarantee that the learned model reveals little about any one data point used to construct it. However, existing differentially private solutions assume that the end user can easily specify good data bounds and hyperparame… ▽ More

    Submitted 16 March, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: This version corresponds to the camera-ready at ICLR 2023

  6. Federated Learning Under Intermittent Client Availability and Time-Varying Communication Constraints

    Authors: Monica Ribero, Haris Vikalo, Gustavo De Veciana

    Abstract: Federated learning systems facilitate training of global models in settings where potentially heterogeneous data is distributed across a large number of clients. Such systems operate in settings with intermittent client availability and/or time-varying communication constraints. As a result, the global models trained by federated learning systems may be biased towards clients with higher availabil… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  7. arXiv:2204.04977  [pdf, other

    cs.CL cs.AI

    Regularization-based Pruning of Irrelevant Weights in Deep Neural Architectures

    Authors: Giovanni Bonetta, Matteo Ribero, Rossella Cancelliere

    Abstract: Deep neural networks exploiting millions of parameters are nowadays the norm in deep learning applications. This is a potential issue because of the great amount of computational resources needed for training, and of the possible loss of generalization performance of overparametrized networks. We propose in this paper a method for learning sparse neural topologies via a regularization technique wh… ▽ More

    Submitted 28 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  8. arXiv:2201.12333  [pdf, other

    cs.CR

    A Joint Exponential Mechanism For Differentially Private Top-$k$

    Authors: Jennifer Gillenwater, Matthew Joseph, Andrés Muñoz Medina, Mónica Ribero

    Abstract: We present a differentially private algorithm for releasing the sequence of $k$ elements with the highest counts from a data domain of $d$ elements. The algorithm is a "joint" instance of the exponential mechanism, and its output space consists of all $O(d^k)$ length-$k$ sequences. Our main contribution is a method to sample this exponential mechanism in time $O(dk\log(k) + d\log(d))$ and space… ▽ More

    Submitted 30 August, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  9. arXiv:2008.06570  [pdf, ps, other

    cs.LG stat.ML

    Fast Dimension Independent Private AdaGrad on Publicly Estimated Subspaces

    Authors: Peter Kairouz, Mónica Ribero, Keith Rush, Abhradeep Thakurta

    Abstract: We revisit the problem of empirical risk minimziation (ERM) with differential privacy. We show that noisy AdaGrad, given appropriate knowledge and conditions on the subspace from which gradients can be drawn, achieves a regret comparable to traditional AdaGrad plus a well-controlled term due to noise. We show a convergence rate of $O(\text{Tr}(G_T)/T)$, where $G_T$ captures the geometry of the gra… ▽ More

    Submitted 30 January, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

  10. arXiv:2007.15197  [pdf, other

    cs.LG stat.ML

    Communication-Efficient Federated Learning via Optimal Client Sampling

    Authors: Monica Ribero, Haris Vikalo

    Abstract: Federated learning (FL) ameliorates privacy concerns in settings where a central server coordinates learning from data distributed across many clients. The clients train locally and communicate the models they learn to the server; aggregation of local models requires frequent communication of large amounts of information between the clients and the central server. We propose a novel, simple and ef… ▽ More

    Submitted 14 October, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

  11. arXiv:2003.00602  [pdf, other

    cs.IR cs.CR cs.LG stat.ML

    Federating Recommendations Using Differentially Private Prototypes

    Authors: Mónica Ribero, Jette Henderson, Sinead Williamson, Haris Vikalo

    Abstract: Machine learning methods allow us to make recommendations to users in applications across fields including entertainment, dating, and commerce, by exploiting similarities in users' interaction patterns. However, in domains that demand protection of personally sensitive data, such as medicine or banking, how can we learn such a model without accessing the sensitive data, and without inadvertently l… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.