Skip to main content

Showing 1–21 of 21 results for author: Medina, A M

.
  1. arXiv:2406.02797  [pdf, other

    cs.LG cs.CR

    Auditing Privacy Mechanisms via Label Inference Attacks

    Authors: Róbert István Busa-Fekete, Travis Dick, Claudio Gentile, Andrés Muñoz Medina, Adam Smith, Marika Swanberg

    Abstract: We propose reconstruction advantage measures to audit label privatization mechanisms. A reconstruction advantage measure quantifies the increase in an attacker's ability to infer the true label of an unlabeled example when provided with a private version of the labels in a dataset (e.g., aggregate of labels from different users or noisy labels output by randomized response), compared to an attacke… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2310.03104  [pdf, other

    cs.LG cs.CR

    Differentially Private Optimization for Non-Decomposable Objective Functions

    Authors: Weiwei Kong, Andrés Muñoz Medina, Mónica Ribero

    Abstract: Unsupervised pre-training is a common step in developing computer vision models and large language models. In this setting, the absence of labels requires the use of similarity-based loss functions, such as contrastive loss, that favor minimizing the distance between similar inputs and maximizing the distance between distinct inputs. As privacy concerns mount, training these models using different… ▽ More

    Submitted 20 February, 2025; v1 submitted 4 October, 2023; originally announced October 2023.

  3. arXiv:2307.05608  [pdf, other

    cs.CR

    DP-Auditorium: a Large Scale Library for Auditing Differential Privacy

    Authors: William Kong, Andrés Muñoz Medina, Mónica Ribero, Umar Syed

    Abstract: New regulations and increased awareness of data privacy have led to the deployment of new and more efficient differentially private mechanisms across public institutions and industries. Ensuring the correctness of these mechanisms is therefore crucial to ensure the proper protection of data. However, since differential privacy is a property of the mechanism itself, and not of an individual output,… ▽ More

    Submitted 18 December, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  4. arXiv:2305.07751  [pdf, other

    cs.LG cs.CR cs.IT math.ST

    Private and Communication-Efficient Algorithms for Entropy Estimation

    Authors: Gecia Bravo-Hermsdorff, Róbert Busa-Fekete, Mohammad Ghavamzadeh, Andres Muñoz Medina, Umar Syed

    Abstract: Modern statistical estimation is often performed in a distributed setting where each sample belongs to a single user who shares their data with a central server. Users are typically concerned with preserving the privacy of their samples, and also with minimizing the amount of data they must transmit to the server. We give improved private and communication-efficient algorithms for estimating sever… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Originally published at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). This version corrects some errors in the original version

  5. arXiv:2304.07210  [pdf, other

    cs.CR cs.LG

    Measuring Re-identification Risk

    Authors: CJ Carey, Travis Dick, Alessandro Epasto, Adel Javanmard, Josh Karlin, Shankar Kumar, Andres Munoz Medina, Vahab Mirrokni, Gabriel Henrique Nunes, Sergei Vassilvitskii, Peilin Zhong

    Abstract: Compact user representations (such as embeddings) form the backbone of personalization services. In this work, we present a new theoretical framework to measure re-identification risk in such user representations. Our framework, based on hypothesis testing, formally bounds the probability that an attacker may be able to obtain the identity of a user from their representation. As an application, we… ▽ More

    Submitted 31 July, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  6. arXiv:2302.03115  [pdf, other

    cs.LG stat.ML

    Easy Learning from Label Proportions

    Authors: Robert Istvan Busa-Fekete, Heejin Choi, Travis Dick, Claudio Gentile, Andres Munoz medina

    Abstract: We consider the problem of Learning from Label Proportions (LLP), a weakly supervised classification setup where instances are grouped into "bags", and only the frequency of class labels at each bag is available. Albeit, the objective of the learner is to achieve low task loss at an individual instance level. Here we propose Easyllp: a flexible and simple-to-implement debiasing approach based on a… ▽ More

    Submitted 13 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  7. arXiv:2301.05605  [pdf, ps, other

    cs.DS

    Differentially Private Continual Releases of Streaming Frequency Moment Estimations

    Authors: Alessandro Epasto, Jieming Mao, Andres Munoz Medina, Vahab Mirrokni, Sergei Vassilvitskii, Peilin Zhong

    Abstract: The streaming model of computation is a popular approach for working with large-scale data. In this setting, there is a stream of items and the goal is to compute the desired quantities (usually data statistics) while making a single pass through the stream and using as little space as possible. Motivated by the importance of data privacy, we develop differentially private streaming algorithms u… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  8. arXiv:2207.06358  [pdf, other

    cs.CR cs.LG

    Smooth Anonymity for Sparse Graphs

    Authors: Alessandro Epasto, Hossein Esfandiari, Vahab Mirrokni, Andres Munoz Medina

    Abstract: When working with user data providing well-defined privacy guarantees is paramount. In this work, we aim to manipulate and share an entire sparse dataset with a third party privately. In fact, differential privacy has emerged as the gold standard of privacy, however, when it comes to sharing sparse datasets, e.g. sparse networks, as one of our main results, we prove that \emph{any} differentially… ▽ More

    Submitted 14 May, 2024; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: WWW 2024 Short Paper

  9. arXiv:2201.12333  [pdf, other

    cs.CR

    A Joint Exponential Mechanism For Differentially Private Top-$k$

    Authors: Jennifer Gillenwater, Matthew Joseph, Andrés Muñoz Medina, Mónica Ribero

    Abstract: We present a differentially private algorithm for releasing the sequence of $k$ elements with the highest counts from a data domain of $d$ elements. The algorithm is a "joint" instance of the exponential mechanism, and its output space consists of all $O(d^k)$ length-$k$ sequences. Our main contribution is a method to sample this exponential mechanism in time $O(dk\log(k) + d\log(d))$ and space… ▽ More

    Submitted 30 August, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  10. arXiv:2201.12306  [pdf, other

    cs.DS cs.CR cs.CY cs.DB stat.CO

    Statistical anonymity: Quantifying reidentification risks without reidentifying users

    Authors: Gecia Bravo-Hermsdorff, Robert Busa-Fekete, Lee M. Gunderson, Andrés Munõz Medina, Umar Syed

    Abstract: Data anonymization is an approach to privacy-preserving data release aimed at preventing participants reidentification, and it is an important alternative to differential privacy in applications that cannot tolerate noisy data. Existing algorithms for enforcing $k$-anonymity in the released data assume that the curator performing the anonymization has complete access to the original data. Reasons… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  11. arXiv:2010.04235  [pdf, other

    cs.CR cs.LG

    Duff: A Dataset-Distance-Based Utility Function Family for the Exponential Mechanism

    Authors: Andrés Muñoz Medina, Jenny Gillenwater

    Abstract: We propose and analyze a general-purpose dataset-distance-based utility function family, Duff, for differential privacy's exponential mechanism. Given a particular dataset and a statistic (e.g., median, mode), this function family assigns utility to a possible output o based on the number of individuals whose data would have to be added to or removed from the dataset in order for the statistic to… ▽ More

    Submitted 21 January, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

  12. arXiv:2007.01181  [pdf, other

    cs.LG cs.CR stat.ML

    Private Optimization Without Constraint Violations

    Authors: Andrés Muñoz Medina, Umar Syed, Sergei Vassilvitskii, Ellen Vitercik

    Abstract: We study the problem of differentially private optimization with linear constraints when the right-hand-side of the constraints depends on private data. This type of problem appears in many applications, especially resource allocation. Previous research provided solutions that retained privacy but sometimes violated the constraints. In many settings, however, the constraints cannot be violated und… ▽ More

    Submitted 3 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  13. arXiv:2004.11225  [pdf, other

    astro-ph.SR astro-ph.EP

    Spectroscopic Orbits of Eleven Nearby, Mid-to-Late M Dwarf Binaries

    Authors: Jennifer G. Winters, Jonathan M. Irwin, David Charbonneau, David W. Latham, Amber M. Medina, Jessica Mink, Gilbert A. Esquerdo, Perry Berlind, Michael L. Calkins, Zachory K. Berta-Thompson

    Abstract: We present the spectroscopic orbits of eleven nearby, mid-to-late M dwarf binary systems in a variety of configurations: two single-lined binaries (SB1s), seven double-lined binaries (SB2s), one double-lined triple (ST2), and one triple-lined triple (ST3). Eight of these orbits are the first published for these systems, while five are newly identified multiples. We obtained multi-epoch, high-resol… ▽ More

    Submitted 27 April, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in AJ. Full table of RVs available upon request before publication. Corrected uncertainty on LHS 1817 mass ratio & T_eff

  14. arXiv:1802.05315  [pdf, other

    cs.LG stat.ML

    Online Learning for Non-Stationary A/B Tests

    Authors: Andrés Muñoz Medina, Sergei Vassilvitskii, Dong Yin

    Abstract: The rollout of new versions of a feature in modern applications is a manual multi-stage process, as the feature is released to ever larger groups of users, while its performance is carefully monitored. This kind of A/B testing is ubiquitous, but suboptimal, as the monitoring requires heavy human intervention, is not guaranteed to capture consistent, but short-term fluctuations in performance, and… ▽ More

    Submitted 27 May, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  15. Raspberry Pi and Arduino Uno Working together as a Basic Meteorological Station

    Authors: José Rafael Cortés León, Ricardo Francisco Martínez-González, Anilú Miranda Medina, Luis Alberto Peralta-Pelaez

    Abstract: The present paper describes a novel Raspberry Pi and Arduino UNO architecture used as a meteorological station. One of the advantages of the proposed architecture is the huge quantity of sensors developed for its usage; practically one can find them for any application, and weather sensing is not an exception. The principle followed is to configure Raspberry as a collector for measures obtained fr… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 8 pages and 5 figures

    Journal ref: International Journal of Computer Science & Information Technology (IJCSIT) Vol 9, No 5, October 2017

  16. arXiv:1706.04732  [pdf, other

    cs.LG cs.GT

    Revenue Optimization with Approximate Bid Predictions

    Authors: Andrés Muñoz Medina, Sergei Vassilvitskii

    Abstract: In the context of advertising auctions, finding good reserve prices is a notoriously challenging learning problem. This is due to the heterogeneity of ad opportunity types and the non-convexity of the objective function. In this work, we show how to reduce reserve price optimization to the standard setting of prediction under squared loss, a well understood problem in the learning community. We fu… ▽ More

    Submitted 6 November, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

    Comments: Accepted to NIPS 2017

  17. arXiv:1506.02719  [pdf, other

    cs.LG cs.GT

    Non-parametric Revenue Optimization for Generalized Second Price Auctions

    Authors: Mehryar Mohri, Andres Munoz Medina

    Abstract: We present an extensive analysis of the key problem of learning optimal reserve prices for generalized second price auctions. We describe two algorithms for this task: one based on density estimation, and a novel algorithm benefiting from solid theoretical guarantees and with a very favorable running-time complexity of $O(n S \log (n S))$, where $n$ is the sample size and $S$ the number of slots.… ▽ More

    Submitted 8 June, 2015; originally announced June 2015.

    Comments: To be published in Proceedings of UAI 2015

  18. arXiv:1411.6305  [pdf, other

    cs.LG

    Revenue Optimization in Posted-Price Auctions with Strategic Buyers

    Authors: Mehryar Mohri, Andres Muñoz Medina

    Abstract: We study revenue optimization learning algorithms for posted-price auctions with strategic buyers. We analyze a very broad family of monotone regret minimization algorithms for this problem, which includes the previously best known algorithm, and show that no algorithm in that family admits a strategic regret more favorable than $Ω(\sqrt{T})$. We then introduce a new algorithm that achieves a stra… ▽ More

    Submitted 23 November, 2014; originally announced November 2014.

    Comments: At NIPS 2014

  19. arXiv:1405.1503  [pdf, other

    cs.LG

    Adaptation Algorithm and Theory Based on Generalized Discrepancy

    Authors: Corinna Cortes, Mehryar Mohri, Andres Muñoz Medina

    Abstract: We present a new algorithm for domain adaptation improving upon a discrepancy minimization algorithm previously shown to outperform a number of algorithms for this task. Unlike many previous algorithms for domain adaptation, our algorithm does not consist of a fixed reweighting of the losses over the training sample. We show that our algorithm benefits from a solid theoretical foundation and more… ▽ More

    Submitted 20 February, 2015; v1 submitted 7 May, 2014; originally announced May 2014.

  20. arXiv:1310.5665  [pdf, other

    cs.LG

    Learning Theory and Algorithms for Revenue Optimization in Second-Price Auctions with Reserve

    Authors: Mehryar Mohri, Andres Muñoz Medina

    Abstract: Second-price auctions with reserve play a critical role for modern search engine and popular online sites since the revenue of these companies often directly de- pends on the outcome of such auctions. The choice of the reserve price is the main mechanism through which the auction revenue can be influenced in these electronic markets. We cast the problem of selecting the reserve price to optimize r… ▽ More

    Submitted 2 December, 2014; v1 submitted 21 October, 2013; originally announced October 2013.

    Comments: Accepted at ICML 2014

  21. arXiv:1205.4343  [pdf, other

    cs.LG stat.ML

    New Analysis and Algorithm for Learning with Drifting Distributions

    Authors: Mehryar Mohri, Andres Munoz Medina

    Abstract: We present a new analysis of the problem of learning with drifting distributions in the batch setting using the notion of discrepancy. We prove learning bounds based on the Rademacher complexity of the hypothesis set and the discrepancy of distributions both for a drifting PAC scenario and a tracking scenario. Our bounds are always tighter and in some cases substantially improve upon previous ones… ▽ More

    Submitted 25 August, 2012; v1 submitted 19 May, 2012; originally announced May 2012.

    Comments: 15 pages, 2 figures to be published in volume 7568 of the Lecture Notes in Computer Science series