Skip to main content

Showing 1–12 of 12 results for author: Pillutla, K

Searching in archive math. Search in all archives.
.
  1. arXiv:2310.13863  [pdf, other

    stat.ML cs.LG math.OC

    Distributionally Robust Optimization with Bias and Variance Reduction

    Authors: Ronak Mehta, Vincent Roulet, Krishna Pillutla, Zaid Harchaoui

    Abstract: We consider the distributionally robust optimization (DRO) problem with spectral risk-based uncertainty set and $f$-divergence penalty. This formulation includes common risk-sensitive learning objectives such as regularized condition value-at-risk (CVaR) and average top-$k$ loss. We present Prospect, a stochastic gradient-based algorithm that only requires tuning a single learning rate hyperparame… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  2. arXiv:2310.06771  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    Correlated Noise Provably Beats Independent Noise for Differentially Private Learning

    Authors: Christopher A. Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla, Arun Ganesh, Thomas Steinke, Abhradeep Thakurta

    Abstract: Differentially private learning algorithms inject noise into the learning process. While the most common private learning algorithm, DP-SGD, adds independent Gaussian noise in each iteration, recent work on matrix factorization mechanisms has shown empirically that introducing correlations in the noise can greatly improve their utility. We characterize the asymptotic learning utility for any choic… ▽ More

    Submitted 7 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Christopher A. Choquette-Choo, Krishnamurthy Dvijotham, and Krishna Pillutla contributed equally

    Journal ref: ICLR 2024

  3. arXiv:2305.18447  [pdf, other

    cs.LG cs.CR cs.IT math.ST

    Unleashing the Power of Randomization in Auditing Differentially Private ML

    Authors: Krishna Pillutla, Galen Andrew, Peter Kairouz, H. Brendan McMahan, Alina Oprea, Sewoong Oh

    Abstract: We present a rigorous methodology for auditing differentially private machine learning algorithms by adding multiple carefully designed examples called canaries. We take a first principles approach based on three key components. First, we introduce Lifted Differential Privacy (LiDP) that expands the definition of differential privacy to handle randomized datasets. This gives us the freedom to desi… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  4. arXiv:2305.10634  [pdf, other

    math.OC cs.LG

    Modified Gauss-Newton Algorithms under Noise

    Authors: Krishna Pillutla, Vincent Roulet, Sham Kakade, Zaid Harchaoui

    Abstract: Gauss-Newton methods and their stochastic version have been widely used in machine learning and signal processing. Their nonsmooth counterparts, modified Gauss-Newton or prox-linear algorithms, can lead to contrasting outcomes when compared to gradient descent in large-scale statistical settings. We explore the contrasting performance of these two classes of algorithms in theory on a stylized stat… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: IEEE SSP 2023

  5. arXiv:2212.05149  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Optimization for Spectral Risk Measures

    Authors: Ronak Mehta, Vincent Roulet, Krishna Pillutla, Lang Liu, Zaid Harchaoui

    Abstract: Spectral risk objectives - also called $L$-risks - allow for learning systems to interpolate between optimizing average-case performance (as in empirical risk minimization) and worst-case performance on a task. We develop stochastic algorithms to optimize these quantities by characterizing their subdifferential and addressing challenges such as biasedness of subgradient estimates and non-smoothnes… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  6. arXiv:2212.04014  [pdf, other

    stat.ML cs.LG math.ST

    Statistical and Computational Guarantees for Influence Diagnostics

    Authors: Jillian Fisher, Lang Liu, Krishna Pillutla, Yejin Choi, Zaid Harchaoui

    Abstract: Influence diagnostics such as influence functions and approximate maximum influence perturbations are popular in machine learning and in AI domain applications. Influence diagnostics are powerful statistical tools to identify influential datapoints or subsets of datapoints. We establish finite-sample statistical bounds, as well as computational complexity bounds, for influence functions and approx… ▽ More

    Submitted 19 September, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: For AISTATS 2023. Software see https://github.com/jfisher52/influence_theory

  7. arXiv:2204.03809  [pdf, other

    cs.LG cs.DC math.OC

    Federated Learning with Partial Model Personalization

    Authors: Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael Rabbat, Maziar Sanjabi, Lin Xiao

    Abstract: We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms… ▽ More

    Submitted 15 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Journal ref: ICML 2022: 17716-17758

  8. arXiv:2201.00508  [pdf, other

    math.OC

    Superquantiles at Work: Machine Learning Applications and Efficient Subgradient Computation

    Authors: Yassine Laguel, Krishna Pillutla, Jérôme Malick, Zaid Harchaoui

    Abstract: R. Tyrell Rockafellar and collaborators introduced, in a series of works, new regression modeling methods based on the notion of superquantile (or conditional value-at-risk). These methods have been influential in economics, finance, management science, and operations research in general. Recently, they have been the subject of a renewed interest in machine learning, to address issues of distribut… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

  9. arXiv:2112.09429  [pdf, other

    cs.LG math.OC stat.ML

    Federated Learning with Superquantile Aggregation for Heterogeneous Data

    Authors: Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaid Harchaoui

    Abstract: We present a federated learning framework that is designed to robustly deliver good predictive performance across individual clients with heterogeneous data. The proposed approach hinges upon a superquantile-based learning objective that captures the tail statistics of the error distribution over heterogeneous clients. We present a stochastic training algorithm that interleaves differentially priv… ▽ More

    Submitted 6 December, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Machine Learning Journal, Special Issue on Safe and Fair Machine Learning (To appear)

    Journal ref: Machine Learning (2023): 1-68

  10. arXiv:2002.11223  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    Device Heterogeneity in Federated Learning: A Superquantile Approach

    Authors: Yassine Laguel, Krishna Pillutla, Jérôme Malick, Zaid Harchaoui

    Abstract: We propose a federated learning framework to handle heterogeneous client devices which do not conform to the population data distribution. The approach hinges upon a parameterized superquantile-based objective, where the parameter ranges over levels of conformity. We present an optimization algorithm and establish its convergence to a stationary point. We show how to practically implement it using… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Journal ref: Machine Learning (2023): 1-68

  11. arXiv:1902.03228  [pdf, other

    stat.ML cs.LG math.OC

    A Smoother Way to Train Structured Prediction Models

    Authors: Krishna Pillutla, Vincent Roulet, Sham M. Kakade, Zaid Harchaoui

    Abstract: We present a framework to train a structured prediction model by performing smoothing on the inference algorithm it builds upon. Smoothing overcomes the non-smoothness inherent to the maximum margin structured prediction objective, and paves the way for the use of fast primal gradient-based optimization algorithms. We illustrate the proposed framework by developing a novel primal incremental optim… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: Short version appeared in Neural Information Processing Systems (NeurIPS) 2018

  12. arXiv:1710.09430  [pdf, ps, other

    stat.ML cs.LG math.OC

    A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares)

    Authors: Prateek Jain, Sham M. Kakade, Rahul Kidambi, Praneeth Netrapalli, Venkata Krishna Pillutla, Aaron Sidford

    Abstract: This work provides a simplified proof of the statistical minimax optimality of (iterate averaged) stochastic gradient descent (SGD), for the special case of least squares. This result is obtained by analyzing SGD as a stochastic process and by sharply characterizing the stationary covariance matrix of this process. The finite rate optimality characterization captures the constant factors and addre… ▽ More

    Submitted 21 July, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: Lemma 1 has been updated in v2