Skip to main content

Showing 1–39 of 39 results for author: Udell, M

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.23081  [pdf, ps, other

    math.OC cs.LG stat.ML

    Gradient Methods with Online Scaling Part I. Theoretical Foundations

    Authors: Wenzhi Gao, Ya-Chi Chu, Yinyu Ye, Madeleine Udell

    Abstract: This paper establishes the theoretical foundations of the online scaled gradient methods (OSGM), a framework that utilizes online learning to adapt stepsizes and provably accelerate first-order methods. OSGM quantifies the effectiveness of a stepsize by a feedback function motivated from a convergence measure and uses the feedback to adjust the stepsize through an online learning algorithm. Conseq… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Extension of arXiv:2411.01803 and arXiv:2502.11229

  2. arXiv:2505.13723  [pdf, ps, other

    cs.LG math.OC stat.ML

    Turbocharging Gaussian Process Inference with Approximate Sketch-and-Project

    Authors: Pratik Rathore, Zachary Frangella, Sachin Garg, Shaghayegh Fazliani, Michał Dereziński, Madeleine Udell

    Abstract: Gaussian processes (GPs) play an essential role in biostatistics, scientific machine learning, and Bayesian optimization for their ability to provide probabilistic predictions and model uncertainty. However, GP inference struggles to scale to large datasets (which are common in modern applications), since it requires the solution of a linear system whose size scales quadratically with the number o… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 28 pages, 6 figures, 2 tables

  3. arXiv:2502.16380  [pdf, ps, other

    cs.LG cs.AI math.OC

    Understanding Fixed Predictions via Confined Regions

    Authors: Connor Lawless, Tsui-Wei Weng, Berk Ustun, Madeleine Udell

    Abstract: Machine learning models can assign fixed predictions that preclude individuals from changing their outcome. Existing approaches to audit fixed predictions do so on a pointwise basis, which requires access to an existing dataset of individuals and may fail to anticipate fixed predictions in out-of-sample data. This work presents a new paradigm to identify fixed predictions by finding confined regio… ▽ More

    Submitted 8 July, 2025; v1 submitted 22 February, 2025; originally announced February 2025.

  4. arXiv:2502.11229  [pdf, other

    math.OC cs.LG

    Provable and Practical Online Learning Rate Adaptation with Hypergradient Descent

    Authors: Ya-Chi Chu, Wenzhi Gao, Yinyu Ye, Madeleine Udell

    Abstract: This paper investigates the convergence properties of the hypergradient descent method (HDM), a 25-year-old heuristic originally proposed for adaptive stepsize selection in stochastic first-order methods. We provide the first rigorous convergence analysis of HDM using the online learning framework of [Gao24] and apply this analysis to develop new state-of-the-art adaptive gradient methods with emp… ▽ More

    Submitted 16 March, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

  5. arXiv:2501.04972  [pdf, ps, other

    math.OC

    Algebraic characterization of equivalence between optimization algorithms

    Authors: Laurent Lessard, Madeleine Udell

    Abstract: When are two algorithms the same? How can we be sure a recently proposed algorithm is novel, and not a minor twist on an existing method? In this paper, we present a framework for reasoning about equivalence between a broad class of iterative algorithms, with a focus on algorithms designed for convex optimization. We propose several notions of what it means for two algorithms to be equivalent, and… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: This paper generalizes and provides new analysis and examples compared to arxiv:2105.04684

  6. arXiv:2411.16015  [pdf, other

    math.OC

    When Does Primal Interior Point Method Beat Primal-dual in Linear Optimization?

    Authors: Wenzhi Gao, Huikang Liu, Yinyu Ye, Madeleine Udell

    Abstract: The primal-dual interior point method (IPM) is widely regarded as the most efficient IPM variant for linear optimization. In this paper, we demonstrate that the improved stability of the pure primal IPM can allow speedups relative to a primal-dual solver, particularly as the IPM approaches convergence. The stability of the primal scaling matrix makes it possible to accelerate each primal IPM step… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

  7. arXiv:2411.01803  [pdf, other

    math.OC cs.LG

    Gradient Methods with Online Scaling

    Authors: Wenzhi Gao, Ya-Chi Chu, Yinyu Ye, Madeleine Udell

    Abstract: We introduce a framework to accelerate the convergence of gradient-based methods with online learning. The framework learns to scale the gradient at each iteration through an online learning algorithm and provably accelerates gradient-based methods asymptotically. In contrast with previous literature, where convergence is established based on worst-case analysis, our framework provides a strong co… ▽ More

    Submitted 5 November, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

  8. arXiv:2407.10070  [pdf, other

    cs.LG math.OC stat.ML

    Have ASkotch: A Neat Solution for Large-scale Kernel Ridge Regression

    Authors: Pratik Rathore, Zachary Frangella, Jiaming Yang, Michał Dereziński, Madeleine Udell

    Abstract: Kernel ridge regression (KRR) is a fundamental computational tool, appearing in problems that range from computational chemistry to health analytics, with a particular interest due to its starring role in Gaussian process regression. However, full KRR solvers are challenging to scale to large datasets: both direct (i.e., Cholesky decomposition) and iterative methods (i.e., PCG) incur prohibitive c… ▽ More

    Submitted 21 February, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 64 pages (including appendices), 16 figures, 5 tables

    MSC Class: 65F10; 68W20; 90C06

  9. arXiv:2404.14524  [pdf, other

    math.OC

    Randomized Nyström Preconditioned Interior Point-Proximal Method of Multipliers

    Authors: Ya-Chi Chu, Luiz-Rafael Santos, Madeleine Udell

    Abstract: We present a new algorithm for convex separable quadratic programming (QP) called Nys-IP-PMM, a regularized interior-point solver that uses low-rank structure to accelerate solution of the Newton system. The algorithm combines the interior point proximal method of multipliers (IP-PMM) with the randomized Nyström preconditioned conjugate gradient method as the inner linear system solver. Our algori… ▽ More

    Submitted 13 January, 2025; v1 submitted 22 April, 2024; originally announced April 2024.

    MSC Class: 90C06; 90C20; 90C51; 65F08

  10. arXiv:2402.01868  [pdf, other

    cs.LG math.OC stat.ML

    Challenges in Training PINNs: A Loss Landscape Perspective

    Authors: Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell

    Abstract: This paper explores challenges in training Physics-Informed Neural Networks (PINNs), emphasizing the role of the loss landscape in the training process. We examine difficulties in minimizing the PINN loss function, particularly due to ill-conditioning caused by differential operators in the residual term. We compare gradient-based optimizers Adam, L-BFGS, and their combination Adam+L-BFGS, showing… ▽ More

    Submitted 3 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Oral; 33 pages (including appendices), 10 figures, 3 tables

  11. arXiv:2312.15594  [pdf, other

    math.NA math.OC

    Scalable Approximate Optimal Diagonal Preconditioning

    Authors: Wenzhi Gao, Zhaonan Qu, Madeleine Udell, Yinyu Ye

    Abstract: We consider the problem of finding the optimal diagonal preconditioner for a positive definite matrix. Although this problem has been shown to be solvable and various methods have been proposed, none of the existing approaches are scalable to matrices of large dimension, or when access is limited to black-box matrix-vector products, thereby significantly limiting their practical application. In vi… ▽ More

    Submitted 5 November, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  12. arXiv:2310.08333  [pdf, other

    math.OC

    GeNIOS: an (almost) second-order operator-splitting solver for large-scale convex optimization

    Authors: Theo Diamandis, Zachary Frangella, Shipu Zhao, Bartolomeo Stellato, Madeleine Udell

    Abstract: We introduce the GEneralized Newton Inexact Operator Splitting solver (GeNIOS) for large-scale convex optimization. GeNIOS speeds up ADMM by approximately solving approximate subproblems: it uses a second-order approximation to the most challenging ADMM subproblem and solves it inexactly with a fast randomized solver. Despite these approximations, GeNIOS retains the convergence rate of classic ADM… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  13. arXiv:2309.02014  [pdf, other

    math.OC cs.LG

    PROMISE: Preconditioned Stochastic Optimization Methods by Incorporating Scalable Curvature Estimates

    Authors: Zachary Frangella, Pratik Rathore, Shipu Zhao, Madeleine Udell

    Abstract: This paper introduces PROMISE ($\textbf{Pr}$econditioned Stochastic $\textbf{O}$ptimization $\textbf{M}$ethods by $\textbf{I}$ncorporating $\textbf{S}$calable Curvature $\textbf{E}$stimates), a suite of sketching-based preconditioned stochastic gradient algorithms for solving large-scale convex optimization problems arising in machine learning. PROMISE includes preconditioned versions of SVRG, SAG… ▽ More

    Submitted 13 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 52 pages, 9 Figures

  14. arXiv:2302.03863  [pdf, ps, other

    math.OC

    On the (linear) convergence of Generalized Newton Inexact ADMM

    Authors: Zachary Frangella, Theo Diamandis, Bartolomeo Stellato, Madeleine Udell

    Abstract: This paper presents GeNI-ADMM, a framework for large-scale composite convex optimization that facilitates theoretical analysis of both existing and new approximate ADMM schemes. GeNI-ADMM encompasses any ADMM algorithm that solves a first- or second-order approximation to the ADMM subproblem inexactly. GeNI-ADMM exhibits the usual $\mathcal{O} (1/t)$-convergence rate under standard hypotheses and… ▽ More

    Submitted 18 June, 2025; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 31 pages, 4 figures, 2 tables

  15. arXiv:2211.08597  [pdf, other

    math.OC cs.LG

    SketchySGD: Reliable Stochastic Optimization via Randomized Curvature Estimates

    Authors: Zachary Frangella, Pratik Rathore, Shipu Zhao, Madeleine Udell

    Abstract: SketchySGD improves upon existing stochastic gradient methods in machine learning by using randomized low-rank approximations to the subsampled Hessian and by introducing an automated stepsize that works well across a wide range of convex machine learning problems. We show theoretically that SketchySGD with a fixed stepsize converges linearly to a small ball around the optimum. Further, in the ill… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 65 pages, 43 figures, 8 tables

  16. arXiv:2202.11599  [pdf, other

    math.OC

    NysADMM: faster composite convex optimization via low-rank approximation

    Authors: Shipu Zhao, Zachary Frangella, Madeleine Udell

    Abstract: This paper develops a scalable new algorithm, called NysADMM, to minimize a smooth convex loss function with a convex regularizer. NysADMM accelerates the inexact Alternating Direction Method of Multipliers (ADMM) by constructing a preconditioner for the ADMM subproblem from a randomized low-rank Nyström approximation. NysADMM comes with strong theoretical guarantees: it solves the ADMM subproblem… ▽ More

    Submitted 2 July, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

  17. arXiv:2110.02820  [pdf, other

    math.NA

    Randomized Nyström Preconditioning

    Authors: Zachary Frangella, Joel A. Tropp, Madeleine Udell

    Abstract: This paper introduces the Nyström PCG algorithm for solving a symmetric positive-definite linear system. The algorithm applies the randomized Nyström method to form a low-rank approximation of the matrix, which leads to an efficient preconditioner that can be deployed with the conjugate gradient algorithm. Theoretical analysis shows that preconditioned system has constant condition number as soon… ▽ More

    Submitted 17 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 37 pages, 3 figures

    MSC Class: 65F08; 68W20; 65F55; 65F22

  18. arXiv:2105.04684  [pdf, other

    math.OC

    An automatic system to detect equivalence between iterative algorithms

    Authors: Shipu Zhao, Laurent Lessard, Madeleine Udell

    Abstract: When are two algorithms the same? How can we be sure a recently proposed algorithm is novel, and not a minor twist on an existing method? In this paper, we present a framework for reasoning about equivalence between a broad class of iterative algorithms, with a focus on algorithms designed for convex optimization. We propose several notions of what it means for two algorithms to be equivalent, and… ▽ More

    Submitted 9 January, 2025; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: This paper documents a software system for identifying equivalence between optimization algorithms. The analysis in this paper has been improved in arxiv:2501.04972

  19. arXiv:2105.00105  [pdf, other

    math.NA cs.LG math.OC

    Tensor Random Projection for Low Memory Dimension Reduction

    Authors: Yiming Sun, Yang Guo, Joel A. Tropp, Madeleine Udell

    Abstract: Random projections reduce the dimension of a set of vectors while preserving structural information, such as distances between vectors in the set. This paper proposes a novel use of row-product random matrices in random projection, where we call it Tensor Random Projection (TRP). It requires substantially less memory than existing dimension reduction maps. The TRP map is formed as the Khatri-Rao p… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: In NeurIPS Workshop on Relational Representation Learning (2018)

  20. arXiv:2101.00323  [pdf, other

    stat.ML cs.LG math.OC

    TenIPS: Inverse Propensity Sampling for Tensor Completion

    Authors: Chengrun Yang, Lijun Ding, Ziyang Wu, Madeleine Udell

    Abstract: Tensors are widely used to represent multiway arrays of data. The recovery of missing entries in a tensor has been extensively studied, generally under the assumption that entries are missing completely at random (MCAR). However, in most practical settings, observations are missing not at random (MNAR): the probability that a given entry is observed (also called the propensity) may depend on other… ▽ More

    Submitted 22 April, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: AISTATS 2021

  21. arXiv:2012.00183  [pdf, other

    math.OC

    A Strict Complementarity Approach to Error Bound and Sensitivity of Solution of Conic Programs

    Authors: Lijun Ding, Madeleine Udell

    Abstract: In this paper, we provide an elementary, geometric, and unified framework to analyze conic programs that we call the strict complementarity approach. This framework allows us to establish error bounds and quantify the sensitivity of the solution. The framework uses three classical ideas from convex geometry and linear algebra: linear regularity of convex sets, facial reduction, and orthogonal deco… ▽ More

    Submitted 16 September, 2022; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 23 pages, 2 figures. In this revision, we added an approach based on conic decomposition. See Section D for details

  22. arXiv:2006.16142  [pdf, other

    math.OC cs.LG

    $k$FW: A Frank-Wolfe style algorithm with stronger subproblem oracles

    Authors: Lijun Ding, Jicong Fan, Madeleine Udell

    Abstract: This paper proposes a new variant of Frank-Wolfe (FW), called $k$FW. Standard FW suffers from slow convergence: iterates often zig-zag as update directions oscillate around extreme points of the constraint set. The new variant, $k$FW, overcomes this problem by using two stronger subproblem oracles in each iteration. The first is a $k$ linear optimization oracle ($k$LOO) that computes the $k$ best… ▽ More

    Submitted 15 November, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: 20 pages main text, 10 figures

  23. arXiv:2002.10673  [pdf, other

    math.OC cs.LG stat.ML

    On the simplicity and conditioning of low rank semidefinite programs

    Authors: Lijun Ding, Madeleine Udell

    Abstract: Low rank matrix recovery problems appear widely in statistics, combinatorics, and imaging. One celebrated method for solving these problems is to formulate and solve a semidefinite program (SDP). It is often known that the exact solution to the SDP with perfect data recovers the solution to the original low rank matrix recovery problem. It is more challenging to show that an approximate solution t… ▽ More

    Submitted 22 July, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 24 pages, 1 figure, and 1 table

  24. arXiv:1912.02949  [pdf, other

    math.OC math.CO

    Scalable Semidefinite Programming

    Authors: Alp Yurtsever, Joel A. Tropp, Olivier Fercoq, Madeleine Udell, Volkan Cevher

    Abstract: Semidefinite programming (SDP) is a powerful framework from convex optimization that has striking potential for data science applications. This paper develops a provably correct randomized algorithm for solving large, weakly constrained SDP problems by economizing on the storage and arithmetic costs. Numerical evidence shows that the method is effective for a range of applications, including relax… ▽ More

    Submitted 25 March, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    MSC Class: 90C22; 65K05 (Primary); 65F99 (Secondary)

    Journal ref: SIAM Journal on Mathematics of Data Science, vol. 3, num. 1, pp. 171-200, Feb. 2021

  25. arXiv:1904.10951  [pdf, other

    math.NA cs.LG

    Low-Rank Tucker Approximation of a Tensor From Streaming Data

    Authors: Yiming Sun, Yang Guo, Charlene Luo, Joel Tropp, Madeleine Udell

    Abstract: This paper describes a new algorithm for computing a low-Tucker-rank approximation of a tensor. The method applies a randomized linear map to the tensor to obtain a sketch that captures the important directions within each mode, as well as the interactions among the modes. The sketch can be extracted from streaming or distributed data or with a single pass over the tensor, and it uses storage prop… ▽ More

    Submitted 30 April, 2021; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Appendix includes supplement from published version

    Journal ref: SIAM Journal on Mathematics of Data Science 2.4 (2020): 1123-1150

  26. arXiv:1902.08651  [pdf, other

    math.NA

    Streaming Low-Rank Matrix Approximation with an Application to Scientific Simulation

    Authors: Joel A. Tropp, Alp Yurtsever, Madeleine Udell, Volkan Cevher

    Abstract: This paper argues that randomized linear sketching is a natural tool for on-the-fly compression of data matrices that arise from large-scale scientific simulations and data collection. The technical contribution consists in a new algorithm for constructing an accurate low-rank approximation of a matrix from streaming data. This method is accompanied by an a priori analysis that allows the user to… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: 70 pages, 33 figures

    MSC Class: 65F30; 68W20

  27. arXiv:1902.03373  [pdf, other

    math.OC cs.LG

    An Optimal-Storage Approach to Semidefinite Programming using Approximate Complementarity

    Authors: Lijun Ding, Alp Yurtsever, Volkan Cevher, Joel A. Tropp, Madeleine Udell

    Abstract: This paper develops a new storage-optimal algorithm that provably solves generic semidefinite programs (SDPs) in standard form. This method is particularly effective for weakly constrained SDPs. The key idea is to formulate an approximate complementarity principle: Given an approximate solution to the dual SDP, the primal SDP has an approximate solution whose range is contained in the eigenspace w… ▽ More

    Submitted 17 June, 2020; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: 35 pages, 24 pages of main text, 17 figures

  28. arXiv:1808.05274  [pdf, other

    math.OC stat.ML

    Frank-Wolfe Style Algorithms for Large Scale Optimization

    Authors: Lijun Ding, Madeleine Udell

    Abstract: We introduce a few variants on Frank-Wolfe style algorithms suitable for large scale optimization. We show how to modify the standard Frank-Wolfe algorithm using stochastic gradients, approximate subproblem solutions, and sketched decision variables in order to scale to enormous problems while preserving (up to constants) the optimal convergence rate $\mathcal{O}(\frac{1}{k})$.

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: 28 pages, 5 figures, a chapter of the book "Large-Scale and Distributed Optimization", Springer's Lecture Notes in Mathematics Series, volume 2227, https://www.springer.com/us/book/9783319974774

  29. arXiv:1807.07531  [pdf, other

    math.OC

    Limited Memory Kelley's Method Converges for Composite Convex and Submodular Objectives

    Authors: Song Zhou, Swati Gupta, Madeleine Udell

    Abstract: The original simplicial method (OSM), a variant of the classic Kelley's cutting plane method, has been shown to converge to the minimizer of a composite convex and submodular objective, though no rate of convergence for this method was known. Moreover, OSM is required to solve subproblems in each iteration whose size grows linearly in the number of iterations. We propose a limited memory version o… ▽ More

    Submitted 17 December, 2018; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: 15 pages, 3 figures with 12 sub-figures

    MSC Class: 90C25; 90C27; 90C30

  30. arXiv:1706.05736  [pdf, other

    math.NA cs.DS stat.ML

    Fixed-Rank Approximation of a Positive-Semidefinite Matrix from Streaming Data

    Authors: Joel A. Tropp, Alp Yurtsever, Madeleine Udell, Volkan Cevher

    Abstract: Several important applications, such as streaming PCA and semidefinite programming, involve a large-scale positive-semidefinite (psd) matrix that is presented as a sequence of linear updates. Because of storage limitations, it may only be possible to retain a sketch of the psd matrix. This paper develops a new algorithm for fixed-rank psd approximation from a sketch. The approach combines the Nyst… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

  31. arXiv:1702.06838  [pdf, other

    math.OC stat.ML

    Sketchy Decisions: Convex Low-Rank Matrix Optimization with Optimal Storage

    Authors: Alp Yurtsever, Madeleine Udell, Joel A. Tropp, Volkan Cevher

    Abstract: This paper concerns a fundamental class of convex matrix optimization problems. It presents the first algorithm that uses optimal storage and provably computes a low-rank approximation of a solution. In particular, when all solutions have low rank, the algorithm converges to a solution. This algorithm, SketchyCGM, modifies a standard convex optimization scheme, the conditional gradient method, to… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

  32. arXiv:1610.05604  [pdf, other

    stat.ML math.OC stat.ME

    Dynamic Assortment Personalization in High Dimensions

    Authors: Nathan Kallus, Madeleine Udell

    Abstract: We study the problem of dynamic assortment personalization with large, heterogeneous populations and wide arrays of products, and demonstrate the importance of structural priors for effective, efficient large-scale personalization. Assortment personalization is the problem of choosing, for each individual (type), a best assortment of products, ads, or other offerings (items) so as to maximize reve… ▽ More

    Submitted 2 May, 2019; v1 submitted 18 October, 2016; originally announced October 2016.

  33. arXiv:1609.03285  [pdf, other

    math.OC

    Disciplined Multi-Convex Programming

    Authors: Xinyue Shen, Steven Diamond, Madeleine Udell, Yuantao Gu, Stephen Boyd

    Abstract: A multi-convex optimization problem is one in which the variables can be partitioned into sets over which the problem is convex when the other variables are fixed. Multi-convex problems are generally solved approximately using variations on alternating or cyclic minimization. Multi-convex problems arise in many applications, such as nonnegative matrix factorization, generalized low rank models, an… ▽ More

    Submitted 8 October, 2016; v1 submitted 12 September, 2016; originally announced September 2016.

  34. arXiv:1609.00048  [pdf, other

    math.NA cs.DS stat.CO stat.ML

    Practical sketching algorithms for low-rank matrix approximation

    Authors: Joel A. Tropp, Alp Yurtsever, Madeleine Udell, Volkan Cevher

    Abstract: This paper describes a suite of algorithms for constructing low-rank approximations of an input matrix from a random linear image of the matrix, called a sketch. These methods can preserve structural properties of the input matrix, such as positive-semidefiniteness, and they can produce approximations with a user-specified rank. The algorithms are simple, accurate, numerically stable, and provably… ▽ More

    Submitted 2 January, 2018; v1 submitted 31 August, 2016; originally announced September 2016.

    MSC Class: Primary 65F30; Secondary 68W20

    Journal ref: SIAM J. Matrix Analysis and Applications, Vol. 38, num. 4, pp. 1454-1485, Dec. 2017

  35. arXiv:1606.02338  [pdf, other

    math.OC

    The Sound of APALM Clapping: Faster Nonsmooth Nonconvex Optimization with Stochastic Asynchronous PALM

    Authors: Damek Davis, Brent Edmunds, Madeleine Udell

    Abstract: We introduce the Stochastic Asynchronous Proximal Alternating Linearized Minimization (SAPALM) method, a block coordinate stochastic proximal-gradient method for solving nonconvex, nonsmooth optimization problems. SAPALM is the first asynchronous parallel optimization method that provably converges on a large class of nonconvex, nonsmooth problems. We prove that SAPALM matches the best known rates… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    MSC Class: 90C15; 65K05; 90C26; 90C30

  36. arXiv:1509.05113  [pdf, other

    stat.ML cs.LG math.OC

    Revealed Preference at Scale: Learning Personalized Preferences from Assortment Choices

    Authors: Nathan Kallus, Madeleine Udell

    Abstract: We consider the problem of learning the preferences of a heterogeneous population by observing choices from an assortment of products, ads, or other offerings. Our observation model takes a form common in assortment planning applications: each arriving customer is offered an assortment consisting of a subset of all possible offerings; we observe only the assortment and the customer's single choice… ▽ More

    Submitted 7 June, 2016; v1 submitted 16 September, 2015; originally announced September 2015.

  37. arXiv:1410.4821  [pdf, ps, other

    math.OC cs.MS stat.ML

    Convex Optimization in Julia

    Authors: Madeleine Udell, Karanveer Mohan, David Zeng, Jenny Hong, Steven Diamond, Stephen Boyd

    Abstract: This paper describes Convex, a convex optimization modeling framework in Julia. Convex translates problems from a user-friendly functional language into an abstract syntax tree describing the problem. This concise representation of the global structure of the problem allows Convex to infer whether the problem complies with the rules of disciplined convex programming (DCP), and to pass the problem… ▽ More

    Submitted 17 October, 2014; originally announced October 2014.

    Comments: To appear in Proceedings of the Workshop on High Performance Technical Computing in Dynamic Languages (HPTCDL) 2014

  38. arXiv:1410.4158  [pdf, ps, other

    math.OC

    Bounding Duality Gap for Separable Problems with Linear Constraints

    Authors: Madeleine Udell, Stephen Boyd

    Abstract: We consider the problem of minimizing a sum of non-convex functions over a compact domain, subject to linear inequality and equality constraints. Approximate solutions can be found by solving a convexified version of the problem, in which each function in the objective is replaced by its convex envelope. We propose a randomized algorithm to solve the convexified problem which finds an $ε$-suboptim… ▽ More

    Submitted 8 January, 2016; v1 submitted 15 October, 2014; originally announced October 2014.

  39. arXiv:1410.0342  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Low Rank Models

    Authors: Madeleine Udell, Corinne Horn, Reza Zadeh, Stephen Boyd

    Abstract: Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal, and other data types. This framework encompasses many well known techniques in data analysis, such as nonnegative matrix factorization, matrix completion, sparse… ▽ More

    Submitted 5 May, 2015; v1 submitted 1 October, 2014; originally announced October 2014.

    Comments: 84 pages, 19 figures