Skip to main content

Showing 1–44 of 44 results for author: Udell, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.23081  [pdf, ps, other

    math.OC cs.LG stat.ML

    Gradient Methods with Online Scaling Part I. Theoretical Foundations

    Authors: Wenzhi Gao, Ya-Chi Chu, Yinyu Ye, Madeleine Udell

    Abstract: This paper establishes the theoretical foundations of the online scaled gradient methods (OSGM), a framework that utilizes online learning to adapt stepsizes and provably accelerate first-order methods. OSGM quantifies the effectiveness of a stepsize by a feedback function motivated from a convergence measure and uses the feedback to adjust the stepsize through an online learning algorithm. Conseq… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Extension of arXiv:2411.01803 and arXiv:2502.11229

  2. arXiv:2505.13723  [pdf, ps, other

    cs.LG math.OC stat.ML

    Turbocharging Gaussian Process Inference with Approximate Sketch-and-Project

    Authors: Pratik Rathore, Zachary Frangella, Sachin Garg, Shaghayegh Fazliani, Michał Dereziński, Madeleine Udell

    Abstract: Gaussian processes (GPs) play an essential role in biostatistics, scientific machine learning, and Bayesian optimization for their ability to provide probabilistic predictions and model uncertainty. However, GP inference struggles to scale to large datasets (which are common in modern applications), since it requires the solution of a linear system whose size scales quadratically with the number o… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 28 pages, 6 figures, 2 tables

  3. arXiv:2503.07642  [pdf, other

    cs.LG stat.ML

    dnamite: A Python Package for Neural Additive Models

    Authors: Mike Van Ness, Madeleine Udell

    Abstract: Additive models offer accurate and interpretable predictions for tabular data, a critical tool for statistical modeling. Recent advances in Neural Additive Models (NAMs) allow these models to handle complex machine learning tasks, including feature selection and survival analysis, on large-scale data. This paper introduces dnamite, a Python package that implements NAMs for these advanced applicati… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  4. arXiv:2501.15941  [pdf, other

    stat.ML cs.LG

    SAPPHIRE: Preconditioned Stochastic Variance Reduction for Faster Large-Scale Statistical Learning

    Authors: Jingruo Sun, Zachary Frangella, Madeleine Udell

    Abstract: Regularized empirical risk minimization (rERM) has become important in data-intensive fields such as genomics and advertising, with stochastic gradient methods typically used to solve the largest problems. However, ill-conditioned objectives and non-smooth regularizers undermine the performance of traditional stochastic gradient methods, leading to slow convergence and significant computational co… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  5. arXiv:2411.05923  [pdf, other

    cs.LG stat.ML

    DNAMite: Interpretable Calibrated Survival Analysis with Discretized Additive Models

    Authors: Mike Van Ness, Billy Block, Madeleine Udell

    Abstract: Survival analysis is a classic problem in statistics with important applications in healthcare. Most machine learning models for survival analysis are black-box models, limiting their use in healthcare settings where interpretability is paramount. More recently, glass-box machine learning models have been introduced for survival analysis, with both strong predictive performance and interpretabilit… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

  6. arXiv:2407.10070  [pdf, other

    cs.LG math.OC stat.ML

    Have ASkotch: A Neat Solution for Large-scale Kernel Ridge Regression

    Authors: Pratik Rathore, Zachary Frangella, Jiaming Yang, Michał Dereziński, Madeleine Udell

    Abstract: Kernel ridge regression (KRR) is a fundamental computational tool, appearing in problems that range from computational chemistry to health analytics, with a particular interest due to its starring role in Gaussian process regression. However, full KRR solvers are challenging to scale to large datasets: both direct (i.e., Cholesky decomposition) and iterative methods (i.e., PCG) incur prohibitive c… ▽ More

    Submitted 21 February, 2025; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 64 pages (including appendices), 16 figures, 5 tables

    MSC Class: 65F10; 68W20; 90C06

  7. arXiv:2404.14689  [pdf, other

    cs.LG stat.ML

    Interpretable Prediction and Feature Selection for Survival Analysis

    Authors: Mike Van Ness, Madeleine Udell

    Abstract: Survival analysis is widely used as a technique to model time-to-event data when some data is censored, particularly in healthcare for predicting future patient risk. In such settings, survival models must be both accurate and interpretable so that users (such as doctors) can trust the model and understand model predictions. While most literature focuses on discrimination, interpretability is equa… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2402.01868  [pdf, other

    cs.LG math.OC stat.ML

    Challenges in Training PINNs: A Loss Landscape Perspective

    Authors: Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell

    Abstract: This paper explores challenges in training Physics-Informed Neural Networks (PINNs), emphasizing the role of the loss landscape in the training process. We examine difficulties in minimizing the PINN loss function, particularly due to ill-conditioning caused by differential operators in the residual term. We compare gradient-based optimizers Adam, L-BFGS, and their combination Adam+L-BFGS, showing… ▽ More

    Submitted 3 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Oral; 33 pages (including appendices), 10 figures, 3 tables

  9. arXiv:2310.15472  [pdf, other

    cs.LG stat.ML

    Interpretable Survival Analysis for Heart Failure Risk Prediction

    Authors: Mike Van Ness, Tomas Bosschieter, Natasha Din, Andrew Ambrosy, Alexander Sandhu, Madeleine Udell

    Abstract: Survival analysis, or time-to-event analysis, is an important and widespread problem in healthcare research. Medical research has traditionally relied on Cox models for survival analysis, due to their simplicity and interpretability. Cox models assume a log-linear hazard function as well as proportional hazards over time, and can perform poorly when these assumptions fail. Newer survival models ba… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  10. arXiv:2211.09259  [pdf, other

    cs.LG cs.AI stat.ML

    The Missing Indicator Method: From Low to High Dimensions

    Authors: Mike Van Ness, Tomas M. Bosschieter, Roberto Halpin-Gregorio, Madeleine Udell

    Abstract: Missing data is common in applied data science, particularly for tabular data sets found in healthcare, social sciences, and natural sciences. Most supervised learning methods only work on complete data, thus requiring preprocessing such as missing value imputation to work on incomplete data sets. However, imputation alone does not encode useful information about the missing values themselves. For… ▽ More

    Submitted 3 February, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Journal ref: SIGKDD 2023 Proceedings p5004 5015

  11. arXiv:2210.06673  [pdf, other

    stat.ME

    Probabilistic Missing Value Imputation for Mixed Categorical and Ordered Data

    Authors: Yuxuan Zhao, Alex Townsend, Madeleine Udell

    Abstract: Many real-world datasets contain missing entries and mixed data types including categorical and ordered (e.g. continuous and ordinal) variables. Imputing the missing entries is necessary, since many data analysis pipelines require complete data, but this is challenging especially for mixed data. This paper proposes a probabilistic imputation method using an extended Gaussian copula model that supp… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  12. arXiv:2207.03935  [pdf, other

    stat.ML cs.LG

    ControlBurn: Nonlinear Feature Selection with Sparse Tree Ensembles

    Authors: Brian Liu, Miaolan Xie, Haoyue Yang, Madeleine Udell

    Abstract: ControlBurn is a Python package to construct feature-sparse tree ensembles that support nonlinear feature selection and interpretable machine learning. The algorithms in this package first build large tree ensembles that prioritize basis functions with few features and then select a feature-sparse subset of these basis functions using a weighted lasso optimization criterion. The package includes v… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 22 pages

  13. arXiv:2204.07615  [pdf, other

    cs.LG stat.ML

    TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets

    Authors: Chengrun Yang, Gabriel Bender, Hanxiao Liu, Pieter-Jan Kindermans, Madeleine Udell, Yifeng Lu, Quoc Le, Da Huang

    Abstract: The best neural architecture for a given machine learning problem depends on many factors: not only the complexity and structure of the dataset, but also on resource constraints including latency, compute, energy consumption, etc. Neural architecture search (NAS) for tabular datasets is an important but under-explored problem. Previous NAS algorithms designed for image search spaces incorporate re… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022; 30 pages, 15 figures, 7 tables

  14. arXiv:2203.05089  [pdf, other

    stat.ME

    gcimpute: A Package for Missing Data Imputation

    Authors: Yuxuan Zhao, Madeleine Udell

    Abstract: This article introduces the Python package gcimpute for missing data imputation. gcimpute can impute missing data with many different variable types, including continuous, binary, ordinal, count, and truncated values, by modeling data as samples from a Gaussian copula model. This semiparametric model learns the marginal distribution of each variable to match the empirical distribution, yet describ… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  15. arXiv:2201.03668  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Towards Group Robustness in the presence of Partial Group Labels

    Authors: Vishnu Suresh Lokhande, Kihyuk Sohn, Jinsung Yoon, Madeleine Udell, Chen-Yu Lee, Tomas Pfister

    Abstract: Learning invariant representations is an important requirement when training machine learning models that are driven by spurious correlations in the datasets. These spurious correlations, between input samples and the target labels, wrongly direct the neural network predictions resulting in poor performance on certain groups, especially the minority groups. Robust training against these spurious c… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  16. arXiv:2107.09194  [pdf, other

    stat.ML cs.LG stat.ME

    Can we globally optimize cross-validation loss? Quasiconvexity in ridge regression

    Authors: William T. Stephenson, Zachary Frangella, Madeleine Udell, Tamara Broderick

    Abstract: Models like LASSO and ridge regression are extensively used in practice due to their interpretability, ease of use, and strong theoretical guarantees. Cross-validation (CV) is widely used for hyperparameter tuning in these models, but do practical optimization methods minimize the true out-of-sample loss? A recent line of research promises to show that the optimum of the CV loss matches the optimu… ▽ More

    Submitted 1 November, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Published in NeurIPS 2021

  17. ControlBurn: Feature Selection by Sparse Forests

    Authors: Brian Liu, Miaolan Xie, Madeleine Udell

    Abstract: Tree ensembles distribute feature importance evenly amongst groups of correlated features. The average feature ranking of the correlated group is suppressed, which reduces interpretability and complicates feature selection. In this paper we present ControlBurn, a feature selection algorithm that uses a weighted LASSO-based feature selection method to prune unnecessary features from tree ensembles,… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 15 pages

  18. arXiv:2101.00323  [pdf, other

    stat.ML cs.LG math.OC

    TenIPS: Inverse Propensity Sampling for Tensor Completion

    Authors: Chengrun Yang, Lijun Ding, Ziyang Wu, Madeleine Udell

    Abstract: Tensors are widely used to represent multiway arrays of data. The recovery of missing entries in a tensor has been extensively studied, generally under the assumption that entries are missing completely at random (MCAR). However, in most practical settings, observations are missing not at random (MNAR): the probability that a given entry is observed (also called the propensity) may depend on other… ▽ More

    Submitted 22 April, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: AISTATS 2021

  19. arXiv:2009.12326  [pdf, other

    cs.LG stat.ML

    Online Missing Value Imputation and Change Point Detection with the Gaussian Copula

    Authors: Yuxuan Zhao, Eric Landgrebe, Eliot Shekhtman, Madeleine Udell

    Abstract: Missing value imputation is crucial for real-world data science workflows. Imputation is harder in the online setting, as it requires the imputation method itself to be able to evolve over time. For practical applications, imputation algorithms should produce imputations that match the true data distribution, handle data of mixed types, including ordinal, boolean, and continuous variables, and sca… ▽ More

    Submitted 15 December, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: Accepted by AAAI 2022

  20. arXiv:2008.10547  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Approximate Cross-Validation with Low-Rank Data in High Dimensions

    Authors: William T. Stephenson, Madeleine Udell, Tamara Broderick

    Abstract: Many recent advances in machine learning are driven by a challenging trifecta: large data size $N$; high dimensions; and expensive algorithms. In this setting, cross-validation (CV) serves as an important tool for model assessment. Recent advances in approximate cross validation (ACV) provide accurate approximations to CV with only a single model fit, avoiding traditional CV's requirement for repe… ▽ More

    Submitted 1 November, 2022; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Published in NeurIPS 2020

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  21. arXiv:2006.10829  [pdf, other

    stat.ML cs.LG

    Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula

    Authors: Yuxuan Zhao, Madeleine Udell

    Abstract: Modern large scale datasets are often plagued with missing entries. For tabular data with missing values, a flurry of imputation algorithms solve for a complete matrix which minimizes some penalized reconstruction error. However, almost none of them can estimate the uncertainty of its imputations. This paper proposes a probabilistic and scalable framework for missing value imputation with quantifi… ▽ More

    Submitted 18 January, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted by NeurIPS 2020

  22. arXiv:2006.04216  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

    Authors: Chengrun Yang, Jicong Fan, Ziyang Wu, Madeleine Udell

    Abstract: Data scientists seeking a good supervised learning model on a new dataset have many choices to make: they must preprocess the data, select features, possibly reduce the dimension, select an estimation algorithm, and choose hyperparameters for each of these pipeline components. With new pipeline components comes a combinatorial explosion in the number of choices! In this work, we design a new AutoM… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

    Comments: This is an extended version of AutoML Pipeline Selection: Efficiently Navigating the Combinatorial Space (DOI: 10.1145/3394486.3403197) at KDD 2020

  23. arXiv:2006.03750  [pdf, other

    cs.LG stat.ML

    Learning to Solve Combinatorial Optimization Problems on Real-World Graphs in Linear Time

    Authors: Iddo Drori, Anant Kharkar, William R. Sickinger, Brandon Kates, Qiang Ma, Suwen Ge, Eden Dolev, Brenda Dietrich, David P. Williamson, Madeleine Udell

    Abstract: Combinatorial optimization algorithms for graph problems are usually designed afresh for each new problem with careful attention by an expert to the problem structure. In this work, we develop a new framework to solve any combinatorial optimization problem over graphs that can be formulated as a single player game defined by states, actions, and rewards, including minimum spanning tree, shortest p… ▽ More

    Submitted 11 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

  24. Robust Non-Linear Matrix Factorization for Dictionary Learning, Denoising, and Clustering

    Authors: Jicong Fan, Chengrun Yang, Madeleine Udell

    Abstract: Low dimensional nonlinear structure abounds in datasets across computer vision and machine learning. Kernelized matrix factorization techniques have recently been proposed to learn these nonlinear structures for denoising, classification, dictionary learning, and missing data imputation, by observing that the image of the matrix in a sufficiently large feature space is low-rank. However, these non… ▽ More

    Submitted 2 December, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Journal ref: IEEE Transactions on Signal Processing 69, 1755-1770 (2021)

  25. arXiv:2002.10673  [pdf, other

    math.OC cs.LG stat.ML

    On the simplicity and conditioning of low rank semidefinite programs

    Authors: Lijun Ding, Madeleine Udell

    Abstract: Low rank matrix recovery problems appear widely in statistics, combinatorics, and imaging. One celebrated method for solving these problems is to formulate and solve a semidefinite program (SDP). It is often known that the exact solution to the SDP with perfect data recovers the solution to the original low rank matrix recovery problem. It is more challenging to show that an approximate solution t… ▽ More

    Submitted 22 July, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 24 pages, 1 figure, and 1 table

  26. arXiv:2002.08934  [pdf, ps, other

    cs.LG stat.ML

    Online high rank matrix completion

    Authors: Jicong Fan, Madeleine Udell

    Abstract: Recent advances in matrix completion enable data imputation in full-rank matrices by exploiting low dimensional (nonlinear) latent structure. In this paper, we develop a new model for high rank matrix completion (HRMC), together with batch and online methods to fit the model and out-of-sample extension to complete new data. The method works by (implicitly) mapping the data into a high dimensional… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: The paper was published by the proceedings of IEEE CVPR 2019

  27. arXiv:1912.06989  [pdf, ps, other

    cs.LG stat.ML

    Polynomial Matrix Completion for Missing Data Imputation and Transductive Learning

    Authors: Jicong Fan, Yuqian Zhang, Madeleine Udell

    Abstract: This paper develops new methods to recover the missing entries of a high-rank or even full-rank matrix when the intrinsic dimension of the data is low compared to the ambient dimension. Specifically, we assume that the columns of a matrix are generated by polynomials acting on a low-dimensional intrinsic variable, and wish to recover the missing entries under this assumption. We show that we can i… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAI 2020. The supplementary material is at https://github.com/jicongfan/Supplementary-material-of-conference-papers/blob/master/supp_PMC_AAAI2020.pdf

  28. arXiv:1911.05774  [pdf, ps, other

    cs.LG stat.ML

    Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery

    Authors: Jicong Fan, Lijun Ding, Yudong Chen, Madeleine Udell

    Abstract: This paper develops a new class of nonconvex regularizers for low-rank matrix recovery. Many regularizers are motivated as convex relaxations of the matrix rank function. Our new factor group-sparse regularizers are motivated as a relaxation of the number of nonzero columns in a factorization of the matrix. These nonconvex regularizers are sharper than the nuclear norm; indeed, we show they are re… ▽ More

    Submitted 18 November, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: Accepted by NeurIPS 2019. The supplementary material is at https://github.com/jicongfan/Supplementary-material-of-conference-papers

  29. arXiv:1910.12845  [pdf, other

    stat.ME

    Missing Value Imputation for Mixed Data via Gaussian Copula

    Authors: Yuxuan Zhao, Madeleine Udell

    Abstract: Missing data imputation forms the first critical step of many data analysis pipelines. The challenge is greatest for mixed data sets, including real, Boolean, and ordinal data, where standard techniques for imputation fail basic sanity checks: for example, the imputed values may not follow the same distributions as the data. This paper proposes a new semiparametric algorithm to impute missing valu… ▽ More

    Submitted 15 June, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted by KDD 2020

  30. arXiv:1910.03698  [pdf, other

    cs.LG cs.CL stat.ML

    AutoML using Metadata Language Embeddings

    Authors: Iddo Drori, Lu Liu, Yi Nian, Sharath C. Koorathota, Jie S. Li, Antonio Khalil Moretti, Juliana Freire, Madeleine Udell

    Abstract: As a human choosing a supervised learning algorithm, it is natural to begin by reading a text description of the dataset and documentation for the algorithms you might use. We demonstrate that the same idea improves the performance of automated machine learning methods. We use language embeddings from modern NLP to improve state-of-the-art AutoML systems by augmenting their recommendations with ve… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Journal ref: NeurIPS Workshop on Meta-Learning, 2019

  31. arXiv:1904.12991  [pdf, other

    cs.LG cs.AI stat.ML

    "Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

    Authors: Yujia Zhang, Kuangyan Song, Yiming Sun, Sarah Tan, Madeleine Udell

    Abstract: Methods for interpreting machine learning black-box models increase the outcomes' transparency and in turn generates insight into the reliability and fairness of the algorithms. However, the interpretations themselves could contain significant uncertainty that undermines the trust in the outcomes and raises concern about the model's reliability. Focusing on the method "Local Interpretable Model-ag… ▽ More

    Submitted 4 June, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  32. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  33. Fairness Under Unawareness: Assessing Disparity When Protected Class Is Unobserved

    Authors: Jiahao Chen, Nathan Kallus, Xiaojie Mao, Geoffry Svacha, Madeleine Udell

    Abstract: Assessing the fairness of a decision making system with respect to a protected class, such as gender or race, is challenging when class membership labels are unavailable. Probabilistic models for predicting the protected class based on observable proxies, such as surname and geolocation for race, are sometimes used to impute these missing labels for compliance assessments. Empirically, these metho… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: 13 pages, 11 figures, FAT*' 19: Conference on Fairness, Accountability, and Transparency (FAT*' 19), January 29-31, 2019, Atlanta, GA, USA

  34. arXiv:1808.05274  [pdf, other

    math.OC stat.ML

    Frank-Wolfe Style Algorithms for Large Scale Optimization

    Authors: Lijun Ding, Madeleine Udell

    Abstract: We introduce a few variants on Frank-Wolfe style algorithms suitable for large scale optimization. We show how to modify the standard Frank-Wolfe algorithm using stochastic gradients, approximate subproblem solutions, and sketched decision variables in order to scale to enormous problems while preserving (up to constants) the optimal convergence rate $\mathcal{O}(\frac{1}{k})$.

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: 28 pages, 5 figures, a chapter of the book "Large-Scale and Distributed Optimization", Springer's Lecture Notes in Mathematics Series, volume 2227, https://www.springer.com/us/book/9783319974774

  35. arXiv:1808.03233  [pdf, other

    cs.LG cs.AI stat.ML

    OBOE: Collaborative Filtering for AutoML Model Selection

    Authors: Chengrun Yang, Yuji Akimoto, Dae Won Kim, Madeleine Udell

    Abstract: Algorithm selection and hyperparameter tuning remain two of the most challenging tasks in machine learning. Automated machine learning (AutoML) seeks to automate these tasks to enable widespread use of machine learning by non-experts. This paper introduces OBOE, a collaborative filtering method for time-constrained model selection and hyperparameter tuning. OBOE forms a matrix of the cross-validat… ▽ More

    Submitted 20 May, 2019; v1 submitted 9 August, 2018; originally announced August 2018.

  36. arXiv:1806.00811  [pdf, other

    stat.ML cs.LG

    Causal Inference with Noisy and Missing Covariates via Matrix Factorization

    Authors: Nathan Kallus, Xiaojie Mao, Madeleine Udell

    Abstract: Valid causal inference in observational studies often requires controlling for confounders. However, in practice measurements of confounders may be noisy, and can lead to biased estimates of causal effects. We show that we can reduce the bias caused by measurement noise using a large number of noisy measurements of the underlying confounders. We propose the use of matrix factorization to infer the… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: 26 pages, 5 figures

  37. arXiv:1706.05736  [pdf, other

    math.NA cs.DS stat.ML

    Fixed-Rank Approximation of a Positive-Semidefinite Matrix from Streaming Data

    Authors: Joel A. Tropp, Alp Yurtsever, Madeleine Udell, Volkan Cevher

    Abstract: Several important applications, such as streaming PCA and semidefinite programming, involve a large-scale positive-semidefinite (psd) matrix that is presented as a sequence of linear updates. Because of storage limitations, it may only be possible to retain a sketch of the psd matrix. This paper develops a new algorithm for fixed-rank psd approximation from a sketch. The approach combines the Nyst… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

  38. arXiv:1705.07474  [pdf, other

    cs.LG stat.ML

    Why are Big Data Matrices Approximately Low Rank?

    Authors: Madeleine Udell, Alex Townsend

    Abstract: Matrices of (approximate) low rank are pervasive in data science, appearing in recommender systems, movie preferences, topic models, medical records, and genomics. While there is a vast literature on how to exploit low rank structure in these datasets, there is less attention on explaining why the low rank structure appears in the first place. Here, we explain the effectiveness of low rank models… ▽ More

    Submitted 29 May, 2018; v1 submitted 21 May, 2017; originally announced May 2017.

  39. arXiv:1702.06838  [pdf, other

    math.OC stat.ML

    Sketchy Decisions: Convex Low-Rank Matrix Optimization with Optimal Storage

    Authors: Alp Yurtsever, Madeleine Udell, Joel A. Tropp, Volkan Cevher

    Abstract: This paper concerns a fundamental class of convex matrix optimization problems. It presents the first algorithm that uses optimal storage and provably computes a low-rank approximation of a solution. In particular, when all solutions have low rank, the algorithm converges to a solution. This algorithm, SketchyCGM, modifies a standard convex optimization scheme, the conditional gradient method, to… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

  40. arXiv:1610.05604  [pdf, other

    stat.ML math.OC stat.ME

    Dynamic Assortment Personalization in High Dimensions

    Authors: Nathan Kallus, Madeleine Udell

    Abstract: We study the problem of dynamic assortment personalization with large, heterogeneous populations and wide arrays of products, and demonstrate the importance of structural priors for effective, efficient large-scale personalization. Assortment personalization is the problem of choosing, for each individual (type), a best assortment of products, ads, or other offerings (items) so as to maximize reve… ▽ More

    Submitted 2 May, 2019; v1 submitted 18 October, 2016; originally announced October 2016.

  41. arXiv:1609.00048  [pdf, other

    math.NA cs.DS stat.CO stat.ML

    Practical sketching algorithms for low-rank matrix approximation

    Authors: Joel A. Tropp, Alp Yurtsever, Madeleine Udell, Volkan Cevher

    Abstract: This paper describes a suite of algorithms for constructing low-rank approximations of an input matrix from a random linear image of the matrix, called a sketch. These methods can preserve structural properties of the input matrix, such as positive-semidefiniteness, and they can produce approximations with a user-specified rank. The algorithms are simple, accurate, numerically stable, and provably… ▽ More

    Submitted 2 January, 2018; v1 submitted 31 August, 2016; originally announced September 2016.

    MSC Class: Primary 65F30; Secondary 68W20

    Journal ref: SIAM J. Matrix Analysis and Applications, Vol. 38, num. 4, pp. 1454-1485, Dec. 2017

  42. arXiv:1509.05113  [pdf, other

    stat.ML cs.LG math.OC

    Revealed Preference at Scale: Learning Personalized Preferences from Assortment Choices

    Authors: Nathan Kallus, Madeleine Udell

    Abstract: We consider the problem of learning the preferences of a heterogeneous population by observing choices from an assortment of products, ads, or other offerings. Our observation model takes a form common in assortment planning applications: each arriving customer is offered an assortment consisting of a subset of all possible offerings; we observe only the assortment and the customer's single choice… ▽ More

    Submitted 7 June, 2016; v1 submitted 16 September, 2015; originally announced September 2015.

  43. arXiv:1410.4821  [pdf, ps, other

    math.OC cs.MS stat.ML

    Convex Optimization in Julia

    Authors: Madeleine Udell, Karanveer Mohan, David Zeng, Jenny Hong, Steven Diamond, Stephen Boyd

    Abstract: This paper describes Convex, a convex optimization modeling framework in Julia. Convex translates problems from a user-friendly functional language into an abstract syntax tree describing the problem. This concise representation of the global structure of the problem allows Convex to infer whether the problem complies with the rules of disciplined convex programming (DCP), and to pass the problem… ▽ More

    Submitted 17 October, 2014; originally announced October 2014.

    Comments: To appear in Proceedings of the Workshop on High Performance Technical Computing in Dynamic Languages (HPTCDL) 2014

  44. arXiv:1410.0342  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Low Rank Models

    Authors: Madeleine Udell, Corinne Horn, Reza Zadeh, Stephen Boyd

    Abstract: Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal, and other data types. This framework encompasses many well known techniques in data analysis, such as nonnegative matrix factorization, matrix completion, sparse… ▽ More

    Submitted 5 May, 2015; v1 submitted 1 October, 2014; originally announced October 2014.

    Comments: 84 pages, 19 figures