Skip to main content

Showing 1–17 of 17 results for author: Pauphilet, J

.
  1. arXiv:2501.02942  [pdf, other

    math.OC cs.LG math.PR

    Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

    Authors: Ryan Cory-Wright, Jean Pauphilet

    Abstract: Inspired by the impact of the Goemans-Williamson algorithm on combinatorial optimization, we construct an analogous relax-then-sample strategy for low-rank optimization problems. First, for orthogonally constrained quadratic optimization problems, we derive a semidefinite relaxation and a randomized rounding scheme, which obtains provably near-optimal solutions, mimicking the blueprint from Goeman… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 30 pages, 5 figures, plus references and appendices

  2. arXiv:2402.01543  [pdf, other

    cs.LG stat.ML

    Adaptive Optimization for Prediction with Missing Data

    Authors: Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

    Abstract: When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions. In this paper, we view prediction with missing data as a two-stage adaptive optimization problem and propose a new class of models, adaptive linear regression models, where the regression coefficients a… ▽ More

    Submitted 24 February, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2305.15629  [pdf, other

    cs.LG cs.AI

    Patient Outcome Predictions Improve Operations at a Large Hospital Network

    Authors: Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

    Abstract: Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals. A large hospital network in the US has been collaborating with academics and consultants to predict short-term and long-term outcomes for all inpatients across their seven hospitals. Methodology/results: We develop machin… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 41 pages, 13 figures

  4. arXiv:2305.12292  [pdf, other

    cs.LG math.OC stat.ML

    Disjunctive Branch-And-Bound for Certifiably Optimal Low-Rank Matrix Completion

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

    Abstract: Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible. Unfortunately, existing methods for matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye. We… ▽ More

    Submitted 7 May, 2025; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Updated version with new numerics showcasing scalability up to n=2500

  5. A Stochastic Benders Decomposition Scheme for Large-Scale Stochastic Network Design

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet, Periklis Petridis

    Abstract: Network design problems involve constructing edges in a transportation or supply chain network to minimize construction and daily operational costs. We study a stochastic version where operational costs are uncertain due to fluctuating demand and estimated as a sample average from historical data. This problem is computationally challenging, and instances with as few as 100 nodes often cannot be s… ▽ More

    Submitted 29 April, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    Journal ref: INFORMS Journal on Computing, 2024

  6. arXiv:2209.14790  [pdf, other

    math.OC cs.LG math.ST stat.ML

    Sparse PCA With Multiple Components

    Authors: Ryan Cory-Wright, Jean Pauphilet

    Abstract: Sparse Principal Component Analysis (sPCA) is a cardinal technique for obtaining combinations of features, or principal components (PCs), that explain the variance of high-dimensional datasets in an interpretable manner. This involves solving a sparsity and orthogonality constrained convex maximization problem, which is extremely computationally challenging. Most existing works address sparse PCA… ▽ More

    Submitted 21 March, 2025; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Updated version with improved algorithmics and a new section containing a generalization of the Gershgorin circle theorem; comments or suggestions welcome

  7. The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

    Authors: Julien Grand-Clément, Jean Pauphilet

    Abstract: Many high-stake decisions follow an expert-in-loop structure in that a human operator receives recommendations from an algorithm but is the ultimate decision maker. Hence, the algorithm's recommendation may differ from the actual decision implemented in practice. However, most algorithmic recommendations are obtained by solving an optimization problem that assumes recommendations will be perfectly… ▽ More

    Submitted 9 December, 2023; v1 submitted 5 September, 2022; originally announced September 2022.

    Journal ref: Management Science, 2024

  8. Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

    Authors: Jean Pauphilet

    Abstract: Problem definition: Mining for heterogeneous responses to an intervention is a crucial step for data-driven operations, for instance to personalize treatment or pricing. We investigate how to estimate price sensitivity from transaction-level data. In causal inference terms, we estimate heterogeneous treatment effects when (a) the response to treatment (here, whether a customer buys a product) is b… ▽ More

    Submitted 13 May, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Manufacturing & Service Operations Management 0(0) (2022)

    MSC Class: 62D20; 62D10; 90B50

    Journal ref: Manufacturing & Service Operations Management 26(1):11-27 (2022)

  9. arXiv:2105.05947  [pdf, other

    math.OC cs.LG stat.ML

    A new perspective on low-rank optimization

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: A key question in many low-rank problems throughout optimization, machine learning, and statistics is to characterize the convex hulls of simple low-rank sets and judiciously apply these convex hulls to obtain strong yet computationally tractable convex relaxations. We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hu… ▽ More

    Submitted 2 March, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: Major revision submitted to Mathematical Programming

    Journal ref: Mathematical Programming 202: 47--92, 2023

  10. arXiv:2104.03158  [pdf, other

    stat.ML cs.LG

    Simple Imputation Rules for Prediction with Missing Data: Contrasting Theoretical Guarantees with Empirical Performance

    Authors: Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

    Abstract: Missing data is a common issue in real-world datasets. This paper studies the performance of impute-then-regress pipelines by contrasting theoretical and empirical evidence. We establish the asymptotic consistency of such pipelines for a broad family of imputation methods. While common sense suggests that a `good' imputation method produces datasets that are plausible, we show, on the contrary, th… ▽ More

    Submitted 2 February, 2024; v1 submitted 7 April, 2021; originally announced April 2021.

    Journal ref: Transactions on Machine Learning Research, 2024

  11. arXiv:2009.10395  [pdf, other

    math.OC cs.LG stat.ML

    Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality. We introduce symmetric projection matrices that satisfy $Y^2=Y$, the matrix analog of binary variables that satisfy $z^2=z$, to model rank constraints. By leveraging regularization and strong duality, we prove that this modeling paradigm yields tractable convex optimization problems over the n… ▽ More

    Submitted 2 April, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: major revision submitted to Operations Research

    Journal ref: Operations Research, Articles in Advance 2021

  12. arXiv:2006.16509  [pdf, other

    stat.AP math.OC q-bio.PE stat.ML

    From predictions to prescriptions: A data-driven response to COVID-19

    Authors: Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng

    Abstract: The COVID-19 pandemic has created unprecedented challenges worldwide. Strained healthcare providers make difficult decisions on patient triage, treatment and care management on a daily basis. Policy makers have imposed social distancing measures to slow the disease, at a steep economic price. We design analytical tools to support these decisions and combat the pandemic. Specifically, we propose a… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Submitted to PNAS

  13. arXiv:2005.05195  [pdf, other

    math.OC cs.LG math.ST stat.CO

    Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features. Existing approaches cannot supply certifiably optimal principal components with more than $p=100s$ of variables. By reformulating sparse PCA as a convex mixed-integer semidefinite optimization problem,… ▽ More

    Submitted 25 August, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: Revision submitted to JMLR

    Journal ref: Journal of Machine Learning Research 23(13):1-35, 2022

  14. arXiv:1907.02109  [pdf, other

    math.OC cs.LG stat.ML

    A unified approach to mixed-integer optimization problems with logical constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems. These problems exhibit logical relationships between continuous and discrete… ▽ More

    Submitted 25 January, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Revised version (including title change). The old title was "A unified approach to mixed-integer optimization: Nonlinear formulations and scalable algorithms"

  15. arXiv:1906.10283  [pdf, other

    stat.ML cs.LG math.OC

    Certifiably Optimal Sparse Inverse Covariance Estimation

    Authors: Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet

    Abstract: We consider the maximum likelihood estimation of sparse inverse covariance matrices. We demonstrate that current heuristic approaches primarily encourage robustness, instead of the desired sparsity. We give a novel approach that solves the cardinality constrained likelihood problem to certifiable optimality. The approach uses techniques from mixed-integer optimization and convex optimization, and… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    MSC Class: 90C11; 90C22; 62H12

    Journal ref: Mathematical Programming 184 (2020) 491-530

  16. Sparse Regression: Scalable algorithms and empirical performance

    Authors: Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

    Abstract: In this paper, we review state-of-the-art methods for feature selection in statistics with an application-oriented eye. Indeed, sparsity is a valuable property and the profusion of research on the topic might have provided little guidance to practitioners. We demonstrate empirically how noise and correlation impact both the accuracy - the number of correct features selected - and the false detecti… ▽ More

    Submitted 28 February, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

    Journal ref: Statistical Science 35-4 (2020) 555-578

  17. Sparse Classification: a scalable discrete optimization perspective

    Authors: Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

    Abstract: We formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly. For sparse logistic regression and sparse SVM, our algorithm finds optimal solutions for $n$ and $p$ in the $10,000$s within minutes. On synthetic data our algorithm achieves perfect support recovery in the large sample r… ▽ More

    Submitted 30 June, 2020; v1 submitted 3 October, 2017; originally announced October 2017.

    Journal ref: Machine Learning 110, 3177-3209 (2021)