Skip to main content

Showing 1–17 of 17 results for author: Gunluk, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.18581  [pdf, ps, other

    cs.DS cs.DM quant-ph

    Parallel Token Swapping for Qubit Routing

    Authors: Ishan Bansal, Oktay Günlük, Richard Shapley

    Abstract: In this paper we study a combinatorial reconfiguration problem that involves finding an optimal sequence of swaps to move an initial configuration of tokens that are placed on the vertices of a graph to a final desired one. This problem arises as a crucial step in reducing the depth of a quantum circuit when compiling a quantum algorithm. We provide the first known constant factor approximation al… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  2. arXiv:2409.02963  [pdf, other

    math.OC cs.CY cs.LG

    Fair Minimum Representation Clustering via Integer Programming

    Authors: Connor Lawless, Oktay Gunluk

    Abstract: Clustering is an unsupervised learning task that aims to partition data into a set of clusters. In many applications, these clusters correspond to real-world constructs (e.g., electoral districts, playlists, TV channels) whose benefit can only be attained by groups when they reach a minimum level of representation (e.g., 50\% to elect their desired candidate). In this paper, we study the k-means a… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2302.03151

  3. arXiv:2401.10738  [pdf, ps, other

    cs.DS

    Warehouse Problem with Multiple Vendors and Generalized Complementarity Constraints

    Authors: Ishan Bansal, Oktay Günlük

    Abstract: We study the warehouse problem, arising in the area of inventory management and production planning. Here, a merchant wants to decide an optimal trading policy that computes quantities of a single commodity to purchase, store and sell during each time period of a finite discrete time horizon. Motivated by recent applications in energy markets, we extend the models by Wolsey and Yaman (2018) and Ba… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  4. arXiv:2302.12136  [pdf, ps, other

    cs.DS

    Warehouse Problem with Bounds, Fixed Costs and Complementarity Constraints

    Authors: Ishan Bansal, Oktay Günlük

    Abstract: This paper studies an open question in the warehouse problem where a merchant trading a commodity tries to find an optimal inventory-trading policy to decide on purchase and sale quantities during a fixed time horizon in order to maximize their total pay-off, making use of fluctuations in sale and cost prices. We provide the first known polynomial-time algorithms for the case when there are fixed… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Version 1 of full paper

  5. arXiv:2302.03151  [pdf, other

    cs.LG math.OC

    Fair Minimum Representation Clustering

    Authors: Connor Lawless, Oktay Gunluk

    Abstract: Clustering is an unsupervised learning task that aims to partition data into a set of clusters. In many applications, these clusters correspond to real-world constructs (e.g. electoral districts) whose benefit can only be attained by groups when they reach a minimum level of representation (e.g. 50\% to elect their desired candidate). This paper considers the problem of performing k-means clusteri… ▽ More

    Submitted 8 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  6. arXiv:2211.03997  [pdf, other

    math.OC cs.DS

    Online Decision Making with Nonconvex Local and Convex Global Constraints

    Authors: Rui Chen, Oktay Gunluk, Andrea Lodi, Guanyi Wang

    Abstract: We study the online decision making problem (ODMP) as a natural generalization of online linear programming. In ODMP, a single decision maker undertakes a sequence of decisions over $T$ time steps. At each time step, the decision maker makes a locally feasible decision based on information available up to that point. The objective is to maximize the accumulated reward while satisfying some convex… ▽ More

    Submitted 28 June, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  7. arXiv:2210.08798  [pdf, other

    cs.LG math.OC

    Cluster Explanation via Polyhedral Descriptions

    Authors: Connor Lawless, Oktay Gunluk

    Abstract: Clustering is an unsupervised learning problem that aims to partition unlabelled data points into groups with similar features. Traditional clustering algorithms provide limited insight into the groups they find as their main focus is accuracy and not the interpretability of the group assignments. This has spurred a recent line of work on explainable machine learning for clustering. In this paper… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  8. arXiv:2111.08466  [pdf, other

    cs.LG cs.AI math.OC

    Interpretable and Fair Boolean Rule Sets via Column Generation

    Authors: Connor Lawless, Sanjeeb Dash, Oktay Gunluk, Dennis Wei

    Abstract: This paper considers the learning of Boolean rules in disjunctive normal form (DNF, OR-of-ANDs, equivalent to decision rule sets) as an interpretable model for classification. An integer program is formulated to optimally trade classification accuracy for rule simplicity. We also consider the fairness setting and extend the formulation to include explicit constraints on two different measures of c… ▽ More

    Submitted 18 September, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.01325, arXiv:1805.09901

    Journal ref: Journal of Machine Learning Research 2023 Volume 24, Number 229, Pages 1-50

  9. arXiv:2107.01325  [pdf, other

    cs.LG cs.AI cs.CY

    Fair Decision Rules for Binary Classification

    Authors: Connor Lawless, Oktay Gunluk

    Abstract: In recent years, machine learning has begun automating decision making in fields as varied as college admissions, credit lending, and criminal sentencing. The socially sensitive nature of some of these applications together with increasing regulatory constraints has necessitated the need for algorithms that are both fair and interpretable. In this paper we consider the problem of building Boolean… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  10. arXiv:2106.13434  [pdf, other

    math.OC cs.DM cs.LG

    Binary Matrix Factorisation and Completion via Integer Programming

    Authors: Reka A. Kovacs, Oktay Gunluk, Raphael A. Hauser

    Abstract: Binary matrix factorisation is an essential tool for identifying discrete patterns in binary data. In this paper we consider the rank-k binary matrix factorisation problem (k-BMF) under Boolean arithmetic: we are given an n x m binary matrix X with possibly missing entries and need to find two binary matrices A and B of dimension n x k and k x m respectively, which minimise the distance between X… ▽ More

    Submitted 3 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  11. arXiv:2011.04457  [pdf, other

    math.OC cs.DM cs.LG

    Binary Matrix Factorisation via Column Generation

    Authors: Reka A. Kovacs, Oktay Gunluk, Raphael A. Hauser

    Abstract: Identifying discrete patterns in binary data is an important dimensionality reduction tool in machine learning and data mining. In this paper, we consider the problem of low-rank binary matrix factorisation (BMF) under Boolean arithmetic. Due to the hardness of this problem, most previous attempts rely on heuristic techniques. We formulate the problem as a mixed integer linear program and use a la… ▽ More

    Submitted 3 August, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: final version as published by AAAI2021, plus including Appendix

  12. arXiv:2006.14084  [pdf, other

    cs.LG stat.ML

    Multilabel Classification by Hierarchical Partitioning and Data-dependent Grouping

    Authors: Shashanka Ubaru, Sanjeeb Dash, Arya Mazumdar, Oktay Gunluk

    Abstract: In modern multilabel classification problems, each data instance belongs to a small number of classes from a large set of classes. In other words, these problems involve learning very sparse binary label vectors. Moreover, in large-scale problems, the labels typically have certain (unknown) hierarchy. In this paper we exploit the sparsity of label vectors and the hierarchical structure to embed th… ▽ More

    Submitted 31 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    Journal ref: Neural Information Processing Systems (NeurIPS), 2020

  13. arXiv:1906.01761  [pdf, other

    cs.LG math.OC stat.ML

    Generalized Linear Rule Models

    Authors: Dennis Wei, Sanjeeb Dash, Tian Gao, Oktay Günlük

    Abstract: This paper considers generalized linear models using rule-based features, also referred to as rule ensembles, for regression and probabilistic classification. Rules facilitate model interpretation while also capturing nonlinear dependences and interactions. Our problem formulation accordingly trades off rule set complexity and prediction accuracy. Column generation is used to optimize over an expo… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Published in the Proceedings of the 36th International Conference on Machine Learning (ICML), PMLR 97:6687-6696, 2019. 17 pages, 7 figures

  14. arXiv:1805.09901  [pdf, other

    cs.AI

    Boolean Decision Rules via Column Generation

    Authors: Sanjeeb Dash, Oktay Günlük, Dennis Wei

    Abstract: This paper considers the learning of Boolean rules in either disjunctive normal form (DNF, OR-of-ANDs, equivalent to decision rule sets) or conjunctive normal form (CNF, AND-of-ORs) as an interpretable model for classification. An integer program is formulated to optimally trade classification accuracy for rule simplicity. Column generation (CG) is used to efficiently search over an exponential nu… ▽ More

    Submitted 5 August, 2020; v1 submitted 24 May, 2018; originally announced May 2018.

  15. arXiv:1805.03682  [pdf, other

    math.OC cs.DS eess.SY math.DS

    Robust-to-Dynamics Optimization

    Authors: Amir Ali Ahmadi, Oktay Gunluk

    Abstract: A robust-to-dynamics optimization (RDO) problem is an optimization problem specified by two pieces of input: (i) a mathematical program (an objective function $f:\mathbb{R}^n\rightarrow\mathbb{R}$ and a feasible set $Ω\subseteq\mathbb{R}^n$), and (ii) a dynamical system (a map $g:\mathbb{R}^n\rightarrow\mathbb{R}^n$). Its goal is to minimize $f$ over the set $\mathcal{S}\subseteqΩ$ of initial cond… ▽ More

    Submitted 22 November, 2023; v1 submitted 9 May, 2018; originally announced May 2018.

    Comments: Major revision

  16. arXiv:1803.04825  [pdf, other

    cs.LG cs.DM stat.ML

    Low-Rank Boolean Matrix Approximation by Integer Programming

    Authors: Reka Kovacs, Oktay Gunluk, Raphael Hauser

    Abstract: Low-rank approximations of data matrices are an important dimensionality reduction tool in machine learning and regression analysis. We consider the case of categorical variables, where it can be formulated as the problem of finding low-rank approximations to Boolean matrices. In this paper we give what is to the best of our knowledge the first integer programming formulation that relies on only p… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

  17. arXiv:1612.03225  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Generalized Decision Trees via Integer Programming

    Authors: Oktay Gunluk, Jayant Kalagnanam, Minhan Li, Matt Menickelly, Katya Scheinberg

    Abstract: Decision trees have been a very popular class of predictive models for decades due to their interpretability and good performance on categorical features. However, they are not always robust and tend to overfit the data. Additionally, if allowed to grow large, they lose interpretability. In this paper, we present a mixed integer programming formulation to construct optimal decision trees of a pres… ▽ More

    Submitted 13 August, 2019; v1 submitted 9 December, 2016; originally announced December 2016.

    MSC Class: 90C10