-
Plausible Counterfactual Explanations of Recommendations
Authors:
Jakub Černý,
Jiří Němeček,
Ivan Dovica,
Jakub Mareček
Abstract:
Explanations play a variety of roles in various recommender systems, from a legally mandated afterthought, through an integral element of user experience, to a key to persuasiveness. A natural and useful form of an explanation is the Counterfactual Explanation (CE). We present a method for generating highly plausible CEs in recommender systems and evaluate it both numerically and with a user study…
▽ More
Explanations play a variety of roles in various recommender systems, from a legally mandated afterthought, through an integral element of user experience, to a key to persuasiveness. A natural and useful form of an explanation is the Counterfactual Explanation (CE). We present a method for generating highly plausible CEs in recommender systems and evaluate it both numerically and with a user study.
△ Less
Submitted 10 July, 2025;
originally announced July 2025.
-
Benchmarking Stochastic Approximation Algorithms for Fairness-Constrained Training of Deep Neural Networks
Authors:
Andrii Kliachkin,
Jana Lepšová,
Gilles Bareilles,
Jakub Mareček
Abstract:
The ability to train Deep Neural Networks (DNNs) with constraints is instrumental in improving the fairness of modern machine-learning models. Many algorithms have been analysed in recent years, and yet there is no standard, widely accepted method for the constrained training of DNNs. In this paper, we provide a challenging benchmark of real-world large-scale fairness-constrained learning tasks, b…
▽ More
The ability to train Deep Neural Networks (DNNs) with constraints is instrumental in improving the fairness of modern machine-learning models. Many algorithms have been analysed in recent years, and yet there is no standard, widely accepted method for the constrained training of DNNs. In this paper, we provide a challenging benchmark of real-world large-scale fairness-constrained learning tasks, built on top of the US Census (Folktables). We point out the theoretical challenges of such tasks and review the main approaches in stochastic approximation algorithms. Finally, we demonstrate the use of the benchmark by implementing and comparing three recently proposed, but as-of-yet unimplemented, algorithms both in terms of optimization performance, and fairness improvement. We release the code of the benchmark as a Python package at https://github.com/humancompatible/train.
△ Less
Submitted 5 July, 2025;
originally announced July 2025.
-
A Feedback Control Framework for Incentivised Suburban Parking Utilisation and Urban Core Traffic Relief
Authors:
Abdul Baseer Satti,
James Saunderson,
Wynita Griggs,
S. M. Nawazish Ali,
Nameer Al Khafaf,
Saman Ahmadi,
Mahdi Jalili,
Jakub Marecek,
Robert Shorten
Abstract:
Urban traffic congestion, exacerbated by inefficient parking management and cruising for parking, significantly hampers mobility and sustainability in smart cities. Drivers often face delays searching for parking spaces, influenced by factors such as accessibility, cost, distance, and available services such as charging facilities in the case of electric vehicles. These inefficiencies contribute t…
▽ More
Urban traffic congestion, exacerbated by inefficient parking management and cruising for parking, significantly hampers mobility and sustainability in smart cities. Drivers often face delays searching for parking spaces, influenced by factors such as accessibility, cost, distance, and available services such as charging facilities in the case of electric vehicles. These inefficiencies contribute to increased urban congestion, fuel consumption, and environmental impact. Addressing these challenges, this paper proposes a feedback control incentivisation-based system that aims to better distribute vehicles between city and suburban parking facilities offering park-and-charge/-ride services. Individual driver behaviours are captured via discrete choice models incorporating factors of importance to parking location choice among drivers, such as distance to work, public transport connectivity, charging infrastructure availability, and amount of incentive offered; and are regulated through principles of ergodic control theory. The proposed framework is applied to an electric vehicle park-and-charge/-ride problem, and demonstrates how predictable long-term behaviour of the system can be guaranteed.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Undecidable problems associated with variational quantum algorithms
Authors:
Georgios Korpas,
Vyacheslav Kungurtsev,
Jakub Mareček
Abstract:
Variational Quantum Algorithms (VQAs), such as the Variational Quantum Eigensolver (VQE) and the Quantum Approximate Optimization Algorithm (QAOA), are widely studied as candidates for near-term quantum advantage. Recent work has shown that training VQAs is NP-hard in general. In this paper, we present a conditional result suggesting that the training of VQAs is undecidable, even in idealized, noi…
▽ More
Variational Quantum Algorithms (VQAs), such as the Variational Quantum Eigensolver (VQE) and the Quantum Approximate Optimization Algorithm (QAOA), are widely studied as candidates for near-term quantum advantage. Recent work has shown that training VQAs is NP-hard in general. In this paper, we present a conditional result suggesting that the training of VQAs is undecidable, even in idealized, noiseless settings. We reduce the decision version of the digitized VQA training problem-where circuit parameters are drawn from a discrete set-to the question of whether a universal Diophantine equation (UDE) has a root. This reduction relies on encoding the UDE into the structure of a variational quantum circuit via the matrix exponentials. The central step involves establishing a correspondence between the objective function of the VQA and a known UDE of 58 variables and degree 4. Our main result is conditional on a natural conjecture: that a certain system of structured complex polynomial equations-arising from the inner product of a VQA circuit output and a fixed observable-has at least one solution. We argue this conjecture is plausible based on dimension-counting arguments (degrees of freedom in the Hamiltonians, state vector, and observable), and the generic solvability of such systems in algebraic geometry over the complex numbers. Under this assumption, we suggest that deciding whether a digitized VQA achieves a given energy threshold is undecidable. This links the limitations of variational quantum algorithms to foundational questions in mathematics and logic, extending the known landscape of quantum computational hardness to include uncomputability. Additionally, we establish an unconditional undecidability result for VQA convergence in open quantum systems.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
ExMAG: Learning of Maximally Ancestral Graphs
Authors:
Petr Ryšavý,
Pavel Rytíř,
Xiaoyu He,
Georgios Korpas,
Jakub Mareček
Abstract:
In mixed graphs, there are both directed and undirected edges. An extension of acyclicity to this mixed-graph setting is known as maximally ancestral graphs. This extension is of considerable interest in causal learning in the presence of confounders. There, directed edges represent a clear direction of causality, while undirected edges represent confounding. We propose a score-based branch-and-cu…
▽ More
In mixed graphs, there are both directed and undirected edges. An extension of acyclicity to this mixed-graph setting is known as maximally ancestral graphs. This extension is of considerable interest in causal learning in the presence of confounders. There, directed edges represent a clear direction of causality, while undirected edges represent confounding. We propose a score-based branch-and-cut algorithm for learning maximally ancestral graphs. The algorithm produces more accurate results than state-of-the-art methods, while being faster to run on small and medium-sized synthetic instances.
△ Less
Submitted 22 May, 2025; v1 submitted 11 March, 2025;
originally announced March 2025.
-
On the Random Schrödinger Equation and Geometric Quantum Control
Authors:
Rufus Lawrence,
Aleš Wodecki,
Johannes Aspman,
Jakub Mareček
Abstract:
We introduce the random Schrödinger equation, with a noise term given by a random Hermitian matrix as a means to model noisy quantum systems. We derive bounds on the error of the synthesised unitary in terms of bounds on the norm of the noise, and show that for certain noise processes these bounds are tight. We then show that in certain situations, minimising the error is equivalent to finding a g…
▽ More
We introduce the random Schrödinger equation, with a noise term given by a random Hermitian matrix as a means to model noisy quantum systems. We derive bounds on the error of the synthesised unitary in terms of bounds on the norm of the noise, and show that for certain noise processes these bounds are tight. We then show that in certain situations, minimising the error is equivalent to finding a geodesic on SU (n) with respect to a Riemannian metric encoding the coupling between the control pulse and the noise process. Our work thus extends the series of seminal papers by Nielsen et al. on the geometry of quantum gate complexity.
△ Less
Submitted 31 March, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
Sample Complexity of Bias Detection with Subsampled Point-to-Subspace Distances
Authors:
German Martinez Matilla,
Jakub Marecek
Abstract:
Sample complexity of bias estimation is a lower bound on the runtime of any bias detection method. Many regulatory frameworks require the bias to be tested for all subgroups, whose number grows exponentially with the number of protected attributes. Unless one wishes to run a bias detection with a doubly-exponential run-time, one should like to have polynomial complexity of bias detection for a sin…
▽ More
Sample complexity of bias estimation is a lower bound on the runtime of any bias detection method. Many regulatory frameworks require the bias to be tested for all subgroups, whose number grows exponentially with the number of protected attributes. Unless one wishes to run a bias detection with a doubly-exponential run-time, one should like to have polynomial complexity of bias detection for a single subgroup. At the same time, the reference data may be based on surveys, and thus come with non-trivial uncertainty.
Here, we reformulate bias detection as a point-to-subspace problem on the space of measures and show that, for supremum norm, it can be subsampled efficiently. In particular, our probabilistically approximately correct (PAC) results are corroborated by tests on well-known instances.
△ Less
Submitted 4 February, 2025;
originally announced February 2025.
-
Bias Detection via Maximum Subgroup Discrepancy
Authors:
Jiří Němeček,
Mark Kozdoba,
Illia Kryvoviaz,
Tomáš Pevný,
Jakub Mareček
Abstract:
Bias evaluation is fundamental to trustworthy AI, both in terms of checking data quality and in terms of checking the outputs of AI systems. In testing data quality, for example, one may study the distance of a given dataset, viewed as a distribution, to a given ground-truth reference dataset. However, classical metrics, such as the Total Variation and the Wasserstein distances, are known to have…
▽ More
Bias evaluation is fundamental to trustworthy AI, both in terms of checking data quality and in terms of checking the outputs of AI systems. In testing data quality, for example, one may study the distance of a given dataset, viewed as a distribution, to a given ground-truth reference dataset. However, classical metrics, such as the Total Variation and the Wasserstein distances, are known to have high sample complexities and, therefore, may fail to provide a meaningful distinction in many practical scenarios.
In this paper, we propose a new notion of distance, the Maximum Subgroup Discrepancy (MSD). In this metric, two distributions are close if, roughly, discrepancies are low for all feature subgroups. While the number of subgroups may be exponential, we show that the sample complexity is linear in the number of features, thus making it feasible for practical applications. Moreover, we provide a practical algorithm for evaluating the distance based on Mixed-integer optimization (MIO). We also note that the proposed distance is easily interpretable, thus providing clearer paths to fixing the biases once they have been identified. Finally, we describe a natural general bias detection framework, termed MSDD distances, and show that MSD aligns well with this framework. We empirically evaluate MSD by comparing it with other metrics and by demonstrating the above properties of MSD on real-world datasets.
△ Less
Submitted 11 June, 2025; v1 submitted 4 February, 2025;
originally announced February 2025.
-
Identifiability of Autonomous and Controlled Open Quantum Systems
Authors:
Waqas Parvaiz,
Johannes Aspman,
Ales Wodecki,
Georgios Korpas,
Jakub Marecek
Abstract:
Open quantum systems are a rich area of research in the intersection of quantum mechanics and stochastic analysis. By considering a variety of master equations, we unify multiple views of autonomous and controlled open quantum systems and, through considering their measurement dynamics, connect them to classical linear and bilinear system identification theory. This allows us to formulate correspo…
▽ More
Open quantum systems are a rich area of research in the intersection of quantum mechanics and stochastic analysis. By considering a variety of master equations, we unify multiple views of autonomous and controlled open quantum systems and, through considering their measurement dynamics, connect them to classical linear and bilinear system identification theory. This allows us to formulate corresponding notions of quantum state identifiability for these systems which, in particular, applies to quantum state tomography, providing conditions under which the probed quantum system is reconstructible. Interestingly, the dynamical representation of the system lends itself to considering two types of identifiability: the full master equation recovery and the recovery of the corresponding system matrices of the linear and bilinear systems. Both of these concepts are discussed in detail, and conditions under which reconstruction is possible are given. We set the groundwork for a number of constructive approaches to the identification of open quantum systems.
△ Less
Submitted 19 February, 2025; v1 submitted 9 January, 2025;
originally announced January 2025.
-
Power System Steady-State Estimation Revisited
Authors:
Pavel Rytir,
Ales Wodecki,
Martin Malachov,
Pavel Baxant,
Premysl Vorac,
Miloslava Chladova,
Jakub Marecek
Abstract:
In power system steady-state estimation (PSSE), one needs to consider (1) the need for robust statistics, (2) the nonconvex transmission constraints, (3) the fast-varying nature of the inputs, and the corresponding need to track optimal trajectories as closely as possible. In combination, these challenges have not been considered, yet. In this paper, we address all three challenges. The need for r…
▽ More
In power system steady-state estimation (PSSE), one needs to consider (1) the need for robust statistics, (2) the nonconvex transmission constraints, (3) the fast-varying nature of the inputs, and the corresponding need to track optimal trajectories as closely as possible. In combination, these challenges have not been considered, yet. In this paper, we address all three challenges. The need for robustness (1) is addressed by using an approach based on the so-called Huber model. The non-convexity (2) of the problem, which results in first order methods failing to find global minima, is dealt with by applying global methods. One of these methods is based on a mixed integer quadratic formulation, which provides results of several orders of magnitude better than conventional gradient descent. Lastly, the trajectory tracking (3) is discussed by showing under which conditions the trajectory tracking of the SDP relaxations has meaning.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
ExDBN: Exact learning of Dynamic Bayesian Networks
Authors:
Pavel Rytir,
Ales Wodecki,
Georgios Korpas,
Jakub Marecek
Abstract:
Causal learning from data has received much attention in recent years. One way of capturing causal relationships is by utilizing Bayesian networks. There, one recovers a weighted directed acyclic graph, in which random variables are represented by vertices, and the weights associated with each edge represent the strengths of the causal relationships between them. This concept is extended to captur…
▽ More
Causal learning from data has received much attention in recent years. One way of capturing causal relationships is by utilizing Bayesian networks. There, one recovers a weighted directed acyclic graph, in which random variables are represented by vertices, and the weights associated with each edge represent the strengths of the causal relationships between them. This concept is extended to capture dynamic effects by introducing a dependency on past data, which may be captured by the structural equation model, which is utilized in the present contribution to formulate a score-based learning approach. A mixed-integer quadratic program is formulated and an algorithmic solution proposed, in which the pre-generation of exponentially many acyclicity constraints is avoided by utilizing the so-called branch-and-cut ("lazy constraint") method. Comparing the novel approach to the state of the art, we show that the proposed approach turns out to produce excellent results when applied to small and medium-sized synthetic instances of up to 25 time-series. Lastly, two interesting applications in bio-science and finance, to which the method is directly applied, further stress the opportunities in developing highly accurate, globally convergent solvers that can handle modest instances.
△ Less
Submitted 22 October, 2024; v1 submitted 21 October, 2024;
originally announced October 2024.
-
Topological quantum compilation of two-qubit gates
Authors:
Phillip C. Burke,
Christos Aravanis,
Johannes Aspman,
Jakub Mareček,
Jiří Vala
Abstract:
We investigate the topological quantum compilation of two-qubit operations within a system of Fibonacci anyons. Our primary goal is to generate gates that are approximately leakage-free and equivalent to the controlled-NOT (CNOT) gate up to single-qubit operations. These gates belong to the local equivalence class [CNOT]. Additionally, we explore which local equivalence classes of two-qubit operat…
▽ More
We investigate the topological quantum compilation of two-qubit operations within a system of Fibonacci anyons. Our primary goal is to generate gates that are approximately leakage-free and equivalent to the controlled-NOT (CNOT) gate up to single-qubit operations. These gates belong to the local equivalence class [CNOT]. Additionally, we explore which local equivalence classes of two-qubit operations can be naturally generated by braiding Fibonacci anyons. We discovered that most of the generated classes are located near the edges of the Weyl chamber representation of two-qubit gates, specifically between the local equivalence classes of the identity [1] and [CNOT], and between those of the double-controlled-NOT [DCNOT] and [SWAP]. Furthermore, we found a numerically exact implementation of a local equivalent of the SWAP gate using a sequence of only nine elements from the Fibonacci braiding gate set.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Empirical Bayes for Dynamic Bayesian Networks Using Generalized Variational Inference
Authors:
Vyacheslav Kungurtsev,
Apaar,
Aarya Khandelwal,
Parth Sandeep Rastogi,
Bapi Chatterjee,
Jakub Mareček
Abstract:
In this work, we demonstrate the Empirical Bayes approach to learning a Dynamic Bayesian Network. By starting with several point estimates of structure and weights, we can use a data-driven prior to subsequently obtain a model to quantify uncertainty. This approach uses a recent development of Generalized Variational Inference, and indicates the potential of sampling the uncertainty of a mixture o…
▽ More
In this work, we demonstrate the Empirical Bayes approach to learning a Dynamic Bayesian Network. By starting with several point estimates of structure and weights, we can use a data-driven prior to subsequently obtain a model to quantify uncertainty. This approach uses a recent development of Generalized Variational Inference, and indicates the potential of sampling the uncertainty of a mixture of DAG structures as well as a parameter posterior.
△ Less
Submitted 28 June, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
ExDAG: Exact learning of DAGs
Authors:
Pavel Rytíř,
Aleš Wodecki,
Jakub Mareček
Abstract:
There has been a growing interest in causal learning in recent years. Commonly used representations of causal structures, including Bayesian networks and structural equation models (SEM), take the form of directed acyclic graphs (DAGs). We provide a novel mixed-integer quadratic programming formulation and associated algorithm that identifies DAGs on up to 50 vertices, where these are identifiable…
▽ More
There has been a growing interest in causal learning in recent years. Commonly used representations of causal structures, including Bayesian networks and structural equation models (SEM), take the form of directed acyclic graphs (DAGs). We provide a novel mixed-integer quadratic programming formulation and associated algorithm that identifies DAGs on up to 50 vertices, where these are identifiable. We call this method ExDAG, which stands for Exact learning of DAGs. Although there is a superexponential number of constraints that prevent the formation of cycles, the algorithm adds constraints violated by solutions found, rather than imposing all constraints in each continuous-valued relaxation. Our empirical results show that ExDAG outperforms local state-of-the-art solvers in terms of precision and outperforms state-of-the-art global solvers with respect to scaling, when considering Gaussian noise. We also provide validation with respect to other noise distributions.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Causal Learning in Biomedical Applications: A Benchmark
Authors:
Petr Ryšavý,
Xiaoyu He,
Jakub Mareček
Abstract:
Learning causal relationships between a set of variables is a challenging problem in computer science. Many existing artificial benchmark datasets are based on sampling from causal models and thus contain residual information that the ${R} ^2$-sortability can identify. Here, we present a benchmark for methods in causal learning using time series. The presented dataset is not ${R}^2$-sortable and i…
▽ More
Learning causal relationships between a set of variables is a challenging problem in computer science. Many existing artificial benchmark datasets are based on sampling from causal models and thus contain residual information that the ${R} ^2$-sortability can identify. Here, we present a benchmark for methods in causal learning using time series. The presented dataset is not ${R}^2$-sortable and is based on a real-world scenario of the Krebs cycle that is used in cells to release energy. We provide four scenarios of learning, including short and long time series, and provide guidance so that testing is unified between possible users.
△ Less
Submitted 16 September, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Reserve Provision from Electric Vehicles: Aggregate Boundaries and Stochastic Model Predictive Control
Authors:
Jacob Thrän,
Jakub Mareček,
Robert N. Shorten,
Timothy C. Green
Abstract:
Controlled charging of electric vehicles, EVs, is a major potential source of flexibility to facilitate the integration of variable renewable energy and reduce the need for stationary energy storage. To offer system services from EVs, fleet aggregators must address the uncertainty of individual driving and charging behaviour. This paper introduces a means of forecasting the service volume availabl…
▽ More
Controlled charging of electric vehicles, EVs, is a major potential source of flexibility to facilitate the integration of variable renewable energy and reduce the need for stationary energy storage. To offer system services from EVs, fleet aggregators must address the uncertainty of individual driving and charging behaviour. This paper introduces a means of forecasting the service volume available from EVs by considering several EV batteries as one conceptual battery with aggregate power and energy boundaries. Aggregation avoids the difficult prediction of individual driving behaviour. The predictability of the boundaries is demonstrated using a multiple linear regression model which achieves a normalised root mean square error of 20% - 40% for a fleet of 1,000 EVs. A two-stage stochastic model predictive control algorithm is used to schedule reserve services on a day-ahead basis addressing risk trade-offs by including Conditional Value-at-Risk in the objective function. A case study with 1.2 million domestic EV charge records from Great Britain illustrates that increasing fleet size improves prediction accuracy, thereby increasing reserve revenues and decreasing an aggregator's operational costs. For fleet sizes of 400 or above, cost reductions plateau at 60% compared to uncontrolled charging, with an average of 1.8kW of reserve provided per vehicle.
△ Less
Submitted 17 February, 2025; v1 submitted 11 June, 2024;
originally announced June 2024.
-
Fairness in AI: challenges in bridging the gap between algorithms and law
Authors:
Giorgos Giannopoulos,
Maria Psalla,
Loukas Kavouras,
Dimitris Sacharidis,
Jakub Marecek,
German M Matilla,
Ioannis Emiris
Abstract:
In this paper we examine algorithmic fairness from the perspective of law aiming to identify best practices and strategies for the specification and adoption of fairness definitions and algorithms in real-world systems and use cases. We start by providing a brief introduction of current anti-discrimination law in the European Union and the United States and discussing the concepts of bias and fair…
▽ More
In this paper we examine algorithmic fairness from the perspective of law aiming to identify best practices and strategies for the specification and adoption of fairness definitions and algorithms in real-world systems and use cases. We start by providing a brief introduction of current anti-discrimination law in the European Union and the United States and discussing the concepts of bias and fairness from an legal and ethical viewpoint. We then proceed by presenting a set of algorithmic fairness definitions by example, aiming to communicate their objectives to non-technical audiences. Then, we introduce a set of core criteria that need to be taken into account when selecting a specific fairness definition for real-world use case applications. Finally, we enumerate a set of key considerations and best practices for the design and employment of fairness methods on real-world AI applications
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Robust Quantum Gate Complexity: Foundations
Authors:
Johannes Aspman,
Vyacheslav Kungurtsev,
Jakub Marecek
Abstract:
Optimal control of closed quantum systems is a well studied geometrically elegant set of computational theory and techniques that have proven pivotal in the implementation and understanding of quantum computers. The design of a circuit itself corresponds to an optimal control problem of choosing the appropriate set of gates (which appear as control operands) in order to steer a qubit from an initi…
▽ More
Optimal control of closed quantum systems is a well studied geometrically elegant set of computational theory and techniques that have proven pivotal in the implementation and understanding of quantum computers. The design of a circuit itself corresponds to an optimal control problem of choosing the appropriate set of gates (which appear as control operands) in order to steer a qubit from an initial, easily prepared state, to one that is informative to the user in some sense, for e.g., an oracle whose evaluation is part of the circuit. However, contemporary devices are known to be noisy, and it is not certain that a circuit will behave as intended. Yet, although the computational tools exist in broader optimal control theory, robustness of adequate operation of a quantum control system with respect to uncertainty and errors has not yet been broadly studied in the literature. In this paper, we propose a new approach inspired by the closed quantum optimal control and its connection to geometric interpretations. To this end, we present the appropriate problem definitions of robustness in the context of quantum control, focusing on its broader implications for gate complexity.
△ Less
Submitted 26 April, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Fairness in Ranking: Robustness through Randomization without the Protected Attribute
Authors:
Andrii Kliachkin,
Eleni Psaroudaki,
Jakub Marecek,
Dimitris Fotakis
Abstract:
There has been great interest in fairness in machine learning, especially in relation to classification problems. In ranking-related problems, such as in online advertising, recommender systems, and HR automation, much work on fairness remains to be done. Two complications arise: first, the protected attribute may not be available in many applications. Second, there are multiple measures of fairne…
▽ More
There has been great interest in fairness in machine learning, especially in relation to classification problems. In ranking-related problems, such as in online advertising, recommender systems, and HR automation, much work on fairness remains to be done. Two complications arise: first, the protected attribute may not be available in many applications. Second, there are multiple measures of fairness of rankings, and optimization-based methods utilizing a single measure of fairness of rankings may produce rankings that are unfair with respect to other measures. In this work, we propose a randomized method for post-processing rankings, which do not require the availability of the protected attribute. In an extensive numerical study, we show the robustness of our methods with respect to P-Fairness and effectiveness with respect to Normalized Discounted Cumulative Gain (NDCG) from the baseline ranking, improving on previously proposed methods.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Spectral Methods for Quantum Optimal Control: Artificial Boundary Conditions
Authors:
Ales Wodecki,
Jakub Marecek,
Vyacheslav Kungurtsev,
Pavel Eichler,
Georgios Korpas,
Philip Intallura
Abstract:
The problem of quantum state preparation is one of the main challenges in achieving the quantum advantage. Furthermore, classically, for multi-level problems, our ability to solve the corresponding quantum optimal control problems is rather limited. The ability of the latter to feed into the former may result in significant progress in quantum computing. To address this challenge, we propose a for…
▽ More
The problem of quantum state preparation is one of the main challenges in achieving the quantum advantage. Furthermore, classically, for multi-level problems, our ability to solve the corresponding quantum optimal control problems is rather limited. The ability of the latter to feed into the former may result in significant progress in quantum computing. To address this challenge, we propose a formulation of quantum optimal control that makes use of artificial boundary conditions for the Schrödinger equation in combination with spectral methods. The resulting formulations are well suited for investigating periodic potentials and lend themselves to direct numerical treatment using conventional methods for bounded domains.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Learning quantum Hamiltonians at any temperature in polynomial time with Chebyshev and bit complexity
Authors:
Ales Wodecki,
Jakub Marecek
Abstract:
We consider the problem of learning local quantum Hamiltonians given copies of their Gibbs state at a known inverse temperature, following Haah et al. [2108.04842] and Bakshi et al. [arXiv:2310.02243]. Our main technical contribution is a new flat polynomial approximation of the exponential function based on the Chebyshev expansion, which enables the formulation of learning quantum Hamiltonians as…
▽ More
We consider the problem of learning local quantum Hamiltonians given copies of their Gibbs state at a known inverse temperature, following Haah et al. [2108.04842] and Bakshi et al. [arXiv:2310.02243]. Our main technical contribution is a new flat polynomial approximation of the exponential function based on the Chebyshev expansion, which enables the formulation of learning quantum Hamiltonians as a polynomial optimization problem. This, in turn, can benefit from the use of moment/SOS relaxations, whose polynomial bit complexity requires careful analysis [O'Donnell, ITCS 2017]. Finally, we show that learning a $k$-local Hamiltonian, whose dual interaction graph is of bounded degree, runs in polynomial time under mild assumptions.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Truss topology design under harmonic loads: Peak power minimization with semidefinite programming
Authors:
Shenyuan Ma,
Jakub Marecek,
Vyacheslav Kungurtsev,
Marek Tyburec
Abstract:
Designing lightweight yet stiff structures that can withstand vibrations is a crucial task in structural optimization. Here, we present a novel framework for truss topology optimization under undamped harmonic oscillations. Our approach minimizes the peak power of the structure under harmonic loads, overcoming the limitations of single-frequency and in-phase assumptions found in previous methods.…
▽ More
Designing lightweight yet stiff structures that can withstand vibrations is a crucial task in structural optimization. Here, we present a novel framework for truss topology optimization under undamped harmonic oscillations. Our approach minimizes the peak power of the structure under harmonic loads, overcoming the limitations of single-frequency and in-phase assumptions found in previous methods. For this, we leverage the concept of semidefinite representable (SDr) functions, demonstrating that while compliance readily conforms to an SDr representation, peak power requires a derivation based on the non-negativity of trigonometric functions. Finally, we introduce convex relaxations for the minimization problem and provide promising computational results.
△ Less
Submitted 19 February, 2025; v1 submitted 29 January, 2024;
originally announced January 2024.
-
The Effects of Transmission-Rights Pricing on Multi-Stage Electricity Markets
Authors:
Erwann de Belloy de Saint-Lienard,
Jakub Marecek,
Vyacheslav Kungurtsev
Abstract:
Cross-border transmission infrastructure is pivotal in balancing modern power systems, but requires fair allocation of cross-border transmission capacity, possibly via fair pricing thereof. This requirement can be implemented using multi-stage market mechanisms for Physical Transmission Rights (PTRs). We analyse the related dynamics, and show prisoner's dilemma arises. Understanding these dynamics…
▽ More
Cross-border transmission infrastructure is pivotal in balancing modern power systems, but requires fair allocation of cross-border transmission capacity, possibly via fair pricing thereof. This requirement can be implemented using multi-stage market mechanisms for Physical Transmission Rights (PTRs). We analyse the related dynamics, and show prisoner's dilemma arises. Understanding these dynamics enables the development of novel market-settlement mechanisms to enhance market efficiency and incentivize renewable energy use.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Generating Likely Counterfactuals Using Sum-Product Networks
Authors:
Jiri Nemecek,
Tomas Pevny,
Jakub Marecek
Abstract:
The need to explain decisions made by AI systems is driven by both recent regulation and user demand. The decisions are often explainable only post hoc. In counterfactual explanations, one may ask what constitutes the best counterfactual explanation. Clearly, multiple criteria must be taken into account, although "distance from the sample" is a key criterion. Recent methods that consider the plaus…
▽ More
The need to explain decisions made by AI systems is driven by both recent regulation and user demand. The decisions are often explainable only post hoc. In counterfactual explanations, one may ask what constitutes the best counterfactual explanation. Clearly, multiple criteria must be taken into account, although "distance from the sample" is a key criterion. Recent methods that consider the plausibility of a counterfactual seem to sacrifice this original objective. Here, we present a system that provides high-likelihood explanations that are, at the same time, close and sparse. We show that the search for the most likely explanations satisfying many common desiderata for counterfactual explanations can be modeled using Mixed-Integer Optimization (MIO). We use a Sum-Product Network (SPN) to estimate the likelihood of a counterfactual. To achieve that, we propose an MIO formulation of an SPN, which can be of independent interest. The source code with examples is available at https://github.com/Epanemu/LiCE.
△ Less
Submitted 11 June, 2025; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Scheduling a Multi-Product Pipeline: A Discretized MILP Formulation
Authors:
Ales Wodecki,
Pavel Rytir,
Vyacheslav Kungurtsev,
Jakub Marecek
Abstract:
Multi-product pipelines are a highly efficient means of transporting liquids. Traditionally used to transport petroleum, its products and derivatives, they are now being repurposed to transport liquified natural gas admixed with hydrogen of various colors. We propose a novel mixed-integer linear programming (MILP) formulation, which optimizes efficiency while satisfying a wide range of real-world…
▽ More
Multi-product pipelines are a highly efficient means of transporting liquids. Traditionally used to transport petroleum, its products and derivatives, they are now being repurposed to transport liquified natural gas admixed with hydrogen of various colors. We propose a novel mixed-integer linear programming (MILP) formulation, which optimizes efficiency while satisfying a wide range of real-world constraints developed to meet the needs of the Czech national pipeline operator CEPRO. We provide tests on well-known synthetic (path-graph) networks and demonstrate the formulation's scaling properties using open-source and commercial MILP solvers.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Challenges and Opportunities in Quantum Optimization
Authors:
Amira Abbas,
Andris Ambainis,
Brandon Augustino,
Andreas Bärtschi,
Harry Buhrman,
Carleton Coffrin,
Giorgio Cortiana,
Vedran Dunjko,
Daniel J. Egger,
Bruce G. Elmegreen,
Nicola Franco,
Filippo Fratini,
Bryce Fuller,
Julien Gacon,
Constantin Gonciulea,
Sander Gribling,
Swati Gupta,
Stuart Hadfield,
Raoul Heese,
Gerhard Kircher,
Thomas Kleinert,
Thorsten Koch,
Georgios Korpas,
Steve Lenk,
Jakub Marecek
, et al. (21 additional authors not shown)
Abstract:
Recent advances in quantum computers are demonstrating the ability to solve problems at a scale beyond brute force classical simulation. As such, a widespread interest in quantum algorithms has developed in many areas, with optimization being one of the most pronounced domains. Across computer science and physics, there are a number of different approaches for major classes of optimization problem…
▽ More
Recent advances in quantum computers are demonstrating the ability to solve problems at a scale beyond brute force classical simulation. As such, a widespread interest in quantum algorithms has developed in many areas, with optimization being one of the most pronounced domains. Across computer science and physics, there are a number of different approaches for major classes of optimization problems, such as combinatorial optimization, convex optimization, non-convex optimization, and stochastic extensions. This work draws on multiple approaches to study quantum optimization. Provably exact versus heuristic settings are first explained using computational complexity theory - highlighting where quantum advantage is possible in each context. Then, the core building blocks for quantum optimization algorithms are outlined to subsequently define prominent problem classes and identify key open questions that, if answered, will advance the field. The effects of scaling relevant problems on noisy quantum devices are also outlined in detail, alongside meaningful benchmarking problems. We underscore the importance of benchmarking by proposing clear metrics to conduct appropriate comparisons with classical optimization techniques. Lastly, we highlight two domains - finance and sustainability - as rich sources of optimization problems that could be used to benchmark, and eventually validate, the potential real-world impact of quantum optimization.
△ Less
Submitted 17 November, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Parallel variational quantum algorithms with gradient-informed restart to speed up optimisation in the presence of barren plateaus
Authors:
Daniel Mastropietro,
Georgios Korpas,
Vyacheslav Kungurtsev,
Jakub Marecek
Abstract:
Inspired by the Fleming-Viot stochastic process, we propose a parallel implementation of variational quantum algorithms with the aim of reducing the time spent by the algorithm in barren plateaus, where optimization direction is unclear. In the Fleming-Viot tradition, parallel searches are called particles. In the proposed approach, the search by a Fleming-Viot particle is stopped when it encounte…
▽ More
Inspired by the Fleming-Viot stochastic process, we propose a parallel implementation of variational quantum algorithms with the aim of reducing the time spent by the algorithm in barren plateaus, where optimization direction is unclear. In the Fleming-Viot tradition, parallel searches are called particles. In the proposed approach, the search by a Fleming-Viot particle is stopped when it encounters a region where the gradient is too small or noisy, suggesting a barren plateau area. The stopped particle continues the search after being regenerated at another location of the parameter space, potentially taking the exploration away from barren plateaus. We first analyze the behavior of the Fleming-Viot particles from a theoretical standpoint. We show that, when simulated annealing optimizers are used as particles, the Fleming-Viot system is expected to find the global optimum faster than a single simulated annealing optimizer, with a relative efficiency that increases proportionally to the percentage of barren plateaus in the domain. This result is corroborated by numerical experiments carried out on synthetic problems as well as on instances of the Max-Cut problem, which show that our method performs better than plain simulated annealing when large barren plateaus are present in the domain.
△ Less
Submitted 15 December, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Piecewise Polynomial Regression of Tame Functions via Integer Programming
Authors:
Gilles Bareilles,
Johannes Aspman,
Jiri Nemecek,
Jakub Marecek
Abstract:
Tame functions are a class of nonsmooth, nonconvex functions, which feature in a wide range of applications: functions encountered in the training of deep neural networks with all common activations, value functions of mixed-integer programs, or wave functions of small molecules. We consider approximating tame functions with piecewise polynomial functions. We bound the quality of approximation of…
▽ More
Tame functions are a class of nonsmooth, nonconvex functions, which feature in a wide range of applications: functions encountered in the training of deep neural networks with all common activations, value functions of mixed-integer programs, or wave functions of small molecules. We consider approximating tame functions with piecewise polynomial functions. We bound the quality of approximation of a tame function by a piecewise polynomial function with a given number of segments on any full-dimensional cube. We also present the first mixed-integer programming formulation of piecewise polynomial regression. Together, these can be used to estimate tame functions. We demonstrate promising computational results.
△ Less
Submitted 4 June, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
Joint Problems in Learning Multiple Dynamical Systems
Authors:
Mengjia Niu,
Xiaoyu He,
Petr Ryšavý,
Quan Zhou,
Jakub Marecek
Abstract:
Clustering of time series is a well-studied problem, with applications ranging from quantitative, personalized models of metabolism obtained from metabolite concentrations to state discrimination in quantum information theory. We consider a variant, where given a set of trajectories and a number of parts, we jointly partition the set of trajectories and learn linear dynamical system (LDS) models f…
▽ More
Clustering of time series is a well-studied problem, with applications ranging from quantitative, personalized models of metabolism obtained from metabolite concentrations to state discrimination in quantum information theory. We consider a variant, where given a set of trajectories and a number of parts, we jointly partition the set of trajectories and learn linear dynamical system (LDS) models for each part, so as to minimize the maximum error across all the models. We present globally convergent methods and EM heuristics, accompanied by promising computational results. The key highlight of this method is that it does not require a predefined hidden state dimension but instead provides an upper bound. Additionally, it offers guidance for determining regularization in the system identification.
△ Less
Submitted 5 May, 2025; v1 submitted 3 November, 2023;
originally announced November 2023.
-
Group-blind optimal transport to group parity and its constrained variants
Authors:
Quan Zhou,
Jakub Marecek
Abstract:
Fairness holds a pivotal role in the realm of machine learning, particularly when it comes to addressing groups categorised by protected attributes, e.g., gender, race. Prevailing algorithms in fair learning predominantly hinge on accessibility or estimations of these protected attributes, at least in the training process. We design a single group-blind projection map that aligns the feature distr…
▽ More
Fairness holds a pivotal role in the realm of machine learning, particularly when it comes to addressing groups categorised by protected attributes, e.g., gender, race. Prevailing algorithms in fair learning predominantly hinge on accessibility or estimations of these protected attributes, at least in the training process. We design a single group-blind projection map that aligns the feature distributions of both groups in the source data, achieving (demographic) group parity, without requiring values of the protected attribute for individual samples in the computation of the map, as well as its use. Instead, our approach utilises the feature distributions of the privileged and unprivileged groups in a boarder population and the essential assumption that the source data are unbiased representation of the population. We present numerical results on synthetic data and real data.
△ Less
Submitted 8 November, 2024; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Taming Binarized Neural Networks and Mixed-Integer Programs
Authors:
Johannes Aspman,
Georgios Korpas,
Jakub Marecek
Abstract:
There has been a great deal of recent interest in binarized neural networks, especially because of their explainability. At the same time, automatic differentiation algorithms such as backpropagation fail for binarized neural networks, which limits their applicability. By reformulating the problem of training binarized neural networks as a subadditive dual of a mixed-integer program, we show that…
▽ More
There has been a great deal of recent interest in binarized neural networks, especially because of their explainability. At the same time, automatic differentiation algorithms such as backpropagation fail for binarized neural networks, which limits their applicability. By reformulating the problem of training binarized neural networks as a subadditive dual of a mixed-integer program, we show that binarized neural networks admit a tame representation. This, in turn, makes it possible to use the framework of Bolte et al. for implicit differentiation, which offers the possibility for practical implementation of backpropagation in the context of binarized neural networks.
This approach could also be used for a broader class of mixed-integer programs, beyond the training of binarized neural networks, as encountered in symbolic approaches to AI and beyond.
△ Less
Submitted 20 December, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Improving the Validity of Decision Trees as Explanations
Authors:
Jiri Nemecek,
Tomas Pevny,
Jakub Marecek
Abstract:
In classification and forecasting with tabular data, one often utilizes tree-based models. Those can be competitive with deep neural networks on tabular data and, under some conditions, explainable. The explainability depends on the depth of the tree and the accuracy in each leaf of the tree. We point out that decision trees containing leaves with unbalanced accuracy can provide misleading explana…
▽ More
In classification and forecasting with tabular data, one often utilizes tree-based models. Those can be competitive with deep neural networks on tabular data and, under some conditions, explainable. The explainability depends on the depth of the tree and the accuracy in each leaf of the tree. We point out that decision trees containing leaves with unbalanced accuracy can provide misleading explanations. Low-accuracy leaves give less valid explanations, which could be interpreted as unfairness among subgroups utilizing these explanations. Here, we train a shallow tree with the objective of minimizing the maximum misclassification error across all leaf nodes. The shallow tree provides a global explanation, while the overall statistical performance of the shallow tree can become comparable to state-of-the-art methods (e.g., well-tuned XGBoost) by extending the leaves with further models.
△ Less
Submitted 4 June, 2024; v1 submitted 11 June, 2023;
originally announced June 2023.
-
Predictability and Fairness in Load Aggregation with Deadband
Authors:
F. V. Difonzo,
M. Roubalik,
J. Marecek
Abstract:
Virtual power plants and load aggregation are becoming increasingly common. There, one regulates the aggregate power output of an ensemble of distributed energy resources (DERs). Marecek et al. [Automatica, Volume 147, January 2023, 110743, arXiv:2110.03001] recently suggested that long-term averages of prices or incentives offered should exist and be independent of the initial states of the opera…
▽ More
Virtual power plants and load aggregation are becoming increasingly common. There, one regulates the aggregate power output of an ensemble of distributed energy resources (DERs). Marecek et al. [Automatica, Volume 147, January 2023, 110743, arXiv:2110.03001] recently suggested that long-term averages of prices or incentives offered should exist and be independent of the initial states of the operators of the DER, the aggregator, and the power grid. This can be seen as predictability, which underlies fairness. Unfortunately, the existence of such averages cannot be guaranteed with many traditional regulators, including the proportional-integral (PI) regulator with or without deadband. Here, we consider the effects of losses in the alternating current model and the deadband in the controller. This yields a non-linear dynamical system (due to the non-linear losses) exhibiting discontinuities (due to the deadband). We show that Filippov invariant measures enable reasoning about predictability and fairness while considering non-linearity of the alternating-current model and deadband.
△ Less
Submitted 9 October, 2024; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Hybrid Methods in Polynomial Optimisation
Authors:
Johannes Aspman,
Gilles Bareilles,
Vyacheslav Kungurtsev,
Jakub Marecek,
Martin Takáč
Abstract:
The Moment/Sum-of-squares hierarchy provides a way to compute the global minimizers of polynomial optimization problems (POP), at the cost of solving a sequence of increasingly large semidefinite programs (SDPs). We consider large-scale POPs, for which interior-point methods are no longer able to solve the resulting SDPs. We propose an algorithm that combines a first-order method for solving the S…
▽ More
The Moment/Sum-of-squares hierarchy provides a way to compute the global minimizers of polynomial optimization problems (POP), at the cost of solving a sequence of increasingly large semidefinite programs (SDPs). We consider large-scale POPs, for which interior-point methods are no longer able to solve the resulting SDPs. We propose an algorithm that combines a first-order method for solving the SDP relaxation, and a second-order method on a non-convex problem obtained from the POP. The switch from the first to the second-order method is based on a quantitative criterion, whose satisfaction ensures that Newton's method converges quadratically from its first iteration. This criterion leverages the point-estimation theory of Smale and the active-set identification. We illustrate the methodology to obtain global minimizers of large-scale optimal power flow problems.
△ Less
Submitted 12 September, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
A Survey of Quantum Alternatives to Randomized Algorithms: Monte Carlo Integration and Beyond
Authors:
Philip Intallura,
Georgios Korpas,
Sudeepto Chakraborty,
Vyacheslav Kungurtsev,
Jakub Marecek
Abstract:
Monte Carlo sampling is a powerful toolbox of algorithmic techniques widely used for a number of applications wherein some noisy quantity, or summary statistic thereof, is sought to be estimated. In this paper, we survey the literature for implementing Monte Carlo procedures using quantum circuits, focusing on the potential to obtain a quantum advantage in the computational speed of these procedur…
▽ More
Monte Carlo sampling is a powerful toolbox of algorithmic techniques widely used for a number of applications wherein some noisy quantity, or summary statistic thereof, is sought to be estimated. In this paper, we survey the literature for implementing Monte Carlo procedures using quantum circuits, focusing on the potential to obtain a quantum advantage in the computational speed of these procedures. We revisit the quantum algorithms that could replace classical Monte Carlo and then consider both the existing quantum algorithms and the potential quantum realizations that include adaptive enhancements as alternatives to the classical procedure.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Iterated Function Systems: A Comprehensive Survey
Authors:
Ramen Ghosh,
Jakub Marecek
Abstract:
We provide an overview of iterated function systems (IFS), where randomly chosen state-to-state maps are applied iteratively to a state. We aim to summarize the state of art and, where possible, identify fundamental challenges and opportunities for further research.
We provide an overview of iterated function systems (IFS), where randomly chosen state-to-state maps are applied iteratively to a state. We aim to summarize the state of art and, where possible, identify fundamental challenges and opportunities for further research.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
Time-Varying Multi-Objective Optimization: Tradeoff Regret Bounds
Authors:
Allahkaram Shafiei,
Jakub Marecek
Abstract:
Multi-objective optimization studies the process of seeking multiple competing desiderata in some operation. Solution techniques highlight marginal tradeoffs associated with weighing one objective over others. In this paper, we consider time-varying multi-objective optimization, in which the objectives are parametrized by a continuously varying parameter and a prescribed computational budget is av…
▽ More
Multi-objective optimization studies the process of seeking multiple competing desiderata in some operation. Solution techniques highlight marginal tradeoffs associated with weighing one objective over others. In this paper, we consider time-varying multi-objective optimization, in which the objectives are parametrized by a continuously varying parameter and a prescribed computational budget is available at each time instant to algorithmically adjust the decision variables to accommodate for the changes. We prove regret bounds indicating the relative guarantees on performance for the competing objectives.
△ Less
Submitted 19 April, 2025; v1 submitted 17 November, 2022;
originally announced November 2022.
-
Statistical static timing analysis via modern optimization lens: I. Histogram-based approach
Authors:
Adam Bosak,
Dmytro Mishagli,
Jakub Marecek
Abstract:
Statistical static timing analysis (SSTA) is studied from the point of view of mathematical optimization. We present two formulations of the problem of finding the critical path delay distribution that were not known before: (i) a formulation of the SSTA problem using Binary--Integer Programming and (ii) a practical formulation using Geometric Programming. For simplicity, we use histogram approxim…
▽ More
Statistical static timing analysis (SSTA) is studied from the point of view of mathematical optimization. We present two formulations of the problem of finding the critical path delay distribution that were not known before: (i) a formulation of the SSTA problem using Binary--Integer Programming and (ii) a practical formulation using Geometric Programming. For simplicity, we use histogram approximation of the distributions. Scalability of the approaches is studied and possible generalizations are discussed.
△ Less
Submitted 7 September, 2023; v1 submitted 5 November, 2022;
originally announced November 2022.
-
Optimal Power Flow Pursuit in the Alternating Current Model
Authors:
Jie Liu,
Antonio Bellon,
Andrea Simonetto,
Martin Takac,
Jakub Marecek
Abstract:
Transmission-constrained problems in power systems can be cast as polynomial optimization problems whose coefficients vary over time. We consider the complications therein and suggest several approaches. On the example of the alternating-current optimal power flows (ACOPFs), we illustrate one of the approaches in detail. For the time-varying ACOPF, we provide an upper bound for the difference betw…
▽ More
Transmission-constrained problems in power systems can be cast as polynomial optimization problems whose coefficients vary over time. We consider the complications therein and suggest several approaches. On the example of the alternating-current optimal power flows (ACOPFs), we illustrate one of the approaches in detail. For the time-varying ACOPF, we provide an upper bound for the difference between the optimal cost for a relaxation using the most recent data and the current approximate optimal cost generated by our algorithm. This bound is a function of the properties of the instance and the rate of change of the coefficients over time. Moreover, we also bound the number of floating-point operations to perform between two subsequent updates to ensure a bounded error.
△ Less
Submitted 22 September, 2023; v1 submitted 5 November, 2022;
originally announced November 2022.
-
Time-Varying Semidefinite Programming: Path Following a Burer-Monteiro Factorization
Authors:
Antonio Bellon,
Mareike Dressler,
Vyacheslav Kungurtsev,
Jakub Marecek,
André Uschmajew
Abstract:
We present an online algorithm for time-varying semidefinite programs (TV-SDPs), based on the tracking of the solution trajectory of a low-rank matrix factorization, also known as the Burer-Monteiro factorization, in a path-following procedure. There, a predictor-corrector algorithm solves a sequence of linearized systems. This requires the introduction of a horizontal space constraint to ensure t…
▽ More
We present an online algorithm for time-varying semidefinite programs (TV-SDPs), based on the tracking of the solution trajectory of a low-rank matrix factorization, also known as the Burer-Monteiro factorization, in a path-following procedure. There, a predictor-corrector algorithm solves a sequence of linearized systems. This requires the introduction of a horizontal space constraint to ensure the local injectivity of the low-rank factorization. The method produces a sequence of approximate solutions for the original TV-SDP problem, for which we show that they stay close to the optimal solution path if properly initialized. Numerical experiments for a time-varying max-cut SDP relaxation demonstrate the computational advantages of the proposed method for tracking TV-SDPs in terms of runtime compared to off-the-shelf interior point methods.
△ Less
Submitted 9 January, 2024; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Transpiling Quantum Circuits using the Pentagon Equation
Authors:
Christos Aravanis,
Georgios Korpas,
Jakub Marecek
Abstract:
We consider the application of the pentagon equation in the context of quantum circuit compression. We show that if solutions to the pentagon equation are found, one can transpile a circuit involving non-Heisenberg-type interactions to a circuit involving only Heisenberg-type interactions while, in parallel, reducing the depth of a circuit. In this context, we consider a model of non-local two-qub…
▽ More
We consider the application of the pentagon equation in the context of quantum circuit compression. We show that if solutions to the pentagon equation are found, one can transpile a circuit involving non-Heisenberg-type interactions to a circuit involving only Heisenberg-type interactions while, in parallel, reducing the depth of a circuit. In this context, we consider a model of non-local two-qubit operations of Zhang \emph{et. al.} (the $A$ gate), and show that for certain parameters it is a solution of the pentagon equation.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
On Unique Ergodicity Of Coupled AIMD Flows
Authors:
Pietro Ferraro,
Jia Yuan Yu,
Ramen Ghosh,
Syed Eqbal Alam,
Jakub Marecek,
Fabian Wirth,
Robert Shorten
Abstract:
The AIMD algorithm, which underpins the Transmission Control Protocol (TCP) for transporting data packets in communication networks, is perhaps the most successful control algorithm ever deployed. Recently, its use has been extended beyond communication networks, and successful applications of the AIMD algorithm have been reported in transportation, energy, and mathematical biology. A very recent…
▽ More
The AIMD algorithm, which underpins the Transmission Control Protocol (TCP) for transporting data packets in communication networks, is perhaps the most successful control algorithm ever deployed. Recently, its use has been extended beyond communication networks, and successful applications of the AIMD algorithm have been reported in transportation, energy, and mathematical biology. A very recent development in the use of AIMD is its application in solving large-scale optimization and distributed control problems without the need for inter-agent communication. In this context, an interesting problem arises when multiple AIMD networks that are coupled in some sense (usually through a nonlinearity). The purpose of this note is to prove that such systems in certain settings inherit the ergodic properties of individual AIMD networks. This result has important consequences for the convergence of the aforementioned optimization algorithms. The arguments in the paper also correct conceptual and technical errors in [1].
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Iteration Complexity of Variational Quantum Algorithms
Authors:
Vyacheslav Kungurtsev,
Georgios Korpas,
Jakub Marecek,
Elton Yechao Zhu
Abstract:
There has been much recent interest in near-term applications of quantum computers, i.e., using quantum circuits that have short decoherence times due to hardware limitations. Variational quantum algorithms (VQA), wherein an optimization algorithm implemented on a classical computer evaluates a parametrized quantum circuit as an objective function, are a leading framework in this space. An enormou…
▽ More
There has been much recent interest in near-term applications of quantum computers, i.e., using quantum circuits that have short decoherence times due to hardware limitations. Variational quantum algorithms (VQA), wherein an optimization algorithm implemented on a classical computer evaluates a parametrized quantum circuit as an objective function, are a leading framework in this space. An enormous breadth of algorithms in this framework have been proposed for solving a range of problems in machine learning, forecasting, applied physics, and combinatorial optimization, among others.
In this paper, we analyze the iteration complexity of VQA, that is, the number of steps that VQA requires until its iterates satisfy a surrogate measure of optimality. We argue that although VQA procedures incorporate algorithms that can, in the idealized case, be modeled as classic procedures in the optimization literature, the particular nature of noise in near-term devices invalidates the claim of applicability of off-the-shelf analyses of these algorithms. Specifically, noise makes the evaluations of the objective function via quantum circuits biased. Commonly used optimization procedures, such as SPSA and the parameter shift rule, can thus be seen as derivative-free optimization algorithms with biased function evaluations, for which there are currently no iteration complexity guarantees in the literature. We derive the missing guarantees and find that the rate of convergence is unaffected. However, the level of bias contributes unfavorably to both the constant therein, and the asymptotic distance to stationarity, i.e., the more bias, the farther one is guaranteed, at best, to reach a stationary point of the VQA objective.
△ Less
Submitted 8 September, 2024; v1 submitted 21 September, 2022;
originally announced September 2022.
-
Globally Optimal Quantum Control
Authors:
Denys I. Bondar,
Kurt Jacobs,
Georgios Korpas,
Jakub Marecek,
and Jiri Vala
Abstract:
Optimization methods for constrained quantum control problems power quantum technologies. Such control problems are notoriously difficult because they are non-convex and plagued with local extrema. Current optimization methods must be repeated many times to find good solutions, each time requiring many simulations of the system. Here we present Quantum Control via Polynomial Optimization (QCPOp),…
▽ More
Optimization methods for constrained quantum control problems power quantum technologies. Such control problems are notoriously difficult because they are non-convex and plagued with local extrema. Current optimization methods must be repeated many times to find good solutions, each time requiring many simulations of the system. Here we present Quantum Control via Polynomial Optimization (QCPOp), a method that eliminates this problem by directly finding globally optimal solutions. This remarkable ability is due to global optimization methods recently developed for polynomial functions. We demonstrate the tremendous improvement over current state-of-the-art methods using a number of non-trivial examples. Global optimization also allows QCPOp to find the simplest control solutions. Since QCPOp is able to reveal the optimum performance of quantum control with high confidence, we expect that it will not only enhance the utility of such control but provide a key tool for determining the limits of quantum technologies.
△ Less
Submitted 10 March, 2023; v1 submitted 13 September, 2022;
originally announced September 2022.
-
Fairness in Forecasting of Observations of Linear Dynamical Systems
Authors:
Quan Zhou,
Jakub Marecek,
Robert N. Shorten
Abstract:
In machine learning, training data often capture the behaviour of multiple subgroups of some underlying human population. This behaviour can often be modelled as observations of an unknown dynamical system with an unobserved state. When the training data for the subgroups are not controlled carefully, however, under-representation bias arises. To counter under-representation bias, we introduce two…
▽ More
In machine learning, training data often capture the behaviour of multiple subgroups of some underlying human population. This behaviour can often be modelled as observations of an unknown dynamical system with an unobserved state. When the training data for the subgroups are not controlled carefully, however, under-representation bias arises. To counter under-representation bias, we introduce two natural notions of fairness in time-series forecasting problems: subgroup fairness and instantaneous fairness. These notions extend predictive parity to the learning of dynamical systems. We also show globally convergent methods for the fairness-constrained learning problems using hierarchies of convexifications of non-commutative polynomial optimisation problems. We also show that by exploiting sparsity in the convexifications, we can reduce the run time of our methods considerably. Our empirical results on a biased data set motivated by insurance applications and the well-known COMPAS data set demonstrate the efficacy of our methods.
△ Less
Submitted 15 May, 2023; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Closed-Loop View of the Regulation of AI: Equal Impact across Repeated Interactions
Authors:
Quan Zhou,
Ramen Ghosh,
Robert Shorten,
Jakub Marecek
Abstract:
There has been much recent interest in the regulation of AI. We argue for a view based on civil-rights legislation, built on the notions of equal treatment and equal impact. In a closed-loop view of the AI system and its users, the equal treatment concerns one pass through the loop. Equal impact, in our view, concerns the long-run average behaviour across repeated interactions. In order to establi…
▽ More
There has been much recent interest in the regulation of AI. We argue for a view based on civil-rights legislation, built on the notions of equal treatment and equal impact. In a closed-loop view of the AI system and its users, the equal treatment concerns one pass through the loop. Equal impact, in our view, concerns the long-run average behaviour across repeated interactions. In order to establish the existence of the average and its properties, one needs to study the ergodic properties of the closed-loop and its unique stationary measure.
△ Less
Submitted 25 February, 2024; v1 submitted 3 September, 2022;
originally announced September 2022.
-
Herd Routes: A Preventative IoT-Based System for Improving Female Pedestrian Safety on City Streets
Authors:
Madeleine Woodburn,
Wynita M. Griggs,
Jakub Marecek,
Robert N. Shorten
Abstract:
Over two thirds of women of all ages in the UK have experienced some form of sexual harassment in a public space. Recent tragic incidents involving female pedestrians have highlighted some of the personal safety issues that women still face in cities today. There exist many popular location-based safety applications as a result of this; however, these applications tend to take a reactive approach…
▽ More
Over two thirds of women of all ages in the UK have experienced some form of sexual harassment in a public space. Recent tragic incidents involving female pedestrians have highlighted some of the personal safety issues that women still face in cities today. There exist many popular location-based safety applications as a result of this; however, these applications tend to take a reactive approach where action is taken only after an incident has occurred. This paper proposes a preventative approach to the problem by creating safer public environments through societal incentivisation. The proposed system, called "Herd Routes", improves the safety of female pedestrians by generating busier pedestrian routes as a result of route incentivisation. A novel application of distributed ledgers is proposed to provide security and trust, a record of system users' locations and IDs, and a platform for token exchange. A proof-of-concept was developed using the simulation package SUMO (Simulation of Urban Mobility), and a smartphone app. was built in Android Studio so that pedestrian Hardware-in-the-Loop testing could be carried out to validate the technical feasibility and desirability of the system. With positive results from the initial testing of the proof-of-concept, further development could significantly contribute towards creating safer pedestrian routes through cities, and tackle the societal change that is required to improve female pedestrian safety in the long term.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Stochastic Langevin Differential Inclusions with Applications to Machine Learning
Authors:
Fabio V. Difonzo,
Vyacheslav Kungurtsev,
Jakub Marecek
Abstract:
Stochastic differential equations of Langevin-diffusion form have received significant attention, thanks to their foundational role in both Bayesian sampling algorithms and optimization in machine learning. In the latter, they serve as a conceptual model of the stochastic gradient flow in training over-parameterized models. However, the literature typically assumes smoothness of the potential, who…
▽ More
Stochastic differential equations of Langevin-diffusion form have received significant attention, thanks to their foundational role in both Bayesian sampling algorithms and optimization in machine learning. In the latter, they serve as a conceptual model of the stochastic gradient flow in training over-parameterized models. However, the literature typically assumes smoothness of the potential, whose gradient is the drift term. Nevertheless, there are many problems for which the potential function is not continuously differentiable, and hence the drift is not Lipschitz continuous everywhere. This is exemplified by robust losses and Rectified Linear Units in regression problems. In this paper, we show some foundational results regarding the flow and asymptotic properties of Langevin-type Stochastic Differential Inclusions under assumptions appropriate to the machine-learning settings. In particular, we show strong existence of the solution, as well as an asymptotic minimization of the canonical free-energy functional.
△ Less
Submitted 12 May, 2024; v1 submitted 23 June, 2022;
originally announced June 2022.
-
An adversarially robust data-market for spatial, crowd-sourced data
Authors:
Aida Manzano Kharman,
Christian Jursitzky,
Quan Zhou,
Pietro Ferraro,
Jakub Marecek,
Pierre Pinson,
Robert Shorten
Abstract:
We describe an architecture for a decentralised data market for applications in which agents are incentivised to collaborate to crowd-source their data. The architecture is designed to reward data that furthers the market's collective goal, and distributes reward fairly to all those that contribute with their data. We show that the architecture is resilient to Sybil, wormhole, and data poisoning a…
▽ More
We describe an architecture for a decentralised data market for applications in which agents are incentivised to collaborate to crowd-source their data. The architecture is designed to reward data that furthers the market's collective goal, and distributes reward fairly to all those that contribute with their data. We show that the architecture is resilient to Sybil, wormhole, and data poisoning attacks. In order to evaluate the resilience of the architecture, we characterise its breakdown points for various adversarial threat models in an automotive use case.
△ Less
Submitted 17 October, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Polynomial Matrix Inequalities within Tame Geometry
Authors:
Christos Aravanis,
Johannes Aspman,
Georgios Korpas,
Jakub Marecek
Abstract:
Polynomial matrix inequalities can be solved using hierarchies of convex relaxations, pioneered by Henrion and Lassere. In some cases, this might not be practical, and one may need to resort to methods with local convergence guarantees, whose development has been rather ad hoc, so far. In this paper, we explore several alternative approaches to the problem, with non-trivial guarantees available us…
▽ More
Polynomial matrix inequalities can be solved using hierarchies of convex relaxations, pioneered by Henrion and Lassere. In some cases, this might not be practical, and one may need to resort to methods with local convergence guarantees, whose development has been rather ad hoc, so far. In this paper, we explore several alternative approaches to the problem, with non-trivial guarantees available using results from tame geometry.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.