Skip to main content

Showing 1–45 of 45 results for author: Gabor, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.14645  [pdf, other

    cs.LG cs.AI

    Surrogate Fitness Metrics for Interpretable Reinforcement Learning

    Authors: Philipp Altmann, Céline Davignon, Maximilian Zorn, Fabian Ritz, Claudia Linnhoff-Popien, Thomas Gabor

    Abstract: We employ an evolutionary optimization framework that perturbs initial states to generate informative and diverse policy demonstrations. A joint surrogate fitness function guides the optimization by combining local diversity, behavioral certainty, and global population diversity. To assess demonstration quality, we apply a set of evaluation metrics, including the reward-based optimality gap, fidel… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: 30 pages, 7 figures, under review

  2. arXiv:2504.06413  [pdf, other

    quant-ph cs.AI

    Evaluating Mutation Techniques in Genetic Algorithm-Based Quantum Circuit Synthesis

    Authors: Michael Kölle, Tom Bintener, Maximilian Zorn, Gerhard Stenzel, Leo Sünkel, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Quantum computing leverages the unique properties of qubits and quantum parallelism to solve problems intractable for classical systems, offering unparalleled computational potential. However, the optimization of quantum circuits remains critical, especially for noisy intermediate-scale quantum (NISQ) devices with limited qubits and high error rates. Genetic algorithms (GAs) provide a promising ap… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: Accepted at GECCO 2025

  3. arXiv:2412.07686  [pdf, other

    cs.RO cs.AI cs.LG

    Optimizing Sensor Redundancy in Sequential Decision-Making Problems

    Authors: Jonas Nüßlein, Maximilian Zorn, Fabian Ritz, Jonas Stein, Gerhard Stenzel, Julian Schönberger, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Reinforcement Learning (RL) policies are designed to predict actions based on current observations to maximize cumulative future rewards. In real-world applications (i.e., non-simulated environments), sensors are essential for measuring the current state and providing the observations on which RL policies rely to make decisions. A significant challenge in deploying RL policies in real-world scenar… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted at ICAART conference 2025

  4. arXiv:2411.04658  [pdf, other

    cs.NE

    Finding Strong Lottery Ticket Networks with Genetic Algorithms

    Authors: Philipp Altmann, Julian Schönberger, Maximilian Zorn, Thomas Gabor

    Abstract: According to the Strong Lottery Ticket Hypothesis, every sufficiently large neural network with randomly initialized weights contains a sub-network which - still with its random weights - already performs as well for a given task as the trained super-network. We present the first approach based on a genetic algorithm to find such strong lottery ticket sub-networks without training or otherwise com… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 12 pages, 7 figures, 5 tables, accepted for publication at the 16th International Joint Conference on Computational Intelligence (IJCCI 2024)

  5. arXiv:2408.04514  [pdf, other

    cs.MA

    Emergence in Multi-Agent Systems: A Safety Perspective

    Authors: Philipp Altmann, Julian Schönberger, Steffen Illium, Maximilian Zorn, Fabian Ritz, Tom Haider, Simon Burton, Thomas Gabor

    Abstract: Emergent effects can arise in multi-agent systems (MAS) where execution is decentralized and reliant on local information. These effects may range from minor deviations in behavior to catastrophic system failures. To formally define these effects, we identify misalignments between the global inherent specification (the true specification) and its local approximation (such as the configuration of d… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 18 pages, 3 figures, accepted for publication at the International Symposium on Leveraging Applications of Formal Methods (ISoLA 2024)

  6. arXiv:2408.01187  [pdf, other

    quant-ph cs.AI cs.LG

    Optimizing Variational Quantum Circuits Using Metaheuristic Strategies in Reinforcement Learning

    Authors: Michael Kölle, Daniel Seidl, Maximilian Zorn, Philipp Altmann, Jonas Stein, Thomas Gabor

    Abstract: Quantum Reinforcement Learning (QRL) offers potential advantages over classical Reinforcement Learning, such as compact state space representation and faster convergence in certain scenarios. However, practical benefits require further validation. QRL faces challenges like flat solution landscapes, where traditional gradient-based methods are inefficient, necessitating the use of gradient-free alg… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: Accepted at QCE24 - QCRL24 Workshop

  7. arXiv:2405.12354  [pdf, other

    quant-ph cs.AI cs.LG

    A Study on Optimization Techniques for Variational Quantum Circuits in Reinforcement Learning

    Authors: Michael Kölle, Timo Witter, Tobias Rohe, Gerhard Stenzel, Philipp Altmann, Thomas Gabor

    Abstract: Quantum Computing aims to streamline machine learning, making it more effective with fewer trainable parameters. This reduction of parameters can speed up the learning process and reduce the use of computational resources. However, in the current phase of quantum computing development, known as the noisy intermediate-scale quantum era (NISQ), learning is difficult due to a limited number of qubits… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted at QSW 2024

  8. arXiv:2405.09272  [pdf, other

    quant-ph cs.ET

    Using an Evolutionary Algorithm to Create (MAX)-3SAT QUBOs

    Authors: Sebastian Zielinski, Maximilian Zorn, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien

    Abstract: A common way of solving satisfiability instances with quantum methods is to transform these instances into instances of QUBO, which in itself is a potentially difficult and expensive task. State-of-the-art transformations from MAX-3SAT to QUBO currently work by mapping clauses of a 3SAT formula associated with the MAX-3SAT instance to an instance of QUBO and combining the resulting QUBOs into a si… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  9. arXiv:2404.09213  [pdf, other

    quant-ph cs.LG

    Qandle: Accelerating State Vector Simulation Using Gate-Matrix Caching and Circuit Splitting

    Authors: Gerhard Stenzel, Sebastian Zielinski, Michael Kölle, Philipp Altmann, Jonas Nüßlein, Thomas Gabor

    Abstract: To address the computational complexity associated with state-vector simulation for quantum circuits, we propose a combination of advanced techniques to accelerate circuit execution. Quantum gate matrix caching reduces the overhead of repeated applications of the Kronecker product when applying a gate matrix to the state vector by storing decomposed partial matrices for each gate. Circuit splittin… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  10. arXiv:2404.03359  [pdf, other

    cs.LG cs.AI cs.NE

    REACT: Revealing Evolutionary Action Consequence Trajectories for Interpretable Reinforcement Learning

    Authors: Philipp Altmann, Céline Davignon, Maximilian Zorn, Fabian Ritz, Claudia Linnhoff-Popien, Thomas Gabor

    Abstract: To enhance the interpretability of Reinforcement Learning (RL), we propose Revealing Evolutionary Action Consequence Trajectories (REACT). In contrast to the prevalent practice of validating RL models based on their optimal behavior learned during training, we posit that considering a range of edge-case trajectories provides a more comprehensive understanding of their inherent behavior. To induce… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 12 pages, 12 figures

  11. arXiv:2312.11337  [pdf, other

    quant-ph cs.LG

    Challenges for Reinforcement Learning in Quantum Circuit Design

    Authors: Philipp Altmann, Jonas Stein, Michael Kölle, Adelina Bärligea, Thomas Gabor, Thomy Phan, Sebastian Feld, Claudia Linnhoff-Popien

    Abstract: Quantum computing (QC) in the current NISQ era is still limited in size and precision. Hybrid applications mitigating those shortcomings are prevalent to gain early insight and advantages. Hybrid quantum machine learning (QML) comprises both the application of QC to improve machine learning (ML) and ML to improve QC architectures. This work considers the latter, leveraging reinforcement learning (… ▽ More

    Submitted 8 August, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 11 pages, 4 figures, accepted for publication at the 2024 IEEE International Conference on Quantum Computing and Engineering (QCE)

  12. arXiv:2311.15966  [pdf, other

    quant-ph cs.ET cs.LG eess.IV

    Towards Transfer Learning for Large-Scale Image Classification Using Annealing-based Quantum Boltzmann Machines

    Authors: Daniëlle Schuman, Leo Sünkel, Philipp Altmann, Jonas Stein, Christoph Roch, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Quantum Transfer Learning (QTL) recently gained popularity as a hybrid quantum-classical approach for image classification tasks by efficiently combining the feature extraction capabilities of large Convolutional Neural Networks with the potential benefits of Quantum Machine Learning (QML). Existing approaches, however, only utilize gate-based Variational Quantum Circuits for the quantum part of t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures (5 if counting subfigures), 1 table. To be published in the proceedings of the 2023 IEEE International Conference on Quantum Computing and Engineering (QCE)

  13. Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks

    Authors: László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Csapó Tamás Gábor

    Abstract: Thanks to the latest deep learning algorithms, silent speech interfaces (SSI) are now able to synthesize intelligible speech from articulatory movement data under certain conditions. However, the resulting models are rather speaker-specific, making a quick switch between users troublesome. Even for the same speaker, these models perform poorly cross-session, i.e. after dismounting and re-mounting… ▽ More

    Submitted 17 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures, 3 tables

    Journal ref: the Proceedings of Interspeech 2023

  14. arXiv:2302.03536  [pdf, other

    quant-ph cs.ET

    Solving (Max) 3-SAT via Quadratic Unconstrained Binary Optimization

    Authors: Jonas Nüßlein, Sebastian Zielinski, Thomas Gabor, Claudia Linnhoff-Popien, Sebastian Feld

    Abstract: We introduce a novel approach to translate arbitrary 3-SAT instances to Quadratic Unconstrained Binary Optimization (QUBO) as they are used by quantum annealing (QA) or the quantum approximate optimization algorithm (QAOA). Our approach requires fewer couplings and fewer physical qubits than the current state-of-the-art, which results in higher solution quality. We verified the practical applicabi… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  15. arXiv:2301.07421  [pdf, other

    cs.LG

    DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training

    Authors: Philipp Altmann, Thomy Phan, Fabian Ritz, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: We propose discriminative reward co-training (DIRECT) as an extension to deep reinforcement learning algorithms. Building upon the concept of self-imitation learning (SIL), we introduce an imitation buffer to store beneficial trajectories generated by the policy determined by their return. A discriminator network is trained concurrently to the policy to distinguish between trajectories generated b… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 9 pages, 10 figures, under review

    ACM Class: I.2.6

  16. arXiv:2301.01649  [pdf, other

    cs.MA

    Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability

    Authors: Thomy Phan, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Stochastic partial observability poses a major challenge for decentralized coordination in multi-agent reinforcement learning but is largely neglected in state-of-the-art research due to a strong focus on state-based centralized training for decentralized execution (CTDE) and benchmarks that lack sufficient stochasticity like StarCraft Multi-Agent Challenge (SMAC). In this paper, we propose Attent… ▽ More

    Submitted 27 December, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Accepted to ICML 2023

  17. Empirical Analysis of Limits for Memory Distance in Recurrent Neural Networks

    Authors: Steffen Illium, Thore Schillman, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Common to all different kinds of recurrent neural networks (RNNs) is the intention to model relations between data points through time. When there is no immediate relationship between subsequent data points (like when the data points are generated at random, e.g.), we show that RNNs are still able to remember a few data points back into the sequence by memorizing them by heart using standard backp… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  18. arXiv:2212.10078  [pdf, other

    cs.NE cs.LG

    Constructing Organism Networks from Collaborative Self-Replicators

    Authors: Steffen Illium, Maximilian Zorn, Cristian Lenta, Michael Kölle, Claudia Linnhoff-Popien, Thomas Gabor

    Abstract: We introduce organism networks, which function like a single neural network but are composed of several neural particle networks; while each particle network fulfils the role of a single weight application within the organism network, it is also trained to self-replicate its own weights. As organism networks feature vastly more parameters than simpler architectures, we perform our initial experime… ▽ More

    Submitted 27 February, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 2023-02-27 fixed one typo in NN formula

  19. Capturing Dependencies within Machine Learning via a Formal Process Model

    Authors: Fabian Ritz, Thomy Phan, Andreas Sedlmeier, Philipp Altmann, Jan Wieghardt, Reiner Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien, Thomas Gabor

    Abstract: The development of Machine Learning (ML) models is more than just a special case of software development (SD): ML models acquire properties and fulfill requirements even without direct human interaction in a seemingly uncontrollable manner. Nonetheless, the underlying processes can be described in a formal way. We define a comprehensive SD process model for ML that encompasses most tasks and artif… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 10 pages, 5 figures, draft; the final version will appear in the proceedings of the International Symposium on Leveraging Applications of Formal Methods (ISoLA) 2022

    Journal ref: ISoLA 2022: Leveraging Applications of Formal Methods, Verification and Validation. Adaptation and Learning. pp 249-265

  20. arXiv:2206.12510  [pdf, other

    cs.LG

    Black Box Optimization Using QUBO and the Cross Entropy Method

    Authors: Jonas Nüßlein, Christoph Roch, Thomas Gabor, Jonas Stein, Claudia Linnhoff-Popien, Sebastian Feld

    Abstract: Black-box optimization (BBO) can be used to optimize functions whose analytic form is unknown. A common approach to realising BBO is to learn a surrogate model which approximates the target black-box function which can then be solved via white-box optimization methods. In this paper, we present our approach BOX-QUBO, where the surrogate model is a QUBO matrix. However, unlike in previous state-of-… ▽ More

    Submitted 9 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  21. arXiv:2206.05827  [pdf, other

    cs.LG

    Case-Based Inverse Reinforcement Learning Using Temporal Coherence

    Authors: Jonas Nüßlein, Steffen Illium, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: Providing expert trajectories in the context of Imitation Learning is often expensive and time-consuming. The goal must therefore be to create algorithms which require as little expert data as possible. In this paper we present an algorithm that imitates the higher-level strategy of the expert rather than just imitating the expert on action level, which we hypothesize requires less expert data and… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: accepted at ICCBR

  22. Algorithmic QUBO Formulations for k-SAT and Hamiltonian Cycles

    Authors: Jonas Nüßlein, Thomas Gabor, Claudia Linnhoff-Popien, Sebastian Feld

    Abstract: Quadratic unconstrained binary optimization (QUBO) can be seen as a generic language for optimization problems. QUBOs attract particular attention since they can be solved with quantum hardware, like quantum annealers or quantum gate computers running QAOA. In this paper, we present two novel QUBO formulations for $k$-SAT and Hamiltonian Cycles that scale significantly better than existing approac… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at GECCO 2022

  23. arXiv:2109.10617  [pdf, other

    cs.AI cs.LG cs.NE

    Solving Large Steiner Tree Problems in Graphs for Cost-Efficient Fiber-To-The-Home Network Expansion

    Authors: Tobias Müller, Kyrill Schmid, Daniëlle Schuman, Thomas Gabor, Markus Friedrich, Marc Geitz

    Abstract: The expansion of Fiber-To-The-Home (FTTH) networks creates high costs due to expensive excavation procedures. Optimizing the planning process and minimizing the cost of the earth excavation work therefore lead to large savings. Mathematically, the FTTH network problem can be described as a minimum Steiner Tree problem. Even though the Steiner Tree problem has already been investigated intensively… ▽ More

    Submitted 24 November, 2021; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted at ICAART 2022, 10 pages, 18 figures

  24. SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning

    Authors: Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien

    Abstract: A characteristic of reinforcement learning is the ability to develop unforeseen strategies when solving problems. While such strategies sometimes yield superior performance, they may also result in undesired or even dangerous behavior. In industrial scenarios, a system's behavior also needs to be predictable and lie within defined ranges. To enable the agents to learn (how) to align with a given s… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 9 pages, 5 figures; accepted as a full paper at ICAART 2021 (http://www.icaart.org/)

    Journal ref: Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 28-37, 2021

  25. arXiv:2004.14036  [pdf, other

    quant-ph cs.LG

    Insights on Training Neural Networks for QUBO Tasks

    Authors: Thomas Gabor, Sebastian Feld, Hila Safi, Thomy Phan, Claudia Linnhoff-Popien

    Abstract: Current hardware limitations restrict the potential when solving quadratic unconstrained binary optimization (QUBO) problems via the quantum approximate optimization algorithm (QAOA) or quantum annealing (QA). Thus, we consider training neural networks in this context. We first discuss QUBO problems that originate from translated instances of the traveling salesman problem (TSP): Analyzing this re… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 6 pages, 5 figures, accepted at the 1st International Workshop on Quantum Software Engineering (Q-SE 2020) at ICSE 2020 and to be published in the corresponding proceedings

  26. arXiv:2004.14035  [pdf, other

    quant-ph cs.AI

    The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating the Machine Learning Pipeline

    Authors: Thomas Gabor, Leo Sünkel, Fabian Ritz, Thomy Phan, Lenz Belzner, Christoph Roch, Sebastian Feld, Claudia Linnhoff-Popien

    Abstract: We discuss the synergetic connection between quantum computing and artificial intelligence. After surveying current approaches to quantum artificial intelligence and relating them to a formal model for machine learning processes, we deduce four major challenges for the future of quantum artificial intelligence: (i) Replace iterative training with faster quantum algorithms, (ii) distill the experie… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 6 pages, 4 figures, accepted at the 1st International Workshop on Quantum Software Engineering (Q-SE 2020) at ICSE 2020 and to be published in the corresponding proceedings

  27. arXiv:2001.00496  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning

    Authors: Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien

    Abstract: Robustness to out-of-distribution (OOD) data is an important goal in building reliable machine learning systems. Especially in autonomous systems, wrong predictions for OOD inputs can cause safety critical situations. As a first step towards a solution, we consider the problem of detecting such data in a value-based deep reinforcement learning (RL) setting. Modelling this problem as a one-class cl… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1901.02219

    Journal ref: Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 2020, ISBN 978-989-758-395-7, pages 522-529

  28. arXiv:1912.06032  [pdf, other

    quant-ph cs.LG

    Integration and Evaluation of Quantum Accelerators for Data-Driven User Functions

    Authors: Thomas Hubregtsen, Christoph Segler, Josef Pichlmeier, Aritra Sarkar, Thomas Gabor, Koen Bertels

    Abstract: Quantum computers hold great promise for accelerating computationally challenging algorithms on noisy intermediate-scale quantum (NISQ) devices in the upcoming years. Much attention of the current research is directed to algorithmic research on artificial data that is disconnected from live systems, such as optimization of systems or training of learning algorithms. In this paper we investigate th… ▽ More

    Submitted 25 January, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: 6 pages, accepted to ISQED 2020

  29. arXiv:1908.02880  [pdf, other

    cs.NE

    Benchmarking Surrogate-Assisted Genetic Recommender Systems

    Authors: Thomas Gabor, Philipp Altmann

    Abstract: We propose a new approach for building recommender systems by adapting surrogate-assisted interactive genetic algorithms. A pool of user-evaluated items is used to construct an approximative model which serves as a surrogate fitness function in a genetic algorithm for optimizing new suggestions. The surrogate is used to recommend new items to the user, which are then evaluated according to the use… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: Proceedings of the Genetic and Evolutionary Computation Conference Companion. ACM, 2019

  30. arXiv:1907.05861  [pdf, other

    cs.AI

    Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning

    Authors: Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien

    Abstract: We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a general memory bounded approach to partially observable open-loop planning. SYMBOL maintains an adaptive stack of Thompson Sampling bandits, whose size is bounded by the planning horizon and can be automatically adapted according to the underlying domain without any prior domain knowledge beyond a generative model. We empirically… ▽ More

    Submitted 27 December, 2023; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: Accepted to IJCAI 2019. arXiv admin note: substantial text overlap with arXiv:1905.04020

  31. arXiv:1905.04077  [pdf, other

    cs.MA cs.AI

    Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning

    Authors: Carsten Hahn, Thomy Phan, Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien

    Abstract: In nature, flocking or swarm behavior is observed in many species as it has beneficial properties like reducing the probability of being caught by a predator. In this paper, we propose SELFish (Swarm Emergent Learning Fish), an approach with multiple autonomous agents which can freely move in a continuous space with the objective to avoid being caught by a present predator. The predator has the pr… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: Accepted at ALIFE 2019

  32. arXiv:1903.06454  [pdf, other

    cs.GT

    A Quantum Annealing Algorithm for Finding Pure Nash Equilibria in Graphical Games

    Authors: Christoph Roch, Thomy Phan, Sebastian Feld, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien

    Abstract: We introduce Q-Nash, a quantum annealing algorithm for the NP-complete problem of Fnding pure Nash equilibria in graphical games. The algorithm consists of two phases. The first phase determines all combinations of best response strategies for each player using classical computation. The second phase finds pure Nash equilibria using a quantum annealing device by mapping the computed combinations t… ▽ More

    Submitted 20 August, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

  33. arXiv:1902.04703  [pdf, other

    cs.ET cs.CC quant-ph

    Assessing Solution Quality of 3SAT on a Quantum Annealing Platform

    Authors: Thomas Gabor, Sebastian Zielinski, Sebastian Feld, Christoph Roch, Christian Seidel, Florian Neukart, Isabella Galter, Wolfgang Mauerer, Claudia Linnhoff-Popien

    Abstract: When solving propositional logic satisfiability (specifically 3SAT) using quantum annealing, we analyze the effect the difficulty of different instances of the problem has on the quality of the answer returned by the quantum annealer. A high-quality response from the annealer in this case is defined by a high percentage of correct solutions among the returned answers. We show that the phase transi… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Comments: 13 pages, published at QTOP 2019

  34. Adapting Quality Assurance to Adaptive Systems: The Scenario Coevolution Paradigm

    Authors: Thomas Gabor, Marie Kiermeier, Andreas Sedlmeier, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner Schmid, Jan Wieghardt

    Abstract: From formal and practical analysis, we identify new challenges that self-adaptive systems pose to the process of quality assurance. When tackling these, the effort spent on various tasks in the process of software engineering is naturally re-distributed. We claim that all steps related to testing need to become self-adaptive to match the capabilities of the self-adaptive system-under-test. Otherwi… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Comments: 17 pages, published at ISOLA 2018

    Journal ref: International Symposium on Leveraging Applications of Formal Methods (ISOLA). Springer, 2018

  35. arXiv:1901.08761  [pdf, other

    cs.AI cs.MA

    Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies

    Authors: Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien

    Abstract: Decision making in multi-agent systems (MAS) is a great challenge due to enormous state and joint action spaces as well as uncertainty, making centralized control generally infeasible. Decentralized control offers better scalability and robustness but requires mechanisms to coordinate on joint tasks and to avoid conflicts. Common approaches to learn decentralized policies for cooperative MAS suffe… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

  36. arXiv:1901.02219  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty-Based Out-of-Distribution Detection in Deep Reinforcement Learning

    Authors: Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien

    Abstract: We consider the problem of detecting out-of-distribution (OOD) samples in deep reinforcement learning. In a value based reinforcement learning setting, we propose to use uncertainty estimation techniques directly on the agent's value estimating neural network to detect OOD samples. The focus of our work lies in analyzing the suitability of approximate Bayesian inference methods and related ensembl… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

  37. Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms

    Authors: Thomas Gabor, Lenz Belzner, Thomy Phan, Kyrill Schmid

    Abstract: As automatic optimization techniques find their way into industrial applications, the behavior of many complex systems is determined by some form of planner picking the right actions to optimize a given objective function. In many cases, the mapping of plans to objective reward may change due to unforeseen events or circumstances in the real world. In those cases, the planner usually needs some ad… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: ICAC, 2018, Trento

  38. Inheritance-Based Diversity Measures for Explicit Convergence Control in Evolutionary Algorithms

    Authors: Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien

    Abstract: Diversity is an important factor in evolutionary algorithms to prevent premature convergence towards a single local optimum. In order to maintain diversity throughout the process of evolution, various means exist in literature. We analyze approaches to diversity that (a) have an explicit and quantifiable influence on fitness at the individual level and (b) require no (or very little) additional do… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: GECCO '18: Genetic and Evolutionary Computation Conference, 2018, Kyoto, Japan

  39. arXiv:1804.10781  [pdf, other

    cs.MA

    The Sharer's Dilemma in Collective Adaptive Systems of Self-Interested Agents

    Authors: Lenz Belzner, Kyrill Schmid, Thomy Phan, Thomas Gabor, Martin Wirsing

    Abstract: In collective adaptive systems (CAS), adaptation can be implemented by optimization wrt. utility. Agents in a CAS may be self-interested, while their utilities may depend on other agents' choices. Independent optimization of agent utilities may yield poor individual and global reward due to locally interfering individual preferences. Joint optimization may scale poorly, and is impossible if agents… ▽ More

    Submitted 28 April, 2018; originally announced April 2018.

  40. arXiv:1804.06311  [pdf, other

    cs.MA

    Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation

    Authors: Thomy Phan, Lenz Belzner, Thomas Gabor, Kyrill Schmid

    Abstract: Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole state space, while still being able to make acceptable decisions. However, planning often has to be performed under strict computational constraints making online planning in multi-agent… ▽ More

    Submitted 27 December, 2023; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: Accepted to AAMAS 2018

  41. Genealogical Distance as a Diversity Estimate in Evolutionary Algorithms

    Authors: Thomas Gabor, Lenz Belzner

    Abstract: The evolutionary edit distance between two individuals in a population, i.e., the amount of applications of any genetic operator it would take the evolutionary process to generate one individual starting from the other, seems like a promising estimate for the diversity between said individuals. We introduce genealogical diversity, i.e., estimating two individuals' degree of relatedness by analyzin… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: Measuring and Promoting Diversity in Evolutionary Algorithms @ GECCO 2017

  42. QoS-Aware Multi-Armed Bandits

    Authors: Lenz Belzner, Thomas Gabor

    Abstract: Motivated by runtime verification of QoS requirements in self-adaptive and self-organizing systems that are able to reconfigure their structure and behavior in response to runtime data, we propose a QoS-aware variant of Thompson sampling for multi-armed bandits. It is applicable in settings where QoS satisfaction of an arm has to be ensured with high confidence efficiently, rather than finding the… ▽ More

    Submitted 28 February, 2017; originally announced March 2017.

    Comments: Accepted at IEEE Workshop on Quality Assurance for Self-adaptive Self-organising Systems, FAS* 2016

  43. arXiv:1702.08726  [pdf, other

    cs.SE cs.AI eess.SY

    Stacked Thompson Bandits

    Authors: Lenz Belzner, Thomas Gabor

    Abstract: We introduce Stacked Thompson Bandits (STB) for efficiently generating plans that are likely to satisfy a given bounded temporal logic requirement. STB uses a simulation for evaluation of plans, and takes a Bayesian approach to using the resulting information to guide its search. In particular, we show that stacking multiarmed bandits and using Thompson sampling to guide the action selection proce… ▽ More

    Submitted 28 February, 2017; originally announced February 2017.

    Comments: Accepted at SEsCPS @ ICSE 2017

  44. arXiv:1702.08725  [pdf, other

    cs.SE cs.AI

    Bayesian Verification under Model Uncertainty

    Authors: Lenz Belzner, Thomas Gabor

    Abstract: Machine learning enables systems to build and update domain models based on runtime observations. In this paper, we study statistical model checking and runtime verification for systems with this ability. Two challenges arise: (1) Models built from limited runtime data yield uncertainty to be dealt with. (2) There is no definition of satisfaction w.r.t. uncertain hypotheses. We propose such a defi… ▽ More

    Submitted 28 February, 2017; originally announced February 2017.

    Comments: Accepted at SEsCPS @ ICSE 2017

  45. arXiv:1702.07544  [pdf, other

    cs.MA eess.SY

    Scalable Multiagent Coordination with Distributed Online Open Loop Planning

    Authors: Lenz Belzner, Thomas Gabor

    Abstract: We propose distributed online open loop planning (DOOLP), a general framework for online multiagent coordination and decision making under uncertainty. DOOLP is based on online heuristic search in the space defined by a generative model of the domain dynamics, which is exploited by agents to simulate and evaluate the consequences of their potential choices. We also propose distributed online Tho… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.