-
Operating two exchange-only qubits in parallel
Authors:
Mateusz T. Mądzik,
Florian Luthi,
Gian Giacomo Guerreschi,
Fahd A. Mohiyaddin,
Felix Borjans,
Jason D. Chadwick,
Matthew J. Curry,
Joshua Ziegler,
Sarah Atanasov,
Peter L. Bavdaz,
Elliot J. Connors,
J. Corrigan,
H. Ekmel Ercan,
Robert Flory,
Hubert C. George,
Benjamin Harpt,
Eric Henry,
Mohammad M. Islam,
Nader Khammassi,
Daniel Keith,
Lester F. Lampert,
Todor M. Mladenov,
Randy W. Morris,
Aditi Nethwewala,
Samuel Neyens
, et al. (16 additional authors not shown)
Abstract:
Semiconductors are among the most promising platforms to implement large-scale quantum computers, as advanced manufacturing techniques allow fabrication of large quantum dot arrays. Various qubit encodings can be used to store and manipulate quantum information on these quantum dot arrays. Regardless of qubit encoding, precise control over the exchange interaction between electrons confined in qua…
▽ More
Semiconductors are among the most promising platforms to implement large-scale quantum computers, as advanced manufacturing techniques allow fabrication of large quantum dot arrays. Various qubit encodings can be used to store and manipulate quantum information on these quantum dot arrays. Regardless of qubit encoding, precise control over the exchange interaction between electrons confined in quantum dots in the array is critical. Furthermore, it is necessary to execute high-fidelity quantum operations concurrently to make full use of the limited coherence of individual qubits. Here, we demonstrate the parallel operation of two exchange-only qubits, consisting of six quantum dots in a linear arrangement. Using randomized benchmarking techniques, we show that issuing pulses on the five barrier gates to modulate exchange interactions in a maximally parallel way maintains the quality of qubit control relative to sequential operation. The techniques developed to perform parallel exchange pulses can be readily adapted to other quantum-dot based encodings. Moreover, we show the first experimental demonstrations of an iSWAP gate and of a charge-locking Pauli spin blockade readout method. The results are validated using cross-entropy benchmarking, a technique useful for performance characterization of larger quantum computing systems; here it is used for the first time on a quantum system based on semiconductor technology.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Report on the Workshop on Simulations for Information Access (Sim4IA 2024) at SIGIR 2024
Authors:
Timo Breuer,
Christin Katharina Kreutz,
Norbert Fuhr,
Krisztian Balog,
Philipp Schaer,
Nolwenn Bernard,
Ingo Frommholz,
Marcel Gohsen,
Kaixin Ji,
Gareth J. F. Jones,
Jüri Keller,
Jiqun Liu,
Martin Mladenov,
Gabriella Pasi,
Johanne Trippas,
Xi Wang,
Saber Zerhoudi,
ChengXiang Zhai
Abstract:
This paper is a report of the Workshop on Simulations for Information Access (Sim4IA) workshop at SIGIR 2024. The workshop had two keynotes, a panel discussion, nine lightning talks, and two breakout sessions. Key takeaways were user simulation's importance in academia and industry, the possible bridging of online and offline evaluation, and the issues of organizing a companion shared task around…
▽ More
This paper is a report of the Workshop on Simulations for Information Access (Sim4IA) workshop at SIGIR 2024. The workshop had two keynotes, a panel discussion, nine lightning talks, and two breakout sessions. Key takeaways were user simulation's importance in academia and industry, the possible bridging of online and offline evaluation, and the issues of organizing a companion shared task around user simulations for information access. We report on how we organized the workshop, provide a brief overview of what happened at the workshop, and summarize the main topics and findings of the workshop and future work.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Authors:
Chih-Wei Hsu,
Martin Mladenov,
Ofer Meshi,
James Pine,
Hubert Pham,
Shane Li,
Xujian Liang,
Anton Polishko,
Li Yang,
Ben Scheetz,
Craig Boutilier
Abstract:
Evaluation of policies in recommender systems typically involves A/B testing using live experiments on real users to assess a new policy's impact on relevant metrics. This ``gold standard'' comes at a high cost, however, in terms of cycle time, user cost, and potential user retention. In developing policies for ``onboarding'' new users, these costs can be especially problematic, since on-boarding…
▽ More
Evaluation of policies in recommender systems typically involves A/B testing using live experiments on real users to assess a new policy's impact on relevant metrics. This ``gold standard'' comes at a high cost, however, in terms of cycle time, user cost, and potential user retention. In developing policies for ``onboarding'' new users, these costs can be especially problematic, since on-boarding occurs only once. In this work, we describe a simulation methodology used to augment (and reduce) the use of live experiments. We illustrate its deployment for the evaluation of ``preference elicitation'' algorithms used to onboard new users of the YouTube Music platform. By developing counterfactually robust user behavior models, and a simulation service that couples such models with production infrastructure, we are able to test new algorithms in a way that reliably predicts their performance on key metrics when deployed live. We describe our domain, our simulation models and platform, results of experiments and deployment, and suggest future steps needed to further realistic simulation as a powerful complement to live experiments.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Demystifying Embedding Spaces using Large Language Models
Authors:
Guy Tennenholtz,
Yinlam Chow,
Chih-Wei Hsu,
Jihwan Jeong,
Lior Shani,
Azamat Tulepbergenov,
Deepak Ramachandran,
Martin Mladenov,
Craig Boutilier
Abstract:
Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machin…
▽ More
Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machine learning interpretability methods. This paper addresses the challenge of making such embeddings more interpretable and broadly useful, by employing Large Language Models (LLMs) to directly interact with embeddings -- transforming abstract vectors into understandable narratives. By injecting embeddings into LLMs, we enable querying and exploration of complex embedding data. We demonstrate our approach on a variety of diverse tasks, including: enhancing concept activation vectors (CAVs), communicating novel embedded entities, and decoding user preferences in recommender systems. Our work couples the immense information potential of embeddings with the interpretative power of LLMs.
△ Less
Submitted 13 March, 2024; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models
Authors:
Craig Boutilier,
Martin Mladenov,
Guy Tennenholtz
Abstract:
Modern recommender systems lie at the heart of complex ecosystems that couple the behavior of users, content providers, advertisers, and other actors. Despite this, the focus of the majority of recommender research -- and most practical recommenders of any import -- is on the local, myopic optimization of the recommendations made to individual users. This comes at a significant cost to the long-te…
▽ More
Modern recommender systems lie at the heart of complex ecosystems that couple the behavior of users, content providers, advertisers, and other actors. Despite this, the focus of the majority of recommender research -- and most practical recommenders of any import -- is on the local, myopic optimization of the recommendations made to individual users. This comes at a significant cost to the long-term utility that recommenders could generate for its users. We argue that explicitly modeling the incentives and behaviors of all actors in the system -- and the interactions among them induced by the recommender's policy -- is strictly necessary if one is to maximize the value the system brings to these actors and improve overall ecosystem "health". Doing so requires: optimization over long horizons using techniques such as reinforcement learning; making inevitable tradeoffs in the utility that can be generated for different actors using the methods of social choice; reducing information asymmetry, while accounting for incentives and strategic behavior, using the tools of mechanism design; better modeling of both user and item-provider behaviors by incorporating notions from behavioral economics and psychology; and exploiting recent advances in generative and foundation models to make these mechanisms interpretable and actionable. We propose a conceptual framework that encompasses these elements, and articulate a number of research challenges that emerge at the intersection of these different disciplines.
△ Less
Submitted 21 September, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Content Prompting: Modeling Content Provider Dynamics to Improve User Welfare in Recommender Ecosystems
Authors:
Siddharth Prasad,
Martin Mladenov,
Craig Boutilier
Abstract:
Users derive value from a recommender system (RS) only to the extent that it is able to surface content (or items) that meet their needs/preferences. While RSs often have a comprehensive view of user preferences across the entire user base, content providers, by contrast, generally have only a local view of the preferences of users that have interacted with their content. This limits a provider's…
▽ More
Users derive value from a recommender system (RS) only to the extent that it is able to surface content (or items) that meet their needs/preferences. While RSs often have a comprehensive view of user preferences across the entire user base, content providers, by contrast, generally have only a local view of the preferences of users that have interacted with their content. This limits a provider's ability to offer new content to best serve the broader population. In this work, we tackle this information asymmetry with content prompting policies. A content prompt is a hint or suggestion to a provider to make available novel content for which the RS predicts unmet user demand. A prompting policy is a sequence of such prompts that is responsive to the dynamics of a provider's beliefs, skills and incentives. We aim to determine a joint prompting policy that induces a set of providers to make content available that optimizes user social welfare in equilibrium, while respecting the incentives of the providers themselves. Our contributions include: (i) an abstract model of the RS ecosystem, including content provider behaviors, that supports such prompting; (ii) the design and theoretical analysis of sequential prompting policies for individual providers; (iii) a mixed integer programming formulation for optimal joint prompting using path planning in content space; and (iv) simple, proof-of-concept experiments illustrating how such policies improve ecosystem health and user welfare.
△ Less
Submitted 2 September, 2023;
originally announced September 2023.
-
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics
Authors:
Guy Tennenholtz,
Martin Mladenov,
Nadav Merlis,
Robert L. Axtell,
Craig Boutilier
Abstract:
While popularity bias is recognized to play a crucial role in recommmender (and other ranking-based) systems, detailed analysis of its impact on collective user welfare has largely been lacking. We propose and theoretically analyze a general mechanism, rooted in many of the models proposed in the literature, by which item popularity, item quality, and position bias jointly impact user choice. We f…
▽ More
While popularity bias is recognized to play a crucial role in recommmender (and other ranking-based) systems, detailed analysis of its impact on collective user welfare has largely been lacking. We propose and theoretically analyze a general mechanism, rooted in many of the models proposed in the literature, by which item popularity, item quality, and position bias jointly impact user choice. We focus on a standard setting in which user utility is largely driven by item quality, and a recommender attempts to estimate it given user behavior. Formulating the problem as a non-stationary contextual bandit, we study the ability of a recommender policy to maximize user welfare under this model. We highlight the importance of exploration, not to eliminate popularity bias, but to mitigate its negative impact on welfare. We first show that naive popularity-biased recommenders induce linear regret by conflating item quality and popularity. More generally, we show that, even in linear settings, identifiability of item quality may not be possible due to the confounding effects of popularity bias. However, under sufficient variability assumptions, we develop an efficient optimistic algorithm and prove efficient regret guarantees w.r.t. user welfare. We complement our analysis with several simulation studies, which demonstrate the negative impact of popularity bias on the performance of several natural recommender policies.
△ Less
Submitted 1 November, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Reinforcement Learning with History-Dependent Dynamic Contexts
Authors:
Guy Tennenholtz,
Nadav Merlis,
Lior Shani,
Martin Mladenov,
Craig Boutilier
Abstract:
We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a novel reinforcement learning framework for history-dependent environments that generalizes the contextual MDP framework to handle non-Markov environments, where contexts change over time. We consider special cases of the model, with a focus on logistic DCMDPs, which break the exponential dependence on history length by leveragin…
▽ More
We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a novel reinforcement learning framework for history-dependent environments that generalizes the contextual MDP framework to handle non-Markov environments, where contexts change over time. We consider special cases of the model, with a focus on logistic DCMDPs, which break the exponential dependence on history length by leveraging aggregation functions to determine context transitions. This special structure allows us to derive an upper-confidence-bound style algorithm for which we establish regret bounds. Motivated by our theoretical results, we introduce a practical model-based algorithm for logistic DCMDPs that plans in a latent space and uses optimism over history-dependent features. We demonstrate the efficacy of our approach on a recommendation task (using MovieLens data) where user behavior dynamics evolve in response to recommendations.
△ Less
Submitted 17 May, 2023; v1 submitted 3 February, 2023;
originally announced February 2023.
-
pyRDDLGym: From RDDL to Gym Environments
Authors:
Ayal Taitler,
Michael Gimelfarb,
Jihwan Jeong,
Sriram Gopalakrishnan,
Martin Mladenov,
Xiaotian Liu,
Scott Sanner
Abstract:
We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description. The discrete time step evolution of variables in RDDL is described by conditional probability functions, which fits naturally into the Gym step scheme. Furthermore, since RDDL is a lifted description, the modification and scaling up of environments to support multiple entities…
▽ More
We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description. The discrete time step evolution of variables in RDDL is described by conditional probability functions, which fits naturally into the Gym step scheme. Furthermore, since RDDL is a lifted description, the modification and scaling up of environments to support multiple entities and different configurations becomes trivial rather than a tedious process prone to errors. We hope that pyRDDLGym will serve as a new wind in the reinforcement learning community by enabling easy and rapid development of benchmarks due to the unique expressive power of RDDL. By providing explicit access to the model in the RDDL description, pyRDDLGym can also facilitate research on hybrid approaches for learning from interaction while leveraging model knowledge. We present the design and built-in examples of pyRDDLGym, and the additions made to the RDDL language that were incorporated into the framework.
△ Less
Submitted 5 February, 2024; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Authors:
Ruohan Zhan,
Konstantina Christakopoulou,
Ya Le,
Jayden Ooi,
Martin Mladenov,
Alex Beutel,
Craig Boutilier,
Ed H. Chi,
Minmin Chen
Abstract:
Most existing recommender systems focus primarily on matching users to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into…
▽ More
Most existing recommender systems focus primarily on matching users to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into account the long-term utility of both users and content providers? By doing so, we hope to sustain more providers and a more diverse content pool for long-term user satisfaction. Understanding the full impact of recommendations on both user and provider groups is challenging. This paper aims to serve as a research investigation of one approach toward building a provider-aware recommender, and evaluating its impact in a simulated setup.
To characterize the user-recommender-provider interdependence, we complement user modeling by formalizing provider dynamics as well. The resulting joint dynamical system gives rise to a weakly-coupled partially observable Markov decision process driven by recommender actions and user feedback to providers. We then build a REINFORCE recommender agent, coined EcoAgent, to optimize a joint objective of user utility and the counterfactual utility lift of the provider associated with the recommended content, which we show to be equivalent to maximizing overall user utility and the utilities of all providers on the platform under some mild assumptions. To evaluate our approach, we introduce a simulation environment capturing the key interactions among users, providers, and the recommender. We offer a number of simulated experiments that shed light on both the benefits and the limitations of our approach. These results help understand how and when a provider-aware recommender agent is of benefit in building multi-stakeholder recommender systems.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
Supersonic Ramjet Engine Inlet for Jovian Flight
Authors:
Martin N. Karanikolov,
Nedislav S. Veselinov,
Dimitar M. Mladenov
Abstract:
This paper discusses the analysis performed on a supersonic ramjet engine inlet for flight in the atmosphere of Jupiter. Since the Jovian atmosphere lacks oxygen, the thrust will be generated by nuclear fission heating in the heat chamber. The first task to solve in the design in a ramjet engine is to design the supersonic inlet. The developed design methodology utilizes theoretical calculations a…
▽ More
This paper discusses the analysis performed on a supersonic ramjet engine inlet for flight in the atmosphere of Jupiter. Since the Jovian atmosphere lacks oxygen, the thrust will be generated by nuclear fission heating in the heat chamber. The first task to solve in the design in a ramjet engine is to design the supersonic inlet. The developed design methodology utilizes theoretical calculations and Computational Fluid Dynamics (CFD) simulations. The analytical model used to calculate the gas parameters in front of the heat chamber, and the CFD analysis, used to define the inlet geometry, are discussed. The results from the analytical model and CFD are compared and used for validation of the design approach. The calculated pressure losses and the mass flow allow the determination of important parameters required for the design of the aircraft, such as the reactor power, the thrust, the maximum mass, and the overall external dimensions.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Authors:
Martin Mladenov,
Chih-Wei Hsu,
Vihan Jain,
Eugene Ie,
Christopher Colby,
Nicolas Mayoraz,
Hubert Pham,
Dustin Tran,
Ivan Vendrov,
Craig Boutilier
Abstract:
The development of recommender systems that optimize multi-turn interaction with users, and model the interactions of different agents (e.g., users, content providers, vendors) in the recommender ecosystem have drawn increasing attention in recent years. Developing and training models and algorithms for such recommenders can be especially difficult using static datasets, which often fail to offer…
▽ More
The development of recommender systems that optimize multi-turn interaction with users, and model the interactions of different agents (e.g., users, content providers, vendors) in the recommender ecosystem have drawn increasing attention in recent years. Developing and training models and algorithms for such recommenders can be especially difficult using static datasets, which often fail to offer the types of counterfactual predictions needed to evaluate policies over extended horizons. To address this, we develop RecSim NG, a probabilistic platform for the simulation of multi-agent recommender systems. RecSim NG is a scalable, modular, differentiable simulator implemented in Edward2 and TensorFlow. It offers: a powerful, general probabilistic programming language for agent-behavior specification; tools for probabilistic inference and latent-variable model learning, backed by automatic differentiation and tracing; and a TensorFlow-based runtime for running simulations on accelerated hardware. We describe RecSim NG and illustrate how it can be used to create transparent, configurable, end-to-end models of a recommender ecosystem, complemented by a small set of simple use cases that demonstrate how RecSim NG can help both researchers and practitioners easily develop and train novel algorithms for recommender systems.
△ Less
Submitted 14 March, 2021;
originally announced March 2021.
-
Meta-Thompson Sampling
Authors:
Branislav Kveton,
Mikhail Konobeev,
Manzil Zaheer,
Chih-wei Hsu,
Martin Mladenov,
Craig Boutilier,
Csaba Szepesvari
Abstract:
Efficient exploration in bandits is a fundamental online learning problem. We propose a variant of Thompson sampling that learns to explore better as it interacts with bandit instances drawn from an unknown prior. The algorithm meta-learns the prior and thus we call it MetaTS. We propose several efficient implementations of MetaTS and analyze it in Gaussian bandits. Our analysis shows the benefit…
▽ More
Efficient exploration in bandits is a fundamental online learning problem. We propose a variant of Thompson sampling that learns to explore better as it interacts with bandit instances drawn from an unknown prior. The algorithm meta-learns the prior and thus we call it MetaTS. We propose several efficient implementations of MetaTS and analyze it in Gaussian bandits. Our analysis shows the benefit of meta-learning and is of a broader interest, because we derive a novel prior-dependent Bayes regret bound for Thompson sampling. Our theory is complemented by empirical evaluation, which shows that MetaTS quickly adapts to the unknown prior.
△ Less
Submitted 23 June, 2021; v1 submitted 11 February, 2021;
originally announced February 2021.
-
Flight in the Jovian Stratosphere. Engine Concept and Flight Altitude Determination
Authors:
Nedislav S. Veselinov,
Martin N. Karanikolov,
Vladislav V. Shihskin,
Dimitar M. Mladenov
Abstract:
An effective method for detailed observation of the Solar System planets is the use of vehicles that can perform flight in their atmospheres, with the most promising of them being Flyers (aircraft for other planets atmospheres). Besides the advantage of probing the atmosphere directly, they have the ability to fly on selected direction and altitude, making them suitable for collecting information…
▽ More
An effective method for detailed observation of the Solar System planets is the use of vehicles that can perform flight in their atmospheres, with the most promising of them being Flyers (aircraft for other planets atmospheres). Besides the advantage of probing the atmosphere directly, they have the ability to fly on selected direction and altitude, making them suitable for collecting information over large areas. Equipping the Flyer with nuclear propulsion will allow it to conduct flight for months without the need of combustible fuel or oxidizer to be carried on board. Among the planets of the Solar System and their satellites, Jupiter is a viable target for exploration, since it features thick atmosphere suitable for aerodynamic flight, there is no solid surface that can be contaminated after end of the mission, and the atmospheric data for designing a Flyer is readily available. This paper proposes a mathematical model for evaluating the thrust, the lift and the maximum allowable mass for horizontal steady flight as functions of the altitude and different heat chamber temperatures.
△ Less
Submitted 16 September, 2020;
originally announced September 2020.
-
Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Authors:
Martin Mladenov,
Elliot Creager,
Omer Ben-Porat,
Kevin Swersky,
Richard Zemel,
Craig Boutilier
Abstract:
Most recommender systems (RS) research assumes that a user's utility can be maximized independently of the utility of the other agents (e.g., other users, content providers). In realistic settings, this is often not true---the dynamics of an RS ecosystem couple the long-term utility of all agents. In this work, we explore settings in which content providers cannot remain viable unless they receive…
▽ More
Most recommender systems (RS) research assumes that a user's utility can be maximized independently of the utility of the other agents (e.g., other users, content providers). In realistic settings, this is often not true---the dynamics of an RS ecosystem couple the long-term utility of all agents. In this work, we explore settings in which content providers cannot remain viable unless they receive a certain level of user engagement. We formulate the recommendation problem in this setting as one of equilibrium selection in the induced dynamical system, and show that it can be solved as an optimal constrained matching problem. Our model ensures the system reaches an equilibrium with maximal social welfare supported by a sufficiently diverse set of viable providers. We demonstrate that even in a simple, stylized dynamical RS model, the standard myopic approach to recommendation---always matching a user to the best provider---performs poorly. We develop several scalable techniques to solve the matching problem, and also draw connections to various notions of user regret and fairness, arguing that these outcomes are fairer in a utilitarian sense.
△ Less
Submitted 18 August, 2020; v1 submitted 31 July, 2020;
originally announced August 2020.
-
Meta-Learning Bandit Policies by Gradient Ascent
Authors:
Branislav Kveton,
Martin Mladenov,
Chih-Wei Hsu,
Manzil Zaheer,
Csaba Szepesvari,
Craig Boutilier
Abstract:
Most bandit policies are designed to either minimize regret in any problem instance, making very few assumptions about the underlying environment, or in a Bayesian sense, assuming a prior distribution over environment parameters. The former are often too conservative in practical settings, while the latter require assumptions that are hard to verify in practice. We study bandit problems that fall…
▽ More
Most bandit policies are designed to either minimize regret in any problem instance, making very few assumptions about the underlying environment, or in a Bayesian sense, assuming a prior distribution over environment parameters. The former are often too conservative in practical settings, while the latter require assumptions that are hard to verify in practice. We study bandit problems that fall between these two extremes, where the learning agent has access to sampled bandit instances from an unknown prior distribution $\mathcal{P}$ and aims to achieve high reward on average over the bandit instances drawn from $\mathcal{P}$. This setting is of a particular importance because it lays foundations for meta-learning of bandit policies and reflects more realistic assumptions in many practical domains. We propose the use of parameterized bandit policies that are differentiable and can be optimized using policy gradients. This provides a broadly applicable framework that is easy to implement. We derive reward gradients that reflect the structure of bandit problems and policies, for both non-contextual and contextual settings, and propose a number of interesting policies that are both differentiable and have low regret. Our algorithmic and theoretical contributions are supported by extensive experiments that show the importance of baseline subtraction, learned biases, and the practicality of our approach on a range problems.
△ Less
Submitted 5 January, 2021; v1 submitted 9 June, 2020;
originally announced June 2020.
-
Differentiable Bandit Exploration
Authors:
Craig Boutilier,
Chih-Wei Hsu,
Branislav Kveton,
Martin Mladenov,
Csaba Szepesvari,
Manzil Zaheer
Abstract:
Exploration policies in Bayesian bandits maximize the average reward over problem instances drawn from some distribution $\mathcal{P}$. In this work, we learn such policies for an unknown distribution $\mathcal{P}$ using samples from $\mathcal{P}$. Our approach is a form of meta-learning and exploits properties of $\mathcal{P}$ without making strong assumptions about its form. To do this, we param…
▽ More
Exploration policies in Bayesian bandits maximize the average reward over problem instances drawn from some distribution $\mathcal{P}$. In this work, we learn such policies for an unknown distribution $\mathcal{P}$ using samples from $\mathcal{P}$. Our approach is a form of meta-learning and exploits properties of $\mathcal{P}$ without making strong assumptions about its form. To do this, we parameterize our policies in a differentiable way and optimize them by policy gradients, an approach that is general and easy to implement. We derive effective gradient estimators and introduce novel variance reduction techniques. We also analyze and experiment with various bandit policy classes, including neural networks and a novel softmax policy. The latter has regret guarantees and is a natural starting point for our optimization. Our experiments show the versatility of our approach. We also observe that neural network policies can learn implicit biases expressed only through the sampled instances.
△ Less
Submitted 9 June, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Mechanics of the Infinitesimal Gyroscopes on the Mylar Balloons and Their Action-Angle Analysis
Authors:
Vasyl Kovalchuk,
Ivailo M. Mladenov
Abstract:
Here we apply the general scheme for description of the mechanics of infinitesimal bodies in the Riemannian spaces to the examples of geodetic and non-geodetic (for two different model potentials) motions of infinitesimal rotators on the Mylar balloons. The structure of partial degeneracy is investigated with the help of the corresponding Hamilton-Jacobi equation and action-angle analysis. In all…
▽ More
Here we apply the general scheme for description of the mechanics of infinitesimal bodies in the Riemannian spaces to the examples of geodetic and non-geodetic (for two different model potentials) motions of infinitesimal rotators on the Mylar balloons. The structure of partial degeneracy is investigated with the help of the corresponding Hamilton-Jacobi equation and action-angle analysis. In all situations it was found that for any of the six disjoint regions in the phase space among the three action variables only two of them are essential for the description of our models at the level of the old quantum theory (according to the Bohr-Sommerfeld postulates). Moreover, in both non-geodetic models the action variables were intertwined with the quantum number $N$ corresponding to the quantization of the radii $r$ of the inflated Mylar balloons.
△ Less
Submitted 7 October, 2019; v1 submitted 3 October, 2019;
originally announced October 2019.
-
RecSim: A Configurable Simulation Platform for Recommender Systems
Authors:
Eugene Ie,
Chih-wei Hsu,
Martin Mladenov,
Vihan Jain,
Sanmit Narvekar,
Jing Wang,
Rui Wu,
Craig Boutilier
Abstract:
We propose RecSim, a configurable platform for authoring simulation environments for recommender systems (RSs) that naturally supports sequential interaction with users. RecSim allows the creation of new environments that reflect particular aspects of user behavior and item structure at a level of abstraction well-suited to pushing the limits of current reinforcement learning (RL) and RS technique…
▽ More
We propose RecSim, a configurable platform for authoring simulation environments for recommender systems (RSs) that naturally supports sequential interaction with users. RecSim allows the creation of new environments that reflect particular aspects of user behavior and item structure at a level of abstraction well-suited to pushing the limits of current reinforcement learning (RL) and RS techniques in sequential interactive recommendation problems. Environments can be easily configured that vary assumptions about: user preferences and item familiarity; user latent state and its dynamics; and choice models and other user response behavior. We outline how RecSim offers value to RL and RS researchers and practitioners, and how it can serve as a vehicle for academic-industrial collaboration.
△ Less
Submitted 26 September, 2019; v1 submitted 11 September, 2019;
originally announced September 2019.
-
Advantage Amplification in Slowly Evolving Latent-State Environments
Authors:
Martin Mladenov,
Ofer Meshi,
Jayden Ooi,
Dale Schuurmans,
Craig Boutilier
Abstract:
Latent-state environments with long horizons, such as those faced by recommender systems, pose significant challenges for reinforcement learning (RL). In this work, we identify and analyze several key hurdles for RL in such environments, including belief state error and small action advantage. We develop a general principle of advantage amplification that can overcome these hurdles through the use…
▽ More
Latent-state environments with long horizons, such as those faced by recommender systems, pose significant challenges for reinforcement learning (RL). In this work, we identify and analyze several key hurdles for RL in such environments, including belief state error and small action advantage. We develop a general principle of advantage amplification that can overcome these hurdles through the use of temporal abstraction. We propose several aggregation methods and prove they induce amplification in certain settings. We also bound the loss in optimality incurred by our methods in environments where latent state evolves slowly and demonstrate their performance empirically in a stylized user-modeling task.
△ Less
Submitted 29 May, 2019;
originally announced May 2019.
-
Empirical Bayes Regret Minimization
Authors:
Chih-Wei Hsu,
Branislav Kveton,
Ofer Meshi,
Martin Mladenov,
Csaba Szepesvari
Abstract:
Most bandit algorithm designs are purely theoretical. Therefore, they have strong regret guarantees, but also are often too conservative in practice. In this work, we pioneer the idea of algorithm design by minimizing the empirical Bayes regret, the average regret over problem instances sampled from a known distribution. We focus on a tractable instance of this problem, the confidence interval and…
▽ More
Most bandit algorithm designs are purely theoretical. Therefore, they have strong regret guarantees, but also are often too conservative in practice. In this work, we pioneer the idea of algorithm design by minimizing the empirical Bayes regret, the average regret over problem instances sampled from a known distribution. We focus on a tractable instance of this problem, the confidence interval and posterior width tuning, and propose an efficient algorithm for solving it. The tuning algorithm is analyzed and evaluated in multi-armed, linear, and generalized linear bandits. We report several-fold reductions in Bayes regret for state-of-the-art bandit algorithms, simply by optimizing over a small sample from a distribution.
△ Less
Submitted 10 June, 2020; v1 submitted 4 April, 2019;
originally announced April 2019.
-
Planning and Learning with Stochastic Action Sets
Authors:
Craig Boutilier,
Alon Cohen,
Amit Daniely,
Avinatan Hassidim,
Yishay Mansour,
Ofer Meshi,
Martin Mladenov,
Dale Schuurmans
Abstract:
In many practical uses of reinforcement learning (RL) the set of actions available at a given state is a random variable, with realizations governed by an exogenous stochastic process. Somewhat surprisingly, the foundations for such sequential decision processes have been unaddressed. In this work, we formalize and investigate MDPs with stochastic action sets (SAS-MDPs) to provide these foundation…
▽ More
In many practical uses of reinforcement learning (RL) the set of actions available at a given state is a random variable, with realizations governed by an exogenous stochastic process. Somewhat surprisingly, the foundations for such sequential decision processes have been unaddressed. In this work, we formalize and investigate MDPs with stochastic action sets (SAS-MDPs) to provide these foundations. We show that optimal policies and value functions in this model have a structure that admits a compact representation. From an RL perspective, we show that Q-learning with sampled action sets is sound. In model-based settings, we consider two important special cases: when individual actions are available with independent probabilities; and a sampling-based model for unknown distributions. We develop poly-time value and policy iteration methods for both cases; and in the first, we offer a poly-time linear programming solution.
△ Less
Submitted 12 February, 2021; v1 submitted 7 May, 2018;
originally announced May 2018.
-
Lifted Convex Quadratic Programming
Authors:
Martin Mladenov,
Leonard Kleinhans,
Kristian Kersting
Abstract:
Symmetry is the essential element of lifted inference that has recently demon- strated the possibility to perform very efficient inference in highly-connected, but symmetric probabilistic models models. This raises the question, whether this holds for optimisation problems in general. Here we show that for a large class of optimisation methods this is actually the case. More precisely, we introduc…
▽ More
Symmetry is the essential element of lifted inference that has recently demon- strated the possibility to perform very efficient inference in highly-connected, but symmetric probabilistic models models. This raises the question, whether this holds for optimisation problems in general. Here we show that for a large class of optimisation methods this is actually the case. More precisely, we introduce the concept of fractional symmetries of convex quadratic programs (QPs), which lie at the heart of many machine learning approaches, and exploit it to lift, i.e., to compress QPs. These lifted QPs can then be tackled with the usual optimization toolbox (off-the-shelf solvers, cutting plane algorithms, stochastic gradients etc.). If the original QP exhibits symmetry, then the lifted one will generally be more compact, and hence their optimization is likely to be more efficient.
△ Less
Submitted 14 June, 2016;
originally announced June 2016.
-
The Symbolic Interior Point Method
Authors:
Martin Mladenov,
Vaishak Belle,
Kristian Kersting
Abstract:
A recent trend in probabilistic inference emphasizes the codification of models in a formal syntax, with suitable high-level features such as individuals, relations, and connectives, enabling descriptive clarity, succinctness and circumventing the need for the modeler to engineer a custom solver. Unfortunately, bringing these linguistic and pragmatic benefits to numerical optimization has proven s…
▽ More
A recent trend in probabilistic inference emphasizes the codification of models in a formal syntax, with suitable high-level features such as individuals, relations, and connectives, enabling descriptive clarity, succinctness and circumventing the need for the modeler to engineer a custom solver. Unfortunately, bringing these linguistic and pragmatic benefits to numerical optimization has proven surprisingly challenging. In this paper, we turn to these challenges: we introduce a rich modeling language, for which an interior-point method computes approximate solutions in a generic way. While logical features easily complicates the underlying model, often yielding intricate dependencies, we exploit and cache local structure using algebraic decision diagrams (ADDs). Indeed, standard matrix-vector algebra is efficiently realizable in ADDs, but we argue and show that well-known optimization methods are not ideal for ADDs. Our engine, therefore, invokes a sophisticated matrix-free approach. We demonstrate the flexibility of the resulting symbolic-numeric optimizer on decision making and compressed sensing tasks with millions of non-zero entries.
△ Less
Submitted 14 June, 2016; v1 submitted 26 May, 2016;
originally announced May 2016.
-
On MKdV Equations Related to the Affine Kac-Moody Algebra $A_{5}^{(2)}$
Authors:
Vladimir S. Gerdjikov,
Dimitar M. Mladenov,
Aleksander A. Stefanov,
Stanislav K. Varbev
Abstract:
We have derived a new system of mKdV-type equations which can be related to the affine Lie algebra $A_{5}^{(2)}$. This system of partial differential equations is integrable via the inverse scattering method. It admits a Hamiltonian formulation and the corresponding Hamiltonian is also given. The Riemann-Hilbert problem for the Lax operator is formulated and its spectral properties are discussed.
We have derived a new system of mKdV-type equations which can be related to the affine Lie algebra $A_{5}^{(2)}$. This system of partial differential equations is integrable via the inverse scattering method. It admits a Hamiltonian formulation and the corresponding Hamiltonian is also given. The Riemann-Hilbert problem for the Lax operator is formulated and its spectral properties are discussed.
△ Less
Submitted 4 December, 2015;
originally announced December 2015.
-
Soliton equations related to the affine Kac-Moody algebra D^(1)_4
Authors:
Vladimir S. Gerdjikov,
Dimitar M. Mladenov,
Aleksander A. Stefanov,
Stanislav K. Varbev
Abstract:
We have derived the hierarchy of soliton equations associated with the untwisted affine Kac-Moody algebra D^(1)_4 by calculating the corresponding recursion operators. The Hamiltonian formulation of the equations from the hierarchy is also considered. As an example we have explicitly presented the first non-trivial member of the hierarchy, which is an one-parameter family of mKdV equations. We hav…
▽ More
We have derived the hierarchy of soliton equations associated with the untwisted affine Kac-Moody algebra D^(1)_4 by calculating the corresponding recursion operators. The Hamiltonian formulation of the equations from the hierarchy is also considered. As an example we have explicitly presented the first non-trivial member of the hierarchy, which is an one-parameter family of mKdV equations. We have also considered the spectral properties of the Lax operator and introduced a minimal set of scattering data.
△ Less
Submitted 9 July, 2015; v1 submitted 7 December, 2014;
originally announced December 2014.
-
Integrable equations and recursion operators related to the affine Lie algebras $A^{(1)}_{r}$
Authors:
Vladimir S. Gerdjikov,
Dimitar M. Mladenov,
Aleksander A. Stefanov,
Stanislav K. Varbev
Abstract:
We have derived a family of equations related to the untwisted affine Lie algebras $A^{(1)}_{r}$ using a Coxeter $\mathbb{Z}_{r+1}$ reduction. They represent the third member of the hierarchy of soliton equations related to the algebra. We also give some particular examples and impose additional reductions.
We have derived a family of equations related to the untwisted affine Lie algebras $A^{(1)}_{r}$ using a Coxeter $\mathbb{Z}_{r+1}$ reduction. They represent the third member of the hierarchy of soliton equations related to the algebra. We also give some particular examples and impose additional reductions.
△ Less
Submitted 2 November, 2014;
originally announced November 2014.
-
Relational Linear Programs
Authors:
Kristian Kersting,
Martin Mladenov,
Pavel Tokmakov
Abstract:
We propose relational linear programming, a simple framework for combing linear programs (LPs) and logic programs. A relational linear program (RLP) is a declarative LP template defining the objective and the constraints through the logical concepts of objects, relations, and quantified variables. This allows one to express the LP objective and constraints relationally for a varying number of indi…
▽ More
We propose relational linear programming, a simple framework for combing linear programs (LPs) and logic programs. A relational linear program (RLP) is a declarative LP template defining the objective and the constraints through the logical concepts of objects, relations, and quantified variables. This allows one to express the LP objective and constraints relationally for a varying number of individuals and relations among them without enumerating them. Together with a logical knowledge base, effectively a logical program consisting of logical facts and rules, it induces a ground LP. This ground LP is solved using lifted linear programming. That is, symmetries within the ground LP are employed to reduce its dimensionality, if possible, and the reduced program is solved using any off-the-shelf LP solver. In contrast to mainstream LP template languages like AMPL, which features a mixture of declarative and imperative programming styles, RLP's relational nature allows a more intuitive representation of optimization problems over relational domains. We illustrate this empirically by experiments on approximate inference in Markov logic networks using LP relaxations, on solving Markov decision processes, and on collective inference using LP support vector machines.
△ Less
Submitted 12 October, 2014;
originally announced October 2014.
-
Dimension Reduction via Colour Refinement
Authors:
Martin Grohe,
Kristian Kersting,
Martin Mladenov,
Erkal Selman
Abstract:
Colour refinement is a basic algorithmic routine for graph isomorphism testing, appearing as a subroutine in almost all practical isomorphism solvers. It partitions the vertices of a graph into "colour classes" in such a way that all vertices in the same colour class have the same number of neighbours in every colour class. Tinhofer (Disc. App. Math., 1991), Ramana, Scheinerman, and Ullman (Disc.…
▽ More
Colour refinement is a basic algorithmic routine for graph isomorphism testing, appearing as a subroutine in almost all practical isomorphism solvers. It partitions the vertices of a graph into "colour classes" in such a way that all vertices in the same colour class have the same number of neighbours in every colour class. Tinhofer (Disc. App. Math., 1991), Ramana, Scheinerman, and Ullman (Disc. Math., 1994) and Godsil (Lin. Alg. and its App., 1997) established a tight correspondence between colour refinement and fractional isomorphisms of graphs, which are solutions to the LP relaxation of a natural ILP formulation of graph isomorphism.
We introduce a version of colour refinement for matrices and extend existing quasilinear algorithms for computing the colour classes. Then we generalise the correspondence between colour refinement and fractional automorphisms and develop a theory of fractional automorphisms and isomorphisms of matrices.
We apply our results to reduce the dimensions of systems of linear equations and linear programs. Specifically, we show that any given LP L can efficiently be transformed into a (potentially) smaller LP L' whose number of variables and constraints is the number of colour classes of the colour refinement algorithm, applied to a matrix associated with the LP. The transformation is such that we can easily (by a linear mapping) map both feasible and optimal solutions back and forth between the two LPs. We demonstrate empirically that colour refinement can indeed greatly reduce the cost of solving linear programs.
△ Less
Submitted 30 April, 2014; v1 submitted 22 July, 2013;
originally announced July 2013.
-
Analytic description and explicit parametrization of the equilibrium shapes of elastic rings and tubes under uniform hydrostatic pressure
Authors:
Peter A. Djondjorov,
Vassil M. Vassilev,
Ivailo M. Mladenov
Abstract:
The parametric equations of the plane curves determining the equilibrium shapes that a uniform inextensible elastic ring or tube could take subject to a uniform hydrostatic pressure are presented in an explicit analytic form. The determination of the equilibrium shape of such a structure corresponding to a given pressure is reduced to the solution of two transcendental equations. The shapes with p…
▽ More
The parametric equations of the plane curves determining the equilibrium shapes that a uniform inextensible elastic ring or tube could take subject to a uniform hydrostatic pressure are presented in an explicit analytic form. The determination of the equilibrium shape of such a structure corresponding to a given pressure is reduced to the solution of two transcendental equations. The shapes with points of contact and the corresponding (contact) pressures are determined by the solutions of three transcendental equations. The analytic results presented here confirm many of the previous numerical results on this subject but the results concerning the shapes with lines of contact reported up to now are revised.
△ Less
Submitted 3 August, 2010;
originally announced August 2010.
-
On the Uncertainty Relations in Stochastic Mechanics
Authors:
D. A. Trifonov,
B. A. Nikolov,
I. M. Mladenov
Abstract:
It is shown that the Bohm equations for the phase $S$ and squared modulus $ρ$ of the quantum mechanical wave function can be derived from the classical ensemble equations admiting an aditional momentum $p_s$ of the form proportional to the osmotic velocity in the Nelson stochastic mechanics and using the variational principle with appropriate change of variables. The possibility to treat grad…
▽ More
It is shown that the Bohm equations for the phase $S$ and squared modulus $ρ$ of the quantum mechanical wave function can be derived from the classical ensemble equations admiting an aditional momentum $p_s$ of the form proportional to the osmotic velocity in the Nelson stochastic mechanics and using the variational principle with appropriate change of variables. The possibility to treat grad$S$ and $p_s$ as two parts of the momentum of quantum ensemble particles is considered from the view point of uncertainty relations of Robertson - Schroedinger type on the examples of the stochastic image of quantum mechanical canonical coherent and squeezed states.
△ Less
Submitted 18 October, 2009; v1 submitted 23 February, 2009;
originally announced February 2009.
-
Cylindrical equilibrium shapes of fluid membranes
Authors:
V. M. Vassilev,
P. A. Djondjorov,
I. M. Mladenov
Abstract:
Within the framework of the well-known curvature models, a fluid lipid bilayer membrane is regarded as a surface embedded in the three-dimensional Euclidean space whose equilibrium shapes are described in terms of its mean and Gaussian curvatures by the so-called membrane shape equation. In the present paper, all solutions to this equation determining cylindrical membrane shapes are found and pr…
▽ More
Within the framework of the well-known curvature models, a fluid lipid bilayer membrane is regarded as a surface embedded in the three-dimensional Euclidean space whose equilibrium shapes are described in terms of its mean and Gaussian curvatures by the so-called membrane shape equation. In the present paper, all solutions to this equation determining cylindrical membrane shapes are found and presented, together with the expressions for the corresponding position vectors, in explicit analytic form. The necessary and sufficient conditions for such a surface to be closed are derived and several sufficient conditions for its directrix to be simple or self-intersecting are given.
△ Less
Submitted 22 October, 2009; v1 submitted 6 March, 2008;
originally announced March 2008.
-
Light-cone SU(2) Yang-Mills theory and conformal mechanics
Authors:
V. P. Gerdt,
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
We examine the mechanical matrix model that can be derived from the SU(2) Yang-Mills light-cone field theory by restricting the gauge fields to depend on the light-cone time alone. We use Dirac's generalized Hamiltonian approach. In contrast to its well-known instant-time counterpart the light-cone version of SU(2) Yang-Mills mechanics has in addition to the constraints, generating the SU(2) gau…
▽ More
We examine the mechanical matrix model that can be derived from the SU(2) Yang-Mills light-cone field theory by restricting the gauge fields to depend on the light-cone time alone. We use Dirac's generalized Hamiltonian approach. In contrast to its well-known instant-time counterpart the light-cone version of SU(2) Yang-Mills mechanics has in addition to the constraints, generating the SU(2) gauge transformations, the new first and second class constraints also. On account of all of these constraints a complete reduction in number of the degrees of freedom is performed. It is argued that the classical evolution of the unconstrained degrees of freedom is equivalent to a free one-dimensional particle dynamics. Considering the complex solutions to the second class constraints we show at this time that the unconstrained Hamiltonian system represents the well-known model of conformal mechanics with a ``strength'' of the inverse square interaction determined by the value of the gauge field spin.
△ Less
Submitted 2 December, 2005; v1 submitted 3 October, 2002;
originally announced October 2002.
-
Analysis of constraints in light-cone version of SU(2) Yang-Mills mechanics
Authors:
V. P. Gerdt,
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
We study the classical dynamics of mechanical model obtained from the light-cone version of SU(2) Yang-Mills field theory under the supposition of gauge potential dependence only on ``time'' along the light-cone direction. The computer algebra system Maple was used strongly to compute and separate the complete set of constraints. In contrast to the instant form of Yang-Mills mechanics the constr…
▽ More
We study the classical dynamics of mechanical model obtained from the light-cone version of SU(2) Yang-Mills field theory under the supposition of gauge potential dependence only on ``time'' along the light-cone direction. The computer algebra system Maple was used strongly to compute and separate the complete set of constraints. In contrast to the instant form of Yang-Mills mechanics the constraints here represent a mixed form of first and second-class constraints and reduce the number of the physical degrees of freedom up to four canonical one.
△ Less
Submitted 12 September, 2002;
originally announced September 2002.
-
Bianchi type I cosmology and the Euler-Calogero-Sutherland model
Authors:
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
The Bianchi type I cosmological model is brought into a form where the evolution of observables is governed by the unconstrained Hamiltonian that coincides with the Hamiltonian describing the relative motion of particles in the integrable three-body hyperbolic Euler-Calogero-Sutherland system.
The Bianchi type I cosmological model is brought into a form where the evolution of observables is governed by the unconstrained Hamiltonian that coincides with the Hamiltonian describing the relative motion of particles in the integrable three-body hyperbolic Euler-Calogero-Sutherland system.
△ Less
Submitted 15 December, 2002; v1 submitted 14 August, 2002;
originally announced August 2002.
-
Unconstrained SU(2) Yang-Mills Theory with Topological Term in the Long-Wavelength Approximation
Authors:
A. M. Khvedelidze,
D. M. Mladenov,
H. -P. Pavel,
G. Röpke
Abstract:
The Hamiltonian reduction of SU(2) Yang-Mills theory for an arbitrary θangle to an unconstrained nonlocal theory of a self-interacting positive definite symmetric 3 \times 3 matrix field S(x) is performed. It is shown that, after exact projection to a reduced phase space, the density of the Pontryagin index remains a pure divergence, proving the θindependence of the unconstrained theory obtained…
▽ More
The Hamiltonian reduction of SU(2) Yang-Mills theory for an arbitrary θangle to an unconstrained nonlocal theory of a self-interacting positive definite symmetric 3 \times 3 matrix field S(x) is performed. It is shown that, after exact projection to a reduced phase space, the density of the Pontryagin index remains a pure divergence, proving the θindependence of the unconstrained theory obtained. An expansion of the nonlocal kinetic part of the Hamiltonian in powers of the inverse coupling constant and truncation to lowest order, however, lead to violation of the θindependence of the theory. In order to maintain this property on the level of the local approximate theory, a modified expansion in the inverse coupling constant is suggested, which for a vanishing θangle coincides with the original expansion. The corresponding approximate Lagrangian up to second order in derivatives is obtained, and the explicit form of the unconstrained analogue of the Chern-Simons current linear in derivatives is given. Finally, for the case of degenerate field configurations S(x) with rank|S| = 1, a nonlinear σ-type model is obtained, with the Pontryagin topological term reducing to the Hopf invariant of the mapping from the three-sphere S^3 to the unit two-sphere S^2 in the Whitehead form.
△ Less
Submitted 6 July, 2003; v1 submitted 21 February, 2002;
originally announced February 2002.
-
On Unconstrained SU(2) Gluodynamics with Theta Angle
Authors:
A. M. Khvedelidze,
D. M. Mladenov,
H. -P. Pavel,
G. Röpke
Abstract:
The Hamiltonian reduction of classical SU(2) Yang-Mills field theory to the equivalent unconstrained theory of gauge invariant local dynamical variables is generalized to the case of nonvanishing theta angle. It is shown that for any theta angle the elimination of the pure gauge degrees of freedom leads to a corresponding unconstrained nonlocal theory of self-interacting second rank symmetric te…
▽ More
The Hamiltonian reduction of classical SU(2) Yang-Mills field theory to the equivalent unconstrained theory of gauge invariant local dynamical variables is generalized to the case of nonvanishing theta angle. It is shown that for any theta angle the elimination of the pure gauge degrees of freedom leads to a corresponding unconstrained nonlocal theory of self-interacting second rank symmetric tensor fields, and that the obtained classical unconstrained gluodynamics with different theta angles are canonically equivalent as on the original constrained level.
△ Less
Submitted 27 February, 2002; v1 submitted 2 October, 2001;
originally announced October 2001.
-
Generalized Calogero-Moser-Sutherland models from geodesic motion on GL(n, R) group manifold
Authors:
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
It is shown that geodesic motion on the GL(n, R) group manifold endowed with the bi-invariant metric d s^2 = tr(g^{-1} d g)^2 corresponds to a generalization of the hyperbolic n-particle Calogero-Moser-Sutherland model. In particular, considering the motion on Principal orbit stratum of the SO(n, R) group action, we arrive at dynamics of a generalized n-particle Calogero-Moser-Sutherland system…
▽ More
It is shown that geodesic motion on the GL(n, R) group manifold endowed with the bi-invariant metric d s^2 = tr(g^{-1} d g)^2 corresponds to a generalization of the hyperbolic n-particle Calogero-Moser-Sutherland model. In particular, considering the motion on Principal orbit stratum of the SO(n, R) group action, we arrive at dynamics of a generalized n-particle Calogero-Moser-Sutherland system with two types of internal degrees of freedom obeying SO(n, R) \bigoplus SO(n, R) algebra. For the Singular orbit strata of SO(n, R) group action the geodesic motion corresponds to certain deformations of the Calogero-Moser-Sutherland model in a sense of description of particles with different masses. The mass ratios depend on the type of Singular orbit stratum and are determined by its degeneracy. Using reduction due to discrete and continuous symmetries of the system a relation to II A_n Euler-Calogero-Moser-Sutherland model is demonstrated.
△ Less
Submitted 7 June, 2002; v1 submitted 26 March, 2001;
originally announced March 2001.
-
Classical mechanics on GL(n, R) group and Euler-Calogero-Sutherland model
Authors:
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
Relations between the free motion on the GL^+(n, R) group manifold and the dynamics of an n-particle system with spin degrees of freedom on a line interacting with the pairwise 1/sinh^2 x ``potential'' (Euler-Calogero-Sutherland model) is discussed in the framework of Hamiltonian reduction. Two kinds of reductions of the degrees of freedom are considered: due to the continuous invariance and due…
▽ More
Relations between the free motion on the GL^+(n, R) group manifold and the dynamics of an n-particle system with spin degrees of freedom on a line interacting with the pairwise 1/sinh^2 x ``potential'' (Euler-Calogero-Sutherland model) is discussed in the framework of Hamiltonian reduction. Two kinds of reductions of the degrees of freedom are considered: due to the continuous invariance and due to the discrete symmetry. It is shown that after projection on the corresponding invariant manifolds the resulting Hamiltonian system represents the Euler-Calogero-Sutherland model in both cases.
△ Less
Submitted 30 January, 2001; v1 submitted 18 January, 2001;
originally announced January 2001.
-
Euler-Calogero-Moser system from SU(2) Yang-Mills theory
Authors:
A. M. Khvedelidze,
D. M. Mladenov
Abstract:
The relation between SU(2) Yang-Mills mechanics, originated from the 4-dimensional SU(2) Yang-Mills theory under the supposition of spatial homogeneity of the gauge fields, and the Euler-Calogero-Moser model is discussed in the framework of Hamiltonian reduction. Two kinds of reductions of the degrees of freedom are considered: due to the gauge invariance and due to the discrete symmetry. In the…
▽ More
The relation between SU(2) Yang-Mills mechanics, originated from the 4-dimensional SU(2) Yang-Mills theory under the supposition of spatial homogeneity of the gauge fields, and the Euler-Calogero-Moser model is discussed in the framework of Hamiltonian reduction. Two kinds of reductions of the degrees of freedom are considered: due to the gauge invariance and due to the discrete symmetry. In the former case, it is shown that after elimination of the gauge degrees of freedom from the SU(2) Yang-Mills mechanics the resulting unconstrained system represents the ID_3 Euler-Calogero-Moser model with an external fourth-order potential. Whereas in the latter, the IA_6 Euler-Calogero-Moser model embedded in an external potential is derived whose projection onto the invariant submanifold through the discrete symmetry coincides again with the SU(2) Yang-Mills mechanics. Based on this connection, the equations of motion of the SU(2) Yang-Mills mechanics in the limit of the zero coupling constant are presented in the Lax form.
△ Less
Submitted 19 March, 2000; v1 submitted 4 June, 1999;
originally announced June 1999.
-
Hamiltonian reduction of SU(2) Dirac-Yang-Mills mechanics
Authors:
S. A. Gogilidze,
A. M. Khvedelidze,
D. M. Mladenov,
H. -P. Pavel
Abstract:
The SU(2) gauge invariant Dirac-Yang-Mills mechanics of spatially homogeneous isospinor and gauge fields is considered in the framework of the generalized Hamiltonian approach. The unconstrained Hamiltonian system equivalent to the model is obtained using the gaugeless method of Hamiltonian reduction. The latter includes the Abelianization of the first class constraints, putting the second class…
▽ More
The SU(2) gauge invariant Dirac-Yang-Mills mechanics of spatially homogeneous isospinor and gauge fields is considered in the framework of the generalized Hamiltonian approach. The unconstrained Hamiltonian system equivalent to the model is obtained using the gaugeless method of Hamiltonian reduction. The latter includes the Abelianization of the first class constraints, putting the second class constraints into the canonical form and performing a canonical transformation to a set of adapted coordinates such that a subset of the new canonical pairs coincides with the second class constraints and part of the new momenta is equal to the Abelian constraints. In the adapted basis the pure gauge degrees of freedom automatically drop out from the consideration after projection of the model onto the constraint shell. Apart from the elimination of these ignorable degrees of freedom a further Hamiltonian reduction is achieved due to the three dimensional group of rigid symmetry possessed by the system.
△ Less
Submitted 15 July, 1997;
originally announced July 1997.