-
Ant-inspired Walling Strategies for Scalable Swarm Separation: Reinforcement Learning Approaches Based on Finite State Machines
Authors:
Shenbagaraj Kannapiran,
Elena Oikonomou,
Albert Chu,
Spring Berman,
Theodore P. Pavlic
Abstract:
In natural systems, emergent structures often arise to balance competing demands. Army ants, for example, form temporary "walls" that prevent interference between foraging trails. Inspired by this behavior, we developed two decentralized controllers for heterogeneous robotic swarms to maintain spatial separation while executing concurrent tasks. The first is a finite-state machine (FSM)-based cont…
▽ More
In natural systems, emergent structures often arise to balance competing demands. Army ants, for example, form temporary "walls" that prevent interference between foraging trails. Inspired by this behavior, we developed two decentralized controllers for heterogeneous robotic swarms to maintain spatial separation while executing concurrent tasks. The first is a finite-state machine (FSM)-based controller that uses encounter-triggered transitions to create rigid, stable walls. The second integrates FSM states with a Deep Q-Network (DQN), dynamically optimizing separation through emergent "demilitarized zones." In simulation, both controllers reduce mixing between subgroups, with the DQN-enhanced controller improving adaptability and reducing mixing by 40-50% while achieving faster convergence.
△ Less
Submitted 26 October, 2025;
originally announced October 2025.
-
VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs
Authors:
Shmuel Berman,
Jia Deng
Abstract:
Visual Language Models (VLMs) excel at complex visual tasks such as VQA and chart understanding, yet recent work suggests they struggle with simple perceptual tests. We present an evaluation that tests vision-language models' capacity for nonlocal visual reasoning -- reasoning that requires chaining evidence collected from multiple, possibly distant, regions of an image. We isolate three distinct…
▽ More
Visual Language Models (VLMs) excel at complex visual tasks such as VQA and chart understanding, yet recent work suggests they struggle with simple perceptual tests. We present an evaluation that tests vision-language models' capacity for nonlocal visual reasoning -- reasoning that requires chaining evidence collected from multiple, possibly distant, regions of an image. We isolate three distinct forms of non-local vision: comparative perception, which demands holding two images in working memory and comparing them; saccadic search, which requires making discrete, evidence-driven jumps to locate successive targets; and smooth visual search, which involves searching smoothly along a continuous contour. Flagship models (e.g., Gemini 2.5 Pro, Claude Vision 3.7, GPT-o4-mini), even those that perform well on prior primitive-vision benchmarks, fail these tests and barely exceed random accuracy on two variants of our tasks that are trivial for humans. Our structured evaluation suite allows us to test if VLMs can perform similar visual algorithms to humans. Our findings show that despite gains in raw visual acuity, current models lack core visual reasoning capabilities.
△ Less
Submitted 4 July, 2025;
originally announced July 2025.
-
Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs
Authors:
Yuval Kansal,
Shmuel Berman,
Lydia Liu
Abstract:
Factuality is a necessary precursor to useful educational tools. As adoption of Large Language Models (LLMs) in education continues of grow, ensuring correctness in all settings is paramount. Despite their strong English capabilities, LLM performance in other languages is largely untested. In this work, we evaluate the correctness of the Llama3.1 family of models in answering factual questions app…
▽ More
Factuality is a necessary precursor to useful educational tools. As adoption of Large Language Models (LLMs) in education continues of grow, ensuring correctness in all settings is paramount. Despite their strong English capabilities, LLM performance in other languages is largely untested. In this work, we evaluate the correctness of the Llama3.1 family of models in answering factual questions appropriate for middle and high school students. We demonstrate that LLMs not only provide extraneous and less truthful information, but also exacerbate existing biases against rare languages.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Grokking vs. Learning: Same Features, Different Encodings
Authors:
Dmitry Manning-Coe,
Jacopo Gliozzi,
Alexander G. Stapleton,
Edward Hirst,
Giuseppe De Tomasi,
Barry Bradlyn,
David S. Berman
Abstract:
Grokking typically achieves similar loss to ordinary, "steady", learning. We ask whether these different learning paths - grokking versus ordinary training - lead to fundamental differences in the learned models. To do so we compare the features, compressibility, and learning dynamics of models trained via each path in two tasks. We find that grokked and steadily trained models learn the same feat…
▽ More
Grokking typically achieves similar loss to ordinary, "steady", learning. We ask whether these different learning paths - grokking versus ordinary training - lead to fundamental differences in the learned models. To do so we compare the features, compressibility, and learning dynamics of models trained via each path in two tasks. We find that grokked and steadily trained models learn the same features, but there can be large differences in the efficiency with which these features are encoded. In particular, we find a novel "compressive regime" of steady training in which there emerges a linear trade-off between model loss and compressibility, and which is absent in grokking. In this regime, we can achieve compression factors 25x times the base model, and 5x times the compression achieved in grokking. We then track how model features and compressibility develop through training. We show that model development in grokking is task-dependent, and that peak compressibility is achieved immediately after the grokking plateau. Finally, novel information-geometric measures are introduced which demonstrate that models undergoing grokking follow a straight path in information space.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
Surface Modification and Subsequent Fermi Density Enhancement of Bi(111)
Authors:
Kuanysh Zhussupbekov,
Killian Walshe,
Brian Walls,
Andrei Ionov,
Sergei I. Bozhko,
Andrei Ksenz,
Rais N. Mozhchil,
Ainur Zhussupbekova,
Karsten Fleischer,
Samuel Berman,
Ivan Zhilyaev,
David D. O'Regan,
Igor V. Shvets
Abstract:
Defects introduced to the surface of Bi(111) break the translational symmetry and modify the surface states locally. We present a theoretical and experimental study of the 2D defects on the surface of Bi(111) and the states that they induce. Bi crystals cleaved in ultrahigh vacuum (UHV) at low temperature (110 K) and the resulting ion-etched surface are investigated by low-energy electron diffract…
▽ More
Defects introduced to the surface of Bi(111) break the translational symmetry and modify the surface states locally. We present a theoretical and experimental study of the 2D defects on the surface of Bi(111) and the states that they induce. Bi crystals cleaved in ultrahigh vacuum (UHV) at low temperature (110 K) and the resulting ion-etched surface are investigated by low-energy electron diffraction (LEED), X-ray photoelectron spectroscopy, ultraviolet photoelectron spectroscopy (UPS), and scanning tunneling microscopy (STM) as well as spectroscopy (STS) techniques in combination with density functional theory (DFT) calculations. STS measurements of cleaved Bi(111) reveal that a commonly observed bilayer step edge has a lower density of states (DOS) around the Fermi level as compared to the atomic-flat terrace. Following ion bombardment, the Bi(111) surface reveals anomalous behavior at both 110 and 300 K: Surface periodicity is observed by LEED, and a significant increase in the number of bilayer step edges and energetically unfavorable monolayer steps is observed by STM. It is suggested that the newly exposed monolayer steps and the type A bilayer step edges result in an increase to the surface Fermi density as evidenced by UPS measurements and the Kohn-Sham DOS. These states appear to be thermodynamically stable under UHV conditions.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
Curvature of an exotic 7-sphere
Authors:
David S. Berman,
Martin Cederwall,
Tancredi Schettini Gherardini
Abstract:
We study the geometry of the Gromoll-Meyer sphere, one of Milnor's exotic $7$-spheres. We focus on a Kaluza-Klein Ansatz, with a round $S^4$ as base space, unit $S^3$ as fibre, and $k=1,2$ $SU(2)$ instantons as gauge fields, where all quantities admit an elegant description in quaternionic language. The metric's moduli space coincides with the $k=1,2$ instantons' moduli space quotiented by the iso…
▽ More
We study the geometry of the Gromoll-Meyer sphere, one of Milnor's exotic $7$-spheres. We focus on a Kaluza-Klein Ansatz, with a round $S^4$ as base space, unit $S^3$ as fibre, and $k=1,2$ $SU(2)$ instantons as gauge fields, where all quantities admit an elegant description in quaternionic language. The metric's moduli space coincides with the $k=1,2$ instantons' moduli space quotiented by the isometry of the base, plus an additional $\mathbb{R}^+$ factor corresponding to the radius of the base, $r$. We identify a "center" of the $k=2$ instanton moduli space with enhanced symmetry. This $k=2$ solution is used together with the maximally symmetric $k=1$ solution to obtain a metric of maximal isometry, $SO(3)\times O(2)$, and to explicitly compute its Ricci tensor. This allows us to put a bound on $r$ to ensure positive Ricci curvature, which implies various energy conditions for an $8$-dimensional static space-time. This construction then enables a concrete examination of the properties of the sectional curvature.
△ Less
Submitted 5 December, 2024; v1 submitted 2 October, 2024;
originally announced October 2024.
-
A Survey on Small-Scale Testbeds for Connected and Automated Vehicles and Robot Swarms
Authors:
Armin Mokhtarian,
Jianye Xu,
Patrick Scheffe,
Maximilian Kloock,
Simon Schäfer,
Heeseung Bang,
Viet-Anh Le,
Sangeet Ulhas,
Johannes Betz,
Sean Wilson,
Spring Berman,
Liam Paull,
Amanda Prorok,
Bassam Alrifaee
Abstract:
Connected and automated vehicles and robot swarms hold transformative potential for enhancing safety, efficiency, and sustainability in the transportation and manufacturing sectors. Extensive testing and validation of these technologies is crucial for their deployment in the real world. While simulations are essential for initial testing, they often have limitations in capturing the complex dynami…
▽ More
Connected and automated vehicles and robot swarms hold transformative potential for enhancing safety, efficiency, and sustainability in the transportation and manufacturing sectors. Extensive testing and validation of these technologies is crucial for their deployment in the real world. While simulations are essential for initial testing, they often have limitations in capturing the complex dynamics of real-world interactions. This limitation underscores the importance of small-scale testbeds. These testbeds provide a realistic, cost-effective, and controlled environment for testing and validating algorithms, acting as an essential intermediary between simulation and full-scale experiments. This work serves to facilitate researchers' efforts in identifying existing small-scale testbeds suitable for their experiments and provide insights for those who want to build their own. In addition, it delivers a comprehensive survey of the current landscape of these testbeds. We derive 62 characteristics of testbeds based on the well-known sense-plan-act paradigm and offer an online table comparing 23 small-scale testbeds based on these characteristics. The online table is hosted on our designated public webpage https://bassamlab.github.io/testbeds-survey, and we invite testbed creators and developers to contribute to it. We closely examine nine testbeds in this paper, demonstrating how the derived characteristics can be used to present testbeds. Furthermore, we discuss three ongoing challenges concerning small-scale testbeds that we identified, i.e., small-scale to full-scale transition, sustainability, and power and resource management.
△ Less
Submitted 21 November, 2024; v1 submitted 26 August, 2024;
originally announced August 2024.
-
The temporal conceptual data modelling language TREND
Authors:
Sonia Berman,
C. Maria Keet,
Tamindran Shunmugam
Abstract:
Temporal conceptual data modelling, as an extension to regular conceptual data modelling languages such as EER and UML class diagrams, has received intermittent attention across the decades. It is receiving renewed interest in the context of, among others, business process modelling that needs robust expressive data models to complement them. None of the proposed temporal conceptual data modelling…
▽ More
Temporal conceptual data modelling, as an extension to regular conceptual data modelling languages such as EER and UML class diagrams, has received intermittent attention across the decades. It is receiving renewed interest in the context of, among others, business process modelling that needs robust expressive data models to complement them. None of the proposed temporal conceptual data modelling languages have been tested on understandability and usability by modellers, however, nor is it clear which temporal constraints would be used by modellers or whether the ones included are the relevant temporal constraints. We therefore sought to investigate temporal representations in temporal conceptual data modelling languages, design a, to date, most expressive language, TREND, through small-scale qualitative experiments, and finalise the graphical notation and modelling and understanding in large scale experiments. This involved a series of 11 experiments with over a thousand participants in total, having created 246 temporal conceptual data models. Key outcomes are that choice of label for transition constraints had limited impact, as did extending explanations of the modelling language, but expressing what needs to be modelled in controlled natural language did improve model quality. The experiments also indicate that more training may be needed, in particular guidance for domain experts, to achieve adoption of temporal conceptual data modelling by the community.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems
Authors:
Shmuel Berman,
Kathleen McKeown,
Baishakhi Ray
Abstract:
Prior research has enhanced the ability of Large Language Models (LLMs) to solve logic puzzles using techniques such as chain-of-thought prompting or introducing a symbolic representation. These frameworks are still usually insufficient to solve complicated logical problems, such as Zebra puzzles, due to the inherent complexity of translating natural language clues into logical statements. We intr…
▽ More
Prior research has enhanced the ability of Large Language Models (LLMs) to solve logic puzzles using techniques such as chain-of-thought prompting or introducing a symbolic representation. These frameworks are still usually insufficient to solve complicated logical problems, such as Zebra puzzles, due to the inherent complexity of translating natural language clues into logical statements. We introduce a multi-agent system, ZPS, that integrates LLMs with an off the shelf theorem prover. This system tackles the complex puzzle-solving task by breaking down the problem into smaller, manageable parts, generating SMT (Satisfiability Modulo Theories) code to solve them with a theorem prover, and using feedback between the agents to repeatedly improve their answers. We also introduce an automated grid puzzle grader to assess the correctness of our puzzle solutions and show that the automated grader is reliable by evaluating it in a user-study. Our approach shows improvement in all three LLMs we tested, with GPT-4 showing 166% improvement in the number of fully correct solutions.
△ Less
Submitted 9 July, 2024; v1 submitted 4 July, 2024;
originally announced July 2024.
-
PathFinder: Attention-Driven Dynamic Non-Line-of-Sight Tracking with a Mobile Robot
Authors:
Shenbagaraj Kannapiran,
Sreenithy Chandran,
Suren Jayasuriya,
Spring Berman
Abstract:
The study of non-line-of-sight (NLOS) imaging is growing due to its many potential applications, including rescue operations and pedestrian detection by self-driving cars. However, implementing NLOS imaging on a moving camera remains an open area of research. Existing NLOS imaging methods rely on time-resolved detectors and laser configurations that require precise optical alignment, making it dif…
▽ More
The study of non-line-of-sight (NLOS) imaging is growing due to its many potential applications, including rescue operations and pedestrian detection by self-driving cars. However, implementing NLOS imaging on a moving camera remains an open area of research. Existing NLOS imaging methods rely on time-resolved detectors and laser configurations that require precise optical alignment, making it difficult to deploy them in dynamic environments. This work proposes a data-driven approach to NLOS imaging, PathFinder, that can be used with a standard RGB camera mounted on a small, power-constrained mobile robot, such as an aerial drone. Our experimental pipeline is designed to accurately estimate the 2D trajectory of a person who moves in a Manhattan-world environment while remaining hidden from the camera's field-of-view. We introduce a novel approach to process a sequence of dynamic successive frames in a line-of-sight (LOS) video using an attention-based neural network that performs inference in real-time. The method also includes a preprocessing selection metric that analyzes images from a moving camera which contain multiple vertical planar surfaces, such as walls and building facades, and extracts planes that return maximum NLOS information. We validate the approach on in-the-wild scenes using a drone for video capture, thus demonstrating low-cost NLOS imaging in dynamic capture environments.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
NCoder -- A Quantum Field Theory approach to encoding data
Authors:
David S. Berman,
Marc S. Klinger,
Alexander G. Stapleton
Abstract:
In this paper we present a novel approach to interpretable AI inspired by Quantum Field Theory (QFT) which we call the NCoder. The NCoder is a modified autoencoder neural network whose latent layer is prescribed to be a subset of $n$-point correlation functions. Regarding images as draws from a lattice field theory, this architecture mimics the task of perturbatively constructing the effective act…
▽ More
In this paper we present a novel approach to interpretable AI inspired by Quantum Field Theory (QFT) which we call the NCoder. The NCoder is a modified autoencoder neural network whose latent layer is prescribed to be a subset of $n$-point correlation functions. Regarding images as draws from a lattice field theory, this architecture mimics the task of perturbatively constructing the effective action of the theory order by order in an expansion using Feynman diagrams. Alternatively, the NCoder may be regarded as simulating the procedure of statistical inference whereby high dimensional data is first summarized in terms of several lower dimensional summary statistics (here the $n$-point correlation functions), and subsequent out-of-sample data is generated by inferring the data generating distribution from these statistics. In this way the NCoder suggests a fascinating correspondence between perturbative renormalizability and the sufficiency of models. We demonstrate the efficacy of the NCoder by applying it to the generation of MNIST images, and find that generated images can be correctly classified using only information from the first three $n$-point functions of the image distribution.
△ Less
Submitted 3 June, 2025; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Reconciling the theoretical and experimental electronic structure of NbO2
Authors:
Samuel Berman,
Ainur Zhussupbekova,
Jos E. Boschker,
Jutta Schwarzkopf,
David D. O'Regan,
Igor V. Shvets,
Kuanysh Zhussupbekov
Abstract:
Metal-insulator transition materials such as NbO2 have generated much excitement in recent years for their potential applications in computing and sensing. NbO2 has generated considerable debate over the nature of the phase transition, and the values for the band gap/band widths in the insulating phase. We present a combined theoretical and experimental study of the band gap and electronic structu…
▽ More
Metal-insulator transition materials such as NbO2 have generated much excitement in recent years for their potential applications in computing and sensing. NbO2 has generated considerable debate over the nature of the phase transition, and the values for the band gap/band widths in the insulating phase. We present a combined theoretical and experimental study of the band gap and electronic structure of the insulating phase of NbO2. We carry out ab-initio density functional theory plus U calculations, directly determining U and J parameters for both the Nb 4d and O 2p subspaces through the recently introduced minimum-tracking linear response method. We find a fundamental bulk band gap of 0.80 eV for the full DFT+U+J theory. We also perform calculations and measurements for a (100) oriented thin film. Scanning tunnelling spectroscopy measurements show that the surface band gap varies from 0.75 eV to 1.35 eV due to an excess of oxygen in and near the surface region of the film. Slab calculations indicate metallicity localised at the surface region caused by an energy level shift consistent with a reduction in Coulomb repulsion. We demonstrate that this effect in combination with the simple, low cost DFT+U+J method can account for the band widths and p-d gap observed in X-ray photoelectron spectroscopy experiments. Overall, our results indicate the possible presence of a 2D anisotropic metallic layer at the (100) surface of NbO2.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Unravelling the atomic and electronic structure of nanocrystals on superconducting Nb(110): Impact of the oxygen monolayer
Authors:
Samuel Berman,
Ainur Zhussupbekova,
Brian Walls,
Killian Walshe,
Sergei I. Bozhko,
Andrei Ionov,
David D. O'Regan,
Igor V. Shvets,
Kuanysh Zhussupbekov
Abstract:
The Niobium surface is almost always covered by a native oxide layer which greatly influences the performance of superconducting devices. Here we investigate the highly stable Niobium oxide overlayer of Nb(110), which is characterised by its distinctive nanocrystal structure as observed by scanning tunnelling microscopy (STM). Our ab-initio density functional theory (DFT) calculations show that a…
▽ More
The Niobium surface is almost always covered by a native oxide layer which greatly influences the performance of superconducting devices. Here we investigate the highly stable Niobium oxide overlayer of Nb(110), which is characterised by its distinctive nanocrystal structure as observed by scanning tunnelling microscopy (STM). Our ab-initio density functional theory (DFT) calculations show that a subtle reconstruction in the surface Niobium atoms gives rise to rows of 4-fold coordinated oxygen separated by regions of 3-fold coordinated oxygen. The 4-fold oxygen rows are determined to be the source of the nanocrystal pattern observed in STM, and the two chemical states of oxygen observed in core-level X-ray photoelectron spectroscopy (XPS) are ascribed to the 3-fold and 4-fold oxygens. Furthermore, we find excellent agreement between the DFT calculated electronic structure with scanning tunnelling spectroscopy and valence XPS measurements.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Density Stabilization Strategies for Nonholonomic Agents on Compact Manifolds
Authors:
Karthik Elamvazhuthi,
Spring Berman
Abstract:
In this article, we consider the problem of stabilizing stochastic processes, which are constrained to a bounded Euclidean domain or a compact smooth manifold, to a given target probability density. Most existing works on modeling and control of robotic swarms that use PDE models assume that the robots' dynamics are holonomic, and hence, the associated stochastic processes have generators that are…
▽ More
In this article, we consider the problem of stabilizing stochastic processes, which are constrained to a bounded Euclidean domain or a compact smooth manifold, to a given target probability density. Most existing works on modeling and control of robotic swarms that use PDE models assume that the robots' dynamics are holonomic, and hence, the associated stochastic processes have generators that are elliptic. We relax this assumption on the ellipticity of the generator of the stochastic processes, and consider the more practical case of the stabilization problem for a swarm of agents whose dynamics are given by a controllable driftless control-affine system. We construct state-feedback control laws that exponentially stabilize a swarm of nonholonomic agents to a target probability density that is sufficiently regular. State-feedback laws can stabilize a swarm only to target probability densities that are positive everywhere. To stabilize the swarm to probability densities that possibly have disconnected supports, we introduce a semilinear PDE model of a collection of interacting agents governed by a hybrid switching diffusion process. The interaction between the agents is modeled using a (mean-field) feedback law that is a function of the local density of the swarm, with the switching parameters as the control inputs. We show that the semilinear PDE system is globally asymptotically stable about the given target probability density. The stabilization strategies are verified without inter-agent interactions is verified numerically for agents that evolve according to the Brockett integrator and a nonholonomic system on the special orthogonal group of 3-dimensional rotations $SO(3)$. The stabilization strategy with inter-agent interactions is verified numerically for agents that evolve according to the Brockett integrator and a holonomic system on the sphere $S^2$.
△ Less
Submitted 7 May, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network
Authors:
Shenbagaraj Kannapiran,
Nalin Bendapudi,
Ming-Yuan Yu,
Devarth Parikh,
Spring Berman,
Ankit Vora,
Gaurav Pandey
Abstract:
Robust feature matching forms the backbone for most Visual Simultaneous Localization and Mapping (vSLAM), visual odometry, 3D reconstruction, and Structure from Motion (SfM) algorithms. However, recovering feature matches from texture-poor scenes is a major challenge and still remains an open area of research. In this paper, we present a Stereo Visual Odometry (StereoVO) technique based on point a…
▽ More
Robust feature matching forms the backbone for most Visual Simultaneous Localization and Mapping (vSLAM), visual odometry, 3D reconstruction, and Structure from Motion (SfM) algorithms. However, recovering feature matches from texture-poor scenes is a major challenge and still remains an open area of research. In this paper, we present a Stereo Visual Odometry (StereoVO) technique based on point and line features which uses a novel feature-matching mechanism based on an Attention Graph Neural Network that is designed to perform well even under adverse weather conditions such as fog, haze, rain, and snow, and dynamic lighting conditions such as nighttime illumination and glare scenarios. We perform experiments on multiple real and synthetic datasets to validate the ability of our method to perform StereoVO under low visibility weather and lighting conditions through robust point and line matches. The results demonstrate that our method achieves more line feature matches than state-of-the-art line matching algorithms, which when complemented with point feature matches perform consistently well in adverse weather and dynamic lighting conditions.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Bayesian Renormalization
Authors:
David S. Berman,
Marc S. Klinger,
Alexander G. Stapleton
Abstract:
In this note we present a fully information theoretic approach to renormalization inspired by Bayesian statistical inference, which we refer to as Bayesian Renormalization. The main insight of Bayesian Renormalization is that the Fisher metric defines a correlation length that plays the role of an emergent RG scale quantifying the distinguishability between nearby points in the space of probabilit…
▽ More
In this note we present a fully information theoretic approach to renormalization inspired by Bayesian statistical inference, which we refer to as Bayesian Renormalization. The main insight of Bayesian Renormalization is that the Fisher metric defines a correlation length that plays the role of an emergent RG scale quantifying the distinguishability between nearby points in the space of probability distributions. This RG scale can be interpreted as a proxy for the maximum number of unique observations that can be made about a given system during a statistical inference experiment. The role of the Bayesian Renormalization scheme is subsequently to prepare an effective model for a given system up to a precision which is bounded by the aforementioned scale. In applications of Bayesian Renormalization to physical systems, the emergent information theoretic scale is naturally identified with the maximum energy that can be probed by current experimental apparatus, and thus Bayesian Renormalization coincides with ordinary renormalization. However, Bayesian Renormalization is sufficiently general to apply even in circumstances in which an immediate physical scale is absent, and thus provides an ideal approach to renormalization in data science contexts. To this end, we provide insight into how the Bayesian Renormalization scheme relates to existing methods for data compression and data generation such as the information bottleneck and the diffusion learning paradigm. We conclude by designing an explicit form of Bayesian Renormalization inspired by Wilson's momentum shell renormalization scheme in Quantum Field Theory. We apply this Bayesian Renormalization scheme to a simple Neural Network and verify the sense in which it organizes the parameters of the model according to a hierarchy of information theoretic importance.
△ Less
Submitted 9 October, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Metal-Organic Chemical Vapor Deposition of PtSe2
Authors:
Maximilian Prechtl,
Marc Busch,
Oliver Hartwig,
Kangho Lee,
Tanja Stimpel-Lindner,
Cormac Ó Coileáin,
Kuanysh Zhussupbekov,
Ainur Zhussupbekova,
Samuel Berman,
Igor V. Shvets,
Georg S. Duesberg
Abstract:
Platinum diselenide (PtSe2), a novel two-dimensional material from the class of noble-metal dichalcogenide (NMD), has recently received significant attention due to its outstanding properties. PtSe2, which undergoes a semi metallic to semiconductor transition when thinned, offers a band-gap in the infrared range and good air stability. These properties make it a prime active material in optoelectr…
▽ More
Platinum diselenide (PtSe2), a novel two-dimensional material from the class of noble-metal dichalcogenide (NMD), has recently received significant attention due to its outstanding properties. PtSe2, which undergoes a semi metallic to semiconductor transition when thinned, offers a band-gap in the infrared range and good air stability. These properties make it a prime active material in optoelectronic and chemical sensing devices. However, a synthesis method that can produce large-scale and reliable high quality PtSe2 is highly sought after. Here, we present PtSe2 growth by metal organic chemical vapor deposition. Films were grown on a variety of centimeter scale substrates and were characterized by Raman, X-ray photoelectron and X-ray diffraction spectroscopy, as well as scanning tunneling microscopy and spectroscopy. Domains within the films are found to be up to several hundred nanometers in size, and atomic scale measurements show their highly ordered crystalline structure. The thickness of homogenous films can be controlled via the growth time. This work provides fundamental guidance for the synthesis and implementation of high quality, large-scale PtSe2 layers, hence offering the key requirement for the implementation of PtSe2 in future electronic devices.
△ Less
Submitted 2 February, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
The Inverse of Exact Renormalization Group Flows as Statistical Inference
Authors:
David S. Berman,
Marc S. Klinger
Abstract:
We build on the view of the Exact Renormalization Group (ERG) as an instantiation of Optimal Transport described by a functional convection-diffusion equation. We provide a new information theoretic perspective for understanding the ERG through the intermediary of Bayesian Statistical Inference. This connection is facilitated by the Dynamical Bayesian Inference scheme, which encodes Bayesian infer…
▽ More
We build on the view of the Exact Renormalization Group (ERG) as an instantiation of Optimal Transport described by a functional convection-diffusion equation. We provide a new information theoretic perspective for understanding the ERG through the intermediary of Bayesian Statistical Inference. This connection is facilitated by the Dynamical Bayesian Inference scheme, which encodes Bayesian inference in the form of a one parameter family of probability distributions solving an integro-differential equation derived from Bayes' law. In this note, we demonstrate how the Dynamical Bayesian Inference equation is, itself, equivalent to a diffusion equation which we dub Bayesian Diffusion. Identifying the features that define Bayesian Diffusion, and mapping them onto the features that define the ERG, we obtain a dictionary outlining how renormalization can be understood as the inverse of statistical inference.
△ Less
Submitted 1 May, 2024; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Single-Shot Domain Adaptation via Target-Aware Generative Augmentation
Authors:
Rakshith Subramanyam,
Kowshik Thopalli,
Spring Berman,
Pavan Turaga,
Jayaraman J. Thiagarajan
Abstract:
The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl…
▽ More
The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of single-shot adaptation and explore the design of augmentation strategies. We argue that augmentations utilized by existing methods are insufficient to handle large distribution shifts, and hence propose a new approach SiSTA (Single-Shot Target Augmentations), which first fine-tunes a generative model from the source domain using a single-shot target, and then employs novel sampling strategies for curating synthetic target data. Using experiments with a state-of-the-art domain adaptation method, we find that SiSTA produces improvements as high as 20\% over existing baselines under challenging shifts in face attribute detection, and that it performs competitively to oracle models obtained by training on a larger target dataset.
△ Less
Submitted 29 October, 2022;
originally announced October 2022.
-
CHARTOPOLIS: A Small-Scale Labor-art-ory for Research and Reflection on Autonomous Vehicles, Human-Robot Interaction, and Sociotechnical Imaginaries
Authors:
Sangeet Sankaramangalam Ulhas,
Aditya Ravichander,
Kathryn A. Johnson,
Theodore P. Pavlic,
Lance Gharavi,
Spring Berman
Abstract:
CHARTOPOLIS is a multi-faceted sociotechnical testbed meant to aid in building connections among engineers, psychologists, anthropologists, ethicists, and artists. Superficially, it is an urban autonomous-vehicle testbed that includes both a physical environment for small-scale robotic vehicles as well as a high-fidelity virtual replica that provides extra flexibility by way of computer simulation…
▽ More
CHARTOPOLIS is a multi-faceted sociotechnical testbed meant to aid in building connections among engineers, psychologists, anthropologists, ethicists, and artists. Superficially, it is an urban autonomous-vehicle testbed that includes both a physical environment for small-scale robotic vehicles as well as a high-fidelity virtual replica that provides extra flexibility by way of computer simulation. However, both environments have been developed to allow for participatory simulation with human drivers as well. Each physical vehicle can be remotely operated by human drivers that have a driver-seat point of view that immerses them within the small-scale testbed, and those same drivers can also pilot high-fidelity models of those vehicles in a virtual replica of the environment. Juxtaposing human driving performance across these two contexts will help identify to what extent human driving behaviors are sensorimotor responses or involve psychological engagement with a system that has physical, not virtual, side effects and consequences. Furthermore, through collaboration with artists, we have designed the physical testbed to make tangible the reality that technological advancement causes the history of a city to fork into multiple, parallel timelines that take place within populations whose increasing isolation effectively creates multiple independent cities in one. Ultimately, CHARTOPOLIS is meant to challenge engineers to take a more holistic view when designing autonomous systems, while also enabling them to gather novel data that will assist them in making these systems more trustworthy.
△ Less
Submitted 1 October, 2022;
originally announced October 2022.
-
Configuration Tracking Control of a Multi-Segment Soft Robotic Arm Using a Cosserat Rod Model
Authors:
Azadeh Doroudchi,
Zhi Qiao,
Wenlong Zhang,
Spring Berman
Abstract:
Controlling soft continuum robotic arms is challenging due to their hyper-redundancy and dexterity. In this paper we demonstrate, for the first time, closed-loop control of the configuration space variables of a soft robotic arm, composed of independently controllable segments, using a Cosserat rod model of the robot and the distributed sensing and actuation capabilities of the segments. Our contr…
▽ More
Controlling soft continuum robotic arms is challenging due to their hyper-redundancy and dexterity. In this paper we demonstrate, for the first time, closed-loop control of the configuration space variables of a soft robotic arm, composed of independently controllable segments, using a Cosserat rod model of the robot and the distributed sensing and actuation capabilities of the segments. Our controller solves the inverse dynamic problem by simulating the Cosserat rod model in MATLAB using a computationally efficient numerical solution scheme, and it applies the computed control output to the actual robot in real time. The position and orientation of the tip of each segment are measured in real time, while the remaining unknown variables that are needed to solve the inverse dynamics are estimated simultaneously in the simulation. We implement the controller on a multi-segment silicone robotic arm with pneumatic actuation, using a motion capture system to measure the segments' positions and orientations. The controller is used to reshape the arm into configurations that are achieved through different combinations of bending and extension deformations in 3D space. The resulting tracking performance indicates the effectiveness of the controller and the accuracy of the simulated Cosserat rod model that is used to estimate the unmeasured variables.
△ Less
Submitted 30 September, 2022;
originally announced October 2022.
-
Twisted Self-duality
Authors:
David S. Berman,
Tancredi Schettini Gherardini
Abstract:
We examine a generalisation of the usual self-duality equations for Yang-Mills theory when the colour space admits a non-trivial involution. This involution allows us to construct a non-trivial twist which may be combined with the Hodge star to form a twisted self-dual curvature. We will construct a simple example of twisted self-duality for $su(2) \oplus su(2)$ gauge theory along with its explici…
▽ More
We examine a generalisation of the usual self-duality equations for Yang-Mills theory when the colour space admits a non-trivial involution. This involution allows us to construct a non-trivial twist which may be combined with the Hodge star to form a twisted self-dual curvature. We will construct a simple example of twisted self-duality for $su(2) \oplus su(2)$ gauge theory along with its explicit solutions and then dimensionally reduce from four dimensions to obtain families of non-trivial non-linear equations in lower dimensions. This twisted self-duality constraint will be shown to arise in E_7 exceptional field theory through a Scherk-Schwarz reduction and we will show how an Eguchi-Hanson gravitational instanton also obeys the twisted self-duality condition.
△ Less
Submitted 14 September, 2022; v1 submitted 21 August, 2022;
originally announced August 2022.
-
NMPC-LBF: Nonlinear MPC with Learned Barrier Function for Decentralized Safe Navigation of Multiple Robots in Unknown Environments
Authors:
Amir Salimi Lafmejani,
Spring Berman,
Georgios Fainekos
Abstract:
In this paper, we present a decentralized control approach based on a Nonlinear Model Predictive Control (NMPC) method that employs barrier certificates for safe navigation of multiple nonholonomic wheeled mobile robots in unknown environments with static and/or dynamic obstacles. This method incorporates a Learned Barrier Function (LBF) into the NMPC design in order to guarantee safe robot naviga…
▽ More
In this paper, we present a decentralized control approach based on a Nonlinear Model Predictive Control (NMPC) method that employs barrier certificates for safe navigation of multiple nonholonomic wheeled mobile robots in unknown environments with static and/or dynamic obstacles. This method incorporates a Learned Barrier Function (LBF) into the NMPC design in order to guarantee safe robot navigation, i.e., prevent robot collisions with other robots and the obstacles. We refer to our proposed control approach as NMPC-LBF. Since each robot does not have a priori knowledge about the obstacles and other robots, we use a Deep Neural Network (DeepNN) running in real-time on each robot to learn the Barrier Function (BF) only from the robot's LiDAR and odometry measurements. The DeepNN is trained to learn the BF that separates safe and unsafe regions. We implemented our proposed method on simulated and actual Turtlebot3 Burger robot(s) in different scenarios. The implementation results show the effectiveness of the NMPC-LBF method at ensuring safe navigation of the robots.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
Robust optimal density control of robotic swarms
Authors:
Carlo Sinigaglia,
Andrea Manzoni,
Francesco Braghin,
Spring Berman
Abstract:
In this paper, we propose a computationally efficient, robust density control strategy for the mean-field model of a robotic swarm. We formulate a static optimal control problem (OCP) that computes a robot velocity field which drives the swarm to a target equilibrium density, and we prove the stability of the controlled system in the presence of transient perturbations and uncertainties in the ini…
▽ More
In this paper, we propose a computationally efficient, robust density control strategy for the mean-field model of a robotic swarm. We formulate a static optimal control problem (OCP) that computes a robot velocity field which drives the swarm to a target equilibrium density, and we prove the stability of the controlled system in the presence of transient perturbations and uncertainties in the initial conditions. The density dynamics are described by a linear elliptic advection-diffusion equation in which the control enters bilinearly into the advection term. The well-posedness of the state problem is ensured by an integral constraint. We prove the existence of optimal controls by embedding the state constraint into the weak formulation of the state dynamics. The resulting control field is space-dependent and does not require any communication between robots or costly density estimation algorithms. Based on the properties of the primal and dual systems, we first propose a method to accommodate the state constraint. Exploiting the properties of the state dynamics and associated controls, we then construct a modified dynamic OCP to speed up the convergence to the target equilibrium density of the associated static problem. We then show that the finite-element discretization of the static and dynamic OCPs inherits the structure and several useful properties of their infinite-dimensional formulations. Finally, we demonstrate the effectiveness of our control approach through numerical simulations of scenarios with obstacles and an external velocity field.
△ Less
Submitted 4 December, 2022; v1 submitted 25 May, 2022;
originally announced May 2022.
-
On the Dynamics of Inference and Learning
Authors:
David S. Berman,
Jonathan J. Heckman,
Marc Klinger
Abstract:
Statistical Inference is the process of determining a probability distribution over the space of parameters of a model given a data set. As more data becomes available this probability distribution becomes updated via the application of Bayes' theorem. We present a treatment of this Bayesian updating process as a continuous dynamical system. Statistical inference is then governed by a first order…
▽ More
Statistical Inference is the process of determining a probability distribution over the space of parameters of a model given a data set. As more data becomes available this probability distribution becomes updated via the application of Bayes' theorem. We present a treatment of this Bayesian updating process as a continuous dynamical system. Statistical inference is then governed by a first order differential equation describing a trajectory or flow in the information geometry determined by a parametric family of models. We solve this equation for some simple models and show that when the Cramér-Rao bound is saturated the learning rate is governed by a simple $1/T$ power-law, with $T$ a time-like variable denoting the quantity of data. The presence of hidden variables can be incorporated in this setting, leading to an additional driving term in the resulting flow equation. We illustrate this with both analytic and numerical examples based on Gaussians and Gaussian Random Processes and inference of the coupling constant in the 1D Ising model. Finally we compare the qualitative behaviour exhibited by Bayesian flows to the training of various neural networks on benchmarked data sets such as MNIST and CIFAR10 and show how that for networks exhibiting small final losses the simple power-law is also satisfied.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Indirect Optimal Control of Advection-Diffusion Fields through Robotic Swarms
Authors:
Carlo Sinigaglia,
Andrea Manzoni,
Francesco Braghin,
Spring Berman
Abstract:
In this paper, we consider the problem of optimally guiding a large-scale swarm of underwater vehicles that is tasked with the indirect control of an advection-diffusion environmental field. The microscopic vehicle dynamics are governed by a stochastic differential equation with drift. The drift terms model the self-propelled velocity of the vehicle and the velocity field of the currents. In the m…
▽ More
In this paper, we consider the problem of optimally guiding a large-scale swarm of underwater vehicles that is tasked with the indirect control of an advection-diffusion environmental field. The microscopic vehicle dynamics are governed by a stochastic differential equation with drift. The drift terms model the self-propelled velocity of the vehicle and the velocity field of the currents. In the mean-field setting, the macroscopic vehicle dynamics are governed by a Kolmogorov forward equation in the form of a linear parabolic advection-diffusion equation. The environmental field is governed by an advection-diffusion equation in which the advection term is defined by the fluid velocity field. The vehicles are equipped with on-board actuators that enable the swarm to act as a distributed source in the environmental field, modulated by a scalar control parameter that determines the local source intensity. In this setting, we formulate an optimal control problem to compute the vehicle velocity and actuator intensity fields that drive the environmental field to a desired distribution within a specified amount of time. In other words, we design optimal vector and scalar actuation fields to indirectly control the environmental field through a distributed source, produced by the swarm. After proving an existence result for the solution of the optimal control problem, we discretize and solve the problem using the Finite Element Method (FEM). The FEM discretization naturally provides an operator that represents the bilinear way in which the controls enter into the dynamics of the vehicle swarm and the environmental field. Finally, we show through numerical simulations the effectiveness of our control strategy in regulating the environmental field to zero or to a desired distribution in the presence of a double-gyre flow field.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Probabilistic Consensus on Feature Distribution for Multi-robot Systems with Markovian Exploration Dynamics
Authors:
Aniket Shirsat,
Shatadal Mishra,
Wenlong Zhang,
Spring Berman
Abstract:
In this paper, we present a consensus-based decentralized multi-robot approach to reconstruct a discrete distribution of features, modeled as an occupancy grid map, that represent information contained in a bounded planar 2D environment, such as visual cues used for navigation or semantic labels associated with object detection. The robots explore the environment according to a random walk modeled…
▽ More
In this paper, we present a consensus-based decentralized multi-robot approach to reconstruct a discrete distribution of features, modeled as an occupancy grid map, that represent information contained in a bounded planar 2D environment, such as visual cues used for navigation or semantic labels associated with object detection. The robots explore the environment according to a random walk modeled by a discrete-time discrete-state (DTDS) Markov chain and estimate the feature distribution from their own measurements and the estimates communicated by neighboring robots, using a distributed Chernoff fusion protocol. We prove that under this decentralized fusion protocol, each robot's feature distribution converges to the ground truth distribution in an almost sure sense. We verify this result in numerical simulations that show that the Hellinger distance between the estimated and ground truth feature distributions converges to zero over time for each robot. We also validate our strategy through Software-In-The-Loop (SITL) simulations of quadrotors that search a bounded square grid for a set of visual features distributed on a discretized circle.
△ Less
Submitted 26 April, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Double copying Exceptional Field theories
Authors:
David S. Berman,
Kwangeon Kim,
Kanghoon Lee
Abstract:
We examine exceptional field theory through the lens of the generalised double copy formalism. This allows us to construct classical solutions in M-theory using a generalised Kerr-Schild ansatz and along the way indicates hints towards a single copy of M-theory. Based on a talk at the Nankai Symposium given by DSB.
We examine exceptional field theory through the lens of the generalised double copy formalism. This allows us to construct classical solutions in M-theory using a generalised Kerr-Schild ansatz and along the way indicates hints towards a single copy of M-theory. Based on a talk at the Nankai Symposium given by DSB.
△ Less
Submitted 26 January, 2022;
originally announced January 2022.
-
Geometric Quantization: Particles, Fields and Strings
Authors:
David S Berman,
Gabriel Cardoso
Abstract:
These notes present an introduction to the method of geometric quantization. We discuss the main theorems in a style suitable for a theoretical physicist with an eye towards the physical motivation and the interpretation of the geometric construction as providing a solution to Dirac's axioms of quantization. We provide in detail the examples of free relativistic particles, their corresponding quan…
▽ More
These notes present an introduction to the method of geometric quantization. We discuss the main theorems in a style suitable for a theoretical physicist with an eye towards the physical motivation and the interpretation of the geometric construction as providing a solution to Dirac's axioms of quantization. We provide in detail the examples of free relativistic particles, their corresponding quantum fields, and the bosonic string using formalism of double field theory. Based on lectures written by Gabriel Cardoso.
△ Less
Submitted 2 January, 2022;
originally announced January 2022.
-
Machine Learning Calabi-Yau Hypersurfaces
Authors:
David S. Berman,
Yang-Hui He,
Edward Hirst
Abstract:
We revisit the classic database of weighted-P4s which admit Calabi-Yau 3-fold hypersurfaces equipped with a diverse set of tools from the machine-learning toolbox. Unsupervised techniques identify an unanticipated almost linear dependence of the topological data on the weights. This then allows us to identify a previously unnoticed clustering in the Calabi-Yau data. Supervised techniques are succe…
▽ More
We revisit the classic database of weighted-P4s which admit Calabi-Yau 3-fold hypersurfaces equipped with a diverse set of tools from the machine-learning toolbox. Unsupervised techniques identify an unanticipated almost linear dependence of the topological data on the weights. This then allows us to identify a previously unnoticed clustering in the Calabi-Yau data. Supervised techniques are successful in predicting the topological parameters of the hypersurface from its weights with an accuracy of R^2 > 95%. Supervised learning also allows us to identify weighted-P4s which admit Calabi-Yau hypersurfaces to 100% accuracy by making use of partitioning supported by the clustering behaviour.
△ Less
Submitted 19 January, 2022; v1 submitted 12 December, 2021;
originally announced December 2021.
-
Noise-induced aggregation of swimmers in the Kolmogorov flow
Authors:
Simon A. Berman,
Kyle S. Ferguson,
Nathaniel Bizzak,
Thomas H. Solomon,
Kevin A. Mitchell
Abstract:
We investigate a model for the dynamics of ellipsoidal microswimmers in an externally imposed, laminar Kolmogorov flow. Through a phase-space analysis of the dynamics without noise, we find that swimmers favor either cross-stream or rotational drift, depending on their swimming speed and aspect ratio. When including noise, i.e. rotational diffusion, we find that swimmers are driven into certain pa…
▽ More
We investigate a model for the dynamics of ellipsoidal microswimmers in an externally imposed, laminar Kolmogorov flow. Through a phase-space analysis of the dynamics without noise, we find that swimmers favor either cross-stream or rotational drift, depending on their swimming speed and aspect ratio. When including noise, i.e. rotational diffusion, we find that swimmers are driven into certain parts of phase space, leading to a nonuniform steady-state distribution. This distribution exhibits a transition from swimmer aggregation in low-shear regions of the flow to aggregation in high-shear regions as the swimmer's speed, aspect ratio, and rotational diffusivity are varied. To explain the nonuniform phase-space distribution of swimmers, we apply a weak-noise averaging principle that produces a reduced description of the stochastic swimmer dynamics. Using this technique, we find that certain swimmer trajectories are more favorable than others in the presence of weak rotational diffusion. By combing this information with the phase-space speed of swimmers along each trajectory, we predict the regions of phase space where swimmers tend to accumulate. The results of the averaging technique are in good agreement with direct calculations of the steady-state distributions of swimmers. In particular, our analysis explains the transition from low-shear to high-shear aggregation.
△ Less
Submitted 17 November, 2021;
originally announced November 2021.
-
Optimal Control of Velocity and Nonlocal Interactions in the Mean-Field Kuramoto Model
Authors:
Carlo Sinigaglia,
Francesco Braghin,
Spring Berman
Abstract:
In this paper, we investigate how the self-synchronization property of a swarm of Kuramoto oscillators can be controlled and exploited to achieve target densities and target phase coherence. In the limit of an infinite number of oscillators, the collective dynamics of the agents' density is described by a mean-field model in the form of a nonlocal PDE, where the nonlocality arises from the synchro…
▽ More
In this paper, we investigate how the self-synchronization property of a swarm of Kuramoto oscillators can be controlled and exploited to achieve target densities and target phase coherence. In the limit of an infinite number of oscillators, the collective dynamics of the agents' density is described by a mean-field model in the form of a nonlocal PDE, where the nonlocality arises from the synchronization mechanism. In this mean-field setting, we introduce two space-time dependent control inputs to affect the density of the oscillators: an angular velocity field that corresponds to a state feedback law for individual agents, and a control parameter that modulates the strength of agent interactions over space and time, i.e., a multiplicative control with respect to the integral nonlocal term. We frame the density tracking problem as a PDE-constrained optimization problem. The controlled synchronization and phase-locking are measured with classical polar order metrics. After establishing the mass conservation property of the mean-field model and bounds on its nonlocal term, a system of first-order necessary conditions for optimality is recovered using a Lagrangian method. The optimality system, comprising a nonlocal PDE for the state dynamics equation, the respective nonlocal adjoint dynamics, and the Euler equation, is solved iteratively following a standard Optimize-then-Discretize approach and an efficient numerical solver based on spectral methods. We demonstrate our approach for each of the two control inputs in simulation.
△ Less
Submitted 15 September, 2021;
originally announced September 2021.
-
Swimmer dynamics in externally-driven fluid flows: The role of noise
Authors:
Simon A. Berman,
Kevin A. Mitchell
Abstract:
We theoretically investigate the effect of random fluctuations on the motion of elongated microswimmers near hydrodynamic transport barriers in externally-driven fluid flows. Focusing on the two-dimensional hyperbolic flow, we consider the effects of translational and rotational diffusion as well as tumbling, i.e. sudden jumps in the swimmer orientation. Regardless of whether diffusion or tumbling…
▽ More
We theoretically investigate the effect of random fluctuations on the motion of elongated microswimmers near hydrodynamic transport barriers in externally-driven fluid flows. Focusing on the two-dimensional hyperbolic flow, we consider the effects of translational and rotational diffusion as well as tumbling, i.e. sudden jumps in the swimmer orientation. Regardless of whether diffusion or tumbling are the primary source of fluctuations, we find that noise significantly increases the probability that a swimmer crosses one-way barriers in the flow, which block the swimmer from returning to its initial position. We employ an asymptotic method for calculating the probability density of noisy swimmer trajectories in a given fluid flow, which produces solutions to the time-dependent Fokker-Planck equation in the weak-noise limit. This procedure mirrors the semiclassical approximation in quantum mechanics and similarly involves calculating the least-action paths of a Hamiltonian system derived from the swimmer's Fokker-Planck equation. Using the semiclassical technique, we compute (i) the steady-state orientation distribution of swimmers with rotational diffusion and tumbling and (ii) the probability that a diffusive swimmer crosses a one-way barrier. The semiclassical results compare favorably with Monte Carlo calculations.
△ Less
Submitted 22 November, 2021; v1 submitted 23 August, 2021;
originally announced August 2021.
-
Programming-By-Example by Programming-By-Example: Synthesis of Looping Programs
Authors:
Shmuel Berman,
Mark Santolucito
Abstract:
Program synthesis has seen many new applications in recent years, in large part thanks to the introduction of SyGuS. However, no existing SyGuS solvers have support for synthesizing recursive functions. We introduce an multi-phase algorithm for the synthesis of recursive ``looplike'' programs in SyGuS for programming-by-example. We solve constraints individually and treat them as ``unrolled`` exam…
▽ More
Program synthesis has seen many new applications in recent years, in large part thanks to the introduction of SyGuS. However, no existing SyGuS solvers have support for synthesizing recursive functions. We introduce an multi-phase algorithm for the synthesis of recursive ``looplike'' programs in SyGuS for programming-by-example. We solve constraints individually and treat them as ``unrolled`` examples of how a recursive program would behave, and solve for the generalized recursive solution. Our approach is modular and supports any SyGuS Solver.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
The single copy of the gravitational holonomy
Authors:
Rashid Alawadhi,
David S. Berman,
Chris D. White,
Sam Wikeley
Abstract:
The double copy is a well-established relationship between gravity and gauge theories. It relates perturbative scattering amplitudes as well as classical solutions, and recently there has been mounting evidence that it also applies to non-perturbative information. In this paper, we consider the holonomy properties of manifolds in gravity and prescribe a single copy of gravitational holonomy that d…
▽ More
The double copy is a well-established relationship between gravity and gauge theories. It relates perturbative scattering amplitudes as well as classical solutions, and recently there has been mounting evidence that it also applies to non-perturbative information. In this paper, we consider the holonomy properties of manifolds in gravity and prescribe a single copy of gravitational holonomy that differs from the holonomy in gauge theory. We discuss specific cases and give examples where the single copy holonomy group is reduced. Our results may prove useful in extending the classical double copy. We also clarify previous misconceptions in the literature regarding gravitational Wilson lines and holonomy.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Towards Decentralized Human-Swarm Interaction by Means of Sequential Hand Gesture Recognition
Authors:
Zahi Kakish,
Sritanay Vedartham,
Spring Berman
Abstract:
In this work, we present preliminary work on a novel method for Human-Swarm Interaction (HSI) that can be used to change the macroscopic behavior of a swarm of robots with decentralized sensing and control. By integrating a small yet capable hand gesture recognition convolutional neural network (CNN) with the next-generation Robot Operating System \emph{ros2}, which enables decentralized implement…
▽ More
In this work, we present preliminary work on a novel method for Human-Swarm Interaction (HSI) that can be used to change the macroscopic behavior of a swarm of robots with decentralized sensing and control. By integrating a small yet capable hand gesture recognition convolutional neural network (CNN) with the next-generation Robot Operating System \emph{ros2}, which enables decentralized implementation of robot software for multi-robot applications, we demonstrate the feasibility of programming a swarm of robots to recognize and respond to a sequence of hand gestures that capable of correspond to different types of swarm behaviors. We test our approach using a sequence of gestures that modifies the target inter-robot distance in a group of three Turtlebot3 Burger robots in order to prevent robot collisions with obstacles. The approach is validated in three different Gazebo simulation environments and in a physical testbed that reproduces one of the simulated environments.
△ Less
Submitted 4 February, 2021;
originally announced February 2021.
-
Double Field Theory and Geometric Quantisation
Authors:
Luigi Alfonsi,
David S. Berman
Abstract:
We examine various properties of double field theory and the doubled string sigma model in the context of geometric quantisation. In particular we look at T-duality as the symplectic transformation related to an alternative choice of polarisation in the construction of the quantum bundle for the string. Following this perspective we adopt a variety of techniques from geometric quantisation to stud…
▽ More
We examine various properties of double field theory and the doubled string sigma model in the context of geometric quantisation. In particular we look at T-duality as the symplectic transformation related to an alternative choice of polarisation in the construction of the quantum bundle for the string. Following this perspective we adopt a variety of techniques from geometric quantisation to study the doubled space. One application is the construction of the double coherent state that provides the shortest distance in any duality frame and a stringy deformed Fourier transform.
△ Less
Submitted 2 June, 2021; v1 submitted 28 January, 2021;
originally announced January 2021.
-
Decentralized Multi-target Tracking with Multiple Quadrotors using a PHD Filter
Authors:
Aniket Shirsat,
Spring Berman
Abstract:
We consider a scenario in which a group of quadrotors is tasked at tracking multiple stationary targets in an unknown, bounded environment. The quadrotors search for targets along a spatial grid overlaid on the environment while performing a random walk on this grid modeled by a discrete-time discrete-state (DTDS) Markov chain. The quadrotors can transmit their estimates of the target locations to…
▽ More
We consider a scenario in which a group of quadrotors is tasked at tracking multiple stationary targets in an unknown, bounded environment. The quadrotors search for targets along a spatial grid overlaid on the environment while performing a random walk on this grid modeled by a discrete-time discrete-state (DTDS) Markov chain. The quadrotors can transmit their estimates of the target locations to other quadrotors that occupy their current location on the grid; thus, their communication network is time-varying and not necessarily connected. We model the search procedure as a renewal-reward process on the underlying DTDS Markov chain. To accommodate changes in the set of targets observed by each quadrotor as it explores the environment, along with uncertainties in the quadrotors' measurements of the targets, we formulate the tracking problem in terms of Random Finite Sets (RFS). The quadrotors use RFS-based Probability Hypothesis Density (PHD) filters to estimate the number of targets and their locations. We present a theoretical estimation framework, based on the Gaussian Mixture formulation of the PHD filter, and preliminary simulation results toward extending existing approaches for RFS-based multi-target tracking to a decentralized multi-robot strategy for multi-target tracking. We validate this approach with simulations of multi-target tracking scenarios with different densities of robots and targets, and we evaluate the average time required for the robots in each scenario to reach agreement on a common set of targets.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
The Classical Double Copy for M-theory from a Kerr-Schild Ansatz for Exceptional Field Theory
Authors:
David S. Berman,
Kwangeon Kim,
Kanghoon Lee
Abstract:
We construct the classical double copy formalism for M-theory. This extends the current state of the art by including the three form potential of eleven dimensional supergravity along with the metric. The key for this extension is to construct a Kerr-Schild type Ansatz for exceptional field theory. This Kerr-Schild Ansatz then allows us to find the solutions of charged objects such as the membrane…
▽ More
We construct the classical double copy formalism for M-theory. This extends the current state of the art by including the three form potential of eleven dimensional supergravity along with the metric. The key for this extension is to construct a Kerr-Schild type Ansatz for exceptional field theory. This Kerr-Schild Ansatz then allows us to find the solutions of charged objects such as the membrane from a set of single copy fields. The exceptional field theory formalism then automatically produces the IIB Kerr-Schild ansatz allowing the construction of the single copy for the fields of IIB supergravity (with manifest $SL(2)$ symmetry).
△ Less
Submitted 16 September, 2021; v1 submitted 16 October, 2020;
originally announced October 2020.
-
Multi-Robot Target Search using Probabilistic Consensus on Discrete Markov Chains
Authors:
Aniket Shirsat,
Karthik Elamvazhuthi,
Spring Berman
Abstract:
In this paper, we propose a probabilistic consensus-based multi-robot search strategy that is robust to communication link failures, and thus is suitable for disaster affected areas. The robots, capable of only local communication, explore a bounded environment according to a random walk modeled by a discrete-time discrete-state (DTDS) Markov chain and exchange information with neighboring robots,…
▽ More
In this paper, we propose a probabilistic consensus-based multi-robot search strategy that is robust to communication link failures, and thus is suitable for disaster affected areas. The robots, capable of only local communication, explore a bounded environment according to a random walk modeled by a discrete-time discrete-state (DTDS) Markov chain and exchange information with neighboring robots, resulting in a time-varying communication network topology. The proposed strategy is proved to achieve consensus, here defined as agreement on the presence of a static target, with no assumptions on the connectivity of the communication network. Using numerical simulations, we investigate the effect of the robot population size, domain size, and information uncertainty on the consensus time statistics under this scheme. We also validate our theoretical results with 3D physics-based simulations in Gazebo. The simulations demonstrate that all robots achieve consensus in finite time with the proposed search strategy over a range of robot densities in the environment.
△ Less
Submitted 20 September, 2020;
originally announced September 2020.
-
MutaGAN: A Seq2seq GAN Framework to Predict Mutations of Evolving Protein Populations
Authors:
Daniel S. Berman,
Craig Howser,
Thomas Mehoke,
Jared D. Evans
Abstract:
The ability to predict the evolution of a pathogen would significantly improve the ability to control, prevent, and treat disease. Despite significant progress in other problem spaces, deep learning has yet to contribute to the issue of predicting mutations of evolving populations. To address this gap, we developed a novel machine learning framework using generative adversarial networks (GANs) wit…
▽ More
The ability to predict the evolution of a pathogen would significantly improve the ability to control, prevent, and treat disease. Despite significant progress in other problem spaces, deep learning has yet to contribute to the issue of predicting mutations of evolving populations. To address this gap, we developed a novel machine learning framework using generative adversarial networks (GANs) with recurrent neural networks (RNNs) to accurately predict genetic mutations and evolution of future biological populations. Using a generalized time-reversible phylogenetic model of protein evolution with bootstrapped maximum likelihood tree estimation, we trained a sequence-to-sequence generator within an adversarial framework, named MutaGAN, to generate complete protein sequences augmented with possible mutations of future virus populations. Influenza virus sequences were identified as an ideal test case for this deep learning framework because it is a significant human pathogen with new strains emerging annually and global surveillance efforts have generated a large amount of publicly available data from the National Center for Biotechnology Information's (NCBI) Influenza Virus Resource (IVR). MutaGAN generated "child" sequences from a given "parent" protein sequence with a median Levenshtein distance of 2.00 amino acids. Additionally, the generator was able to augment the majority of parent proteins with at least one mutation identified within the global influenza virus population. These results demonstrate the power of the MutaGAN framework to aid in pathogen forecasting with implications for broad utility in evolutionary prediction for any protein population.
△ Less
Submitted 26 August, 2020;
originally announced August 2020.
-
Transport barriers to self-propelled particles in fluid flows
Authors:
Simon A. Berman,
John Buggeln,
David A. Brantley,
Kevin Mitchell,
Thomas H. Solomon
Abstract:
We present theory and experiments demonstrating the existence of invariant manifolds that impede the motion of microswimmers in two-dimensional fluid flows. One-way barriers are apparent in a hyperbolic fluid flow that block the swimming of both smooth-swimming and run-and-tumble \emph{Bacillus subtilis} bacteria. We identify key phase-space structures, called swimming invariant manifolds (SwIMs),…
▽ More
We present theory and experiments demonstrating the existence of invariant manifolds that impede the motion of microswimmers in two-dimensional fluid flows. One-way barriers are apparent in a hyperbolic fluid flow that block the swimming of both smooth-swimming and run-and-tumble \emph{Bacillus subtilis} bacteria. We identify key phase-space structures, called swimming invariant manifolds (SwIMs), that serve as separatrices between different regions of long-time swimmer behavior. When projected into $xy$-space, the edges of the SwIMs act as one-way barriers, consistent with the experiments.
△ Less
Submitted 27 October, 2020; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Weyl doubling
Authors:
Rashid Alawadhi,
David S. Berman,
Bill Spence
Abstract:
We study a host of spacetimes where the Weyl curvature may be expressed algebraically in terms of an Abelian field strength. These include Type D spacetimes in four and higher dimensions which obey a simple quadratic relation between the field strength and the Weyl tensor, following the Weyl spinor double copy relation. However, we diverge from the usual double copy paradigm by taking the gauge fi…
▽ More
We study a host of spacetimes where the Weyl curvature may be expressed algebraically in terms of an Abelian field strength. These include Type D spacetimes in four and higher dimensions which obey a simple quadratic relation between the field strength and the Weyl tensor, following the Weyl spinor double copy relation. However, we diverge from the usual double copy paradigm by taking the gauge fields to be in the curved spacetime as opposed to an auxiliary flat space.
We show how for Gibbons-Hawking spacetimes with more than two centres a generalisation of the Weyl doubling formula is needed by including a derivative-dependent expression which is linear in the Abelian field strength. We also find a type of twisted doubling formula in a case of a manifold with Spin(7) holonomy in eight dimensions.
For Einstein Maxwell theories where there is an independent gauge field defined on spacetime, we investigate how the gauge fields determine the Weyl spacetime curvature via a doubling formula. We first show that this occurs for the Reissner-Nordstrom metric in any dimension, and that this generalises to the electrically-charged Born-Infeld solutions. Finally, we consider brane systems in supergravity, showing that a similar doubling formula applies. This Weyl formula is based on the field strength of the p-form potential that minimally couples to the brane and the brane world volume Killing vectors.
△ Less
Submitted 18 August, 2020; v1 submitted 7 July, 2020;
originally announced July 2020.
-
Using Reinforcement Learning to Herd a Robotic Swarm to a Target Distribution
Authors:
Zahi M. Kakish,
Karthik Elamvazhuthi,
Spring Berman
Abstract:
In this paper, we present a reinforcement learning approach to designing a control policy for a "leader" agent that herds a swarm of "follower" agents, via repulsive interactions, as quickly as possible to a target probability distribution over a strongly connected graph. The leader control policy is a function of the swarm distribution, which evolves over time according to a mean-field model in t…
▽ More
In this paper, we present a reinforcement learning approach to designing a control policy for a "leader" agent that herds a swarm of "follower" agents, via repulsive interactions, as quickly as possible to a target probability distribution over a strongly connected graph. The leader control policy is a function of the swarm distribution, which evolves over time according to a mean-field model in the form of an ordinary difference equation. The dependence of the policy on agent populations at each graph vertex, rather than on individual agent activity, simplifies the observations required by the leader and enables the control strategy to scale with the number of agents. Two Temporal-Difference learning algorithms, SARSA and Q-Learning, are used to generate the leader control policy based on the follower agent distribution and the leader's location on the graph. A simulation environment corresponding to a grid graph with 4 vertices was used to train and validate the control policies for follower agent populations ranging from 10 to 100. Finally, the control policies trained on 100 simulated agents were used to successfully redistribute a physical swarm of 10 small robots to a target distribution among 4 spatial regions.
△ Less
Submitted 12 December, 2020; v1 submitted 29 June, 2020;
originally announced June 2020.
-
The Geometry, Branes and Applications of Exceptional Field Theory
Authors:
David S. Berman,
Chris D. A. Blair
Abstract:
This is a review of exceptional field theory: a generalisation of Kaluza-Klein theory that unifies the metric and $p$-form gauge field degrees of freedom of supergravity into a generalised or extended geometry, whose additional coordinates may be viewed as conjugate to brane winding modes. This unifies the maximal supergravities, treating their previously-hidden exceptional Lie symmetries as a fun…
▽ More
This is a review of exceptional field theory: a generalisation of Kaluza-Klein theory that unifies the metric and $p$-form gauge field degrees of freedom of supergravity into a generalised or extended geometry, whose additional coordinates may be viewed as conjugate to brane winding modes. This unifies the maximal supergravities, treating their previously-hidden exceptional Lie symmetries as a fundamental geometric symmetry. Duality orbits of solutions simplify into single objects, that in many cases have simple geometric interpretations, for instance as wave or monopole-type solutions. It also provides a route to explore exotic or non-geometric aspects of M-theory, such as exotic branes, U-folds, and more novel sorts of non-Riemannian spaces.
△ Less
Submitted 21 October, 2024; v1 submitted 17 June, 2020;
originally announced June 2020.
-
Trapping of swimmers in a vortex lattice
Authors:
S. A. Berman,
K. A. Mitchell
Abstract:
We examine the motion of rigid, ellipsoidal swimmers subjected to a steady vortex flow in two dimensions. Numerical simulations of swimmers in a spatially periodic array of vortices reveal a range of possible behaviors, including trapping inside a single vortex and motility-induced diffusion across many vortices. While the trapping probability vanishes at a sufficiently high swimming speed, we fin…
▽ More
We examine the motion of rigid, ellipsoidal swimmers subjected to a steady vortex flow in two dimensions. Numerical simulations of swimmers in a spatially periodic array of vortices reveal a range of possible behaviors, including trapping inside a single vortex and motility-induced diffusion across many vortices. While the trapping probability vanishes at a sufficiently high swimming speed, we find that it exhibits surprisingly large oscillations as this critical swimming speed is approached. Strikingly, at even higher swimming speeds, we find swimmers that swim perpendicular to their elongation direction can again become trapped. To explain this complex behavior, we investigate the underlying swimmer phase-space geometry. We identify the fixed points and periodic orbits of the swimmer equations of motion that regulate swimmer trapping inside a single vortex cell. For low to intermediate swimming speeds, we find that a stable periodic orbit surrounded by invariant tori forms a transport barrier to swimmers and can trap them inside individual vortices. For swimming speeds approaching the maximum fluid speed, we find instead that perpendicular swimmers can be trapped by asymptotically stable fixed points. A bifurcation analysis of the stable periodic orbit and the fixed points explains the complex and non-monotonic breakdown and reemergence of swimmer trapping as the swimmer speed and shape are varied.
△ Less
Submitted 21 February, 2020;
originally announced February 2020.
-
S-duality and the Double Copy
Authors:
Rashid Alawadhi,
David S. Berman,
Bill Spence,
David Peinador Veiga
Abstract:
The double copy formalism provides an intriguing connection between gauge theories and gravity. It was first demonstrated in the perturbative context of scattering amplitudes but recently the formalism has been applied to exact classical solutions in gauge theories such as the monopole and instanton.
In this paper we will investigate how duality symmetries in the gauge theory double copy to grav…
▽ More
The double copy formalism provides an intriguing connection between gauge theories and gravity. It was first demonstrated in the perturbative context of scattering amplitudes but recently the formalism has been applied to exact classical solutions in gauge theories such as the monopole and instanton.
In this paper we will investigate how duality symmetries in the gauge theory double copy to gravity and relate these to solution generating transformations and the action of $Sl(2,R)$ in general relativity.
△ Less
Submitted 26 February, 2020; v1 submitted 15 November, 2019;
originally announced November 2019.
-
Reductions of Exceptional Field Theories
Authors:
David S. Berman,
Ray Otsuki
Abstract:
Double Field Theory (DFT) and Exceptional Field Theory (EFT), collectively called ExFTs, have proven to be a remarkably powerful new framework for string and M-theory. Exceptional field theories were constructed on a case by case basis as often each EFT has its own idiosyncrasies. Intuitively though, an $E_{n-1(n-1)}$ EFT must be contained in an $E_{n(n)}$ ExFT. In this paper we propose a generali…
▽ More
Double Field Theory (DFT) and Exceptional Field Theory (EFT), collectively called ExFTs, have proven to be a remarkably powerful new framework for string and M-theory. Exceptional field theories were constructed on a case by case basis as often each EFT has its own idiosyncrasies. Intuitively though, an $E_{n-1(n-1)}$ EFT must be contained in an $E_{n(n)}$ ExFT. In this paper we propose a generalised Kaluza-Klein ansatz to relate different ExFTs. We then discuss in more detail the different aspects of the relationship between various ExFTs including the coordinates, section condition and (pseudo)-Lagrangian densities. For the $E_{8(8)}$ EFT we describe a generalisation of the Mukhi-Papageorgakis mechanism to relate the d = 3 topological term in the $E_{8(8)}$ EFT to a Yang-Mills action in the $E_{7(7)}$ EFT.
△ Less
Submitted 26 February, 2020; v1 submitted 14 November, 2019;
originally announced November 2019.
-
Exotic Branes in M-Theory
Authors:
Ray Otsuki,
David S. Berman,
Edvard T. Musaev
Abstract:
We revisit curious objects in string and M-theory called exotic brane---objects that are highly non-perturbative, possessing a tension that scales less than $g_s^{-2}$ and are generically of low-codimension. They are non-geometric in the sense that they are only well-defined locally as supergravity solutions and require duality transformations to patch correctly, in addition to the usual diffeomor…
▽ More
We revisit curious objects in string and M-theory called exotic brane---objects that are highly non-perturbative, possessing a tension that scales less than $g_s^{-2}$ and are generically of low-codimension. They are non-geometric in the sense that they are only well-defined locally as supergravity solutions and require duality transformations to patch correctly, in addition to the usual diffeomorphisms and gauge transformations.
We argue that Double Field Theory (DFT) and Exceptional Field Theory (EFT) are the prime setting in which to examine such objects. To emphasise this, we construct an explicit solution in $E_{7(7)} \times \mathbb{R}^+$ EFT that unifies many of the codimension-2 exotic branes into a single well-behaved solution on an extended spacetime. We further argue that there are in fact an infinite number of exotic branes in string- and M-theory, many of which fall into a more general class of exotic branes that do not afford even a local description in terms of conventional supergravity.
△ Less
Submitted 25 March, 2019;
originally announced March 2019.
-
Automated Construction of Metric Maps using a Stochastic Robotic Swarm Leveraging Received Signal Strength
Authors:
Ragesh K. Ramachandran,
Spring Berman
Abstract:
In this work, we present a novel automated procedure for constructing a metric map of an unknown domain with obstacles using uncertain position data collected by a swarm of resource-constrained robots. The robots obtain this data during random exploration of the domain by combining onboard odometry information with noisy measurements of signals received from transmitters located outside the domain…
▽ More
In this work, we present a novel automated procedure for constructing a metric map of an unknown domain with obstacles using uncertain position data collected by a swarm of resource-constrained robots. The robots obtain this data during random exploration of the domain by combining onboard odometry information with noisy measurements of signals received from transmitters located outside the domain. This data is processed offline to compute a density function of the free space over a discretization of the domain. We use persistent homology techniques from topological data analysis to estimate a value for thresholding the density function, thereby segmenting the obstacle-occupied region in the unknown domain. Our approach is substantiated with theoretical results to prove its completeness and to analyze its time complexity. The effectiveness of the procedure is illustrated with numerical simulations conducted on six different domains, each with two signal transmitters.
△ Less
Submitted 13 March, 2019;
originally announced March 2019.