Skip to main content

Showing 1–50 of 86 results for author: Leonard, N

.
  1. arXiv:2504.11717  [pdf, other

    cs.RO eess.SY

    Safety with Agency: Human-Centered Safety Filter with Application to AI-Assisted Motorsports

    Authors: Donggeon David Oh, Justin Lidard, Haimin Hu, Himani Sinhmar, Elle Lazarski, Deepak Gopinath, Emily S. Sumner, Jonathan A. DeCastro, Guy Rosman, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: We propose a human-centered safety filter (HCSF) for shared autonomy that significantly enhances system safety without compromising human agency. Our HCSF is built on a neural safety value function, which we first learn scalably through black-box interactions and then use at deployment to enforce a novel state-action control barrier function (Q-CBF) safety constraint. Since this Q-CBF safety filte… ▽ More

    Submitted 29 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: Accepted to Robotics: Science and Systems (R:SS) 2025, 22 pages, 16 figures, 7 tables

  2. Leveraging deep learning for plant disease identification: a bibliometric analysis in SCOPUS from 2018 to 2024

    Authors: Enow Takang Achuo Albert, Ngalle Hermine Bille, Ngonkeu Mangaptche Eddy Leonard

    Abstract: This work aimed to present a bibliometric analysis of deep learning research for plant disease identification, with a special focus on generative modeling. A thorough analysis of SCOPUS-sourced bibliometric data from 253 documents was performed. Key performance metrics such as accuracy, precision, recall, and F1-score were analyzed for generative modeling. The findings highlighted significant cont… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Report number: Volume 9, 2025, 16--39

    Journal ref: Journal of Scientific Agriculture, 2025

  3. arXiv:2504.01878  [pdf, other

    eess.SY

    Tunable Thresholds and Frequency Encoding in a Spiking NOD Controller

    Authors: Ian Xul Belaustegui, Alessio Franci, Naomi Ehrich Leonard

    Abstract: Spiking Nonlinear Opinion Dynamics (S-NOD) is an excitable decision-making model inspired by the spiking dynamics of neurons. S-NOD enables the design of agile decision-making that can rapidly switch between decision options in response to a changing environment. In S-NOD, decisions are represented by continuous time, yet discrete, opinion spikes. Here, we extend previous analysis of S-NOD and exp… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  4. arXiv:2502.00966  [pdf, other

    cs.RO cs.HC

    The Beatbots: A Musician-Informed Multi-Robot Percussion Quartet

    Authors: Isabella Pu, Jeff Snyder, Naomi Ehrich Leonard

    Abstract: Artistic creation is often seen as a uniquely human endeavor, yet robots bring distinct advantages to music-making, such as precise tempo control, unpredictable rhythmic complexities, and the ability to coordinate intricate human and robot performances. While many robotic music systems aim to mimic human musicianship, our work emphasizes the unique strengths of robots, resulting in a novel multi-r… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: Copyright protected by IEEE/ACM, 10 pages, 4 figures, 1 table, in proceedings of 20th IEEE/ACM International Conference on Human-Robot Interaction (HRI 2025)

  5. arXiv:2410.17058  [pdf, other

    eess.SY

    Optimal gait design for nonlinear soft robotic crawlers

    Authors: Yenan Shen, Naomi Ehrich Leonard, Bassam Bamieh, Juncal Arbelaiz

    Abstract: Soft robots offer a frontier in robotics with enormous potential for safe human-robot interaction and agility in uncertain environments. A stepping stone towards unlocking their potential is a control theory tailored to soft robotics, including a principled framework for gait design. We analyze the problem of optimal gait design for a soft crawling body - the crawler. The crawler is an elastic bod… ▽ More

    Submitted 14 March, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

  6. arXiv:2410.12993  [pdf, other

    math.DS physics.soc-ph q-bio.PE

    Opinion-driven risk perception and reaction in SIS epidemics

    Authors: Marcela Ordorica Arango, Anastasia Bizyaeva, Simon A. Levin, Naomi Ehrich Leonard

    Abstract: We present and analyze a mathematical model to study the feedback between behavior and epidemic spread in a population that is actively assessing and reacting to risk of infection. In our model, a population dynamically forms an opinion that reflects its willingness to engage in risky behavior (e.g., not wearing a mask in a crowded area) or reduce it (e.g., social distancing). We consider SIS epid… ▽ More

    Submitted 18 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

  7. arXiv:2410.00798  [pdf, other

    math.DS

    Fast-and-flexible decision-making with modulatory interactions

    Authors: Rodrigo Moreno-Morton, Anastasia Bizyaeva, Naomi Ehrich Leonard, Alessio Franci

    Abstract: Multi-agent systems in biology, society, and engineering are capable of making decisions through the dynamic interaction of their elements. Nonlinearity of the interactions is key for the speed, robustness, and flexibility of multi-agent decision-making. In this work we introduce modulatory, that is, multiplicative, in contrast to additive, interactions in a nonlinear opinion dynamics model of fas… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 6 pages, 4 figures, submitted to American Control Conference 2025

    MSC Class: 37G99

  8. arXiv:2409.12420  [pdf, other

    math.AP cs.SI

    Spatially-invariant opinion dynamics on the circle

    Authors: Giovanna Amorim, Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose and analyze a nonlinear opinion dynamics model for an agent making decisions about a continuous distribution of options in the presence of input. Inspired by perceptual decision-making, we develop new theory for opinion formation in response to inputs about options distributed on the circle. Options on the circle can represent, e.g., the possible directions of perceived objects and resu… ▽ More

    Submitted 27 November, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

  9. arXiv:2409.12317  [pdf, other

    eess.SY

    Spiking Nonlinear Opinion Dynamics (S-NOD) for Agile Decision-Making

    Authors: Charlotte Cathcart, Ian Xul Belaustegui, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We present, analyze, and illustrate a first-of-its-kind model of two-dimensional excitable (spiking) dynamics for decision-making over two options. The model, Spiking Nonlinear Opinion Dynamics (S-NOD), provides superior agility, characterized by fast, flexible, and adaptive response to rapid and unpredictable changes in context, environment, or information received about available options. S-NOD… ▽ More

    Submitted 27 November, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 6 pages, 6 figures

  10. arXiv:2406.14746  [pdf, other

    cs.LG cs.RO

    Behavior-Inspired Neural Networks for Relational Inference

    Authors: Yulong Yang, Bowen Feng, Keqin Wang, Naomi Ehrich Leonard, Adji Bousso Dieng, Christine Allen-Blanchette

    Abstract: From pedestrians to Kuramoto oscillators, interactions between agents govern how dynamical systems evolve in space and time. Discovering how these agents relate to each other has the potential to improve our understanding of the often complex dynamics that underlie these systems. Recent works learn to categorize relationships between agents based on observations of their physical behavior. These a… ▽ More

    Submitted 11 March, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accept to The 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025)

  11. arXiv:2406.09810  [pdf, other

    cs.RO eess.SY

    Think Deep and Fast: Learning Neural Nonlinear Opinion Dynamics from Inverse Dynamic Games for Split-Second Interactions

    Authors: Haimin Hu, Jaime Fernández Fisac, Naomi Ehrich Leonard, Deepak Gopinath, Jonathan DeCastro, Guy Rosman

    Abstract: Non-cooperative interactions commonly occur in multi-agent scenarios such as car racing, where an ego vehicle can choose to overtake the rival, or stay behind it until a safe overtaking "corridor" opens. While an expert human can do well at making such time-sensitive decisions, autonomous agents are incapable of rapidly reasoning about complex, potentially conflicting options, leading to suboptima… ▽ More

    Submitted 26 April, 2025; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2025

  12. arXiv:2405.20593  [pdf, other

    eess.SY cs.RO

    Excitable crawling

    Authors: Juncal Arbelaiz, Alessio Franci, Naomi Ehrich Leonard, Rodolphe Sepulchre, Bassam Bamieh

    Abstract: We propose and analyze the suitability of a spiking controller to engineer the locomotion of a soft robotic crawler. Inspired by the FitzHugh-Nagumo model of neural excitability, we design a bistable controller with an electrical flipflop circuit representation capable of generating spikes on-demand when coupled to the passive crawler mechanics. A proprioceptive sensory signal from the crawler mec… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 5 pages, MTNS 2024 extended abstract

  13. arXiv:2404.17756  [pdf, other

    cond-mat.soft cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.stat-mech physics.chem-ph

    Origins of suppressed self-diffusion of nanoscale constituents of a complex liquid

    Authors: Christian P. N. Tanner, Vivian R. K. Wall, Mumtaz Gababa, Joshua Portner, Ahhyun Jeong, Matthew J. Hurley, Nicholas Leonard, Jonathan G. Raybin, James K. Utterback, Ahyoung Kim, Andrei Fluerasu, Yanwen Sun, Johannes Moeller, Alexey Zozulya, Wonhyuk Jo, Felix Brausse, James Wrigley, Ulrike Boesenberg, Wei Lu, Roman Shayduk, Mohamed Youssef, Anders Madsen, David T. Limmer, Dmitri V. Talapin, Samuel W. Teitelbaum , et al. (1 additional authors not shown)

    Abstract: The ability to understand and ultimately control the transformations and properties of various nanoscale systems, from proteins to synthetic nanomaterial assemblies, hinges on the ability to uncover their dynamics on their characteristic length and time scales. Here, we use MHz X-ray photon correlation spectroscopy (XPCS) to directly elucidate the characteristic microsecond-dynamics of density flu… ▽ More

    Submitted 7 January, 2025; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures

  14. arXiv:2404.16808  [pdf, other

    cond-mat.soft cond-mat.mes-hall cond-mat.mtrl-sci

    Enhancing nanoscale charged colloid crystallization near a metastable liquid binodal

    Authors: Christian P. N. Tanner, Vivian R. K. Wall, Joshua Portner, Ahhyun Jeong, Avishek Das, James K. Utterback, Leo M. Hamerlynck, Jonathan G. Raybin, Matthew J. Hurley, Nicholas Leonard, Rebecca B. Wai, Jenna A. Tan, Mumtaz Gababa, Chenhui Zhu, Eric Schaible, Christopher J. Tassone, David T. Limmer, Samuel W. Teitelbaum, Dmitri V. Talapin, Naomi S. Ginsberg

    Abstract: Bottom-up assembly of nanocrystals (NCs) into ordered arrays, or superlattices (SLs), is a promising route to design materials with new functionalities, but the degree of control over assembly into functional structures remains challenging. Using electrostatics, rather than density, to tune the interactions between semiconductor NCs, we watch self-assembly proceeding through a metastable liquid ph… ▽ More

    Submitted 28 April, 2025; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 13 pages, 4 figures

  15. arXiv:2403.05457  [pdf, other

    eess.SY

    Sparse dynamic network reconstruction through L1-regularization of a Lyapunov equation

    Authors: Ian Xul Belaustegui, Marcela Ordorica Arango, Román Rossi-Pool, Naomi Ehrich Leonard, Alessio Franci

    Abstract: An important problem in many areas of science is that of recovering interaction networks from simultaneous time-series of many interacting dynamical processes. A common approach is to use the elements of the correlation matrix or its inverse as proxies of the interaction strengths, but the reconstructed networks are necessarily undirected. Transfer entropy methods have been proposed to reconstruct… ▽ More

    Submitted 12 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  16. arXiv:2402.14174  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Blending Data-Driven Priors in Dynamic Games

    Authors: Justin Lidard, Haimin Hu, Asher Hancock, Zixu Zhang, Albert Gimó Contreras, Vikash Modi, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, María Santos, Jaime Fernández Fisac

    Abstract: As intelligent robots like autonomous vehicles become increasingly deployed in the presence of people, the extent to which these systems should leverage model-based game-theoretic planners versus data-driven policies for safe, interaction-aware motion planning remains an open question. Existing dynamic game formulations assume all agents are task-driven and behave optimally. However, in reality, h… ▽ More

    Submitted 6 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 20 pages, 12 figures

  17. arXiv:2312.06395  [pdf, other

    cs.RO math.DS math.OC

    Threshold Decision-Making Dynamics Adaptive to Physical Constraints and Changing Environment

    Authors: Giovanna Amorim, María Santos, Shinkyu Park, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a threshold decision-making framework for controlling the physical dynamics of an agent switching between two spatial tasks. Our framework couples a nonlinear opinion dynamics model that represents the evolution of an agent's preference for a particular task with the physical dynamics of the agent. We prove the bifurcation that governs the behavior of the coupled dynamics. We show by me… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  18. arXiv:2311.02204  [pdf, other

    q-bio.PE eess.SY math.DS

    Active risk aversion in SIS epidemics on networks

    Authors: Anastasia Bizyaeva, Marcela Ordorica Arango, Yunxiu Zhou, Simon Levin, Naomi Ehrich Leonard

    Abstract: We present and analyze an actively controlled Susceptible-Infected-Susceptible (actSIS) model of interconnected populations to study how risk aversion strategies, such as social distancing, affect network epidemics. A population using a risk aversion strategy reduces its contact rate with other populations when it perceives an increase in infection risk. The network actSIS model relies on two dist… ▽ More

    Submitted 16 October, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  19. arXiv:2308.14666  [pdf, other

    cs.CV cs.CE cs.LG

    Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

    Authors: Justice Mason, Christine Allen-Blanchette, Nicholas Zolman, Elizabeth Davison, Naomi Ehrich Leonard

    Abstract: In many real-world settings, image observations of freely rotating 3D rigid bodies may be available when low-dimensional measurements are not. However, the high-dimensionality of image data precludes the use of classical estimation techniques to learn the dynamics. The usefulness of standard deep learning methods is also limited, because an image of a rigid body reveals nothing about the distribut… ▽ More

    Submitted 10 April, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Previously appeared as arXiv:2209.11355v2, which was submitted as a replacement by accident. arXiv admin note: text overlap with arXiv:2209.11355

  20. arXiv:2308.02755  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Multi-topic belief formation through bifurcations over signed social networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose and analyze a nonlinear dynamic model of continuous-time multi-dimensional belief formation over signed social networks. Our model accounts for the effects of a structured belief system, self-appraisal, internal biases, and various sources of cognitive dissonance posited by recent theories in social psychology. We prove that agents become opinionated as a consequence of a bifurcation. W… ▽ More

    Submitted 2 July, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 16 pages, 7 figures

  21. arXiv:2306.07535  [pdf, other

    eess.SY

    Learning with Delayed Payoffs in Population Games using Kullback-Leibler Divergence Regularization

    Authors: Shinkyu Park, Naomi Ehrich Leonard

    Abstract: We study a multi-agent decision problem in large population games. Agents from multiple populations select strategies for repeated interactions with one another. At each stage of these interactions, agents use their decision-making model to revise their strategy selections based on payoffs determined by an underlying game. Their goal is to learn the strategies that correspond to the Nash equilibri… ▽ More

    Submitted 30 December, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  22. arXiv:2305.17600  [pdf, other

    cs.LG cs.CV cs.GT cs.RO math.OC

    NashFormer: Leveraging Local Nash Equilibria for Semantically Diverse Trajectory Prediction

    Authors: Justin Lidard, Oswin So, Yanxia Zhang, Jonathan DeCastro, Xiongyi Cui, Xin Huang, Yen-Ling Kuo, John Leonard, Avinash Balachandran, Naomi Leonard, Guy Rosman

    Abstract: Interactions between road agents present a significant challenge in trajectory prediction, especially in cases involving multiple agents. Because existing diversity-aware predictors do not account for the interactive nature of multi-agent predictions, they may miss these important interaction outcomes. In this paper, we propose NashFormer, a framework for trajectory prediction that leverages game-… ▽ More

    Submitted 11 November, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures

  23. arXiv:2304.02687  [pdf, other

    eess.SY cs.RO

    Emergent Coordination through Game-Induced Nonlinear Opinion Dynamics

    Authors: Haimin Hu, Kensuke Nakamura, Kai-Chieh Hsu, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: We present a multi-agent decision-making framework for the emergent coordination of autonomous agents whose intents are initially undecided. Dynamic non-cooperative games have been used to encode multi-agent interaction, but ambiguity arising from factors such as goal preference or the presence of multiple equilibria may lead to coordination issues, ranging from the "freezing robot" problem to uns… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  24. arXiv:2210.01642  [pdf, other

    cs.RO

    Proactive Opinion-Driven Robot Navigation around Human Movers

    Authors: Charlotte Cathcart, María Santos, Shinkyu Park, Naomi Ehrich Leonard

    Abstract: We propose, analyze, and experimentally verify a new proactive approach for robot social navigation driven by the robot's "opinion" for which way and by how much to pass human movers crossing its path. The robot forms an opinion over time according to nonlinear dynamics that depend on the robot's observations of human movers and its level of attention to these social cues. For these dynamics, it i… ▽ More

    Submitted 11 September, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

  25. arXiv:2210.00353  [pdf, other

    math.OC cs.MA cs.SI math.DS physics.soc-ph

    Sustained oscillations in multi-topic belief dynamics over signed networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We study the dynamics of belief formation on multiple interconnected topics in networks of agents with a shared belief system. We establish sufficient conditions and necessary conditions under which sustained oscillations of beliefs arise on the network in a Hopf bifurcation and characterize the role of the communication graph and the belief system graph in shaping the relative phase and amplitude… ▽ More

    Submitted 22 March, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 6 pages, 6 figures, accepted for publication in the 2023 American Control Conference proceedings

  26. arXiv:2209.11355  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Interpretable Dynamics from Images of a Freely Rotating 3D Rigid Body

    Authors: Justice Mason, Christine Allen-Blanchette, Nicholas Zolman, Elizabeth Davison, Naomi Leonard

    Abstract: In many real-world settings, image observations of freely rotating 3D rigid bodies, such as satellites, may be available when low-dimensional measurements are not. However, the high-dimensionality of image data precludes the use of classical estimation techniques to learn the dynamics and a lack of interpretability reduces the usefulness of standard deep learning methods. In this work, we present… ▽ More

    Submitted 23 August, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 figures

  27. arXiv:2208.01800  [pdf, other

    cs.RO cs.MA

    Decentralized Learning With Limited Communications for Multi-robot Coverage of Unknown Spatial Fields

    Authors: Kensuke Nakamura, María Santos, Naomi Ehrich Leonard

    Abstract: This paper presents an algorithm for a team of mobile robots to simultaneously learn a spatial field over a domain and spatially distribute themselves to optimally cover it. Drawing from previous approaches that estimate the spatial field through a centralized Gaussian process, this work leverages the spatial structure of the coverage problem and presents a decentralized strategy where samples are… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Accepted IROS 2022

  28. arXiv:2206.14893  [pdf, other

    math.DS cs.MA cs.SI math.OC

    Breaking indecision in multi-agent, multi-option dynamics

    Authors: Alessio Franci, Martin Golubitsky, Ian Stewart, Anastasia Bizyaeva, Naomi Ehrich Leonard

    Abstract: How does a group of agents break indecision when deciding about options with qualities that are hard to distinguish? Biological and artificial multi-agent systems, from honeybees and bird flocks to bacteria, robots, and humans, often need to overcome indecision when choosing among options in situations in which the performance or even the survival of the group are at stake. Breaking indecision is… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 36 pages

  29. arXiv:2203.11703  [pdf, other

    math.OC eess.SY math.DS

    Switching transformations for decentralized control of opinion patterns in signed networks: application to dynamic task allocation

    Authors: Anastasia Bizyaeva, Giovanna Amorim, Maria Santos, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a new decentralized design method to control opinion patterns on signed networks of agents making decisions about two options and to switch the network from any opinion pattern to a new desired one. Our method relies on switching transformations, which switch the sign of an agent's opinion at a stable equilibrium by flipping the sign of its local interactions with its neighbors. The glo… ▽ More

    Submitted 31 May, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

  30. arXiv:2201.13288  [pdf, other

    math.OC cs.LG stat.ML

    A Regret Minimization Approach to Multi-Agent Control

    Authors: Udaya Ghai, Udari Madhushani, Naomi Leonard, Elad Hazan

    Abstract: We study the problem of multi-agent control of a dynamical system with known dynamics and adversarial disturbances. Our study focuses on optimal control without centralized precomputed policies, but rather with adaptive control policies for the different agents that are only equipped with a stabilizing controller. We give a reduction from any (standard) regret minimizing control method to a distri… ▽ More

    Submitted 25 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:7422-7434, 2022

  31. arXiv:2111.12482  [pdf, other

    stat.ML cs.LG

    One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

    Authors: Udari Madhushani, Abhimanyu Dubey, Naomi Ehrich Leonard, Alex Pentland

    Abstract: The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative ban… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Journal ref: Conference on Neural Information Processing Systems, 2021

  32. arXiv:2110.07392  [pdf, other

    cs.LG cs.MA math.OC

    Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

    Authors: Justin Lidard, Udari Madhushani, Naomi Ehrich Leonard

    Abstract: A challenge in reinforcement learning (RL) is minimizing the cost of sampling associated with exploration. Distributed exploration reduces sampling complexity in multi-agent RL (MARL). We investigate the benefits to performance in MARL when exploration is fully decentralized. Specifically, we consider a class of online, episodic, tabular $Q$-learning problems under time-varying reward and transiti… ▽ More

    Submitted 2 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper to American Control Conference (ACC) 2022

  33. Flavored Gauge-Mediated Supersymmetry Breaking Models with Discrete Non-Abelian Symmetries

    Authors: Shu Tian Eu, Lisa L. Everett, Todd Garon, Neil Leonard

    Abstract: We investigate flavored gauge mediation models in which the Higgs and messenger doublets are embedded in multiplets of the discrete non-Abelian symmetry S3. In these theories, the S3 symmetry correlates the flavor structure of the quark and lepton Yukawa couplings with the structure of the messenger Yukawa couplings that contribute to the soft supersymmetry breaking mass parameters. We provide a s… ▽ More

    Submitted 29 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 32 pages, 8 figures

    Journal ref: Phys. Rev. D 105, 055019, 2022

  34. arXiv:2110.04396  [pdf, other

    stat.ML cs.LG

    When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits

    Authors: Udari Madhushani, Naomi Leonard

    Abstract: In cooperative bandits, a framework that captures essential features of collective sequential decision making, agents can minimize group regret, and thereby improve performance, by leveraging shared information. However, sharing information can be costly, which motivates developing policies that minimize group regret while also reducing the number of messages communicated by agents. Existing coope… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  35. arXiv:2108.00966  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Tuning Cooperative Behavior in Games with Nonlinear Opinion Dynamics

    Authors: Shinkyu Park, Anastasia Bizyaeva, Mari Kawakatsu, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We examine the tuning of cooperative behavior in repeated multi-agent games using an analytically tractable, continuous-time, nonlinear model of opinion dynamics. Each modeled agent updates its real-valued opinion about each available strategy in response to payoffs and other agent opinions, as observed over a network. We show how the model provides a principled and systematic means to investigate… ▽ More

    Submitted 23 November, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

  36. arXiv:2103.14764  [pdf, ps, other

    math.OC cs.MA eess.SY

    Control of Agreement and Disagreement Cascades with Distributed Inputs

    Authors: Anastasia Bizyaeva, Timothy Sorochkin, Alessio Franci, Naomi Ehrich Leonard

    Abstract: For a group of autonomous communicating agents, the ability to distinguish a meaningful input from disturbance, and come to collective agreement or disagreement in response to that input, is paramount for carrying out coordinated objectives. In this work we study how a cascade of opinion formation spreads through a group of networked decision-makers in response to a distributed input signal. Using… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 7 pages, 4 figures

  37. arXiv:2103.12223  [pdf, other

    physics.soc-ph math.OC

    Analysis and control of agreement and disagreement opinion cascades

    Authors: Alessio Franci, Anastasia Bizyaeva, Shinkyu Park, Naomi Ehrich Leonard

    Abstract: We introduce and analyze a continuous time and state-space model of opinion cascades on networks of large numbers of agents that form opinions about two or more options. By leveraging our recent results on the emergence of agreement and disagreement states, we introduce novel tools to analyze and control agreement and disagreement opinion cascades. New notions of agreement and disagreement central… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  38. arXiv:2011.07720  [pdf, other

    stat.ML cs.LG math.PR

    Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study the decentralized multi-agent multi-armed bandit problem for agents that communicate with probability over a network defined by a $d$-regular graph. Every edge in the graph has probabilistic weight $p$ to account for the ($1\!-\!p$) probability of a communication link failure. At each time step, each agent chooses an arm and receives a numerical reward associated with the chosen arm. Afte… ▽ More

    Submitted 8 October, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

  39. arXiv:2011.05927  [pdf, other

    cs.LG eess.SY math.OC

    On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension

    Authors: Udari Madhushani, Biswadip Dey, Naomi Ehrich Leonard, Amit Chakraborty

    Abstract: Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions. However, when the underlying state transition dynamics are stochastic and evolve on a high-dimensional space, generating independent and identically distributed (IID) data samples for creating these datasets poses a significant cha… ▽ More

    Submitted 28 March, 2022; v1 submitted 11 November, 2020; originally announced November 2020.

  40. arXiv:2010.12932  [pdf, other

    cs.LG cs.CV

    LagNetViP: A Lagrangian Neural Network for Video Prediction

    Authors: Christine Allen-Blanchette, Sushant Veer, Anirudha Majumdar, Naomi Ehrich Leonard

    Abstract: The dominant paradigms for video prediction rely on opaque transition models where neither the equations of motion nor the underlying physical quantities of the system are easily inferred. The equations of motion, as defined by Newton's second law, describe the time evolution of a physical system state and can therefore be applied toward the determination of future system states. In this paper, we… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  41. arXiv:2009.13600  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Patterns of Nonlinear Opinion Formation on Networks

    Authors: Anastasia Bizyaeva, Ayanna Matthews, Alessio Franci, Naomi Ehrich Leonard

    Abstract: When communicating agents form opinions about a set of possible options, agreement and disagreement are both possible outcomes. Depending on the context, either can be desirable or undesirable. We show that for nonlinear opinion dynamics on networks, and a variety of network structures, the spectral properties of the underlying adjacency matrix fully characterize the occurrence of either agreement… ▽ More

    Submitted 26 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 6 pages, 4 figures; accepted to appear in 2021 American Control Conference proceedings

  42. arXiv:2009.04332  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Nonlinear Opinion Dynamics with Tunable Sensitivity

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a continuous-time multi-option nonlinear generalization of classical linear weighted-average opinion dynamics. Nonlinearity is introduced by saturating opinion exchanges, and this is enough to enable a significantly greater range of opinion-forming behaviors with our model as compared to existing linear and nonlinear models. For a group of agents that communicate opinions over a network… ▽ More

    Submitted 30 July, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

  43. arXiv:2009.01339  [pdf, other

    math.OC stat.ML

    Heterogeneous Explore-Exploit Strategies on Multi-Star Networks

    Authors: Udari Madhushani, Naomi Leonard

    Abstract: We investigate the benefits of heterogeneity in multi-agent explore-exploit decision making where the goal of the agents is to maximize cumulative group reward. To do so we study a class of distributed stochastic bandit problems in which agents communicate over a multi-star network and make sequential choices among options in the same uncertain environment. Typically, in multi-agent bandit problem… ▽ More

    Submitted 1 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

  44. arXiv:2008.04383  [pdf, other

    math.OC cs.MA math.DS

    Influence Spread in the Heterogeneous Multiplex Linear Threshold Model

    Authors: Yaofeng Desmond Zhong, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish sign… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  45. arXiv:2007.01926  [pdf, other

    cs.LG eess.SY stat.ML

    Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control

    Authors: Yaofeng Desmond Zhong, Naomi Ehrich Leonard

    Abstract: Recent approaches for modelling dynamics of physical systems with neural networks enforce Lagrangian or Hamiltonian structure to improve prediction and generalization. However, when coordinates are embedded in high-dimensional data such as images, these approaches either lose interpretability or can only be applied to one particular example. We introduce a new unsupervised neural network model tha… ▽ More

    Submitted 31 August, 2022; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: This version corrects an error in Equation (3) of the 2020 NeurIPS Proceedings paper

  46. arXiv:2007.01424  [pdf, ps, other

    physics.soc-ph math.DS q-bio.PE

    Active Control and Sustained Oscillations in actSIS Epidemic Dynamics

    Authors: Yunxiu Zhou, Simon A. Levin, Naomi E. Leonard

    Abstract: An actively controlled Susceptible-Infected-Susceptible (actSIS) contagion model is presented for studying epidemic dynamics with continuous-time feedback control of infection rates. Our work is inspired by the observation that epidemics can be controlled through decentralized disease-control strategies such as quarantining, sheltering in place, social distancing, etc., where individuals actively… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  47. arXiv:2004.06171  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Learning: Sequential Decision Making in Resource-Constrained Environments

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study cost-effective communication strategies that can be used to improve the performance of distributed learning systems in resource-constrained environments. For distributed learning in sequential decision making, we propose a new cost-effective partial communication protocol. We illustrate that with this protocol the group obtains the same order of performance that it obtains with full commu… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  48. arXiv:2004.03793  [pdf, other

    math.OC cs.LG

    A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors under a linear observation cost. Neighbors are defined by a network graph that encodes the inherent observation constraints of the system. We define a cost associated with observations such that at every instance an agent makes an observation it rece… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  49. arXiv:2003.01312  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study a distributed decision-making problem in which multiple agents face the same multi-armed bandit (MAB), and each agent makes sequential choices among arms to maximize its own individual reward. The agents cooperate by sharing their estimates over a fixed communication graph. We consider an unconstrained reward model in which two or more agents can choose the same arm and collect independen… ▽ More

    Submitted 11 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  50. arXiv:1909.11852  [pdf, other

    math.OC eess.SY math.DS

    A Continuous Threshold Model of Cascade Dynamics

    Authors: Yaofeng Desmond Zhong, Naomi Ehrich Leonard

    Abstract: We present a continuous threshold model (CTM) of cascade dynamics for a network of agents with real-valued activity levels that change continuously in time. The model generalizes the linear threshold model (LTM) from the literature, where an agent becomes active (adopts an innovation) if the fraction of its neighbors that are active is above a threshold. With the CTM we study the influence on casc… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.