Search | arXiv e-print repository

Energy-efficient Merging of Connected and Automated Vehicles using Control Barrier Functions

Authors: Shreshta Rajakumar Deshpande, Mrdjan Jankovic

Abstract: Highway merges present difficulties for human drivers and automated vehicles due to incomplete situational awareness and a need for a structured (precedence, order) environment, respectively. In this paper, an unstructured merge algorithm is presented for connected and automated vehicles. There is neither precedence nor established passing order through the merge point. The algorithm relies on Con… ▽ More Highway merges present difficulties for human drivers and automated vehicles due to incomplete situational awareness and a need for a structured (precedence, order) environment, respectively. In this paper, an unstructured merge algorithm is presented for connected and automated vehicles. There is neither precedence nor established passing order through the merge point. The algorithm relies on Control Barrier Functions for safety (collision avoidance) and for coordination that arises from exponential instability of stall-equilibria in the inter-agent space. A Monte Carlo simulation comparison to a first-in-first-out approach shows improvement in traffic flow and a significant energy efficiency benefit. △ Less

Submitted 19 March, 2025; originally announced March 2025.

Comments: This work has been submitted to an ASME journal for possible publication and is under review. Paper summary: 6 pages, 3 figures, 5 tables

arXiv:2308.15633 [pdf, other]

The Impact of Reference-Command Preview on Human-in-the-Loop Control Behavior

Authors: Pedram Rabiee, S. Alireza Seyyed Mousavi, Amelia J. S. Sheffler, Erik Hellström, Mrdjan Jankovic, Mario A. Santillo, T. M. Seigler, Jesse B. Hoagg

Abstract: This article presents results from an experiment in which 44 human subjects interact with a dynamic system to perform 40 trials of a command-following task. The reference command is unpredictable and different on each trial, but all subjects have the same sequence of reference commands for the 40 trials. The subjects are divided into 4 groups of 11 subjects. One group performs the command-followin… ▽ More This article presents results from an experiment in which 44 human subjects interact with a dynamic system to perform 40 trials of a command-following task. The reference command is unpredictable and different on each trial, but all subjects have the same sequence of reference commands for the 40 trials. The subjects are divided into 4 groups of 11 subjects. One group performs the command-following task without preview of the reference command, and the other 3 groups are given preview of the reference command for different time lengths into the future (0.5 s, 1 s, 1.5 s). A subsystem identification algorithm is used to obtain best-fit models of each subject's control behavior on each trial. The time- and frequency-domain performance, as well as the identified models of the control behavior for the 4 groups are examined to investigate the effects of reference-command preview. The results suggest that preview tends to improve performance by allowing the subjects to compensate for sensory time delay and approximate the inverse dynamics in feedforward. However, too much preview may decrease performance by degrading the ability to use the correct phase lead in feedforward. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: Preprint submitted to IEEE Transactions on Cybernetics

arXiv:2207.07829 [pdf, other]

Robust AI Driving Strategy for Autonomous Vehicles

Authors: Subramanya Nageshrao, Yousaf Rahman, Vladimir Ivanovic, Mrdjan Jankovic, Eric Tseng, Michael Hafner, Dimitar Filev

Abstract: There has been significant progress in sensing, perception, and localization for automated driving, However, due to the wide spectrum of traffic/road structure scenarios and the long tail distribution of human driver behavior, it has remained an open challenge for an intelligent vehicle to always know how to make and execute the best decision on road given available sensing / perception / localiza… ▽ More There has been significant progress in sensing, perception, and localization for automated driving, However, due to the wide spectrum of traffic/road structure scenarios and the long tail distribution of human driver behavior, it has remained an open challenge for an intelligent vehicle to always know how to make and execute the best decision on road given available sensing / perception / localization information. In this chapter, we talk about how artificial intelligence and more specifically, reinforcement learning, can take advantage of operational knowledge and safety reflex to make strategical and tactical decisions. We discuss some challenging problems related to the robustness of reinforcement learning solutions and their implications to the practical design of driving strategies for autonomous vehicles. We focus on automated driving on highway and the integration of reinforcement learning, vehicle motion control, and control barrier function, leading to a robust AI driving strategy that can learn and adapt safely. △ Less

Submitted 16 July, 2022; originally announced July 2022.

arXiv:2207.04915 [pdf, other]

Multi-agent systems with CBF-based controllers -- collision avoidance and liveness from instability

Authors: Mrdjan Jankovic, Mario Santillo, Yan Wang

Abstract: Assuring system stability is typically a major control design objective. In this paper, we present a system where instability provides a crucial benefit. We consider multi-agent collision avoidance using Control Barrier Functions (CBF) and study trade-offs between safety and liveness -- the ability to reach a destination without large detours or gridlock. We compare two standard decentralized poli… ▽ More Assuring system stability is typically a major control design objective. In this paper, we present a system where instability provides a crucial benefit. We consider multi-agent collision avoidance using Control Barrier Functions (CBF) and study trade-offs between safety and liveness -- the ability to reach a destination without large detours or gridlock. We compare two standard decentralized policies, with only the local (host) control available, to co-optimization policies (PCCA and CCS) where everyone's (virtual) control action is available. The co-optimization policies compute control for everyone even though they lack information about others' intentions. For comparison, we use a Centralized, full information policy as the benchmark. One contribution of this paper is proving feasibility for the Centralized, PCCA, and CCS policies. Monte Carlo simulations show that decentralized, host-only control policies and CCS lack liveness while the PCCA policy performs as well as the Centralized. Next, we explain the observed results by considering two agents negotiating the passing order through an intersection. We show that the structure and stability of the resulting equilibria correlates with the observed propensity to gridlock -- the policies with unstable equilibria avoid gridlocks while those with stable ones do not. △ Less

Submitted 11 July, 2022; originally announced July 2022.

Comments: 12 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2012.10261

arXiv:2204.00127 [pdf, ps, other]

Future-Focused Control Barrier Functions for Autonomous Vehicle Control

Authors: Mitchell Black, Mrdjan Jankovic, Abhishek Sharma, Dimitra Panagou

Abstract: In this paper, we introduce a class of future-focused control barrier functions (ff-CBF) aimed at improving traditionally myopic CBF based control design and study their efficacy in the context of an unsignaled four-way intersection crossing problem for collections of both communicating and non-communicating autonomous vehicles. Our novel ff-CBF encodes that vehicles take control actions that avoi… ▽ More In this paper, we introduce a class of future-focused control barrier functions (ff-CBF) aimed at improving traditionally myopic CBF based control design and study their efficacy in the context of an unsignaled four-way intersection crossing problem for collections of both communicating and non-communicating autonomous vehicles. Our novel ff-CBF encodes that vehicles take control actions that avoid collisions predicted under a zero-acceleration policy over an arbitrarily long future time interval. In this sense the ff-CBF defines a virtual barrier, a loosening of which we propose in the form of a relaxed future-focused CBF (rff-CBF) that allows a relaxation of the virtual ff-CBF barrier far from the physical barrier between vehicles. We study the performance of ff-CBF and rff-CBF based controllers on communicating vehicles via a series of simulated trials of the intersection scenario, and in particular highlight how the rff-CBF based controller empirically outperforms a benchmark controller from the literature by improving intersection throughput while preserving safety and feasibility. Finally, we demonstrate our proposed ff-CBF control law on an intersection scenario in the laboratory environment with a collection of 5 non-communicating AION ground rovers. △ Less

Submitted 3 October, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

Comments: 8 pages, 7 figures, 2 tables, submitted to 2023 American Control Conference, under review

arXiv:2201.09447 [pdf, ps, other]

Prescribed-Time Safety Design for a Chain of Integrators

Authors: Imoleayo Abel, Drew Steeves, Miroslav Krstic, Mrdjan Jankovic

Abstract: Safety in dynamical systems is commonly pursued using control barrier functions (CBFs) which enforce safety-constraints over the entire duration of a system's evolution. We propose a prescribed-time safety (PTSf) design which enforces safety only for a finite time of interest to the user. While traditional CBF designs would keep the system away from the barrier longer than necessary, our PTSf desi… ▽ More Safety in dynamical systems is commonly pursued using control barrier functions (CBFs) which enforce safety-constraints over the entire duration of a system's evolution. We propose a prescribed-time safety (PTSf) design which enforces safety only for a finite time of interest to the user. While traditional CBF designs would keep the system away from the barrier longer than necessary, our PTSf design lets the system reach the barrier by the prescribed time and obey the operator's intent thereafter. To emphasize the capability of our design for safety constraints with high relative degrees, we focus our exposition on a chain of integrators where the safety condition is defined for the state furthest from the control input. In contrast to existing CBF-based methods for high-relative degree constraints, our approach involves choosing explicitly specified gains (instead of class $\mathcal{K}$ functions), and, with the aid of backstepping, operates in the entirety of the original safe set with no additional restriction on the initial conditions. With Quadratic Programming (QP) being employed in the design, in addition to backstepping and CBFs with a PTSf property, we refer to our design as a QP-backstepping PT-CBF design. For illustration, we include a simulation for the double-integrator system. △ Less

Submitted 21 March, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

Comments: Paper length shrunk to 6 pages to meet American Control Conference page limit restrictions

arXiv:2108.10491 [pdf, other]

Control Barrier Functions With Unmodeled Dynamics Using Integral Quadratic Constraints

Authors: Peter Seiler, Mrdjan Jankovic, Erik Hellstrom

Abstract: This paper presents a control design method that achieves safety for systems with unmodeled dynamics at the plant input. The proposed method combines control barrier functions (CBFs) and integral quadratic constraints (IQCs). Simplified, low-order models are often used in the design of the controller. Parasitic, unmodeled dynamics (e.g. actuator dynamics, time delays, etc) can lead to safety viola… ▽ More This paper presents a control design method that achieves safety for systems with unmodeled dynamics at the plant input. The proposed method combines control barrier functions (CBFs) and integral quadratic constraints (IQCs). Simplified, low-order models are often used in the design of the controller. Parasitic, unmodeled dynamics (e.g. actuator dynamics, time delays, etc) can lead to safety violations. The proposed method bounds the input-output behavior of these unmodeled dynamics in the time-domain using an alpha-IQC. The alpha-IQC is then incorporated into the CBF constraint to ensure safety. The approach is demonstrated with a simple example. △ Less

Submitted 23 August, 2021; originally announced August 2021.

arXiv:2104.11332 [pdf, other]

Backup Control Barrier Functions: Formulation and Comparative Study

Authors: Yuxiao Chen, Mrdjan Jankovic, Mario Santillo, Aaron D. Ames

Abstract: The backup control barrier function (CBF) was recently proposed as a tractable formulation that guarantees the feasibility of the CBF quadratic programming (QP) via an implicitly defined control invariant set. The control invariant set is based on a fixed backup policy and evaluated online by forward integrating the dynamics under the backup policy. This paper is intended as a tutorial of the back… ▽ More The backup control barrier function (CBF) was recently proposed as a tractable formulation that guarantees the feasibility of the CBF quadratic programming (QP) via an implicitly defined control invariant set. The control invariant set is based on a fixed backup policy and evaluated online by forward integrating the dynamics under the backup policy. This paper is intended as a tutorial of the backup CBF approach and a comparative study to some benchmarks. First, the backup CBF approach is presented step by step with the underlying math explained in detail. Second, we prove that the backup CBF always has a relative degree 1 under mild assumptions. Third, the backup CBF approach is compared with benchmarks such as Hamilton Jacobi PDE and Sum-of-Squares on the computation of control invariant sets, which shows that one can obtain a control invariant set close to the maximum control invariant set under a good backup policy for many practical problems. △ Less

Submitted 22 April, 2021; originally announced April 2021.

arXiv:2012.10261 [pdf, other]

Collision Avoidance and Liveness of Multi-agent Systems with CBF-based Controllers

Authors: Mrdjan Jankovic, Mario Santillo

Abstract: In this paper we consider multi-agent navigation with collision avoidance using Control Barrier Functions (CBF). In the case of non-communicating agents, we consider trade-offs between level of safety guarantee and liveness - the ability to reach destination in short time without large detours or gridlock. We compare several CBF-based driving policies against the benchmark established by the Centr… ▽ More In this paper we consider multi-agent navigation with collision avoidance using Control Barrier Functions (CBF). In the case of non-communicating agents, we consider trade-offs between level of safety guarantee and liveness - the ability to reach destination in short time without large detours or gridlock. We compare several CBF-based driving policies against the benchmark established by the Centralized controller that requires communication. One of the policies (CCS2) being compared is new and straddles the space between policies with only local control available and a more complex Predictor-Corrector for Collision Avoidance (PCCA) policy that adjusts local copies of everyone's control actions based on observed behavior. The paper establishes feasibility for the Centralized, PCCA and CCS2 policies. Monte Carlo simulations show that decentralized, host-only control policies lack liveness compared to the ones that use all the control inputs in calculations and that the PCCA policy performs equally well as the Centralized, even though it is decentralized. △ Less

Submitted 18 December, 2020; originally announced December 2020.

arXiv:2011.10023 [pdf, other]

Online decentralized decision making with inequality constraints: an ADMM approach

Authors: Yuxiao Chen, Mario Santillo, Mrdjan Jankovic, Aaron D. Ames

Abstract: We discuss an online decentralized decision making problem where the agents are coupled with affine inequality constraints. Alternating Direction Method of Multipliers (ADMM) is used as the computation engine and we discuss the convergence of the algorithm in an online setting. To be specific, when decisions have to be made sequentially with a fixed time step, there might not be enough time for th… ▽ More We discuss an online decentralized decision making problem where the agents are coupled with affine inequality constraints. Alternating Direction Method of Multipliers (ADMM) is used as the computation engine and we discuss the convergence of the algorithm in an online setting. To be specific, when decisions have to be made sequentially with a fixed time step, there might not be enough time for the ADMM to converge before the scenario changes and the decision needs to be updated. In this case, a suboptimal solution is employed and we analyze the optimality gap given the convergence condition. Moreover, in many cases, the decision making problem changes gradually over time. We propose a warm-start scheme to accelerate the convergence of ADMM and analyze the benefit of the warm-start. The proposed method is demonstrated in a decentralized multiagent control barrier function problem with simulation. △ Less

Submitted 19 November, 2020; originally announced November 2020.

arXiv:2008.12092 [pdf, other]

Collision Free Navigation with Interacting, Non-Communicating Obstacles

Authors: Mario Santillo, Mrdjan Jankovic

Abstract: In this paper we consider the problem of navigation and motion control in an area densely populated with other agents. We propose an algorithm that, without explicit communication and based on the information it has, computes the best control action for all the agents and implements its own. Notably, the host agent (the agent executing the algorithm) computes the differences between the other agen… ▽ More In this paper we consider the problem of navigation and motion control in an area densely populated with other agents. We propose an algorithm that, without explicit communication and based on the information it has, computes the best control action for all the agents and implements its own. Notably, the host agent (the agent executing the algorithm) computes the differences between the other agents' computed and observed control actions and treats them as known disturbances that are fed back into a robust control barrier function (RCBF) based quadratic program. A feedback loop is created because the computed control action for another agent depends on the previously used disturbance estimate. In the case of two interacting agents, stability of the feedback loop is proven and a performance guarantee in terms of constraint adherence is established. This holds whether the other agent executes the same algorithm or not. △ Less

Submitted 27 August, 2020; originally announced August 2020.

arXiv:1901.01816 [pdf]

doi 10.1109/JBHI.2018.2887067

A dual mode adaptive basal-bolus advisor based on reinforcement learning

Authors: Qingnan Sun, Marko V. Jankovic, João Budzinski, Brett Moore, Peter Diem, Christoph Stettler, Stavroula G. Mougiakakou

Abstract: Self-monitoring of blood glucose (SMBG) and continuous glucose monitoring (CGM) are commonly used by type 1 diabetes (T1D) patients to measure glucose concentrations. The proposed adaptive basal-bolus algorithm (ABBA) supports inputs from either SMBG or CGM devices to provide personalised suggestions for the daily basal rate and prandial insulin doses on the basis of the patients' glucose level on… ▽ More Self-monitoring of blood glucose (SMBG) and continuous glucose monitoring (CGM) are commonly used by type 1 diabetes (T1D) patients to measure glucose concentrations. The proposed adaptive basal-bolus algorithm (ABBA) supports inputs from either SMBG or CGM devices to provide personalised suggestions for the daily basal rate and prandial insulin doses on the basis of the patients' glucose level on the previous day. The ABBA is based on reinforcement learning (RL), a type of artificial intelligence, and was validated in silico with an FDA-accepted population of 100 adults under different realistic scenarios lasting three simulated months. The scenarios involve three main meals and one bedtime snack per day, along with different variabilities and uncertainties for insulin sensitivity, mealtime, carbohydrate amount, and glucose measurement time. The results indicate that the proposed approach achieves comparable performance with CGM or SMBG as input signals, without influencing the total daily insulin dose. The results are a promising indication that AI algorithmic approaches can provide personalised adaptive insulin optimisation and achieve glucose control - independently of the type of glucose monitoring technology. △ Less

Submitted 7 January, 2019; originally announced January 2019.

Comments: 9 pages, 8 figures, accepted by Journal of Biomedical and Health Informatics in December 2018

Showing 1–12 of 12 results for author: Jankovic, M