Search | arXiv e-print repository

Safety-Aware Reinforcement Learning for Control via Risk-Sensitive Action-Value Iteration and Quantile Regression

Authors: Clinton Enwerem, Aniruddh G. Puranic, John S. Baras, Calin Belta

Abstract: Mainstream approximate action-value iteration reinforcement learning (RL) algorithms suffer from overestimation bias, leading to suboptimal policies in high-variance stochastic environments. Quantile-based action-value iteration methods reduce this bias by learning a distribution of the expected cost-to-go using quantile regression. However, ensuring that the learned policy satisfies safety constr… ▽ More Mainstream approximate action-value iteration reinforcement learning (RL) algorithms suffer from overestimation bias, leading to suboptimal policies in high-variance stochastic environments. Quantile-based action-value iteration methods reduce this bias by learning a distribution of the expected cost-to-go using quantile regression. However, ensuring that the learned policy satisfies safety constraints remains a challenge when these constraints are not explicitly integrated into the RL framework. Existing methods often require complex neural architectures or manual tradeoffs due to combined cost functions. To address this, we propose a risk-regularized quantile-based algorithm integrating Conditional Value-at-Risk (CVaR) to enforce safety without complex architectures. We also provide theoretical guarantees on the contraction properties of the risk-sensitive distributional Bellman operator in Wasserstein space, ensuring convergence to a unique cost distribution. Simulations of a mobile robot in a dynamic reach-avoid task show that our approach leads to more goal successes, fewer collisions, and better safety-performance trade-offs compared to risk-neutral methods. △ Less

Submitted 7 June, 2025; originally announced June 2025.

Comments: 13 pages, 4 figures. Submission under review

arXiv:2501.13192 [pdf, ps, other]

Remote State Estimation over Unreliable Channels with Unreliable Feedback: Fundamental Limits

Authors: Touraj Soleymani, Mohamad Assaad, John S. Baras

Abstract: This article is concerned with networked estimation in a system composed of a source that is observed by a sensor, a remote monitor that needs to estimate the state of the source in real time, and a communication channel that connects the source to the monitor. The source is a partially observable dynamical process, and the communication channel is a packet-erasure channel with feedback. Our main… ▽ More This article is concerned with networked estimation in a system composed of a source that is observed by a sensor, a remote monitor that needs to estimate the state of the source in real time, and a communication channel that connects the source to the monitor. The source is a partially observable dynamical process, and the communication channel is a packet-erasure channel with feedback. Our main objective is to obtain the fundamental performance limits of the underlying networked system in the sense of a causal tradeoff between the packet rate and the mean square error when both forward and backward channels are unreliable. We characterize an optimal coding policy profile consisting of a scheduling policy for the encoder and an estimation policy for the decoder. We complement our theoretical results with a numerical analysis, and compare the performance limits of the networked system in different communication regimes. △ Less

Submitted 22 January, 2025; originally announced January 2025.

Comments: arXiv admin note: text overlap with arXiv:2308.16085

arXiv:2408.08668 [pdf, other]

Robust Stochastic Shortest-Path Planning via Risk-Sensitive Incremental Sampling

Authors: Clinton Enwerem, Erfaun Noorani, John S. Baras, Brian M. Sadler

Abstract: With the pervasiveness of Stochastic Shortest-Path (SSP) problems in high-risk industries, such as last-mile autonomous delivery and supply chain management, robust planning algorithms are crucial for ensuring successful task completion while mitigating hazardous outcomes. Mainstream chance-constrained incremental sampling techniques for solving SSP problems tend to be overly conservative and typi… ▽ More With the pervasiveness of Stochastic Shortest-Path (SSP) problems in high-risk industries, such as last-mile autonomous delivery and supply chain management, robust planning algorithms are crucial for ensuring successful task completion while mitigating hazardous outcomes. Mainstream chance-constrained incremental sampling techniques for solving SSP problems tend to be overly conservative and typically do not consider the likelihood of undesirable tail events. We propose an alternative risk-aware approach inspired by the asymptotically-optimal Rapidly-Exploring Random Trees (RRT*) planning algorithm, which selects nodes along path segments with minimal Conditional Value-at-Risk (CVaR). Our motivation rests on the step-wise coherence of the CVaR risk measure and the optimal substructure of the SSP problem. Thus, optimizing with respect to the CVaR at each sampling iteration necessarily leads to an optimal path in the limit of the sample size. We validate our approach via numerical path planning experiments in a two-dimensional grid world with obstacles and stochastic path-segment lengths. Our simulation results show that incorporating risk into the tree growth process yields paths with lengths that are significantly less sensitive to variations in the noise parameter, or equivalently, paths that are more robust to environmental uncertainty. Algorithmic analyses reveal similar query time and memory space complexity to the baseline RRT* procedure, with only a marginal increase in processing time. This increase is offset by significantly lower noise sensitivity and reduced planner failure rates. △ Less

Submitted 16 August, 2024; originally announced August 2024.

Comments: Accepted for presentation at the 2024 IEEE Conference on Decision and Control (CDC)

arXiv:2405.16430 [pdf, other]

GAMEOPT+: Improving Fuel Efficiency in Unregulated Heterogeneous Traffic Intersections via Optimal Multi-agent Cooperative Control

Authors: Nilesh Suriyarachchi, Rohan Chandra, Arya Anantula, John S. Baras, Dinesh Manocha

Abstract: Better fuel efficiency leads to better financial security as well as a cleaner environment. We propose a novel approach for improving fuel efficiency in unstructured and unregulated traffic environments. Existing intelligent transportation solutions for improving fuel efficiency, however, apply only to traffic intersections with sparse traffic or traffic where drivers obey the regulations, or both… ▽ More Better fuel efficiency leads to better financial security as well as a cleaner environment. We propose a novel approach for improving fuel efficiency in unstructured and unregulated traffic environments. Existing intelligent transportation solutions for improving fuel efficiency, however, apply only to traffic intersections with sparse traffic or traffic where drivers obey the regulations, or both. We propose GameOpt+, a novel hybrid approach for cooperative intersection control in dynamic, multi-lane, unsignalized intersections. GameOpt+ is a hybrid solution that combines an auction mechanism and an optimization-based trajectory planner. It generates a priority entrance sequence for each agent and computes velocity controls in real-time, taking less than 10 milliseconds even in high-density traffic with over 10,000 vehicles per hour. Compared to fully optimization-based methods, it operates 100 times faster while ensuring fairness, safety, and efficiency. Tested on the SUMO simulator, our algorithm improves throughput by at least 25%, reduces the time to reach the goal by at least 70%, and decreases fuel consumption by 50% compared to auction-based and signaled approaches using traffic lights and stop signs. GameOpt+ is also unaffected by unbalanced traffic inflows, whereas some of the other baselines encountered a decrease in performance in unbalanced traffic inflow environments. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: Journal Version

arXiv:2405.07381 [pdf, ps, other]

Networked Control with Hybrid Automatic Repeat Request Protocols

Authors: Touraj Soleymani, John S. Baras, Deniz Gündüz

Abstract: We study feedback control of a dynamical process over a lossy channel equipped with a hybrid automatic repeat request protocol that connects a sensor to an actuator. The dynamical process is modeled by a Gauss-Markov process, and the lossy channel by a packet-erasure channel with ideal feedback. We suppose that data is communicated in the format of packets with negligible quantization error. In su… ▽ More We study feedback control of a dynamical process over a lossy channel equipped with a hybrid automatic repeat request protocol that connects a sensor to an actuator. The dynamical process is modeled by a Gauss-Markov process, and the lossy channel by a packet-erasure channel with ideal feedback. We suppose that data is communicated in the format of packets with negligible quantization error. In such a networked control system, whenever a packet loss occurs, there exists a tradeoff between transmitting new sensory information with a lower success probability and retransmitting previously failed sensory information with a higher success probability. In essence, an inherent tradeoff between freshness and reliability. To address this tradeoff, we consider a linear-quadratic-regulator performance index, which penalizes state deviations and control efforts over a finite horizon, and jointly design optimal policies for an encoder and a decoder, which are collocated with the sensor and the actuator, respectively. Our emphasis here lies specifically on designing switching and control policies, rather than error-correcting codes. We derive the structural properties of the optimal encoding and decoding policies. We show that the former is a threshold switching policy and the latter is a certainty-equivalent control policy. In addition, we specify the iterative equations that the encoder and the decoder need to solve in order to implement the optimal policies. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2403.11932 [pdf, ps, other]

Consistency of Value of Information: Effects of Packet Loss and Time Delay in Networked Control Systems Tasks

Authors: Touraj Soleymani, John S. Baras, Siyi Wang, Sandra Hirche, Karl H. Johansson

Abstract: In this chapter, we study the consistency of the value of information$\unicode{x2014}$a semantic metric that claims to determine the right piece of information in networked control systems tasks$\unicode{x2014}$in a lossy and delayed communication regime. Our analysis begins with a focus on state estimation, and subsequently extends to feedback control. To that end, we make a causal tradeoff betwe… ▽ More In this chapter, we study the consistency of the value of information$\unicode{x2014}$a semantic metric that claims to determine the right piece of information in networked control systems tasks$\unicode{x2014}$in a lossy and delayed communication regime. Our analysis begins with a focus on state estimation, and subsequently extends to feedback control. To that end, we make a causal tradeoff between the packet rate and the mean square error. Associated with this tradeoff, we demonstrate the existence of an optimal policy profile, comprising a symmetric threshold scheduling policy based on the value of information for the encoder and a non-Gaussian linear estimation policy for the decoder. Our structural results assert that the scheduling policy is expressible in terms of $3d-1$ variables related to the source and the channel, where $d$ is the time delay, and that the estimation policy incorporates no residual related to signaling. We then construct an optimal control policy by exploiting the separation principle. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.11927 [pdf, ps, other]

Foundations of Value of Information: A Semantic Metric for Networked Control Systems Tasks

Authors: Touraj Soleymani, John S. Baras, Sandra Hirche, Karl H. Johansson

Abstract: In this chapter, we present our recent invention, i.e., the notion of the value of information$\unicode{x2014}$a semantic metric that is fundamental for networked control systems tasks. We begin our analysis by formulating a causal tradeoff between the packet rate and the regulation cost, with an encoder and a decoder as two distributed decision makers, and show that the valuation of information i… ▽ More In this chapter, we present our recent invention, i.e., the notion of the value of information$\unicode{x2014}$a semantic metric that is fundamental for networked control systems tasks. We begin our analysis by formulating a causal tradeoff between the packet rate and the regulation cost, with an encoder and a decoder as two distributed decision makers, and show that the valuation of information is conceivable and quantifiable grounded on this tradeoff. More precisely, we characterize an equilibrium, and quantify the value of information there as the variation in a value function with respect to a piece of sensory measurement that can be communicated from the encoder to the decoder at each time. We prove that, in feedback control of a dynamical process over a noiseless channel, the value of information is a function of the discrepancy between the state estimates at the encoder and the decoder, and that a data packet containing a sensory measurement at each time should be exchanged only if the value of information at that time is nonnegative. Finally, we prove that the characterized equilibrium is in fact globally optimal. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.11926 [pdf, ps, other]

Relation between Value and Age of Information in Feedback Control

Authors: Touraj Soleymani, John S. Baras, Karl H. Johansson

Abstract: In this chapter, we investigate the value of information as a more comprehensive instrument than the age of information for optimally shaping the information flow in a networked control system. In particular, we quantify the value of information based on the variation in a value function, and discuss the structural properties of this metric. Through our analysis, we establish the mathematical rela… ▽ More In this chapter, we investigate the value of information as a more comprehensive instrument than the age of information for optimally shaping the information flow in a networked control system. In particular, we quantify the value of information based on the variation in a value function, and discuss the structural properties of this metric. Through our analysis, we establish the mathematical relation between the value of information and the age of information. We prove that the value of information is in general a function of an estimation discrepancy that depends on the age of information and the primitive variables. In addition, we prove that there exists a condition under which the value of information becomes completely expressible in terms of the age of information. Nonetheless, we show that this condition is not achievable without a degradation in the performance of the system. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2311.08491 [pdf, other]

doi 10.1109/ITSC55140.2022.9921986

Cooperative Bidirectional Mixed-Traffic Overtaking

Authors: Faizan M. Tariq, Nilesh Suriyarachchi, Christos Mavridis, John S. Baras

Abstract: Safe overtaking, especially in a bidirectional mixed-traffic setting, remains a key challenge for Connected Autonomous Vehicles (CAVs). The presence of human-driven vehicles (HDVs), behavior unpredictability, and blind spots resulting from sensor occlusion make this a challenging control problem. To overcome these difficulties, we propose a cooperative communication-based approach that utilizes th… ▽ More Safe overtaking, especially in a bidirectional mixed-traffic setting, remains a key challenge for Connected Autonomous Vehicles (CAVs). The presence of human-driven vehicles (HDVs), behavior unpredictability, and blind spots resulting from sensor occlusion make this a challenging control problem. To overcome these difficulties, we propose a cooperative communication-based approach that utilizes the information shared between CAVs to reduce the effects of sensor occlusion while benefiting from the local velocity prediction based on past tracking data. Our control framework aims to perform overtaking maneuvers with the objective of maximizing velocity while prioritizing safety and passenger comfort. Our method is also capable of reactively adjusting its plan to dynamic changes in the environment. The performance of the proposed approach is verified using realistic traffic simulations. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: Published in: 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC)

Journal ref: 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China, 2022, pp. 2494-2501

arXiv:2311.03284 [pdf, other]

Safe Collective Control under Noisy Inputs and Competing Constraints via Non-Smooth Barrier Functions

Authors: Clinton Enwerem, John S. Baras

Abstract: We consider the problem of safely coordinating ensembles of identical autonomous agents to conduct complex missions with conflicting safety requirements and under noisy control inputs. Using non-smooth control barrier functions (CBFs) and stochastic model-predictive control as springboards, and by adopting an extrinsic approach where the ensemble is treated as a unified dynamic entity, we devise a… ▽ More We consider the problem of safely coordinating ensembles of identical autonomous agents to conduct complex missions with conflicting safety requirements and under noisy control inputs. Using non-smooth control barrier functions (CBFs) and stochastic model-predictive control as springboards, and by adopting an extrinsic approach where the ensemble is treated as a unified dynamic entity, we devise a method to synthesize safety-aware control inputs for uncertain collectives. Drawing upon stochastic CBF theory and recent developments in Boolean CBF composition, our method proceeds by smoothing a Boolean-composed CBF and solving a stochastic optimization problem where each agent's forcing term is restricted to the affine subspace of control inputs certified by the combined CBF. For the smoothing step, we employ a polynomial approximation scheme, providing evidence for its advantage in generating more conservative yet sufficiently-filtered control inputs than the smoother but more aggressive equivalents produced from an approximation technique based on the log-sum-exp function. To further demonstrate the utility of the proposed method, we present an upper bound for the expected CBF approximation error, along with results from simulations of a single-integrator collective under velocity perturbations. Lastly, we compare these results with those obtained using a naive state-feedback controller lacking safety filters. △ Less

Submitted 28 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

Comments: Accepted to the 2024 European Control Conference. See Section VI.B (in particular, Theorem 1, Proposition 2, and Remark 2) for updates incorporating new results (from Reference 3) on almost-sure safety of ZCBFs

arXiv:2310.01538 [pdf, ps, other]

Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning

Authors: Armin Lederer, Erfaun Noorani, John S. Baras, Sandra Hirche

Abstract: Humans have the ability to deviate from their natural behavior when necessary, which is a cognitive process called response inhibition. Similar approaches have independently received increasing attention in recent years for ensuring the safety of control. Realized using control barrier functions or predictive safety filters, these approaches can effectively ensure the satisfaction of state constra… ▽ More Humans have the ability to deviate from their natural behavior when necessary, which is a cognitive process called response inhibition. Similar approaches have independently received increasing attention in recent years for ensuring the safety of control. Realized using control barrier functions or predictive safety filters, these approaches can effectively ensure the satisfaction of state constraints through an online adaptation of nominal control laws, e.g., obtained through reinforcement learning. While the focus of these realizations of inhibitory control has been on risk-neutral formulations, human studies have shown a tight link between response inhibition and risk attitude. Inspired by this insight, we propose a flexible, risk-sensitive method for inhibitory control. Our method is based on a risk-aware condition for value functions, which guarantees the satisfaction of state constraints. We propose a method for learning these value functions using common techniques from reinforcement learning and derive sufficient conditions for its success. By enforcing the derived safety conditions online using the learned value function, risk-sensitive inhibitory control is effectively achieved. The effectiveness of the developed control scheme is demonstrated in simulations. △ Less

Submitted 2 October, 2023; originally announced October 2023.

Comments: The 62nd IEEE Conference on Decision and Control, Dec. 13-15, 2023, Singapore

arXiv:2309.12531 [pdf, other]

RCMS: Risk-Aware Crash Mitigation System for Autonomous Vehicles

Authors: Faizan M. Tariq, David Isele, John S. Baras, Sangjae Bae

Abstract: We propose a risk-aware crash mitigation system (RCMS), to augment any existing motion planner (MP), that enables an autonomous vehicle to perform evasive maneuvers in high-risk situations and minimize the severity of collision if a crash is inevitable. In order to facilitate a smooth transition between RCMS and MP, we develop a novel activation mechanism that combines instantaneous as well as pre… ▽ More We propose a risk-aware crash mitigation system (RCMS), to augment any existing motion planner (MP), that enables an autonomous vehicle to perform evasive maneuvers in high-risk situations and minimize the severity of collision if a crash is inevitable. In order to facilitate a smooth transition between RCMS and MP, we develop a novel activation mechanism that combines instantaneous as well as predictive collision risk evaluation strategies in a unified hysteresis-band approach. For trajectory planning, we deploy a modular receding horizon optimization-based approach that minimizes a smooth situational risk profile, while adhering to the physical road limits as well as vehicular actuator limits. We demonstrate the performance of our approach in a simulation environment. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: Presented at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023, Bilbao, Bizkaia, Spain

arXiv:2309.09156 [pdf, ps, other]

Consensus-Based Leader-Follower Formation Tracking for Control-Affine Nonlinear Multiagent Systems

Authors: Clinton Enwerem, John S. Baras

Abstract: In the typical multiagent formation tracking problem centered on consensus, the prevailing assumption in the literature is that the agents' nonlinear models can be approximated by integrator systems, by their feedback-linearized equivalents, or by dynamics composed of deterministic linear and nonlinear terms. The resulting approaches associated with such assumptions, however, are hardly applicable… ▽ More In the typical multiagent formation tracking problem centered on consensus, the prevailing assumption in the literature is that the agents' nonlinear models can be approximated by integrator systems, by their feedback-linearized equivalents, or by dynamics composed of deterministic linear and nonlinear terms. The resulting approaches associated with such assumptions, however, are hardly applicable to general nonlinear systems. To this end, we present consensus-based control laws for multiagent formation tracking in finite-dimensional state space, with the agents represented by a more general class of dynamics: control-affine nonlinear systems. The agents also exchange information via a leader-follower communication topology modeled as an undirected and connected graph with a single leader node. By leveraging standard tools from algebraic graph theory and Lyapunov analysis, we first derive a locally asymptotically stabilizing formation tracking law. Next, to demonstrate the effectiveness of our approach, we present results from numerical simulations of an example in robotics. These results -- together with a comparison of the formation errors obtained with our approach and those realized via an optimization-based method -- further validate our theoretical propositions. △ Less

Submitted 17 September, 2023; originally announced September 2023.

Comments: To appear in the proceedings of the 9th International Conference on Control, Decision, and Information Technologies (CoDIT)

arXiv:2308.16446 [pdf, ps, other]

PASA: A Priori Adaptive Splitting Algorithm for the Split Delivery Vehicle Routing Problem

Authors: Nariman Torkzaban, Anousheh Gholami, John S. Baras, Bruce Golden

Abstract: The split delivery vehicle routing problem (SDVRP) is a relaxed variant of the capacitated vehicle routing problem (CVRP) where the restriction that each customer is visited precisely once is removed. Compared with CVRP, the SDVRP allows a reduction in the cost of the routes traveled by vehicles. The exact methods to solve the SDVRP are computationally expensive. Moreover, the complexity and diffi… ▽ More The split delivery vehicle routing problem (SDVRP) is a relaxed variant of the capacitated vehicle routing problem (CVRP) where the restriction that each customer is visited precisely once is removed. Compared with CVRP, the SDVRP allows a reduction in the cost of the routes traveled by vehicles. The exact methods to solve the SDVRP are computationally expensive. Moreover, the complexity and difficult implementation of the state-of-the-art heuristic approaches hinder their application in real-life scenarios of the SDVRP. In this paper, we propose an easily understandable and effective approach to solve the SDVPR based on an a priori adaptive splitting algorithm (PASA). The idea of a priori split strategy was first introduced in Chen et al. (2017). In this approach, the demand of the customers is split into smaller values using a fixed splitting rule in advance. Consequently, the original SDVRP instance is converted to a CVRP instance which is solved using an existing CVRP solver. While the proposed a priori splitting rule in Chen et al. (2017) is fixed for all customers regardless of their demand and location, we suggest an adaptive splitting rule that takes into account the distance of the customers to the depot and their demand values. Our experiments show that PASA can generate solutions comparable to the state-of-the-art but much faster. Furthermore, our algorithm outperforms the fixed a priori splitting rule proposed by Chen et al. (2017). △ Less

Submitted 31 August, 2023; originally announced August 2023.

arXiv:2308.15659 [pdf, other]

Enabling Cooperative Hybrid Beamforming in TDD-based Distributed MIMO Systems

Authors: Nariman Torkzaban, Amir Khojastepour, John S. Baras

Abstract: Distributed massive MIMO networks are envisioned to realize cooperative multi-point transmission in next-generation wireless systems. For efficient cooperative hybrid beamforming, the cluster of access points (APs) needs to obtain precise estimates of the uplink channel to perform reliable downlink precoding. However, due to the radio frequency (RF) impairments between the transceivers at the two… ▽ More Distributed massive MIMO networks are envisioned to realize cooperative multi-point transmission in next-generation wireless systems. For efficient cooperative hybrid beamforming, the cluster of access points (APs) needs to obtain precise estimates of the uplink channel to perform reliable downlink precoding. However, due to the radio frequency (RF) impairments between the transceivers at the two en-points of the wireless channel, full channel reciprocity does not hold which results in performance degradation in the cooperative hybrid beamforming (CHBF) unless a suitable reciprocity calibration mechanism is in place. We propose a two-step approach to calibrate any two hybrid nodes in the distributed MIMO system. We then present and utilize the novel concept of reciprocal tandem to propose a low-complexity approach for jointly calibrating the cluster of APs and estimating the downlink channel. Finally, we validate our calibration technique's effectiveness through numerical simulation. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.15650 [pdf, other]

Blind Cyclic Prefix-based CFO Estimation in MIMO-OFDM Systems

Authors: Nariman Torkzaban, Amir Khojastepour, John S. Baras

Abstract: Low-complexity estimation and correction of carrier frequency offset (CFO) are essential in orthogonal frequency division multiplexing (OFDM). In this paper, we propose a low-overhead blind CFO estimation technique based on cyclic prefix (CP), in multi-input multi-output (MIMO)-OFDM systems. We propose to use antenna diversity for CFO estimation. Given that the RF chains for all antenna elements a… ▽ More Low-complexity estimation and correction of carrier frequency offset (CFO) are essential in orthogonal frequency division multiplexing (OFDM). In this paper, we propose a low-overhead blind CFO estimation technique based on cyclic prefix (CP), in multi-input multi-output (MIMO)-OFDM systems. We propose to use antenna diversity for CFO estimation. Given that the RF chains for all antenna elements at a communication node share the same clock, the carrier frequency offset (CFO) between two points may be estimated by using the combination of the received signal at all antennas. We improve our method by combining the antenna diversity with time diversity by considering the CP for multiple OFDM symbols. We provide a closed-form expression for CFO estimation and present algorithms that can considerably improve the CFO estimation performance at the expense of a linear increase in computational complexity. We validate the effectiveness of our estimation scheme via extensive numerical analysis. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: To Appear in Proceedings of IEEE Globecom 2023

arXiv:2305.11230 [pdf, other]

Learning Agent Interactions from Density Evolution in 3D Regions With Obstacles

Authors: Amoolya Tirumalai, Christos N. Mavridis, John S. Baras

Abstract: In this work, we study the inverse problem of identifying complex flocking dynamics in a domain cluttered with obstacles. We get inspiration from animal flocks moving in complex ways with capabilities far beyond what current robots can do. Owing to the difficulty of observing and recovering the trajectories of the agents, we focus on the dynamics of their probability densities, which are governed… ▽ More In this work, we study the inverse problem of identifying complex flocking dynamics in a domain cluttered with obstacles. We get inspiration from animal flocks moving in complex ways with capabilities far beyond what current robots can do. Owing to the difficulty of observing and recovering the trajectories of the agents, we focus on the dynamics of their probability densities, which are governed by partial differential equations (PDEs), namely compressible Euler equations subject to non-local forces. We formulate the inverse problem of learning interactions as a PDE-constrained optimization problem of minimizing the squared Hellinger distance between the histogram of the flock and the distribution associated to our PDEs. The numerical methods used to efficiently solve the PDE-constrained optimization problem are described. Realistic flocking data are simulated using the Boids model of flocking agents, which differs in nature from the reconstruction models used in our PDEs. Our analysis and simulated experiments show that the behavior of cohesive flocks can be recovered accurately with approximate PDE solutions. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 6 pages, 5 figures, submitted to IEEE CDC 2023

arXiv:2304.14556 [pdf, other]

Mobile Network Slicing under Demand Uncertainty: A Stochastic Programming Approach

Authors: Anousheh Gholami, Nariman Torkzaban, John S. Baras

Abstract: Network slicing enables the deployment of multiple dedicated virtual sub-networks, i.e. slices on a shared physical infrastructure. Unlike traditional one-size-fits-all resource provisioning schemes, each network slice (NS) in 5G is tailored to the specific service requirements of a group of customers. An end-to-end (E2E) mobile NS orchestration requires the simultaneous provisioning of computing,… ▽ More Network slicing enables the deployment of multiple dedicated virtual sub-networks, i.e. slices on a shared physical infrastructure. Unlike traditional one-size-fits-all resource provisioning schemes, each network slice (NS) in 5G is tailored to the specific service requirements of a group of customers. An end-to-end (E2E) mobile NS orchestration requires the simultaneous provisioning of computing, storage, and networking resources across the core network (CN) and the radio access network (RAN). Constant temporospatial changes in mobile user demand profiles further complicate the E2E NSs resource provisioning beyond the limits of the existing best-effort schemes that are only effective under accurate demand forecasts for all slices. This paper proposes a practical two-time-scale resource provisioning framework for E2E network slicing under demand uncertainty. At each macro-scale instance, we assume that only the spatial probability distribution of the NS demands is available. We formulate the NSs resource allocation problem as a stochastic mixed integer program (SMIP) with the objective of minimizing the total resource cost at the CN and the RAN. At each microscale instance, utilizing the exact slice demand profiles, a linear program is solved to jointly minimize the unsupported traffic and the resource cost at the RAN. We verify the effectiveness of our resource allocation scheme through numerical experiments. △ Less

Submitted 27 April, 2023; originally announced April 2023.

arXiv:2303.00861 [pdf, other]

doi 10.1109/CDC51059.2022.9992401

SLAS: Speed and Lane Advisory System for Highway Navigation

Authors: Faizan M. Tariq, David Isele, John S. Baras, Sangjae Bae

Abstract: This paper proposes a hierarchical autonomous vehicle navigation architecture, composed of a high-level speed and lane advisory system (SLAS) coupled with low-level trajectory generation and trajectory following modules. Specifically, we target a multi-lane highway driving scenario where an autonomous ego vehicle navigates in traffic. We propose a novel receding horizon mixed-integer optimization… ▽ More This paper proposes a hierarchical autonomous vehicle navigation architecture, composed of a high-level speed and lane advisory system (SLAS) coupled with low-level trajectory generation and trajectory following modules. Specifically, we target a multi-lane highway driving scenario where an autonomous ego vehicle navigates in traffic. We propose a novel receding horizon mixed-integer optimization based method for SLAS with the objective to minimize travel time while accounting for passenger comfort. We further incorporate various modifications in the proposed approach to improve the overall computational efficiency and achieve real-time performance. We demonstrate the efficacy of the proposed approach in contrast to the existing methods, when applied in conjunction with state-of-the-art trajectory generation and trajectory following frameworks, in a CARLA simulation environment. △ Less

Submitted 1 March, 2023; originally announced March 2023.

Comments: Presented at the IEEE 61st Conference on Decision and Control (CDC), Cancun, Mexico, 2022

Journal ref: 2022 IEEE 61st Conference on Decision and Control (CDC), Cancun, Mexico, 2022, pp. 6979-6986

arXiv:2302.05416 [pdf, other]

Approximate Dynamic Programming for a Mean-field Game of Traffic Flow: Existence and Uniqueness

Authors: Amoolya Tirumalai, John S. Baras

Abstract: Highway vehicular traffic is an inherently multi-agent problem. Traffic jams can appear and disappear mysteriously. We develop a method for traffic flow control that is applied at the vehicular level via mean-field games. We begin this work with a microscopic model of vehicles subject to control input, disturbances, noise, and a speed limit. We formulate a discounted-cost infinite-horizon robust m… ▽ More Highway vehicular traffic is an inherently multi-agent problem. Traffic jams can appear and disappear mysteriously. We develop a method for traffic flow control that is applied at the vehicular level via mean-field games. We begin this work with a microscopic model of vehicles subject to control input, disturbances, noise, and a speed limit. We formulate a discounted-cost infinite-horizon robust mean-field game on the vehicles, and obtain the associated dynamic programming (DP) PDE system. We then perform approximate dynamic programming (ADP) using these equations to obtain a sub-optimal control for the traffic density adaptively. The sub-optimal controls are subject to an ODE-PDE system. We show that the ADP ODE-PDE system has a unique weak solution in a suitable Hilbert space using semigroup and successive approximation methods. We additionally give a numerical simulation, and interpret the results. △ Less

Submitted 4 June, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

Comments: 42 pages, 5 figures

arXiv:2301.07774 [pdf, other]

Capacitated Beam Placement for Multi-beam Non-Geostationary Satellite Systems

Authors: Nariman Torkzaban, Asim Zoulkarni, Anousheh Gholami, John S. Baras

Abstract: Non-geostationary (NGSO) satellite communications systems have attracted a lot of attention both from industry and academia, over the past several years. Beam placement is among the major resource allocation problems in multi-beam NGSO systems. In this paper, we formulate the beam placement problem as a Euclidean disk cover optimization model. We aim at minimizing the number of placed beams while… ▽ More Non-geostationary (NGSO) satellite communications systems have attracted a lot of attention both from industry and academia, over the past several years. Beam placement is among the major resource allocation problems in multi-beam NGSO systems. In this paper, we formulate the beam placement problem as a Euclidean disk cover optimization model. We aim at minimizing the number of placed beams while satisfying the total downlink traffic demand of targeted ground terminals without exceeding the capacity of the placed beams. We present a low-complexity deterministic annealing (DA)-based algorithm to solve the NP-hard optimization model for near-optimal solutions. We further propose an extended variant of the previous model to ensure the traffic assigned to the beams is balanced. We verify the effectiveness of our proposed methods by means of numerical experiments and show that our scheme is superior to the state-of-the-art methods in that it covers the ground users by fewer number of beams on average. △ Less

Submitted 18 January, 2023; originally announced January 2023.

arXiv:2203.02321 [pdf, ps, other]

Actuator Scheduling for Linear Systems: A Convex Relaxation Approach

Authors: Junjie Jiao, Dipankar Maity, John S. Baras, Sandra Hirche

Abstract: In this letter, we investigate the problem of actuator scheduling for networked control systems. Given a stochastic linear system with a number of actuators, we consider the case that one actuator is activated at each time. This problem is combinatorial in nature and NP hard to solve. We propose a convex relaxation to the actuator scheduling problem, and use its solution as a reference to design a… ▽ More In this letter, we investigate the problem of actuator scheduling for networked control systems. Given a stochastic linear system with a number of actuators, we consider the case that one actuator is activated at each time. This problem is combinatorial in nature and NP hard to solve. We propose a convex relaxation to the actuator scheduling problem, and use its solution as a reference to design an algorithm for solving the original scheduling problem. Using dynamic programming arguments, we provide a suboptimality bound of our proposed algorithm. Furthermore, we show that our framework can be extended to incorporate multiple actuators scheduling at each time and actuation costs. A simulation example is provided, which shows that our proposed method outperforms a random selection approach and a greedy selection approach. △ Less

Submitted 20 May, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

Comments: 8 pages, 4 figures

arXiv:2202.11572 [pdf, other]

GAMEOPT: Optimal Real-time Multi-Agent Planning and Control for Dynamic Intersections

Authors: Nilesh Suriyarachchi, Rohan Chandra, John S. Baras, Dinesh Manocha

Abstract: We propose GameOpt: a novel hybrid approach to cooperative intersection control for dynamic, multi-lane, unsignalized intersections. Safely navigating these complex and accident prone intersections requires simultaneous trajectory planning and negotiation among drivers. GameOpt is a hybrid formulation that first uses an auction mechanism to generate a priority entrance sequence for every agent, fo… ▽ More We propose GameOpt: a novel hybrid approach to cooperative intersection control for dynamic, multi-lane, unsignalized intersections. Safely navigating these complex and accident prone intersections requires simultaneous trajectory planning and negotiation among drivers. GameOpt is a hybrid formulation that first uses an auction mechanism to generate a priority entrance sequence for every agent, followed by an optimization-based trajectory planner that computes velocity controls that satisfy the priority sequence. This coupling operates at real-time speeds of less than 10 milliseconds in high density traffic of more than 10,000 vehicles/hr, 100 times faster than other fully optimization-based methods, while providing guarantees in terms of fairness, safety, and efficiency. Tested on the SUMO simulator, our algorithm improves throughput by at least 25%, time taken to reach the goal by 75%, and fuel consumption by 33% compared to auction-based approaches and signaled approaches using traffic-lights and stop signs. △ Less

Submitted 18 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

Comments: Submitted to ITSC 2022

arXiv:2202.06452 [pdf, other]

Multi-user Beam Alignment in Presence of Multi-path

Authors: Nariman Torkzaban, Mohammad A., Khojastepour, John S. Baras

Abstract: To overcome the high path-loss and the intense shadowing in millimeter-wave (mmWave) communications, effective beamforming schemes are required which incorporate narrow beams with high beamforming gains. The mmWave channel consists of a few spatial clusters each associated with an angle of departure (AoD). The narrow beams must be aligned with the channel AoDs to increase the beamforming gain. Thi… ▽ More To overcome the high path-loss and the intense shadowing in millimeter-wave (mmWave) communications, effective beamforming schemes are required which incorporate narrow beams with high beamforming gains. The mmWave channel consists of a few spatial clusters each associated with an angle of departure (AoD). The narrow beams must be aligned with the channel AoDs to increase the beamforming gain. This is achieved through a procedure called beam alignment (BA). Most of the BA schemes in the literature consider channels with a single dominant path while in practice the channel has a few resolvable paths with different AoDs, hence, such BA schemes may not work correctly in the presence of multi-path or at the least do not exploit such multipath to achieve diversity or increase robustness. In this paper, we propose an efficient BA scheme in presence of multi-path. The proposed BA scheme transmits probing packets using a set of scanning beams and receives feedback for all the scanning beams at the end of the probing phase from each user. We formulate the BA scheme as minimizing the expected value of the average transmission beamwidth under different policies. The policy is defined as a function from the set of received feedback to the set of transmission beams (TB). In order to maximize the number of possible feedback sequences, we prove that the set of scanning beams (SB) has a special form, namely, Tulip Design. Consequently, we rewrite the minimization problem with a set of linear constraints and a reduced number of variables which is solved by using an efficient greedy algorithm. △ Less

Submitted 13 February, 2022; originally announced February 2022.

Comments: Accepted IEEE CISS 2022

arXiv:2202.03610 [pdf, other]

Codebook Design for Composite Beamforming in Next-generation mmWave Systems

Authors: Nariman Torkzaban, Mohamamd A., Khojastepour, John S. Baras

Abstract: In pursuance of the unused spectrum in higher frequencies, millimeter wave (mmWave) bands have a pivotal role. However, the high path-loss and poor scattering associated with mmWave communications highlight the necessity of employing effective beamforming techniques. In order to efficiently search for the beam to serve a user and to jointly serve multiple users it is often required to use a compos… ▽ More In pursuance of the unused spectrum in higher frequencies, millimeter wave (mmWave) bands have a pivotal role. However, the high path-loss and poor scattering associated with mmWave communications highlight the necessity of employing effective beamforming techniques. In order to efficiently search for the beam to serve a user and to jointly serve multiple users it is often required to use a composite beam which consists of multiple disjoint lobes. A composite beam covers multiple desired angular coverage intervals (ACIs) and ideally has maximum and uniform gain (smoothness) within each desired ACI, negligible gain (leakage) outside the desired ACIs, and sharp edges. We propose an algorithm for designing such ideal composite codebook by providing an analytical closed-form solution with low computational complexity. There is a fundamental trade-off between the gain, leakage and smoothness of the beams. Our design allows to achieve different values in such trade-off based on changing the design parameters. We highlight the shortcomings of the uniform linear arrays (ULAs) in building arbitrary composite beams. Consequently, we use a recently introduced twin-ULA (TULA) antenna structure to effectively resolve these inefficiencies. Numerical results are used to validate the theoretical findings. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Comments: Accepted at IEEE WCNC 2022

arXiv:2111.06426 [pdf, other]

doi 10.23919/ACC53348.2022.9867331

A Robust Mean-field Game of Boltzmann-Vlasov-like Traffic Flow

Authors: Amoolya Tirumalai, John S. Baras

Abstract: Historically, traffic modelling approaches have taken either a particle-like (microscopic) approach, or a gas-like (meso- or macroscopic) approach. Until recently with the introduction of mean-field games to the controls community, there has not been a rigorous framework to facilitate passage between controls for the microscopic models and the macroscopic models. We begin this work with a particle… ▽ More Historically, traffic modelling approaches have taken either a particle-like (microscopic) approach, or a gas-like (meso- or macroscopic) approach. Until recently with the introduction of mean-field games to the controls community, there has not been a rigorous framework to facilitate passage between controls for the microscopic models and the macroscopic models. We begin this work with a particle-based model of autonomous vehicles subject to drag and unknown disturbances, noise, and a speed limit in addition to the control. We formulate a robust stochastic differential game on the particles. We pass formally to the infinite-particle limit to obtain a robust mean-field game PDE system. We solve the mean-field game PDE system numerically and discuss the results. In particular, we obtain an optimal control which increases the bulk velocity of the traffic flow while reducing congestion. △ Less

Submitted 14 November, 2021; v1 submitted 11 November, 2021; originally announced November 2021.

Comments: Omission of a line corrected in this version. 6 pages; 3 figures; 1 algorithm; Submitted to ACC 2022

arXiv:2111.05361 [pdf, ps, other]

Weak Solutions to an Euler Alignment System with Singular Interactions in a Bounded Domain

Authors: Amoolya Tirumalai, Christos Mavridis, John S. Baras

Abstract: Euler alignment systems appear as hydrodynamic limits of interacting self-propelled particle systems such as the (generalized) Cucker-Smale model. In this work, we study weak solutions to an Euler alignment system on smooth, bounded, connected domains. This particular Euler alignment system includes singular alignment, attraction, and repulsion interaction kernels which correspond to a Yukawa pote… ▽ More Euler alignment systems appear as hydrodynamic limits of interacting self-propelled particle systems such as the (generalized) Cucker-Smale model. In this work, we study weak solutions to an Euler alignment system on smooth, bounded, connected domains. This particular Euler alignment system includes singular alignment, attraction, and repulsion interaction kernels which correspond to a Yukawa potential. We also include a confinement potential and self-propulsion. We embed the problem into an abstract Euler system to conclude that infinitely many weak solutions exist. We further show that we can construct solutions satisfying bounds on an energy quantity, and that the solutions satisfy a weak-strong uniqueness principle. Finally, we present an addition of leader-agents governed by controlled ODEs, and modification of the interactions to be Bessel potentials of fractional order $s > 2$. △ Less

Submitted 22 May, 2023; v1 submitted 9 November, 2021; originally announced November 2021.

Comments: 32 pages

arXiv:2110.08924 [pdf, other]

Sensor Scheduling for Linear Systems: A Covariance Tracking Approach

Authors: Dipankar Maity, David Hartman, John S. Baras

Abstract: We consider the classical sensor scheduling problem for linear systems where only one sensor is activated at each time. We show that the sensor scheduling problem has a close relation to the sensor design problem and the solution of a sensor schedule problem can be extracted from an equivalent sensor design problem. We propose a convex relaxation to the sensor design problem and a reference covari… ▽ More We consider the classical sensor scheduling problem for linear systems where only one sensor is activated at each time. We show that the sensor scheduling problem has a close relation to the sensor design problem and the solution of a sensor schedule problem can be extracted from an equivalent sensor design problem. We propose a convex relaxation to the sensor design problem and a reference covariance trajectory is obtained from solving the relaxed sensor design problem. Afterwards, a covariance tracking algorithm is designed to obtain an approximate solution to the sensor scheduling problem using the reference covariance trajectory obtained from the sensor design problem. While the sensor scheduling problem is NP-hard, the proposed framework circumvents this computational complexity by decomposing this problem into a convex sensor design problem and a covariance tracking problem. We provide theoretical justification and a sub-optimality bound for the proposed method using dynamic programming. The proposed method is validated over several experiments portraying the efficacy of the framework. △ Less

Submitted 17 October, 2021; originally announced October 2021.

Comments: To appear in Automatica

arXiv:2108.09176 [pdf, other]

Controller Placement in SDN-enabled 5G Satellite-Terrestrial Networks

Authors: Nariman Torkzaban, John S. Baras

Abstract: SDN-enabled Integrated satellite-terrestrial networks (ISTNs), can provide several advantages including global seamless coverage, high reliability, low latency, etc. and can be a key enabler towards next generation networks. To deal with the complexity of the control and management of the integrated network, leveraging the concept of software-defined networking (SDN) will be helpful. In this regar… ▽ More SDN-enabled Integrated satellite-terrestrial networks (ISTNs), can provide several advantages including global seamless coverage, high reliability, low latency, etc. and can be a key enabler towards next generation networks. To deal with the complexity of the control and management of the integrated network, leveraging the concept of software-defined networking (SDN) will be helpful. In this regard, the SDN controller placement problem in SDN-enabled ISTNs becomes of paramount importance. In this paper, we formulate an optimization problem for the SDN controller placement with the objective of minimizing the average failure probability of SDN control paths to ensure the SDN switches receive the instructions in the most reliable fashion. Simultaneously, we aim at deploying the SDN controllers close to the satellite gateways to ensure the connection between the two layers occurs with the lowest latency. We first model the problem as a mixed integer linear program (MILP). To reduce the time complexity of the MILP model, we use submodular optimization techniques to generate near-optimal solutions in a time-efficient manner. Finally, we verify the effectiveness of our approach by means of simulation, showing that the approximation method results in a reasonable optimality gap with respect to the exact MILP solution. △ Less

Submitted 20 August, 2021; originally announced August 2021.

Comments: Accepted at IEEE Globecom 2021. arXiv admin note: substantial text overlap with arXiv:2103.08735

arXiv:2104.07853 [pdf, other]

On the Importance of Trust in Next-Generation Networked CPS Systems: An AI Perspective

Authors: Anousheh Gholami, Nariman Torkzaban, John S. Baras

Abstract: With the increasing scale, complexity, and heterogeneity of the next generation networked systems, seamless control, management, and security of such systems becomes increasingly challenging. Many diverse applications have driven interest in networked systems, including large-scale distributed learning, multi-agent optimization, 5G service provisioning, and network slicing, etc. In this paper, we… ▽ More With the increasing scale, complexity, and heterogeneity of the next generation networked systems, seamless control, management, and security of such systems becomes increasingly challenging. Many diverse applications have driven interest in networked systems, including large-scale distributed learning, multi-agent optimization, 5G service provisioning, and network slicing, etc. In this paper, we propose trust as a measure to evaluate the status of network agents and improve the decision-making process. We interpret trust as a relation among entities that participate in various protocols. Trust relations are based on evidence created by the interactions of entities within a protocol and may be a composite of multiple metrics such as availability, reliability, resilience, etc. depending on application context. We first elaborate on the importance of trust as a metric and then present a mathematical framework for trust computation and aggregation within a network. Then we show in practice, how trust can be integrated into network decision-making processes by presenting two examples. In the first example, we show how utilizing the trust evidence can improve the performance and the security of Federated Learning. Second, we show how a 5G network resource provisioning framework can be improved when augmented with a trust-aware decision-making scheme. We verify the validity of our trust-based approach through simulations. Finally, we explain the challenges associated with aggregating the trust evidence and briefly explain our ideas to tackle them. △ Less

Submitted 15 April, 2021; originally announced April 2021.

arXiv:2104.03355 [pdf, other]

Value of information in networked control systems subject to delay

Authors: Siyi Wang, Qingchen Liu, Precious Ugo Abara, John S. Baras, Sandra Hirche

Abstract: In this paper, we study the trade-off between the transmission cost and the control performance of the multi-loop networked control system subject to network-induced delay. Within the linear-quadratic-Gaussian (LQG) framework, the joint design of control policy and networking strategy is decomposed into separation optimization problems. Based on the trade-off analysis, a scalable, delay-dependent… ▽ More In this paper, we study the trade-off between the transmission cost and the control performance of the multi-loop networked control system subject to network-induced delay. Within the linear-quadratic-Gaussian (LQG) framework, the joint design of control policy and networking strategy is decomposed into separation optimization problems. Based on the trade-off analysis, a scalable, delay-dependent Value-of-Information (VoI) based scheduling policy is constructed to quantify the value of transmitting the data packet, and enables the decision-makers embedded in subsystems to determine the transmission policy. The proposed scalable VoI inherits the task criticality of the previous VoI metric meanwhile is sensitive to the system parameters such as information freshness and network delays. The VoI-based scheduling policy is proved to outperform the periodical triggering policy and existing Age-of-Information (AoI) based policy for network control system under transmission delay. The effectiveness of the constructed VoI with arbitrary network delay is validated through numerical simulations. △ Less

Submitted 29 December, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: accepted CDC2021

arXiv:2103.14012 [pdf, ps, other]

doi 10.1109/TAC.2022.3194125

Value of Information in Feedback Control: Global Optimality

Authors: Touraj Soleymani, John S. Baras, Sandra Hirche, Karl H. Johansson

Abstract: The rate-regulation tradeoff, defined between two objective functions, one penalizing the packet rate and one the regulation cost, can express the fundamental performance bound of networked control systems. However, the characterization of the set of globally optimal solutions in this tradeoff for multi-dimensional Gauss-Markov processes has been an open problem. In the present article, we charact… ▽ More The rate-regulation tradeoff, defined between two objective functions, one penalizing the packet rate and one the regulation cost, can express the fundamental performance bound of networked control systems. However, the characterization of the set of globally optimal solutions in this tradeoff for multi-dimensional Gauss-Markov processes has been an open problem. In the present article, we characterize a policy profile that belongs to this set without imposing any restrictions on the information structure or the policy structure. We prove that such a policy profile consists of a symmetric threshold triggering policy based on the value of information and a certainty-equivalent control policy based on a non-Gaussian linear estimator. These policies are deterministic and can be designed separately. Besides, we provide a global optimality analysis for the value of information $\text{VoI}_k$, a semantic metric that emerges from the rate-regulation tradeoff as the difference between the benefit and the cost of a data packet. We prove that it is globally optimal that a data packet containing sensory information at time $k$ be transmitted to the controller only if $\text{VoI}_k$ becomes nonnegative. These results have important implications in the areas of communication and control. △ Less

Submitted 4 May, 2022; v1 submitted 25 March, 2021; originally announced March 2021.

Comments: arXiv admin note: text overlap with arXiv:1812.07534

arXiv:2103.08735 [pdf, ps, other]

Joint Satellite Gateway Deployment & Controller Placement in Software-Defined 5G-Satellite Integrated Networks

Authors: Nariman Torkzaban, John S. Baras

Abstract: Several challenging optimization problems arise while considering the deployment of the space-air-ground integrated networks (SAGINs), among which the optimal satellite gateway deployment problem is of significant importance. Moreover, with the increasing interest in the software-defined integration of 5G networks and satellites, the existence of an effective scheme for optimal placement of SDN co… ▽ More Several challenging optimization problems arise while considering the deployment of the space-air-ground integrated networks (SAGINs), among which the optimal satellite gateway deployment problem is of significant importance. Moreover, with the increasing interest in the software-defined integration of 5G networks and satellites, the existence of an effective scheme for optimal placement of SDN controllers is essential. In this paper, we discuss the interrelation between the two problems above and propose suitable methods to solve them under various network design criteria. We first provide a MILP model for solving the joint problem, and then motivate the decomposition of the model into two disjoint MILPs. We then show that the resulting problems can be modeled as the optimization of submodular set functions and can be solved efficiently with provable optimality gaps. △ Less

Submitted 19 March, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

arXiv:2102.08772 [pdf, ps, other]

Semi-linear Poisson-mediated Flocking in a Cucker-Smale Model

Authors: Christos N. Mavridis, Amoolya Tirumalai, John S. Baras, Ion Matei

Abstract: We propose a family of compactly supported parametric interaction functions in the general Cucker-Smale flocking dynamics such that the mean-field macroscopic system of mass and momentum balance equations with non-local damping terms can be converted from a system of partial integro-differential equations to an augmented system of partial differential equations in a compact set. We treat the inter… ▽ More We propose a family of compactly supported parametric interaction functions in the general Cucker-Smale flocking dynamics such that the mean-field macroscopic system of mass and momentum balance equations with non-local damping terms can be converted from a system of partial integro-differential equations to an augmented system of partial differential equations in a compact set. We treat the interaction functions as Green's functions for an operator corresponding to a semi-linear Poisson equation and compute the density and momentum in a translating reference frame, i.e. one that is taken in reference to the flock's centroid. This allows us to consider the dynamics in a fixed, flock-centered compact set without loss of generality. We approach the computation of the non-local damping using the standard finite difference treatment of the chosen differential operator, resulting in a tridiagonal system which can be solved quickly. △ Less

Submitted 11 February, 2021; originally announced February 2021.

arXiv:2011.12772 [pdf, ps, other]

Event-triggered Feedback Control for Signal Temporal Logic Tasks

Authors: Lars Lindemann, Dipankar Maity, John S. Baras, Dimos V. Dimarogonas

Abstract: A framework for the event-triggered control synthesis under signal temporal logic (STL) tasks is proposed. In our previous work, a continuous-time feedback control law was designed, using the prescribed performance control technique, to satisfy STL tasks. We replace this continuous-time feedback control law by an event-triggered controller. The event-triggering mechanism is based on a maximum trig… ▽ More A framework for the event-triggered control synthesis under signal temporal logic (STL) tasks is proposed. In our previous work, a continuous-time feedback control law was designed, using the prescribed performance control technique, to satisfy STL tasks. We replace this continuous-time feedback control law by an event-triggered controller. The event-triggering mechanism is based on a maximum triggering interval and on a norm bound on the difference between the value of the current state and the value of the state at the last triggering instance. Simulations of a multi-agent system quantitatively show the efficacy of using an event-triggered controller to reduce communication and computation efforts. △ Less

Submitted 25 November, 2020; originally announced November 2020.

Comments: Conference on Decision and Control (2018), 6 pages

arXiv:2009.14446 [pdf, other]

Joint Mobility-Aware UAV Placement and Routing in Multi-Hop UAV Relaying Systems

Authors: Anousheh Gholami, Nariman Torkzaban, John S. Baras, Chrysa Papagianni

Abstract: Unmanned Aerial Vehicles (UAVs) have been extensively utilized to provide wireless connectivity in rural and under-developed areas, enhance network capacity and provide support for peaks or unexpected surges in user demand, mainly due to their fast deployment, cost-efficiency and superior communication performance resulting from Line of Sight (LoS)-dominated wireless channels. In order to exploit… ▽ More Unmanned Aerial Vehicles (UAVs) have been extensively utilized to provide wireless connectivity in rural and under-developed areas, enhance network capacity and provide support for peaks or unexpected surges in user demand, mainly due to their fast deployment, cost-efficiency and superior communication performance resulting from Line of Sight (LoS)-dominated wireless channels. In order to exploit the benefits of UAVs as base stations or relays in a mobile network, a major challenge is to determine the optimal UAV placement and relocation strategy with respect to the mobility and traffic patterns of the ground network nodes. Moreover, considering that the UAVs form a multi-hop aerial network, capacity and connectivity constraints have significant impacts on the end-to-end network performance. To this end, we formulate the joint UAV placement and routing problem as a Mixed Integer Linear Program (MILP) and propose an approximation that leads to a LP rounding algorithm and achieves a balance between time-complexity and optimality. △ Less

Submitted 30 September, 2020; originally announced September 2020.

Comments: 15 Pages, Accepted at ADHOCNETS2020

arXiv:2009.07343 [pdf, other]

Trust-Aware Service Function Chain Embedding: A Path-Based Approach

Authors: Nariman Torkzaban, John S. Baras

Abstract: With the emergence of network function virtualization (NFV), and software-defined networking (SDN), the realization and implementation of service function chains (SFCs) have become much easier. An SFC is an ordered set of interconnected virtual network functions (VNFs). NFV allows for decoupling the network functions from proprietary hardware realizing a software-based implementation of VNFs on co… ▽ More With the emergence of network function virtualization (NFV), and software-defined networking (SDN), the realization and implementation of service function chains (SFCs) have become much easier. An SFC is an ordered set of interconnected virtual network functions (VNFs). NFV allows for decoupling the network functions from proprietary hardware realizing a software-based implementation of VNFs on commodity hardware and SDN decouples the network control from its forwarding logic allowing for a more flexible and programmable traffic routing among the VNFs. The SFC embedding problem (i.e. placement of SFCs on a shared substrate and establishing the corresponding traffic routes between the VNFs), has been extensively studied in the literature. In this paper, we extend a previous work on trust-aware service chain embedding with generalizing the role of trust by incorporating the trustworthiness of the service network links and substrate network paths into the SFC embedding decision process. We first introduce and formulate the path-based trust-aware service chain embedding problem as a mixed integer-linear program (MILP), and then provide an approximate model-based on selecting k-shortest candidate substrate paths for hosting each virtual link, to reduce the complexity of the model. We validate the performance of our methods through simulations and conduct a discussion on evaluating the methods and some operation trade-offs. △ Less

Submitted 5 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

Comments: 6 pages, Accepted at IEEE NFV-SDN 2020

arXiv:2007.07634 [pdf, ps, other]

Delay-sensitive Joint Optimal Control and Resource Management in Multi-loop Networked Control Systems

Authors: Mohammad H. Mamduhi, Dipankar Maity, Sandra Hirche, John S. Baras, Karl H. Johansson

Abstract: In the operation of networked control systems, where multiple processes share a resource-limited and time-varying cost-sensitive network, communication delay is inevitable and primarily influenced by, first, the control systems deploying intermittent sensor sampling to reduce the communication cost by restricting non-urgent transmissions, and second, the network performing resource management to m… ▽ More In the operation of networked control systems, where multiple processes share a resource-limited and time-varying cost-sensitive network, communication delay is inevitable and primarily influenced by, first, the control systems deploying intermittent sensor sampling to reduce the communication cost by restricting non-urgent transmissions, and second, the network performing resource management to minimize excessive traffic and eventually data loss. In a heterogeneous scenario, where control systems may tolerate only specific levels of sensor-to-controller latency, delay sensitivities need to be considered in the design of control and network policies to achieve the desired performance guarantees. We propose a cross-layer optimal co-design of control, sampling and resource management policies for an NCS consisting of multiple stochastic linear time-invariant systems which close their sensor-to-controller loops over a shared network. Aligned with advanced communication technology, we assume that the network offers a range of latency-varying transmission services for given prices. Local samplers decide either to pay higher cost to access a low-latency channel, or to delay sending a state sample at a reduced price. A resource manager residing in the network data-link layer arbitrates channel access and re-allocates resources if link capacities are exceeded. The performance of the local closed-loop systems is measured by a combination of linear-quadratic Gaussian cost and a suitable communication cost, and the overall objective is to minimize a defined social cost by all three policy makers. We derive optimal control, sampling and resource allocation policies under different cross-layer awareness models, including constant and time-varying parameters, and show that higher awareness generally leads to performance enhancement at the expense of higher computational complexity. △ Less

Submitted 15 July, 2020; originally announced July 2020.

arXiv:2003.11693 [pdf, ps, other]

Order Effects of Measurements in Multi-Agent Hypothesis Testing

Authors: Aneesh Raghavan, John S. Baras

Abstract: In multi-agent systems, agents observe data, and use them to make inferences and take actions. As a result sensing and control naturally interfere, more so from a real-time perspective. A natural consequence is that in multi-agent systems there are propositions based on the set of observed events that might not be simultaneously verifiable, which leads to the need for probability structures that a… ▽ More In multi-agent systems, agents observe data, and use them to make inferences and take actions. As a result sensing and control naturally interfere, more so from a real-time perspective. A natural consequence is that in multi-agent systems there are propositions based on the set of observed events that might not be simultaneously verifiable, which leads to the need for probability structures that allow such \textit{incompatible events}. We revisit the structure of events in a multi-agent system and we introduce the necessary new models that incorporate such incompatible events in the formalism. These models are essential for building non-commutative probability models, which are different than the classical models based on the Kolmogorov construction. From this perspective, we revisit the concepts of \textit{event-state-operation structure} and the needed \textit{relationship of incompatibility} from the literature and use them as a tool to study the needed new algebraic structure of the set of events. We present an example from multi-agent hypothesis testing where the set of events does not form a Boolean algebra, but forms an ortholattice. A possible construction of a `noncommutative probability space', accounting for \textit{incompatible events} is discussed. We formulate and solve the binary hypothesis testing problem in the noncommutative probability space. We illustrate the occurrence of `order effects' in the multi-agent hypothesis testing problem by computing the minimum probability of error that can be achieved with different orders of measurements. △ Less

Submitted 11 November, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

Comments: Journal Paper Accepted

arXiv:2003.11612 [pdf, other]

Cooperative Hypothesis Testing by Two Observers with Asymmetric Information

Authors: Aneesh Raghavan, John S. Baras

Abstract: We consider the binary hypothesis testing problem with two observers. There are two possible states of nature (or hypotheses). Observations collected by the two observers are statistically related to the true state of nature. The knowledge of joint distribution of the observations collected and the true state of nature is unknown to the observers. There are two problems to be solved by the observe… ▽ More We consider the binary hypothesis testing problem with two observers. There are two possible states of nature (or hypotheses). Observations collected by the two observers are statistically related to the true state of nature. The knowledge of joint distribution of the observations collected and the true state of nature is unknown to the observers. There are two problems to be solved by the observers: (i) true state of nature is known: find the distribution of the local information collected; (ii) true state of nature is unknown: collaboratively estimate the same using the distributions found by solving the first problem. We present four algorithms, each having two phases where the two problems are solved, with emphasis on the information exchange between the observers and resulting patterns. We prove different properties of the algorithms including the following: the probability spaces constructed as a consequence of solving the first problem are dependent on the information patterns at the observers; (ii) the rate of decay of probability of error of algorithms while solving the second problem is dependent on the information exchange between the observers. We present a numerical example demonstrating the four algorithms. △ Less

Submitted 17 September, 2024; v1 submitted 25 March, 2020; originally announced March 2020.

Comments: Journal Paper to be published

MSC Class: 93A14 93E10 62F03 60A05

arXiv:2003.10025 [pdf, other]

Interpretable machine learning models: a physics-based view

Authors: Ion Matei, Johan de Kleer, Christoforos Somarakis, Rahul Rai, John S. Baras

Abstract: To understand changes in physical systems and facilitate decisions, explaining how model predictions are made is crucial. We use model-based interpretability, where models of physical systems are constructed by composing basic constructs that explain locally how energy is exchanged and transformed. We use the port Hamiltonian (p-H) formalism to describe the basic constructs that contain physically… ▽ More To understand changes in physical systems and facilitate decisions, explaining how model predictions are made is crucial. We use model-based interpretability, where models of physical systems are constructed by composing basic constructs that explain locally how energy is exchanged and transformed. We use the port Hamiltonian (p-H) formalism to describe the basic constructs that contain physically interpretable processes commonly found in the behavior of physical systems. We describe how we can build models out of the p-H constructs and how we can train them. In addition we show how we can impose physical properties such as dissipativity that ensure numerical stability of the training process. We give examples on how to build and train models for describing the behavior of two physical systems: the inverted pendulum and swarm dynamics. △ Less

Submitted 22 March, 2020; originally announced March 2020.

arXiv:2002.03071 [pdf, other]

Joint Satellite Gateway Placement and Routing for Integrated Satellite-Terrestrial Networks

Authors: Nariman Torkzaban, Anousheh Gholami, Chrysa Papagianni, John S. Baras

Abstract: With the increasing attention to the integrated satellite-terrestrial networks (ISTNs), the satellite gateway placement problem becomes of paramount importance. The resulting network performance may vary depending on the different design strategies. In this paper, a joint satellite gateway placement and routing strategy for the terrestrial network is proposed to minimize the overall cost of gatewa… ▽ More With the increasing attention to the integrated satellite-terrestrial networks (ISTNs), the satellite gateway placement problem becomes of paramount importance. The resulting network performance may vary depending on the different design strategies. In this paper, a joint satellite gateway placement and routing strategy for the terrestrial network is proposed to minimize the overall cost of gateway deployment and traffic routing, while adhering to the average delay requirement for traffic demands. Although traffic routing and gateway placement can be solved independently, the dependence between the routing decisions for different demands makes it more realistic to solve an aggregated model instead. We develop a mixed-integer linear program (MILP) formulation for the problem. We relax the integrality constraints to achieve a linear program (LP) which reduces time-complexity at the expense of a sub-optimal solution. We further propose a variant of the proposed model to balance the load between the selected gateways. △ Less

Submitted 5 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

Comments: 6 pages, In Proceedings of IEEE ICC 2020. https://ieeexplore.ieee.org/document/9149175 N. Torkzaban, A. Gholami, J. S. Baras and C. Papagianni, "Joint Satellite Gateway Placement and Routing for Integrated Satellite-Terrestrial Networks," ICC 2020 - 2020 IEEE International Conference on Communications (ICC), Dublin, Ireland, 2020, pp. 1-6. doi: 10.1109/ICC40277.2020.9149175

arXiv:1912.07848 [pdf, other]

doi 10.1016/j.ifacol.2020.12.2361

Fast, Composable Rescue Mission Planning for UAVs using Metric Temporal Logic

Authors: Usman A. Fiaz, John S. Baras

Abstract: We present a hybrid compositional approach for real-time mission planning for multi-rotor unmanned aerial vehicles (UAVs) in a time critical search and rescue scenario. Starting with a known environment, we specify the mission using Metric Temporal Logic (MTL) and use a hybrid dynamical model to capture the various modes of UAV operation. We then divide the mission into several sub-tasks by exploi… ▽ More We present a hybrid compositional approach for real-time mission planning for multi-rotor unmanned aerial vehicles (UAVs) in a time critical search and rescue scenario. Starting with a known environment, we specify the mission using Metric Temporal Logic (MTL) and use a hybrid dynamical model to capture the various modes of UAV operation. We then divide the mission into several sub-tasks by exploiting the invariant nature of safety and timing constraints along the way, and the different modes (i.e., dynamics) of the UAV. For each sub-task, we translate the MTL specifications into linear constraints and solve the associated optimal control problem for desired path, using a Mixed Integer Linear Program (MILP) solver. The complete path for the mission is constructed recursively by composing the individual optimal sub-paths. We show by simulations that the resulting suboptimal trajectories satisfy the mission specifications, and the proposed approach leads to significant reduction in computational complexity of the problem, making it possible to implement in real-time. Our proposed method ensures the safety of UAVs at all times and guarantees finite time mission completion. It is also shown that our approach scales up nicely for a large number of UAVs. △ Less

Submitted 28 September, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

Comments: Published as a conference paper at the IFAC World Congress 2020. arXiv admin note: substantial text overlap with arXiv:1904.03830

Journal ref: IFAC-PapersOnLine, Volume 53, Issue 2, 2020, Pages 15404-15411

arXiv:1911.09467 [pdf, ps, other]

Event-triggered Add-on Safety for Connected and Automated Vehicles Using Road-side Network Infrastructure

Authors: Mohammad H. Mamduhi, Ehsan Hashemi, John S. Baras, Karl H. Johansson

Abstract: This paper proposes an event-triggered add-on safety mechanism to adjust the control parameters for timely braking in a networked vehicular system while maintaining maneuverability. Passenger vehicle maneuverability is significantly affected by the combined-slip friction effect, in which larger longitudinal tire slips result in considerable drop in lateral tire forces. This is of higher importance… ▽ More This paper proposes an event-triggered add-on safety mechanism to adjust the control parameters for timely braking in a networked vehicular system while maintaining maneuverability. Passenger vehicle maneuverability is significantly affected by the combined-slip friction effect, in which larger longitudinal tire slips result in considerable drop in lateral tire forces. This is of higher importance when unexpected dangerous situations occur on the road and immediate actions, such as braking, need to be taken to avoid collision. Harsh braking can lead to high-slip and loss of maneuverability, hence, timely braking is essential to reduce high-slip scenarios. In addition to the vehicles own active safety systems, the proposed event-triggered add-on safety is activated upon being informed about dangers by the road-side infrastructure. The aim is to incorporate the add-on safety feature to adjust the automatic control parameters for smooth and timely braking such that a collision is avoided while vehicle's maneuverability is maintained. We study two different wireless technologies for communication between the infrastructure and the vehicles, the Long-Term Evolution (LTE) and the fifth generation (5G) schemes. The framework is validated through high-fidelity software simulations and the advantages of including the add-on feature to augment the safety margins for each communication technology is evaluated. △ Less

Submitted 21 November, 2019; originally announced November 2019.

Comments: 8 pages, 6 figures, preprint submitted for IFAC 2020

arXiv:1909.02150 [pdf, other]

Drone-Assisted Communications for Remote Areas and Disaster Relief

Authors: Anousheh Gholami, Usman A. Fiaz, John S. Baras

Abstract: We explore an end-to-end (including access and backhaul links) UAV-assisted wireless communication system, considering both uplink and downlink traffics, with the goal of supporting demand of the Ground Users (GUs) using the minimum number of UAVs. Moreover, in order to extend the operational (flight) time of UAVs, we exploit an energy-aware routing scheme. Our intention is to design and analyze t… ▽ More We explore an end-to-end (including access and backhaul links) UAV-assisted wireless communication system, considering both uplink and downlink traffics, with the goal of supporting demand of the Ground Users (GUs) using the minimum number of UAVs. Moreover, in order to extend the operational (flight) time of UAVs, we exploit an energy-aware routing scheme. Our intention is to design and analyze the access and backhaul connectivity of a drone-assisted communication network for remote and crowded areas and disaster relief, while minimizing the resources required i.e., the number of UAVs. △ Less

Submitted 4 September, 2019; originally announced September 2019.

Comments: Accepted at DGRS 2019

arXiv:1904.03830 [pdf, other]

A Hybrid Compositional Approach to Optimal Mission Planning for Multi-rotor UAVs using Metric Temporal Logic

Authors: Usman A. Fiaz, John S. Baras

Abstract: This paper investigates a hybrid compositional approach to optimal mission planning for multi-rotor Unmanned Aerial Vehicles (UAVs). We consider a time critical search and rescue scenario with two quadrotors in a constrained environment. Metric Temporal Logic (MTL) is used to formally describe the task specifications. In order to capture the various modes of UAV operation, we utilize a hybrid mode… ▽ More This paper investigates a hybrid compositional approach to optimal mission planning for multi-rotor Unmanned Aerial Vehicles (UAVs). We consider a time critical search and rescue scenario with two quadrotors in a constrained environment. Metric Temporal Logic (MTL) is used to formally describe the task specifications. In order to capture the various modes of UAV operation, we utilize a hybrid model for the system with linearized dynamics around different operating points. We divide the mission into several sub-tasks by exploiting the invariant nature of various task specifications i.e., the mutual independence of safety and timing constraints along the way, and the different modes (i,e., dynamics) of the robot. For each sub-task, we translate the MTL formulae into linear constraints, and solve the associated optimal control problem for desired path using a Mixed Integer Linear Program (MILP) solver. The complete path is constructed by the composition of individual optimal sub-paths. We show that the resulting trajectory satisfies the task specifications, and the proposed approach leads to significant reduction in computational complexity of the problem, making it possible to implement in real-time. △ Less

Submitted 19 September, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

Comments: 8 pages, 5 figures, 1 table. Fixed typos, added new references

arXiv:1812.07534 [pdf, ps, other]

doi 10.1109/TAC.2021.3113472

Value of Information in Feedback Control: Quantification

Authors: Touraj Soleymani, John S. Baras, Sandra Hirche

Abstract: Although transmission of a data packet containing sensory information in a networked control system improves the quality of regulation, it has indeed a price from the communication perspective. It is, therefore, rational that such a data packet be transmitted only if it is valuable in the sense of a cost-benefit analysis. Yet, the fact is that little is known so far about this valuation of informa… ▽ More Although transmission of a data packet containing sensory information in a networked control system improves the quality of regulation, it has indeed a price from the communication perspective. It is, therefore, rational that such a data packet be transmitted only if it is valuable in the sense of a cost-benefit analysis. Yet, the fact is that little is known so far about this valuation of information and its connection with traditional event-triggered communication. In the present article, we study this intrinsic property of networked control systems by formulating a rate-regulation tradeoff between the packet rate and the regulation cost with an event trigger and a controller as two distributed decision makers, and show that the valuation of information is conceivable and quantifiable grounded on this tradeoff. In particular, we characterize an equilibrium in the rate-regulation tradeoff, and quantify the value of information $\text{VoI}_k$ there as the variation in a so-called value function with respect to a piece of sensory information that can be communicated to the controller at each time $k$. We prove that, for a multi-dimensional Gauss-Markov process, $\text{VoI}_k$ is a symmetric function of the discrepancy between the state estimates at the event trigger and the controller, and that a data packet containing sensory information at time $k$ should be transmitted to the controller only if $\text{VoI}_k$ is nonnegative. Moreover, we discuss that $\text{VoI}_k$ can be computed with arbitrary accuracy, and that it can be approximated by a closed-form quadratic function with a performance guarantee. △ Less

Submitted 2 May, 2022; v1 submitted 18 December, 2018; originally announced December 2018.

arXiv:1810.10983 [pdf, ps, other]

Stochastic Control with Stale Information--Part I: Fully Observable Systems

Authors: Touraj Soleymani, John S. Baras, Karl H. Johansson

Abstract: In this study, we adopt age of information as a measure of the staleness of information, and take initial steps towards analyzing the control performance of stochastic systems with stale information. Our goals are to cast light on a fundamental limit on the information staleness that is required for a certain level of the control performance and to specify the corresponding stalest information pat… ▽ More In this study, we adopt age of information as a measure of the staleness of information, and take initial steps towards analyzing the control performance of stochastic systems with stale information. Our goals are to cast light on a fundamental limit on the information staleness that is required for a certain level of the control performance and to specify the corresponding stalest information pattern. In the asymptotic regime, such a limit asserts a critical information staleness that is required for stabilization. We achieve these goals by formulating the problem as a stochastic optimization problem and characterizing the associated optimal solutions. These solutions are in fact a control policy, which specifies the control inputs of the plant, and a queuing policy, which specifies the staleness of information at the controller. △ Less

Submitted 25 October, 2018; originally announced October 2018.

arXiv:1806.11206 [pdf, ps, other]

doi 10.1109/LCSYS.2018.2853648

Optimal LQG Control under Delay-dependent Costly Information

Authors: Dipankar Maity, Mohammad H. Mamduhi, Sandra Hirche, Karl Henrik Johansson, John S. Baras

Abstract: In the design of closed-loop networked control systems (NCSs), induced transmission delay between sensors and the control station is an often-present issue which compromises control performance and may even cause instability. A very relevant scenario in which network-induced delay needs to be investigated is costly usage of communication resources. More precisely, advanced communication technologi… ▽ More In the design of closed-loop networked control systems (NCSs), induced transmission delay between sensors and the control station is an often-present issue which compromises control performance and may even cause instability. A very relevant scenario in which network-induced delay needs to be investigated is costly usage of communication resources. More precisely, advanced communication technologies, e.g. 5G, are capable of offering latency-varying information exchange for different prices. Therefore, induced delay becomes a decision variable. It is then the matter of decision maker's willingness to either pay the required cost to have low-latency access to the communication resource, or delay the access at a reduced price. In this article, we consider optimal price-based bi-variable decision making problem for single-loop NCS with a stochastic linear time-invariant system. Assuming that communication incurs cost such that transmission with shorter delay is more costly, a decision maker determines the switching strategy between communication links of different delays such that an optimal balance between the control performance and the communication cost is maintained. In this article, we show that, under mild assumptions on the available information for decision makers, the separation property holds between the optimal link selecting and control policies. As the cost function is decomposable, the optimal policies are efficiently computed. △ Less

Submitted 28 June, 2018; originally announced June 2018.

Journal ref: IEEE Control Systems Letters ( Volume: 3, Issue: 1, Jan. 2019 )

arXiv:1802.09657 [pdf, other]

Event-Triggered Controller Synthesis for Dynamical Systems with Temporal Logic Constraints

Authors: Dipankar Maity, John S. Baras

Abstract: In this work, we propose an event-triggered con- trol framework for dynamical systems with temporal logical constraints. Event-triggered control methodologies have proven to be very efficient in reducing sensing, communication and computation costs. When a continuous feedback control is re- placed with an event-triggered strategy, the corresponding state trajectories also differ. In a system with… ▽ More In this work, we propose an event-triggered con- trol framework for dynamical systems with temporal logical constraints. Event-triggered control methodologies have proven to be very efficient in reducing sensing, communication and computation costs. When a continuous feedback control is re- placed with an event-triggered strategy, the corresponding state trajectories also differ. In a system with logical constraints, such small deviation in the trajectory might lead to unsatisfiability of the logical constraints. In this work, we develop an approach where we ensure that the event-triggered state trajectory is confined within an tube of the ideal trajectory associated with the continuous state feedback. At the same time, we will ensure satisfiability of the logical constraints as well. Furthermore, we show that the proposed method works for delayed systems as long as the delay is bounded by a certain quantity. △ Less

Submitted 26 February, 2018; originally announced February 2018.

Showing 1–50 of 69 results for author: Baras, J S