-
Contextualized Autonomous Drone Navigation using LLMs Deployed in Edge-Cloud Computing
Authors:
Hongqian Chen,
Yun Tang,
Antonios Tsourdos,
Weisi Guo
Abstract:
Autonomous navigation is usually trained offline in diverse scenarios and fine-tuned online subject to real-world experiences. However, the real world is dynamic and changeable, and many environmental encounters/effects are not accounted for in real-time due to difficulties in describing them within offline training data or hard to describe even in online scenarios. However, we know that the human…
▽ More
Autonomous navigation is usually trained offline in diverse scenarios and fine-tuned online subject to real-world experiences. However, the real world is dynamic and changeable, and many environmental encounters/effects are not accounted for in real-time due to difficulties in describing them within offline training data or hard to describe even in online scenarios. However, we know that the human operator can describe these dynamic environmental encounters through natural language, adding semantic context. The research is to deploy Large Language Models (LLMs) to perform real-time contextual code adjustment to autonomous navigation. The challenge not evaluated in literature is what LLMs are appropriate and where should these computationally heavy algorithms sit in the computation-communication edge-cloud computing architectures. In this paper, we evaluate how different LLMs can adjust both the navigation map parameters dynamically (e.g., contour map shaping) and also derive navigation task instruction sets. We then evaluate which LLMs are most suitable and where they should sit in future edge-cloud of 6G telecommunication architectures.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Region of Attraction Estimation Using Union Theorem in Sum-of-Squares Optimization
Authors:
Bhaskar Biswas,
Dmitry Ignatyev,
Argyrios Zolotas,
Antonios Tsourdos
Abstract:
Appropriate estimation of Region of Attraction for a nonlinear dynamical system plays a key role in system analysis and control design. Sum-of-Squares optimization is a powerful tool enabling Region of Attraction estimation for polynomial dynamical systems. Employment of a positive definite function called shape function within the Sum-of-Squares procedure helps to find a richer representation of…
▽ More
Appropriate estimation of Region of Attraction for a nonlinear dynamical system plays a key role in system analysis and control design. Sum-of-Squares optimization is a powerful tool enabling Region of Attraction estimation for polynomial dynamical systems. Employment of a positive definite function called shape function within the Sum-of-Squares procedure helps to find a richer representation of the Lyapunov function and a larger corresponding Region of Attraction estimation. However, existing Sum-of-Squares optimization techniques demonstrate very conservative results. The main novelty of this paper is the Union theorem which enables the use of multiple shape functions to create a polynomial Lyapunov function encompassing all the areas generated by the shape functions. The main contribution of this paper is a novel computationally-efficient numerical method for Region of Attraction estimation, which remarkably improves estimation performance and overcomes limitations of existing methods, while maintaining the resultant Lyapunov function polynomial, thus facilitating control system design and construction of control Lyapunov function with enhanced Region of Attraction using conventional Sum-of-Squares tools. A mathematical proof of the Union theorem along with its application to the numerical algorithm of Region of Attraction estimation is provided. The method yields significantly enlarged Region of Attraction estimations even for systems with non-symmetric or unbounded Region of Attraction, which is demonstrated via simulations of several benchmark examples.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Incremental Correction in Dynamic Systems Modelled with Neural Networks for Constraint Satisfaction
Authors:
Namhoon Cho,
Hyo-Sang Shin,
Antonios Tsourdos,
Davide Amato
Abstract:
This study presents incremental correction methods for refining neural network parameters or control functions entering into a continuous-time dynamic system to achieve improved solution accuracy in satisfying the interim point constraints placed on the performance output variables. The proposed approach is to linearise the dynamics around the baseline values of its arguments, and then to solve fo…
▽ More
This study presents incremental correction methods for refining neural network parameters or control functions entering into a continuous-time dynamic system to achieve improved solution accuracy in satisfying the interim point constraints placed on the performance output variables. The proposed approach is to linearise the dynamics around the baseline values of its arguments, and then to solve for the corrective input required to transfer the perturbed trajectory to precisely known or desired values at specific time points, i.e., the interim points. Depending on the type of decision variables to adjust, parameter correction and control function correction methods are developed. These incremental correction methods can be utilised as a means to compensate for the prediction errors of pre-trained neural networks in real-time applications where high accuracy of the prediction of dynamical systems at prescribed time points is imperative. In this regard, the online update approach can be useful for enhancing overall targeting accuracy of finite-horizon control subject to point constraints using a neural policy. Numerical example demonstrates the effectiveness of the proposed approach in an application to a powered descent problem at Mars.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Two-timescale Resource Allocation for Automated Networks in IIoT
Authors:
Yanhua He,
Yun Ren,
Zhenyu Zhou,
Shahid Mumtaz,
Saba Al-Rubaye,
Antonios Tsourdos,
Octavia A. Dobre
Abstract:
The rapid technological advances of cellular technologies will revolutionize network automation in industrial internet of things (IIoT). In this paper, we investigate the two-timescale resource allocation problem in IIoT networks with hybrid energy supply, where temporal variations of energy harvesting (EH), electricity price, channel state, and data arrival exhibit different granularity. The form…
▽ More
The rapid technological advances of cellular technologies will revolutionize network automation in industrial internet of things (IIoT). In this paper, we investigate the two-timescale resource allocation problem in IIoT networks with hybrid energy supply, where temporal variations of energy harvesting (EH), electricity price, channel state, and data arrival exhibit different granularity. The formulated problem consists of energy management at a large timescale, as well as rate control, channel selection, and power allocation at a small timescale. To address this challenge, we develop an online solution to guarantee bounded performance deviation with only causal information. Specifically, Lyapunov optimization is leveraged to transform the long-term stochastic optimization problem into a series of short-term deterministic optimization problems. Then, a low-complexity rate control algorithm is developed based on alternating direction method of multipliers (ADMM), which accelerates the convergence speed via the decomposition-coordination approach. Next, the joint channel selection and power allocation problem is transformed into a one-to-many matching problem, and solved by the proposed price-based matching with quota restriction. Finally, the proposed algorithm is verified through simulations under various system configurations.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Bayesian Learning Approach to Model Predictive Control
Authors:
Namhoon Cho,
Seokwon Lee,
Hyo-Sang Shin,
Antonios Tsourdos
Abstract:
This study presents a Bayesian learning perspective towards model predictive control algorithms. High-level frameworks have been developed separately in the earlier studies on Bayesian learning and sampling-based model predictive control. On one hand, the Bayesian learning rule provides a general framework capable of generating various machine learning algorithms as special instances. On the other…
▽ More
This study presents a Bayesian learning perspective towards model predictive control algorithms. High-level frameworks have been developed separately in the earlier studies on Bayesian learning and sampling-based model predictive control. On one hand, the Bayesian learning rule provides a general framework capable of generating various machine learning algorithms as special instances. On the other hand, the dynamic mirror descent model predictive control framework is capable of diversifying sample-rollout-based control algorithms. However, connections between the two frameworks have still not been fully appreciated in the context of stochastic optimal control. This study combines the Bayesian learning rule point of view into the model predictive control setting by taking inspirations from the view of understanding model predictive controller as an online learner. The selection of posterior class and natural gradient approximation for the variational formulation governs diversification of model predictive control algorithms in the Bayesian learning approach to model predictive control. This alternative viewpoint complements the dynamic mirror descent framework through streamlining the explanation of design choices.
△ Less
Submitted 11 March, 2022; v1 submitted 5 March, 2022;
originally announced March 2022.
-
Variational Probabilistic Multi-Hypothesis Tracking
Authors:
Shuoyuan Xu,
Hyo-Sang Shin,
Antonios Tsourdos
Abstract:
This paper proposes a novel multi-target tracking (MTT) algorithm for scenarios with arbitrary numbers of measurements per target. We propose the variational probabilistic multi-hypothesis tracking (VPMHT) algorithm based on the variational Bayesian expectation-maximisation (VBEM) algorithm to resolve the MTT problem in the classic PMHT algorithm. With the introduction of variational inference, th…
▽ More
This paper proposes a novel multi-target tracking (MTT) algorithm for scenarios with arbitrary numbers of measurements per target. We propose the variational probabilistic multi-hypothesis tracking (VPMHT) algorithm based on the variational Bayesian expectation-maximisation (VBEM) algorithm to resolve the MTT problem in the classic PMHT algorithm. With the introduction of variational inference, the proposed VPMHT handles track-loss much better than the conventional probabilistic multi-hypothesis tracking (PMHT) while preserving a similar or even better tracking accuracy. Extensive numerical simulations are conducted to demonstrate the effectiveness of the proposed algorithm.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Model Checking for Decision Making System of Long Endurance Unmanned Surface Vehicle
Authors:
Hanlin Niu,
Ze Ji,
Al Savvaris,
Antonios Tsourdos,
Joaquin Carrasco
Abstract:
This work aims to develop a model checking method to verify the decision making system of Unmanned Surface Vehicle (USV) in a long range surveillance mission. The scenario in this work was captured from a long endurance USV surveillance mission using C-Enduro, an USV manufactured by ASV Ltd. The C-Enduro USV may encounter multiple non-deterministic and concurrent problems including lost communicat…
▽ More
This work aims to develop a model checking method to verify the decision making system of Unmanned Surface Vehicle (USV) in a long range surveillance mission. The scenario in this work was captured from a long endurance USV surveillance mission using C-Enduro, an USV manufactured by ASV Ltd. The C-Enduro USV may encounter multiple non-deterministic and concurrent problems including lost communication signals, collision risk and malfunction. The vehicle is designed to utilise multiple energy sources from solar panel, wind turbine and diesel generator. The energy state can be affected by the solar irradiance condition, wind condition, states of the diesel generator, sea current condition and states of the USV. In this research, the states and the interactive relations between environmental uncertainties, sensors, USV energy system, USV and Ground Control Station (GCS) decision making systems are abstracted and modelled successfully using Kripke models. The desirable properties to be verified are expressed using temporal logic statement and finally the safety properties and the long endurance properties are verified using the model checker MCMAS, a model checker for multi-agent systems. The verification results are analyzed and show the feasibility of applying model checking method to retrospect the desirable property of the USV decision making system. This method could assist researcher to identify potential design error of decision making system in advance.
△ Less
Submitted 22 February, 2021; v1 submitted 21 February, 2021;
originally announced February 2021.
-
Improving Learning Effectiveness For Object Detection and Classification in Cluttered Backgrounds
Authors:
Vinorth Varatharasan,
Hyo-Sang Shin,
Antonios Tsourdos,
Nick Colosimo
Abstract:
Usually, Neural Networks models are trained with a large dataset of images in homogeneous backgrounds. The issue is that the performance of the network models trained could be significantly degraded in a complex and heterogeneous environment. To mitigate the issue, this paper develops a framework that permits to autonomously generate a training dataset in heterogeneous cluttered backgrounds. It is…
▽ More
Usually, Neural Networks models are trained with a large dataset of images in homogeneous backgrounds. The issue is that the performance of the network models trained could be significantly degraded in a complex and heterogeneous environment. To mitigate the issue, this paper develops a framework that permits to autonomously generate a training dataset in heterogeneous cluttered backgrounds. It is clear that the learning effectiveness of the proposed framework should be improved in complex and heterogeneous environments, compared with the ones with the typical dataset. In our framework, a state-of-the-art image segmentation technique called DeepLab is used to extract objects of interest from a picture and Chroma-key technique is then used to merge the extracted objects of interest into specific heterogeneous backgrounds. The performance of the proposed framework is investigated through empirical tests and compared with that of the model trained with the COCO dataset. The results show that the proposed framework outperforms the model compared. This implies that the learning effectiveness of the framework developed is superior to the models with the typical dataset.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
A Domain-Knowledge-Aided Deep Reinforcement Learning Approach for Flight Control Design
Authors:
Hyo-Sang Shin,
Shaoming He,
Antonios Tsourdos
Abstract:
This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in flight control. Instead of learning from scratch, we suggest to leverage domain knowledge available in learning to improve learning efficiency and generalisability. More specifically, the proposed approach fixes the autopilot structure as typical three-loop autopilot and deep reinforcement lear…
▽ More
This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in flight control. Instead of learning from scratch, we suggest to leverage domain knowledge available in learning to improve learning efficiency and generalisability. More specifically, the proposed approach fixes the autopilot structure as typical three-loop autopilot and deep reinforcement learning is utilised to learn the autopilot gains. To solve the flight control problem, we then formulate a Markovian decision process with a proper reward function that enable the application of reinforcement learning theory. Another type of domain knowledge is exploited for defining the reward function, by shaping reference inputs in consideration of important control objectives and using the shaped reference inputs in the reward function. The state-of-the-art deep deterministic policy gradient algorithm is utilised to learn an action policy that maps the observed states to the autopilot gains. Extensive empirical numerical simulations are performed to validate the proposed computational control algorithm.
△ Less
Submitted 11 November, 2020; v1 submitted 19 August, 2019;
originally announced August 2019.
-
Sample Greedy Gossip for Distributed Network-Wide Average Computation
Authors:
Hyo-Sang Shin,
Shaoming He,
Antonios Tsourdos
Abstract:
This paper investigates the problem of distributed network-wide averaging and proposes a new greedy gossip algorithm. Instead of finding the optimal path of each node in a greedy manner, the proposed approach utilises a suboptimal communication path by performing greedy selection among randomly selected active local nodes. Theoretical analysis on convergence speed is also performed to investigate…
▽ More
This paper investigates the problem of distributed network-wide averaging and proposes a new greedy gossip algorithm. Instead of finding the optimal path of each node in a greedy manner, the proposed approach utilises a suboptimal communication path by performing greedy selection among randomly selected active local nodes. Theoretical analysis on convergence speed is also performed to investigate the characteristics of the proposed algorithm. The main feature of the new algorithm is that it provides great flexibility and well balance between communication cost and convergence performance introduced by the stochastic sampling strategy. Extensive numerical simulations are performed to validate the analytic findings.
△ Less
Submitted 19 August, 2019; v1 submitted 27 March, 2019;
originally announced March 2019.