-
Data-Driven Existence and Design of Target Output Controllers
Authors:
Yuan Zhang,
Wenxuan Xu,
Mohamed Darouach,
Tyrone Fernando
Abstract:
Target output controllers aim at regulating a system's target outputs by placing poles of a suitable subsystem using partial state feedback, where full state controllability is not required. This paper establishes existence conditions for such controllers using input and partial state data, where the system dynamics are unknown. The approach bypasses traditional system identification steps and lev…
▽ More
Target output controllers aim at regulating a system's target outputs by placing poles of a suitable subsystem using partial state feedback, where full state controllability is not required. This paper establishes existence conditions for such controllers using input and partial state data, where the system dynamics are unknown. The approach bypasses traditional system identification steps and leverages the intrinsic structure of historical data to certify controller existence and synthesize a suitable feedback gain. Analytical characterizations are provided, ensuring that the resulting closed-loop system satisfies desired performance objectives such as pole placement or stabilization. Data-driven algorithms are then proposed to design target output controllers directly from data without identifying system parameters, where controllers with the order matching the number of target outputs and with minimum-order augmented target outputs are both addressed. Furthermore, a separation principle is revealed, decoupling the design of target output controllers from state observers. This enables the development of data-driven observer-based controllers that integrate estimation and control. Numerical examples validate the theoretical results and demonstrate the efficacy of the proposed approach.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Functional Controllability, Functional Stabilisability, and the Generalised Separation Principle
Authors:
Tyrone Fernando,
Mohamed Darouach
Abstract:
This paper introduces the new concepts of Functional Controllability and Functional Stabilisability, and establishes their duality with Functional Observability and Functional Detectability, respectively. We further present a Generalised Separation Principle, demonstrating that the classical Separation Principle emerges as a special case. Conditions for the existence of functional controllers of a…
▽ More
This paper introduces the new concepts of Functional Controllability and Functional Stabilisability, and establishes their duality with Functional Observability and Functional Detectability, respectively. We further present a Generalised Separation Principle, demonstrating that the classical Separation Principle emerges as a special case. Conditions for the existence of functional controllers of a specified order are derived. Importantly, the design framework does not require full controllability. Furthermore, we develop a functional observer-based controller design applicable to systems that are both uncontrollable and unobservable. The results presented generalise the classical full-state feedback control paradigm.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Existence and Design of Target Output Controllers
Authors:
Tyrone Fernando,
Mohamed Darouach
Abstract:
This paper introduces new conditions for target output controllability and provides existence conditions for placing a specific number of poles with a target output controller. Additionally, an algorithm is presented for the design of a target output controller. Controllability of the system under consideration is not required for designing target output controllers in this context. The findings i…
▽ More
This paper introduces new conditions for target output controllability and provides existence conditions for placing a specific number of poles with a target output controller. Additionally, an algorithm is presented for the design of a target output controller. Controllability of the system under consideration is not required for designing target output controllers in this context. The findings in this paper extend the principles of full state feedback control. Moreover, we present conditions for static output feedback control under specific constraints. Several numerical examples are provided to illustrate the results.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking
Authors:
Shaheer Mohamed,
Tharindu Fernando,
Sridha Sridharan,
Peyman Moghadam,
Clinton Fookes
Abstract:
Hyperspectral object tracking using snapshot mosaic cameras is emerging as it provides enhanced spectral information alongside spatial data, contributing to a more comprehensive understanding of material properties. Using transformers, which have consistently outperformed convolutional neural networks (CNNs) in learning better feature representations, would be expected to be effective for Hyperspe…
▽ More
Hyperspectral object tracking using snapshot mosaic cameras is emerging as it provides enhanced spectral information alongside spatial data, contributing to a more comprehensive understanding of material properties. Using transformers, which have consistently outperformed convolutional neural networks (CNNs) in learning better feature representations, would be expected to be effective for Hyperspectral object tracking. However, training large transformers necessitates extensive datasets and prolonged training periods. This is particularly critical for complex tasks like object tracking, and the scarcity of large datasets in the hyperspectral domain acts as a bottleneck in achieving the full potential of powerful transformer models. This paper proposes an effective methodology that adapts large pretrained transformer-based foundation models for hyperspectral object tracking. We propose an adaptive, learnable spatial-spectral token fusion module that can be extended to any transformer-based backbone for learning inherent spatial-spectral features in hyperspectral data. Furthermore, our model incorporates a cross-modality training pipeline that facilitates effective learning across hyperspectral datasets collected with different sensor modalities. This enables the extraction of complementary knowledge from additional modalities, whether or not they are present during testing. Our proposed model also achieves superior performance with minimal training iterations.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
Generic Diagonalizability, Structural Functional Observability and Output Controllability
Authors:
Yuan Zhang,
Tyrone Fernando,
Mohamed Darouach
Abstract:
This paper investigates the structural functional observability (SFO) and structural output controllability (SOC) of a class of systems with generically diagonalizable state matrices and explores the associated minimal sensor and actuator placement problems. The verification of SOC and the corresponding sensor and actuator placement problems, i.e., the problems of determining the minimum number of…
▽ More
This paper investigates the structural functional observability (SFO) and structural output controllability (SOC) of a class of systems with generically diagonalizable state matrices and explores the associated minimal sensor and actuator placement problems. The verification of SOC and the corresponding sensor and actuator placement problems, i.e., the problems of determining the minimum number of outputs and inputs required to achieve SFO and SOC, respectively, are yet open for general systems, which motivates our focus on a class of systems enabling polynomial-time solutions. In this line, we first define and characterize generically diagonalizable systems, referring to structured systems for which almost all realizations of the state matrices are diagonalizable. We then develop computationally efficient criteria for SFO and SOC within the context of generically diagonalizable systems. Our work expands the class of systems amenable to polynomial-time SOC verification. Thanks to the simplicity of the obtained criteria, we derive closed-form solutions for determining the minimal sensor placement to achieve SFO and the minimal actuator deployment to achieve SOC in such systems, along with efficient weighted maximum matching based and weighted maximum flow based algorithms. For more general systems to achieve SFO, an upper bound is given by identifying a non-decreasing property of SFO with respect to a specific class of edge additions, which is shown to be optimal under certain circumstances.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
AI, Entrepreneurs, and Privacy: Deep Learning Outperforms Humans in Detecting Entrepreneurs from Image Data
Authors:
Martin Obschonka,
Christian Fisch,
Tharindu Fernando,
Clinton Fookes
Abstract:
Occupational outcomes like entrepreneurship are generally considered personal information that individuals should have the autonomy to disclose. With the advancing capability of artificial intelligence (AI) to infer private details from widely available human-centric data (e.g., social media), it is crucial to investigate whether AI can accurately extract private occupational information from such…
▽ More
Occupational outcomes like entrepreneurship are generally considered personal information that individuals should have the autonomy to disclose. With the advancing capability of artificial intelligence (AI) to infer private details from widely available human-centric data (e.g., social media), it is crucial to investigate whether AI can accurately extract private occupational information from such data. In this study, we demonstrate that deep neural networks can classify individuals as entrepreneurs with high accuracy based on facial images sourced from Crunchbase, a premier source for entrepreneurship data. Utilizing a dataset comprising facial images of 40,728 individuals, including both entrepreneurs and non-entrepreneurs, we train a Convolutional Neural Network (CNN) using a contrastive learning approach based on pairs of facial images (one entrepreneur and one non-entrepreneur per pair). While human experts (n=650) and trained participants (n=133) were unable to classify entrepreneurs with accuracy above chance levels (>50%), our AI model achieved a classification accuracy of 79.51%. Several robustness tests indicate that this high level of accuracy is maintained under various conditions. These results indicate privacy risks for entrepreneurs.
△ Less
Submitted 8 March, 2025; v1 submitted 19 August, 2024;
originally announced September 2024.
-
Functional Observability, Structural Functional Observability and Optimal Sensor Placement
Authors:
Yuan Zhang,
Tyrone Fernando,
Mohamed Darouach
Abstract:
In this paper, new characterizations for functional observability, functional detectability, and structural functional observability (SFO) are developed, and based on them, the related optimal sensor placement problems are investigated. A novel concept of modal functional observability coinciding with the notion of modal observability is proposed. This notion introduces necessary and sufficient co…
▽ More
In this paper, new characterizations for functional observability, functional detectability, and structural functional observability (SFO) are developed, and based on them, the related optimal sensor placement problems are investigated. A novel concept of modal functional observability coinciding with the notion of modal observability is proposed. This notion introduces necessary and sufficient conditions for functional observability and detectability in a unified way without resorting to system observability decomposition, and facilitates the design of a functionally observable/detectable system. Afterwards, SFO is redefined rigorously from a generic perspective, contrarily to the definition of structural observability. A complete graph-theoretic characterization for SFO is proposed. Based on these results, the problems of selecting the minimal sensors from a prior set to achieve functional observability and SFO are shown to be NP-hard. Nevertheless, supermodular set functions are established, leading to greedy heuristics that can find approximation solutions to these problems with provable guarantees in polynomial time. A closed-form solution along with a constructive procedure is also given for the unconstrained case on systems with diagonalizable state matrices. Notably, our results also yield a polynomial-time verifiable case for structural target controllability, a problem that may be hard otherwise.
△ Less
Submitted 17 September, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis
Authors:
Harshala Gammulle,
Tharindu Fernando,
Sridha Sridharan,
Simon Denman,
Clinton Fookes
Abstract:
This paper presents a novel lightweight COVID-19 diagnosis framework using CT scans. Our system utilises a novel two-stage approach to generate robust and efficient diagnoses across heterogeneous patient level inputs. We use a powerful backbone network as a feature extractor to capture discriminative slice-level features. These features are aggregated by a lightweight network to obtain a patient l…
▽ More
This paper presents a novel lightweight COVID-19 diagnosis framework using CT scans. Our system utilises a novel two-stage approach to generate robust and efficient diagnoses across heterogeneous patient level inputs. We use a powerful backbone network as a feature extractor to capture discriminative slice-level features. These features are aggregated by a lightweight network to obtain a patient level diagnosis. The aggregation network is carefully designed to have a small number of trainable parameters while also possessing sufficient capacity to generalise to diverse variations within different CT volumes and to adapt to noise introduced during the data acquisition. We achieve a significant performance increase over the baselines when benchmarked on the SPGC COVID-19 Radiomics Dataset, despite having only 2.5 million trainable parameters and requiring only 0.623 seconds on average to process a single patient's CT volume using an Nvidia-GeForce RTX 2080 GPU.
△ Less
Submitted 8 August, 2021;
originally announced August 2021.
-
Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings
Authors:
Tharindu Fernando,
Sridha Sridharan,
Simon Denman,
Houman Ghaemmaghami,
Clinton Fookes
Abstract:
This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition on each event. Exploiting the lightweight nature of Temporal Convolution Networks (TCNs) and their superior results compared to their recurrent counterparts, we propose a lightweight, yet robust, and completely interpretable framework for…
▽ More
This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition on each event. Exploiting the lightweight nature of Temporal Convolution Networks (TCNs) and their superior results compared to their recurrent counterparts, we propose a lightweight, yet robust, and completely interpretable framework for lung sound event detection. We propose the use of a multi-branch TCN architecture and exploit a novel fusion strategy to combine the resultant features from these branches. This not only allows the network to retain the most salient information across different temporal granularities and disregards irrelevant information, but also allows our network to process recordings of arbitrary length. Results: The proposed method is evaluated on multiple public and in-house benchmarks of irregular and noisy recordings of the respiratory auscultation process for the identification of numerous auscultation events including inhalation, exhalation, crackles, wheeze, stridor, and rhonchi. We exceed the state-of-the-art results in all evaluations. Furthermore, we empirically analyse the effect of the proposed multi-branch TCN architecture and the feature fusion strategy and provide quantitative and qualitative evaluations to illustrate their efficiency. Moreover, we provide an end-to-end model interpretation pipeline that interprets the operations of all the components of the proposed framework. Our analysis of different feature fusion strategies shows that the proposed feature concatenation method leads to better suppression of non-informative features, which drastically reduces the classifier overhead resulting in a robust lightweight network.The lightweight nature of our model allows it to be deployed in end-user devices such as smartphones, and it has the ability to generate predictions in real-time.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
Deep Learning for Medical Anomaly Detection -- A Survey
Authors:
Tharindu Fernando,
Harshala Gammulle,
Simon Denman,
Sridha Sridharan,
Clinton Fookes
Abstract:
Machine learning-based medical anomaly detection is an important problem that has been extensively studied. Numerous approaches have been proposed across various medical application domains and we observe several similarities across these distinct applications. Despite this comparability, we observe a lack of structured organisation of these diverse research applications such that their advantages…
▽ More
Machine learning-based medical anomaly detection is an important problem that has been extensively studied. Numerous approaches have been proposed across various medical application domains and we observe several similarities across these distinct applications. Despite this comparability, we observe a lack of structured organisation of these diverse research applications such that their advantages and limitations can be studied. The principal aim of this survey is to provide a thorough theoretical analysis of popular deep learning techniques in medical anomaly detection. In particular, we contribute a coherent and systematic review of state-of-the-art techniques, comparing and contrasting their architectural differences as well as training algorithms. Furthermore, we provide a comprehensive overview of deep model interpretation strategies that can be used to interpret model decisions. In addition, we outline the key limitations of existing deep medical anomaly detection techniques and propose key research directions for further investigation.
△ Less
Submitted 13 April, 2021; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Attention Driven Fusion for Multi-Modal Emotion Recognition
Authors:
Darshana Priyasad,
Tharindu Fernando,
Simon Denman,
Clinton Fookes,
Sridha Sridharan
Abstract:
Deep learning has emerged as a powerful alternative to hand-crafted methods for emotion recognition on combined acoustic and text modalities. Baseline systems model emotion information in text and acoustic modes independently using Deep Convolutional Neural Networks (DCNN) and Recurrent Neural Networks (RNN), followed by applying attention, fusion, and classification. In this paper, we present a d…
▽ More
Deep learning has emerged as a powerful alternative to hand-crafted methods for emotion recognition on combined acoustic and text modalities. Baseline systems model emotion information in text and acoustic modes independently using Deep Convolutional Neural Networks (DCNN) and Recurrent Neural Networks (RNN), followed by applying attention, fusion, and classification. In this paper, we present a deep learning-based approach to exploit and fuse text and acoustic data for emotion classification. We utilize a SincNet layer, based on parameterized sinc functions with band-pass filters, to extract acoustic features from raw audio followed by a DCNN. This approach learns filter banks tuned for emotion recognition and provides more effective features compared to directly applying convolutions over the raw speech signal. For text processing, we use two branches (a DCNN and a Bi-direction RNN followed by a DCNN) in parallel where cross attention is introduced to infer the N-gram level correlations on hidden representations received from the Bi-RNN. Following existing state-of-the-art, we evaluate the performance of the proposed system on the IEMOCAP dataset. Experimental results indicate that the proposed system outperforms existing methods, achieving 3.5% improvement in weighted accuracy.
△ Less
Submitted 10 October, 2020; v1 submitted 23 September, 2020;
originally announced September 2020.
-
A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation
Authors:
Theekshana Dissanayake,
Tharindu Fernando,
Simon Denman,
Sridha Sridharan,
Houman Ghaemmaghami,
Clinton Fookes
Abstract:
Traditionally, abnormal heart sound classification is framed as a three-stage process. The first stage involves segmenting the phonocardiogram to detect fundamental heart sounds; after which features are extracted and classification is performed. Some researchers in the field argue the segmentation step is an unwanted computational burden, whereas others embrace it as a prior step to feature extra…
▽ More
Traditionally, abnormal heart sound classification is framed as a three-stage process. The first stage involves segmenting the phonocardiogram to detect fundamental heart sounds; after which features are extracted and classification is performed. Some researchers in the field argue the segmentation step is an unwanted computational burden, whereas others embrace it as a prior step to feature extraction. When comparing accuracies achieved by studies that have segmented heart sounds before analysis with those who have overlooked that step, the question of whether to segment heart sounds before feature extraction is still open. In this study, we explicitly examine the importance of heart sound segmentation as a prior step for heart sound classification, and then seek to apply the obtained insights to propose a robust classifier for abnormal heart sound detection. Furthermore, recognizing the pressing need for explainable Artificial Intelligence (AI) models in the medical domain, we also unveil hidden representations learned by the classifier using model interpretation techniques. Experimental results demonstrate that the segmentation plays an essential role in abnormal heart sound classification. Our new classifier is also shown to be robust, stable and most importantly, explainable, with an accuracy of almost 100% on the widely used PhysioNet dataset.
△ Less
Submitted 29 September, 2020; v1 submitted 21 May, 2020;
originally announced May 2020.
-
Heart Sound Segmentation using Bidirectional LSTMs with Attention
Authors:
Tharindu Fernando,
Houman Ghaemmaghami,
Simon Denman,
Sridha Sridharan,
Nayyar Hussain,
Clinton Fookes
Abstract:
This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state. We propose the use of recurrent neural networks and exploit recent advancements in attention based learning to segment the PCG signal. This al…
▽ More
This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state. We propose the use of recurrent neural networks and exploit recent advancements in attention based learning to segment the PCG signal. This allows the network to identify the most salient aspects of the signal and disregard uninformative information. The proposed method attains state-of-the-art performance on multiple benchmarks including both human and animal heart recordings. Furthermore, we empirically analyse different feature combinations including envelop features, wavelet and Mel Frequency Cepstral Coefficients (MFCC), and provide quantitative measurements that explore the importance of different features in the proposed approach. We demonstrate that a recurrent neural network coupled with attention mechanisms can effectively learn from irregular and noisy PCG recordings. Our analysis of different feature combinations shows that MFCC features and their derivatives offer the best performance compared to classical wavelet and envelop features. Heart sound segmentation is a crucial pre-processing step for many diagnostic applications. The proposed method provides a cost effective alternative to labour extensive manual segmentation, and provides a more accurate segmentation than existing methods. As such, it can improve the performance of further analysis including the detection of murmurs and ejection clicks. The proposed method is also applicable for detection and segmentation of other one dimensional biomedical signals.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Authors:
Tharindu Fernando,
Sridha Sridharan,
Mitchell McLaren,
Darshana Priyasad,
Simon Denman,
Clinton Fookes
Abstract:
This paper presents a novel framework for Speech Activity Detection (SAD). Inspired by the recent success of multi-task learning approaches in the speech processing domain, we propose a novel joint learning framework for SAD. We utilise generative adversarial networks to automatically learn a loss function for joint prediction of the frame-wise speech/ non-speech classifications together with the…
▽ More
This paper presents a novel framework for Speech Activity Detection (SAD). Inspired by the recent success of multi-task learning approaches in the speech processing domain, we propose a novel joint learning framework for SAD. We utilise generative adversarial networks to automatically learn a loss function for joint prediction of the frame-wise speech/ non-speech classifications together with the next audio segment. In order to exploit the temporal relationships within the input signal, we propose a temporal discriminator which aims to ensure that the predicted signal is temporally consistent. We evaluate the proposed framework on multiple public benchmarks, including NIST OpenSAT' 17, AMI Meeting and HAVIC, where we demonstrate its capability to outperform state-of-the-art SAD approaches. Furthermore, our cross-database evaluations demonstrate the robustness of the proposed approach across different languages, accents, and acoustic environments.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Neural Memory Networks for Seizure Type Classification
Authors:
David Ahmedt-Aristizabal,
Tharindu Fernando,
Simon Denman,
Lars Petersson,
Matthew J. Aburn,
Clinton Fookes
Abstract:
Classification of seizure type is a key step in the clinical process for evaluating an individual who presents with seizures. It determines the course of clinical diagnosis and treatment, and its impact stretches beyond the clinical domain to epilepsy research and the development of novel therapies. Automated identification of seizure type may facilitate understanding of the disease, and seizure d…
▽ More
Classification of seizure type is a key step in the clinical process for evaluating an individual who presents with seizures. It determines the course of clinical diagnosis and treatment, and its impact stretches beyond the clinical domain to epilepsy research and the development of novel therapies. Automated identification of seizure type may facilitate understanding of the disease, and seizure detection and prediction has been the focus of recent research that has sought to exploit the benefits of machine learning and deep learning architectures. Nevertheless, there is not yet a definitive solution for automating the classification of seizure type, a task that must currently be performed by an expert epileptologist. Inspired by recent advances in neural memory networks (NMNs), we introduce a novel approach for the classification of seizure type using electrophysiological data. We first explore the performance of traditional deep learning techniques which use convolutional and recurrent neural networks, and enhance these architectures by using external memory modules with trainable neural plasticity. We show that our model achieves a state-of-the-art weighted F1 score of 0.945 for seizure type classification on the TUH EEG Seizure Corpus with the IBM TUSZ preprocessed data. This work highlights the potential of neural memory networks to support the field of epilepsy research, along with biomedical research and signal analysis more broadly.
△ Less
Submitted 29 January, 2020; v1 submitted 10 December, 2019;
originally announced December 2019.