-
Numerical Techniques for the Maximum Likelihood Toeplitz Covariance Matrix Estimation: Part I. Symmetric Toeplitz Matrices
Authors:
Yuri Abramovich,
Victor Abramovich,
Tanit Pongsiri
Abstract:
In several applications, one must estimate a real-valued (symmetric) Toeplitz covariance matrix, typically shifted by the conjugated diagonal matrices of phase progression and phase "calibration" errors. Unlike the Hermitian Toeplitz covariance matrices, these symmetric matrices have a unique potential capability of being estimated regardless of these beam-steering phase progression and/or phase "…
▽ More
In several applications, one must estimate a real-valued (symmetric) Toeplitz covariance matrix, typically shifted by the conjugated diagonal matrices of phase progression and phase "calibration" errors. Unlike the Hermitian Toeplitz covariance matrices, these symmetric matrices have a unique potential capability of being estimated regardless of these beam-steering phase progression and/or phase "calibration" errors. This unique capability is the primary motivation of this paper.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
On sample-based functional observability of linear systems
Authors:
Isabelle Krauss,
Victor G. Lopez,
Matthias A. Müller
Abstract:
Sample-based observability characterizes the ability to reconstruct the internal state of a dynamical system by using limited output information, i.e., when measurements are only infrequently and/or irregularly available. In this work, we investigate the concept of functional observability, which refers to the ability to infer a function of the system state from the outputs, within a samplebased f…
▽ More
Sample-based observability characterizes the ability to reconstruct the internal state of a dynamical system by using limited output information, i.e., when measurements are only infrequently and/or irregularly available. In this work, we investigate the concept of functional observability, which refers to the ability to infer a function of the system state from the outputs, within a samplebased framework. Here, we give necessary and sufficient conditions for a system to be sample-based functionally observable, and formulate conditions on the sampling schemes such that these are satisfied. Furthermore, we provide a numerical example, where we demonstrate the applicability of the obtained results.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
Securing the Sky: Integrated Satellite-UAV Physical Layer Security for Low-Altitude Wireless Networks
Authors:
Jiahui Li,
Geng Sun,
Xiaoyu Sun,
Fang Mei,
Jingjing Wang,
Xiangwang Hou,
Daxin Tian,
Victor C. M. Leung
Abstract:
Low-altitude wireless networks (LAWNs) have garnered significant attention in the forthcoming 6G networks. In LAWNs, satellites with wide coverage and unmanned aerial vehicles (UAVs) with flexible mobility can complement each other to form integrated satellite-UAV networks, providing ubiquitous and high-speed connectivity for low-altitude operations. However, the higher line-of-sight probability i…
▽ More
Low-altitude wireless networks (LAWNs) have garnered significant attention in the forthcoming 6G networks. In LAWNs, satellites with wide coverage and unmanned aerial vehicles (UAVs) with flexible mobility can complement each other to form integrated satellite-UAV networks, providing ubiquitous and high-speed connectivity for low-altitude operations. However, the higher line-of-sight probability in low-altitude airspace increases transmission security concerns. In this work, we present a collaborative beamforming-based physical layer security scheme for LAWNs. We introduce the fundamental aspects of integrated satellite-UAV networks, physical layer security, UAV swarms, and collaborative beamforming for LAWN applications. Following this, we highlight several opportunities for collaborative UAV swarm secure applications enabled by satellite networks, including achieving physical layer security in scenarios involving data dissemination, data relay, eavesdropper collusion, and imperfect eavesdropper information. Next, we detail two case studies: a secure relay system and a two-way aerial secure communication framework specifically designed for LAWN environments. Simulation results demonstrate that these physical layer security schemes are effective and beneficial for secure low-altitude wireless communications. A short practicality analysis shows that the proposed method is applicable to LAWN scenarios. Finally, we discuss current challenges and future research directions for enhancing security in LAWNs.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
Identifiability and Maximum Likelihood Estimation for System Identification of Networks of Dynamical Systems
Authors:
Anders Hansson,
João Victor Galvão da Mata,
Martin S. Andersen
Abstract:
In this paper we investigate identifiability and maximum likelihood estimation for direct system identification of networks of dynamical systems. We provide necessary and sufficient conditions for network identifiability in terms of Gröbner bases. We show that the maximum likelihood approach is both consistent and efficient, which is in contrast to existing prediction error approaches. Moreover, o…
▽ More
In this paper we investigate identifiability and maximum likelihood estimation for direct system identification of networks of dynamical systems. We provide necessary and sufficient conditions for network identifiability in terms of Gröbner bases. We show that the maximum likelihood approach is both consistent and efficient, which is in contrast to existing prediction error approaches. Moreover, our approach has wider applicability, i.e., it is applicable whenever network identifiability holds. Finally, we show that we can formulate the maximum likelihood problem without the use of a predictor, which is the key to numerically being able to solve it efficiently.
△ Less
Submitted 28 June, 2025; v1 submitted 25 June, 2025;
originally announced June 2025.
-
Estimating Spatially-Dependent GPS Errors Using a Swarm of Robots
Authors:
Praneeth Somisetty,
Robert Griffin,
Victor M. Baez,
Miguel F. Arevalo-Castiblanco,
Aaron T. Becker,
Jason M. O'Kane
Abstract:
External factors, including urban canyons and adversarial interference, can lead to Global Positioning System (GPS) inaccuracies that vary as a function of the position in the environment. This study addresses the challenge of estimating a static, spatially-varying error function using a team of robots. We introduce a State Bias Estimation Algorithm (SBE) whose purpose is to estimate the GPS biase…
▽ More
External factors, including urban canyons and adversarial interference, can lead to Global Positioning System (GPS) inaccuracies that vary as a function of the position in the environment. This study addresses the challenge of estimating a static, spatially-varying error function using a team of robots. We introduce a State Bias Estimation Algorithm (SBE) whose purpose is to estimate the GPS biases. The central idea is to use sensed estimates of the range and bearing to the other robots in the team to estimate changes in bias across the environment. A set of drones moves in a 2D environment, each sampling data from GPS, range, and bearing sensors. The biases calculated by the SBE at estimated positions are used to train a Gaussian Process Regression (GPR) model. We use a Sparse Gaussian process-based Informative Path Planning (IPP) algorithm that identifies high-value regions of the environment for data collection. The swarm plans paths that maximize information gain in each iteration, further refining their understanding of the environment's positional bias landscape. We evaluated SBE and IPP in simulation and compared the IPP methodology to an open-loop strategy.
△ Less
Submitted 26 June, 2025; v1 submitted 24 June, 2025;
originally announced June 2025.
-
Overcoming Occlusions in the Wild: A Multi-Task Age Head Approach to Age Estimation
Authors:
Waqar Tanveer,
Laura Fernández-Robles,
Eduardo Fidalgo,
Víctor González-Castro,
Enrique Alegre
Abstract:
Facial age estimation has achieved considerable success under controlled conditions. However, in unconstrained real-world scenarios, which are often referred to as 'in the wild', age estimation remains challenging, especially when faces are partially occluded, which may obscure their visibility. To address this limitation, we propose a new approach integrating generative adversarial networks (GANs…
▽ More
Facial age estimation has achieved considerable success under controlled conditions. However, in unconstrained real-world scenarios, which are often referred to as 'in the wild', age estimation remains challenging, especially when faces are partially occluded, which may obscure their visibility. To address this limitation, we propose a new approach integrating generative adversarial networks (GANs) and transformer architectures to enable robust age estimation from occluded faces. We employ an SN-Patch GAN to effectively remove occlusions, while an Attentive Residual Convolution Module (ARCM), paired with a Swin Transformer, enhances feature representation. Additionally, we introduce a Multi-Task Age Head (MTAH) that combines regression and distribution learning, further improving age estimation under occlusion. Experimental results on the FG-NET, UTKFace, and MORPH datasets demonstrate that our proposed approach surpasses existing state-of-the-art techniques for occluded facial age estimation by achieving an MAE of $3.00$, $4.54$, and $2.53$ years, respectively.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation
Authors:
Runhao Zeng,
Qi Deng,
Ronghao Zhang,
Shuaicheng Niu,
Jian Chen,
Xiping Hu,
Victor C. M. Leung
Abstract:
Test-time adaptation (TTA) aims to boost the generalization capability of a trained model by conducting self-/unsupervised learning during the testing phase. While most existing TTA methods for video primarily utilize visual supervisory signals, they often overlook the potential contribution of inherent audio data. To address this gap, we propose a novel approach that incorporates audio informatio…
▽ More
Test-time adaptation (TTA) aims to boost the generalization capability of a trained model by conducting self-/unsupervised learning during the testing phase. While most existing TTA methods for video primarily utilize visual supervisory signals, they often overlook the potential contribution of inherent audio data. To address this gap, we propose a novel approach that incorporates audio information into video TTA. Our method capitalizes on the rich semantic content of audio to generate audio-assisted pseudo-labels, a new concept in the context of video TTA. Specifically, we propose an audio-to-video label mapping method by first employing pre-trained audio models to classify audio signals extracted from videos and then mapping the audio-based predictions to video label spaces through large language models, thereby establishing a connection between the audio categories and video labels. To effectively leverage the generated pseudo-labels, we present a flexible adaptation cycle that determines the optimal number of adaptation iterations for each sample, based on changes in loss and consistency across different views. This enables a customized adaptation process for each sample. Experimental results on two widely used datasets (UCF101-C and Kinetics-Sounds-C), as well as on two newly constructed audio-video TTA datasets (AVE-C and AVMIT-C) with various corruption types, demonstrate the superiority of our approach. Our method consistently improves adaptation performance across different video classification models and represents a significant step forward in integrating audio information into video TTA. Code: https://github.com/keikeiqi/Audio-Assisted-TTA.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Less Conservative Adaptive Gain-scheduling Control for Continuous-time Systems with Polytopic Uncertainties
Authors:
Ariany C. Oliveira,
Victor C. S. Campos,
Leonardo. A. Mozelli
Abstract:
The synthesis of adaptive gain-scheduling controller is discussed for continuous-time linear models characterized by polytopic uncertainties. The proposed approach computes the control law assuming the parameters as uncertain and adaptively provides an estimate for the gain-scheduling implementation. Conservativeness is reduced using our recent results on describing uncertainty: i) a structural re…
▽ More
The synthesis of adaptive gain-scheduling controller is discussed for continuous-time linear models characterized by polytopic uncertainties. The proposed approach computes the control law assuming the parameters as uncertain and adaptively provides an estimate for the gain-scheduling implementation. Conservativeness is reduced using our recent results on describing uncertainty: i) a structural relaxation that casts the parameters as outer terms and introduces slack variables; and ii) a precise topological representation that describes the mismatch between the uncertainty and its estimate. Numerical examples illustrate a high degree of relaxation in comparison with the state-of-the-art.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Enhancing Privacy: The Utility of Stand-Alone Synthetic CT and MRI for Tumor and Bone Segmentation
Authors:
André Ferreira,
Kunpeng Xie,
Caroline Wilpert,
Gustavo Correia,
Felix Barajas Ordonez,
Tiago Gil Oliveira,
Maike Bode,
Robert Siepmann,
Frank Hölzle,
Rainer Röhrig,
Jens Kleesiek,
Daniel Truhn,
Jan Egger,
Victor Alves,
Behrus Puladi
Abstract:
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthe…
▽ More
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthetic data can replace real data in segmentation tasks. We employed head and neck cancer CT scans and brain glioma MRI scans from two large datasets. Synthetic data were generated using generative adversarial networks and diffusion models. We evaluated the quality of the synthetic data using MAE, MS-SSIM, Radiomics and a Visual Turing Test (VTT) performed by 5 radiologists and their usefulness in segmentation tasks using DSC. Radiomics indicates high fidelity of synthetic MRIs, but fall short in producing highly realistic CT tissue, with correlation coefficient of 0.8784 and 0.5461 for MRI and CT tumors, respectively. DSC results indicate limited utility of synthetic data: tumor segmentation achieved DSC=0.064 on CT and 0.834 on MRI, while bone segmentation a mean DSC=0.841. Relation between DSC and correlation is observed, but is limited by the complexity of the task. VTT results show synthetic CTs' utility, but with limited educational applications. Synthetic data can be used independently for the segmentation task, although limited by the complexity of the structures to segment. Advancing generative models to better tolerate heterogeneous inputs and learn subtle details is essential for enhancing their realism and expanding their application potential.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Adapting to Heterophilic Graph Data with Structure-Guided Neighbor Discovery
Authors:
Victor M. Tenorio,
Madeline Navarro,
Samuel Rey,
Santiago Segarra,
Antonio G. Marques
Abstract:
Graph Neural Networks (GNNs) often struggle with heterophilic data, where connected nodes may have dissimilar labels, as they typically assume homophily and rely on local message passing. To address this, we propose creating alternative graph structures by linking nodes with similar structural attributes (e.g., role-based or global), thereby fostering higher label homophily on these new graphs. We…
▽ More
Graph Neural Networks (GNNs) often struggle with heterophilic data, where connected nodes may have dissimilar labels, as they typically assume homophily and rely on local message passing. To address this, we propose creating alternative graph structures by linking nodes with similar structural attributes (e.g., role-based or global), thereby fostering higher label homophily on these new graphs. We theoretically prove that GNN performance can be improved by utilizing graphs with fewer false positive edges (connections between nodes of different classes) and that considering multiple graph views increases the likelihood of finding such beneficial structures. Building on these insights, we introduce Structure-Guided GNN (SG-GNN), an architecture that processes the original graph alongside the newly created structural graphs, adaptively learning to weigh their contributions. Extensive experiments on various benchmark datasets, particularly those with heterophilic characteristics, demonstrate that our SG-GNN achieves state-of-the-art or highly competitive performance, highlighting the efficacy of exploiting structural information to guide GNNs.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Multivariate Probabilistic Assessment of Speech Quality
Authors:
Fredrik Cumlin,
Xinyu Liang,
Victor Ungureanu,
Chandan K. A. Reddy,
Christian Schüldt,
Saikat Chatterjee
Abstract:
The mean opinion score (MOS) is a standard metric for assessing speech quality, but its singular focus fails to identify specific distortions when low scores are observed. The NISQA dataset addresses this limitation by providing ratings across four additional dimensions: noisiness, coloration, discontinuity, and loudness, alongside MOS. In this paper, we extend the explored univariate MOS estimati…
▽ More
The mean opinion score (MOS) is a standard metric for assessing speech quality, but its singular focus fails to identify specific distortions when low scores are observed. The NISQA dataset addresses this limitation by providing ratings across four additional dimensions: noisiness, coloration, discontinuity, and loudness, alongside MOS. In this paper, we extend the explored univariate MOS estimation to a multivariate framework by modeling these dimensions jointly using a multivariate Gaussian distribution. Our approach utilizes Cholesky decomposition to predict covariances without imposing restrictive assumptions and extends probabilistic affine transformations to a multivariate context. Experimental results show that our model performs on par with state-of-the-art methods in point estimation, while uniquely providing uncertainty and correlation estimates across speech quality dimensions. This enables better diagnosis of poor speech quality and informs targeted improvements.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Model Splitting Enhanced Communication-Efficient Federated Learning for CSI Feedback
Authors:
Yanjie Dong,
Haijun Zhang,
Gaojie Chen,
Xiaoyi Fan,
Victor C. M. Leung,
Xiping Hu
Abstract:
Recent advancements have introduced federated machine learning-based channel state information (CSI) compression before the user equipments (UEs) upload the downlink CSI to the base transceiver station (BTS). However, most existing algorithms impose a high communication overhead due to frequent parameter exchanges between UEs and BTS. In this work, we propose a model splitting approach with a shar…
▽ More
Recent advancements have introduced federated machine learning-based channel state information (CSI) compression before the user equipments (UEs) upload the downlink CSI to the base transceiver station (BTS). However, most existing algorithms impose a high communication overhead due to frequent parameter exchanges between UEs and BTS. In this work, we propose a model splitting approach with a shared model at the BTS and multiple local models at the UEs to reduce communication overhead. Moreover, we implant a pipeline module at the BTS to reduce training time. By limiting exchanges of boundary parameters during forward and backward passes, our algorithm can significantly reduce the exchanged parameters over the benchmarks during federated CSI feedback training.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Second-order AAA algorithms for structured data-driven modeling
Authors:
Michael S. Ackermann,
Ion Victor Gosea,
Serkan Gugercin,
Steffen W. R. Werner
Abstract:
The data-driven modeling of dynamical systems has become an essential tool for the construction of accurate computational models from real-world data. In this process, the inherent differential structures underlying the considered physical phenomena are often neglected making the reinterpretation of the learned models in a physically meaningful sense very challenging. In this work, we present thre…
▽ More
The data-driven modeling of dynamical systems has become an essential tool for the construction of accurate computational models from real-world data. In this process, the inherent differential structures underlying the considered physical phenomena are often neglected making the reinterpretation of the learned models in a physically meaningful sense very challenging. In this work, we present three data-driven modeling approaches for the construction of dynamical systems with second-order differential structure directly from frequency domain data. Based on the second-order structured barycentric form, we extend the well-known Adaptive Antoulas-Anderson algorithm to the case of second-order systems. Depending on the available computational resources, we propose variations of the proposed method that prioritize either higher computation speed or greater modeling accuracy, and we present a theoretical analysis for the expected accuracy and performance of the proposed methods. Three numerical examples demonstrate the effectiveness of our new structured approaches in comparison to classical unstructured data-driven modeling.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Distributed Intelligence in the Computing Continuum with Active Inference
Authors:
Victor Casamayor Pujol,
Boris Sedlak,
Tommaso Salvatori,
Karl Friston,
Schahram Dustdar
Abstract:
The Computing Continuum (CC) is an emerging Internet-based computing paradigm that spans from local Internet of Things sensors and constrained edge devices to large-scale cloud data centers. Its goal is to orchestrate a vast array of diverse and distributed computing resources to support the next generation of Internet-based applications. However, the distributed, heterogeneous, and dynamic nature…
▽ More
The Computing Continuum (CC) is an emerging Internet-based computing paradigm that spans from local Internet of Things sensors and constrained edge devices to large-scale cloud data centers. Its goal is to orchestrate a vast array of diverse and distributed computing resources to support the next generation of Internet-based applications. However, the distributed, heterogeneous, and dynamic nature of CC platforms demands distributed intelligence for adaptive and resilient service management. This article introduces a distributed stream processing pipeline as a CC use case, where each service is managed by an Active Inference (AIF) agent. These agents collaborate to fulfill service needs specified by SLOiDs, a term we introduce to denote Service Level Objectives that are aware of its deployed devices, meaning that non-functional requirements must consider the characteristics of the hosting device. We demonstrate how AIF agents can be modeled and deployed alongside distributed services to manage them autonomously. Our experiments show that AIF agents achieve over 90% SLOiD fulfillment when using tested transition models, and around 80% when learning the models during deployment. We compare their performance to a multi-agent reinforcement learning algorithm, finding that while both approaches yield similar results, MARL requires extensive training, whereas AIF agents can operate effectively from the start. Additionally, we evaluate the behavior of AIF agents in offloading scenarios, observing a strong capacity for adaptation. Finally, we outline key research directions to advance AIF integration in CC platforms.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
DeepInverse: A Python package for solving imaging inverse problems with deep learning
Authors:
Julián Tachella,
Matthieu Terris,
Samuel Hurault,
Andrew Wang,
Dongdong Chen,
Minh-Hai Nguyen,
Maxime Song,
Thomas Davies,
Leo Davy,
Jonathan Dong,
Paul Escande,
Johannes Hertrich,
Zhiyuan Hu,
Tobías I. Liaudat,
Nils Laurent,
Brett Levac,
Mathurin Massias,
Thomas Moreau,
Thibaut Modrzyk,
Brayan Monroy,
Sebastian Neumayer,
Jérémy Scanvic,
Florian Sarron,
Victor Sechaud,
Georg Schramm
, et al. (2 additional authors not shown)
Abstract:
DeepInverse is an open-source PyTorch-based library for solving imaging inverse problems. The library covers all crucial steps in image reconstruction from the efficient implementation of forward operators (e.g., optics, MRI, tomography), to the definition and resolution of variational problems and the design and training of advanced neural network architectures. In this paper, we describe the mai…
▽ More
DeepInverse is an open-source PyTorch-based library for solving imaging inverse problems. The library covers all crucial steps in image reconstruction from the efficient implementation of forward operators (e.g., optics, MRI, tomography), to the definition and resolution of variational problems and the design and training of advanced neural network architectures. In this paper, we describe the main functionality of the library and discuss the main design choices.
△ Less
Submitted 17 June, 2025; v1 submitted 26 May, 2025;
originally announced May 2025.
-
Towards a Spatiotemporal Fusion Approach to Precipitation Nowcasting
Authors:
Felipe Curcio,
Pedro Castro,
Augusto Fonseca,
Rafaela Castro,
Raquel Franco,
Eduardo Ogasawara,
Victor Stepanenko,
Fabio Porto,
Mariza Ferro,
Eduardo Bezerra
Abstract:
With the increasing availability of meteorological data from various sensors, numerical models and reanalysis products, the need for efficient data integration methods has become paramount for improving weather forecasts and hydrometeorological studies. In this work, we propose a data fusion approach for precipitation nowcasting by integrating data from meteorological and rain gauge stations in Ri…
▽ More
With the increasing availability of meteorological data from various sensors, numerical models and reanalysis products, the need for efficient data integration methods has become paramount for improving weather forecasts and hydrometeorological studies. In this work, we propose a data fusion approach for precipitation nowcasting by integrating data from meteorological and rain gauge stations in Rio de Janeiro metropolitan area with ERA5 reanalysis data and GFS numerical weather prediction. We employ the spatiotemporal deep learning architecture called STConvS2S, leveraging a structured dataset covering a 9 x 11 grid. The study spans from January 2011 to October 2024, and we evaluate the impact of integrating three surface station systems. Among the tested configurations, the fusion-based model achieves an F1-score of 0.2033 for forecasting heavy precipitation events (greater than 25 mm/h) at a one-hour lead time. Additionally, we present an ablation study to assess the contribution of each station network and propose a refined inference strategy for precipitation nowcasting, integrating the GFS numerical weather prediction (NWP) data with in-situ observations.
△ Less
Submitted 25 May, 2025;
originally announced May 2025.
-
Sufficient Conditions for Detectability of Approximately Discretized Nonlinear Systems
Authors:
Seth Siriya,
Julian D. Schiller,
Victor G. Lopez,
Matthias A. Müller
Abstract:
In many sampled-data applications, observers are designed based on approximately discretized models of continuous-time systems, where usually only the discretized system is analyzed in terms of its detectability. In this paper, we show that if the continuous-time system satisfies certain linear matrix inequality (LMI) conditions, and the sampling period of the discretization scheme is sufficiently…
▽ More
In many sampled-data applications, observers are designed based on approximately discretized models of continuous-time systems, where usually only the discretized system is analyzed in terms of its detectability. In this paper, we show that if the continuous-time system satisfies certain linear matrix inequality (LMI) conditions, and the sampling period of the discretization scheme is sufficiently small, then the whole family of discretized systems (parameterized by the sampling period) satisfies analogous discrete-time LMI conditions that imply detectability. Our results are applicable to general discretization schemes, as long as they produce approximate models whose linearizations are in some sense consistent with the linearizations of the continuous-time ones. We explicitly show that the Euler and second-order Runge-Kutta methods satisfy this condition. A batch-reactor system example is provided to highlight the usefulness of our results from a practical perspective.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Reverse-Speech-Finder: A Neural Network Backtracking Architecture for Generating Alzheimer's Disease Speech Samples and Improving Diagnosis Performance
Authors:
Victor OK Li,
Yang Han,
Jacqueline CK Lam,
Lawrence YL Cheung
Abstract:
This study introduces Reverse-Speech-Finder (RSF), a groundbreaking neural network backtracking architecture designed to enhance Alzheimer's Disease (AD) diagnosis through speech analysis. Leveraging the power of pre-trained large language models, RSF identifies and utilizes the most probable AD-specific speech markers, addressing both the scarcity of real AD speech samples and the challenge of li…
▽ More
This study introduces Reverse-Speech-Finder (RSF), a groundbreaking neural network backtracking architecture designed to enhance Alzheimer's Disease (AD) diagnosis through speech analysis. Leveraging the power of pre-trained large language models, RSF identifies and utilizes the most probable AD-specific speech markers, addressing both the scarcity of real AD speech samples and the challenge of limited interpretability in existing models. RSF's unique approach consists of three core innovations: Firstly, it exploits the observation that speech markers most probable of predicting AD, defined as the most probable speech-markers (MPMs), must have the highest probability of activating those neurons (in the neural network) with the highest probability of predicting AD, defined as the most probable neurons (MPNs). Secondly, it utilizes a speech token representation at the input layer, allowing backtracking from MPNs to identify the most probable speech-tokens (MPTs) of AD. Lastly, it develops an innovative backtracking method to track backwards from the MPNs to the input layer, identifying the MPTs and the corresponding MPMs, and ingeniously uncovering novel speech markers for AD detection. Experimental results demonstrate RSF's superiority over traditional methods such as SHAP and Integrated Gradients, achieving a 3.5% improvement in accuracy and a 3.2% boost in F1-score. By generating speech data that encapsulates novel markers, RSF not only mitigates the limitations of real data scarcity but also significantly enhances the robustness and accuracy of AD diagnostic models. These findings underscore RSF's potential as a transformative tool in speech-based AD detection, offering new insights into AD-related linguistic deficits and paving the way for more effective non-invasive early intervention strategies.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
A system identification approach to clustering vector autoregressive time series
Authors:
Zuogong Yue,
Xinyi Wang,
Victor Solo
Abstract:
Clustering of time series based on their underlying dynamics is keeping attracting researchers due to its impacts on assisting complex system modelling. Most current time series clustering methods handle only scalar time series, treat them as white noise, or rely on domain knowledge for high-quality feature construction, where the autocorrelation pattern/feature is mostly ignored. Instead of relyi…
▽ More
Clustering of time series based on their underlying dynamics is keeping attracting researchers due to its impacts on assisting complex system modelling. Most current time series clustering methods handle only scalar time series, treat them as white noise, or rely on domain knowledge for high-quality feature construction, where the autocorrelation pattern/feature is mostly ignored. Instead of relying on heuristic feature/metric construction, the system identification approach allows treating vector time series clustering by explicitly considering their underlying autoregressive dynamics. We first derive a clustering algorithm based on a mixture autoregressive model. Unfortunately it turns out to have significant computational problems. We then derive a `small-noise' limiting version of the algorithm, which we call k-LMVAR (Limiting Mixture Vector AutoRegression), that is computationally manageable. We develop an associated BIC criterion for choosing the number of clusters and model order. The algorithm performs very well in comparative simulations and also scales well computationally.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned
Authors:
Jorge Fabila,
Lidia Garrucho,
Víctor M. Campello,
Carlos Martín-Isla,
Karim Lekadir
Abstract:
This study explores the use of Federated Learning (FL) for tuberculosis (TB) diagnosis using chest X-rays in low-resource settings across Africa. FL allows hospitals to collaboratively train AI models without sharing raw patient data, addressing privacy concerns and data scarcity that hinder traditional centralized models. The research involved hospitals and research centers in eight African count…
▽ More
This study explores the use of Federated Learning (FL) for tuberculosis (TB) diagnosis using chest X-rays in low-resource settings across Africa. FL allows hospitals to collaboratively train AI models without sharing raw patient data, addressing privacy concerns and data scarcity that hinder traditional centralized models. The research involved hospitals and research centers in eight African countries. Most sites used local datasets, while Ghana and The Gambia used public ones. The study compared locally trained models with a federated model built across all institutions to evaluate FL's real-world feasibility. Despite its promise, implementing FL in sub-Saharan Africa faces challenges such as poor infrastructure, unreliable internet, limited digital literacy, and weak AI regulations. Some institutions were also reluctant to share model updates due to data control concerns. In conclusion, FL shows strong potential for enabling AI-driven healthcare in underserved regions, but broader adoption will require improvements in infrastructure, education, and regulatory support.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective
Authors:
Henk Wymeersch,
Nuutti Tervo,
Stefan Wänstedt,
Sharief Saleh,
Joerg Ahlendorf,
Ozgur Akgul,
Vasileios Tsekenis,
Sokratis Barmpounakis,
Liping Bai,
Martin Beale,
Rafael Berkvens,
Nabeel Nisar Bhat,
Hui Chen,
Shrayan Das,
Claude Desset,
Antonio de la Oliva,
Prajnamaya Dass,
Jeroen Famaey,
Hamed Farhadi,
Gerhard P. Fettweis,
Yu Ge,
Hao Guo,
Rreze Halili,
Katsuyuki Haneda,
Abdur Rahman Mohamed Ismail
, et al. (18 additional authors not shown)
Abstract:
Integrated sensing and communication (ISAC) enables radio systems to simultaneously sense and communicate with their environment. This paper, developed within the Hexa-X-II project funded by the European Union, presents a comprehensive cross-layer vision for ISAC in 6G networks, integrating insights from physical-layer design, hardware architectures, AI-driven intelligence, and protocol-level inno…
▽ More
Integrated sensing and communication (ISAC) enables radio systems to simultaneously sense and communicate with their environment. This paper, developed within the Hexa-X-II project funded by the European Union, presents a comprehensive cross-layer vision for ISAC in 6G networks, integrating insights from physical-layer design, hardware architectures, AI-driven intelligence, and protocol-level innovations. We begin by revisiting the foundational principles of ISAC, highlighting synergies and trade-offs between sensing and communication across different integration levels. Enabling technologies, such as multiband operation, massive and distributed MIMO, non-terrestrial networks, reconfigurable intelligent surfaces, and machine learning, are analyzed in conjunction with hardware considerations including waveform design, synchronization, and full-duplex operation. To bridge implementation and system-level evaluation, we introduce a quantitative cross-layer framework linking design parameters to key performance and value indicators. By synthesizing perspectives from both academia and industry, this paper outlines how deeply integrated ISAC can transform 6G into a programmable and context-aware platform supporting applications from reliable wireless access to autonomous mobility and digital twinning.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Learning Nonlinear Dynamics in Physical Modelling Synthesis using Neural Ordinary Differential Equations
Authors:
Victor Zheleznov,
Stefan Bilbao,
Alec Wright,
Simon King
Abstract:
Modal synthesis methods are a long-standing approach for modelling distributed musical systems. In some cases extensions are possible in order to handle geometric nonlinearities. One such case is the high-amplitude vibration of a string, where geometric nonlinear effects lead to perceptually important effects including pitch glides and a dependence of brightness on striking amplitude. A modal deco…
▽ More
Modal synthesis methods are a long-standing approach for modelling distributed musical systems. In some cases extensions are possible in order to handle geometric nonlinearities. One such case is the high-amplitude vibration of a string, where geometric nonlinear effects lead to perceptually important effects including pitch glides and a dependence of brightness on striking amplitude. A modal decomposition leads to a coupled nonlinear system of ordinary differential equations. Recent work in applied machine learning approaches (in particular neural ordinary differential equations) has been used to model lumped dynamic systems such as electronic circuits automatically from data. In this work, we examine how modal decomposition can be combined with neural ordinary differential equations for modelling distributed musical systems. The proposed model leverages the analytical solution for linear vibration of system's modes and employs a neural network to account for nonlinear dynamic behaviour. Physical parameters of a system remain easily accessible after the training without the need for a parameter encoder in the network architecture. As an initial proof of concept, we generate synthetic data for a nonlinear transverse string and show that the model can be trained to reproduce the nonlinear dynamics of the system. Sound examples are presented.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Load-independent Metrics for Benchmarking Force Controllers
Authors:
Victor Shime,
Elisa G. Vergamini,
Cícero Zanette,
Leonardo F. dos Santos,
Lucca Maitan,
Andrea Calanca,
Thiago Boaventura
Abstract:
Torque-controlled actuators are critical components in mechatronic systems that closely interact with their environment, such as legged robots, collaborative manipulators, and exoskeletons. The performance and stability of these actuators depend not only on controller design and system dynamics but also significantly on load characteristics, which may include interactions with humans or unstructur…
▽ More
Torque-controlled actuators are critical components in mechatronic systems that closely interact with their environment, such as legged robots, collaborative manipulators, and exoskeletons. The performance and stability of these actuators depend not only on controller design and system dynamics but also significantly on load characteristics, which may include interactions with humans or unstructured environments. This load dependence highlights the need for frameworks that properly assess and compare torque controllers independent of specific loading conditions. In this short paper, we concisely present a modeling approach that captures the impact of load on the closed-loop dynamics of torque-controlled systems. Based on this model, we propose new methods and quantitative metrics, including the Passivity Index Interval, which blends passivity and small-gain theory to offer a less conservative measure of coupled stability than passivity alone. These metrics can be used alongside traditional control performance indicators, such as settling time and bandwidth, to provide a more comprehensive characterization of torque-controlled systems. We demonstrate the application of the proposed metrics through experimental comparisons of linear actuator force controllers.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Distributed Event-Triggered Nash Equilibrium Seeking for Noncooperative Games
Authors:
Victor Hugo Pereira Rodrigues,
Tiago Roux Oliveira,
Miroslav Krstic,
Tamer Basar
Abstract:
We propose locally convergent Nash equilibrium seeking algorithms for $N$-player noncooperative games, which use distributed event-triggered pseudo-gradient estimates. The proposed approach employs sinusoidal perturbations to estimate the pseudo-gradients of unknown quadratic payoff functions. This is the first instance of noncooperative games being tackled in a model-free fashion with event-trigg…
▽ More
We propose locally convergent Nash equilibrium seeking algorithms for $N$-player noncooperative games, which use distributed event-triggered pseudo-gradient estimates. The proposed approach employs sinusoidal perturbations to estimate the pseudo-gradients of unknown quadratic payoff functions. This is the first instance of noncooperative games being tackled in a model-free fashion with event-triggered extremum seeking. Each player evaluates independently the deviation between the corresponding current pseudo-gradient estimate and its last broadcasted value from the event-triggering mechanism to tune individually the player action, while they preserve collectively the closed-loop stability/convergence. We guarantee Zeno behavior avoidance by establishing a minimum dwell-time to avoid infinitely fast switching. In particular, the stability analysis is carried out using Lyapunov's method and averaging for systems with discontinuous right-hand sides. We quantify the size of the ultimate small residual sets around the Nash equilibrium and illustrate the theoretical results numerically on an oligopoly setting.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Multi-User Beamforming with Deep Reinforcement Learning in Sensing-Aided Communication
Authors:
Xiyu Wang,
Gilberto Berardinelli,
Hei Victor Cheng,
Petar Popovski,
Ramoni Adeogun
Abstract:
Mobile users are prone to experience beam failure due to beam drifting in millimeter wave (mmWave) communications. Sensing can help alleviate beam drifting with timely beam changes and low overhead since it does not need user feedback. This work studies the problem of optimizing sensing-aided communication by dynamically managing beams allocated to mobile users. A multi-beam scheme is introduced,…
▽ More
Mobile users are prone to experience beam failure due to beam drifting in millimeter wave (mmWave) communications. Sensing can help alleviate beam drifting with timely beam changes and low overhead since it does not need user feedback. This work studies the problem of optimizing sensing-aided communication by dynamically managing beams allocated to mobile users. A multi-beam scheme is introduced, which allocates multiple beams to the users that need an update on the angle of departure (AoD) estimates and a single beam to the users that have satisfied AoD estimation precision. A deep reinforcement learning (DRL) assisted method is developed to optimize the beam allocation policy, relying only upon the sensing echoes. For comparison, a heuristic AoD-based method using approximated Cramér-Rao lower bound (CRLB) for allocation is also presented. Both methods require neither user feedback nor prior state evolution information. Results show that the DRL-assisted method achieves a considerable gain in throughput than the conventional beam sweeping method and the AoD-based method, and it is robust to different user speeds.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
$\mathcal{H}_2$-optimal model reduction of linear quadratic-output systems by multivariate rational interpolation
Authors:
Sean Reiter,
Ion Victor Gosea,
Igor Pontes Duff,
Serkan Gugercin
Abstract:
This paper addresses the $\mathcal{H}_2$-optimal approximation of linear dynamical systems with quadratic-output functions, also known as linear quadratic-output systems. Our major contributions are threefold. First, we derive interpolation-based first-order optimality conditions for the linear quadratic-output $\mathcal{H}_2$ minimization problem. These conditions correspond to the mixed-multipoi…
▽ More
This paper addresses the $\mathcal{H}_2$-optimal approximation of linear dynamical systems with quadratic-output functions, also known as linear quadratic-output systems. Our major contributions are threefold. First, we derive interpolation-based first-order optimality conditions for the linear quadratic-output $\mathcal{H}_2$ minimization problem. These conditions correspond to the mixed-multipoint tangential interpolation of the full-order linear- and quadratic-output transfer functions, and generalize the Meier-Luenberger optimality framework for the $\mathcal{H}_2$-optimal model reduction of linear time-invariant systems. Second, given the interpolation data, we show how to enforce these mixed-multipoint tangential interpolation conditions explicitly by Petrov-Galerkin projection of the full-order model matrices. Third, to find the optimal interpolation data, we build on this projection framework and propose a generalization of the iterative rational Krylov algorithm for the $\mathcal{H}_2$-optimal model reduction of linear quadratic-output systems, called LQO-IRKA. Upon convergence, LQO-IRKA produces a reduced linear quadratic-output system that satisfies the interpolatory optimality conditions. The method only requires solving shifted linear systems and matrix-vector products, thus making it suitable for large-scale problems. Numerical examples are included to illustrate the effectiveness of the proposed method.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Securing 5G and Beyond-Enabled UAV Networks: Resilience Through Multiagent Learning and Transformers Detection
Authors:
Joseanne Viana,
Hamed Farkhari,
Victor P Gil Jimenez
Abstract:
Achieving resilience remains a significant challenge for Unmanned Aerial Vehicle (UAV) communications in 5G and 6G networks. Although UAVs benefit from superior positioning capabilities, rate optimization techniques, and extensive line-of-sight (LoS) range, these advantages alone cannot guarantee high reliability across diverse UAV use cases. This limitation becomes particularly evident in urban e…
▽ More
Achieving resilience remains a significant challenge for Unmanned Aerial Vehicle (UAV) communications in 5G and 6G networks. Although UAVs benefit from superior positioning capabilities, rate optimization techniques, and extensive line-of-sight (LoS) range, these advantages alone cannot guarantee high reliability across diverse UAV use cases. This limitation becomes particularly evident in urban environments, where UAVs face vulnerability to jamming attacks and where LoS connectivity is frequently compromised by buildings and other physical obstructions. This paper introduces DET-FAIR- WINGS ( Detection-Enhanced Transformer Framework for AI-Resilient Wireless Networks in Ground UAV Systems), a novel solution designed to enhance reliability in UAV communications under attacks. Our system leverages multi-agent reinforcement learning (MARL) and transformer-based detection algorithms to identify attack patterns within the network and subsequently select the most appropriate mechanisms to strengthen reliability in authenticated UAV-Base Station links. The DET-FAIR-WINGS approach integrates both discrete and continuous parameters. Discrete parameters include retransmission attempts, bandwidth partitioning, and notching mechanisms, while continuous parameters encompass beam angles and elevations from both the Base Station (BS) and user devices. The detection part integrates a transformer in the agents to speed up training. Our findings demonstrate that replacing fixed retransmission counts with AI-integrated flexible approaches in 5G networks significantly reduces latency by optimizing decision-making processes within 5G layers.
△ Less
Submitted 3 May, 2025;
originally announced May 2025.
-
Real-Time, Single-Ear, Wearable ECG Reconstruction, R-Peak Detection, and HR/HRV Monitoring
Authors:
Carlos Santos,
Sebastian Frey,
Andrea Cossettini,
Luca Benini,
Victor Kartsch
Abstract:
Biosignal monitoring, in particular heart activity through heart rate (HR) and heart rate variability (HRV) tracking, is vital in enabling continuous, non-invasive tracking of physiological and cognitive states. Recent studies have explored compact, head-worn devices for HR and HRV monitoring to improve usability and reduce stigma. However, this approach is challenged by the current reliance on we…
▽ More
Biosignal monitoring, in particular heart activity through heart rate (HR) and heart rate variability (HRV) tracking, is vital in enabling continuous, non-invasive tracking of physiological and cognitive states. Recent studies have explored compact, head-worn devices for HR and HRV monitoring to improve usability and reduce stigma. However, this approach is challenged by the current reliance on wet electrodes, which limits usability, the weakness of ear-derived signals, making HR/HRV extraction more complex, and the incompatibility of current algorithms for embedded deployment. This work introduces a single-ear wearable system for real-time ECG (Electrocardiogram) parameter estimation, which directly runs on BioGAP, an energy-efficient device for biosignal acquisition and processing. By combining SoA in-ear electrode technology, an optimized DeepMF algorithm, and BioGAP, our proposed subject-independent approach allows for robust extraction of HR/HRV parameters directly on the device with just 36.7 uJ/inference at comparable performance with respect to the current state-of-the-art architecture, achieving 0.49 bpm and 25.82 ms for HR/HRV mean errors, respectively and an estimated battery life of 36h with a total system power consumption of 7.6 mW. Clinical relevance: The ability to reconstruct ECG signals and extract HR and HRV paves the way for continuous, unobtrusive cardiovascular monitoring with head-worn devices. In particular, the integration of cardiovascular measurements in everyday-use devices (such as earbuds) has potential in continuous at-home monitoring to enable early detection of cardiovascular irregularities.
△ Less
Submitted 3 May, 2025;
originally announced May 2025.
-
Impairments are Clustered in Latents of Deep Neural Network-based Speech Quality Models
Authors:
Fredrik Cumlin,
Xinyu Liang,
Victor Ungureanu,
Chandan K. A. Reddy,
Christian Schüldt,
Saikat Chatterjee
Abstract:
In this article, we provide an experimental observation: Deep neural network (DNN) based speech quality assessment (SQA) models have inherent latent representations where many types of impairments are clustered. While DNN-based SQA models are not trained for impairment classification, our experiments show good impairment classification results in an appropriate SQA latent representation. We invest…
▽ More
In this article, we provide an experimental observation: Deep neural network (DNN) based speech quality assessment (SQA) models have inherent latent representations where many types of impairments are clustered. While DNN-based SQA models are not trained for impairment classification, our experiments show good impairment classification results in an appropriate SQA latent representation. We investigate the clustering of impairments using various kinds of audio degradations that include different types of noises, waveform clipping, gain transition, pitch shift, compression, reverberation, etc. To visualize the clusters we perform classification of impairments in the SQA-latent representation domain using a standard k-nearest neighbor (kNN) classifier. We also develop a new DNN-based SQA model, named DNSMOS+, to examine whether an improvement in SQA leads to an improvement in impairment classification. The classification accuracy is 94% for LibriAugmented dataset with 16 types of impairments and 54% for ESC-50 dataset with 50 types of real noises.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Authors:
Guodong Shen,
Yuqi Ouyang,
Junru Lu,
Yixuan Yang,
Victor Sanchez
Abstract:
Despite the prevailing transition from single-task to multi-task approaches in video anomaly detection, we observe that many adopt sub-optimal frameworks for individual proxy tasks. Motivated by this, we contend that optimizing single-task frameworks can advance both single- and multi-task approaches. Accordingly, we leverage middle-frame prediction as the primary proxy task, and introduce an effe…
▽ More
Despite the prevailing transition from single-task to multi-task approaches in video anomaly detection, we observe that many adopt sub-optimal frameworks for individual proxy tasks. Motivated by this, we contend that optimizing single-task frameworks can advance both single- and multi-task approaches. Accordingly, we leverage middle-frame prediction as the primary proxy task, and introduce an effective hybrid framework designed to generate accurate predictions for normal frames and flawed predictions for abnormal frames. This hybrid framework is built upon a bi-directional structure that seamlessly integrates both vision transformers and ConvLSTMs. Specifically, we utilize this bi-directional structure to fully analyze the temporal dimension by predicting frames in both forward and backward directions, significantly boosting the detection stability. Given the transformer's capacity to model long-range contextual dependencies, we develop a convolutional temporal transformer that efficiently associates feature maps from all context frames to generate attention-based predictions for target frames. Furthermore, we devise a layer-interactive ConvLSTM bridge that facilitates the smooth flow of low-level features across layers and time-steps, thereby strengthening predictions with fine details. Anomalies are eventually identified by scrutinizing the discrepancies between target frames and their corresponding predictions. Several experiments conducted on public benchmarks affirm the efficacy of our hybrid framework, whether used as a standalone single-task approach or integrated as a branch in a multi-task approach. These experiments also underscore the advantages of merging vision transformers and ConvLSTMs for video anomaly detection.
△ Less
Submitted 20 April, 2025;
originally announced April 2025.
-
Analysis of the MICCAI Brain Tumor Segmentation -- Metastases (BraTS-METS) 2025 Lighthouse Challenge: Brain Metastasis Segmentation on Pre- and Post-treatment MRI
Authors:
Nazanin Maleki,
Raisa Amiruddin,
Ahmed W. Moawad,
Nikolay Yordanov,
Athanasios Gkampenis,
Pascal Fehringer,
Fabian Umeh,
Crystal Chukwurah,
Fatima Memon,
Bojan Petrovic,
Justin Cramer,
Mark Krycia,
Elizabeth B. Shrickel,
Ichiro Ikuta,
Gerard Thompson,
Lorenna Vidal,
Vilma Kosovic,
Adam E. Goldman-Yassen,
Virginia Hill,
Tiffany So,
Sedra Mhana,
Albara Alotaibi,
Nathan Page,
Prisha Bhatia,
Yasaman Sharifi
, et al. (218 additional authors not shown)
Abstract:
Despite continuous advancements in cancer treatment, brain metastatic disease remains a significant complication of primary cancer and is associated with an unfavorable prognosis. One approach for improving diagnosis, management, and outcomes is to implement algorithms based on artificial intelligence for the automated segmentation of both pre- and post-treatment MRI brain images. Such algorithms…
▽ More
Despite continuous advancements in cancer treatment, brain metastatic disease remains a significant complication of primary cancer and is associated with an unfavorable prognosis. One approach for improving diagnosis, management, and outcomes is to implement algorithms based on artificial intelligence for the automated segmentation of both pre- and post-treatment MRI brain images. Such algorithms rely on volumetric criteria for lesion identification and treatment response assessment, which are still not available in clinical practice. Therefore, it is critical to establish tools for rapid volumetric segmentations methods that can be translated to clinical practice and that are trained on high quality annotated data. The BraTS-METS 2025 Lighthouse Challenge aims to address this critical need by establishing inter-rater and intra-rater variability in dataset annotation by generating high quality annotated datasets from four individual instances of segmentation by neuroradiologists while being recorded on video (two instances doing "from scratch" and two instances after AI pre-segmentation). This high-quality annotated dataset will be used for testing phase in 2025 Lighthouse challenge and will be publicly released at the completion of the challenge. The 2025 Lighthouse challenge will also release the 2023 and 2024 segmented datasets that were annotated using an established pipeline of pre-segmentation, student annotation, two neuroradiologists checking, and one neuroradiologist finalizing the process. It builds upon its previous edition by including post-treatment cases in the dataset. Using these high-quality annotated datasets, the 2025 Lighthouse challenge plans to test benchmark algorithms for automated segmentation of pre-and post-treatment brain metastases (BM), trained on diverse and multi-institutional datasets of MRI images obtained from patients with brain metastases.
△ Less
Submitted 6 May, 2025; v1 submitted 16 April, 2025;
originally announced April 2025.
-
Robust Visual Servoing under Human Supervision for Assembly Tasks
Authors:
Victor Nan Fernandez-Ayala,
Jorge Silva,
Meng Guo,
Dimos V. Dimarogonas
Abstract:
We propose a framework enabling mobile manipulators to reliably complete pick-and-place tasks for assembling structures from construction blocks. The picking uses an eye-in-hand visual servoing controller for object tracking with Control Barrier Functions (CBFs) to ensure fiducial markers in the blocks remain visible. An additional robot with an eye-to-hand setup ensures precise placement, critica…
▽ More
We propose a framework enabling mobile manipulators to reliably complete pick-and-place tasks for assembling structures from construction blocks. The picking uses an eye-in-hand visual servoing controller for object tracking with Control Barrier Functions (CBFs) to ensure fiducial markers in the blocks remain visible. An additional robot with an eye-to-hand setup ensures precise placement, critical for structural stability. We integrate human-in-the-loop capabilities for flexibility and fault correction and analyze robustness to camera pose errors, proposing adapted barrier functions to handle them. Lastly, experiments validate the framework on 6-DoF mobile arms.
△ Less
Submitted 16 April, 2025;
originally announced April 2025.
-
Event-Triggered Source Seeking Control for Nonholonomic Systems
Authors:
Victor Hugo Pereira Rodrigues,
Tiago Roux Oliveira,
Miroslav Krstic
Abstract:
This paper introduces an event-triggered source seeking control (ET-SSC) for autonomous vehicles modeled as the nonholonomic unicycle. The classical source seeking control is enhanced with static-triggering conditions to enable aperiodic and less frequent updates of the system's input signals, offering a resource-aware control design. Our convergence analysis is based on time-scaling combined with…
▽ More
This paper introduces an event-triggered source seeking control (ET-SSC) for autonomous vehicles modeled as the nonholonomic unicycle. The classical source seeking control is enhanced with static-triggering conditions to enable aperiodic and less frequent updates of the system's input signals, offering a resource-aware control design. Our convergence analysis is based on time-scaling combined with Lyapunov and averaging theories for systems with discontinuous right-hand sides. ET-SSC ensures exponentially stable behavior for the resulting average system, leading to practical asymptotic convergence to a small neighborhood of the source point. We guarantee the avoidance of Zeno behavior by establishing a minimum dwell time to prevent infinitely fast switching. The performance optimization is aligned with classical continuous-time source seeking algorithms while balancing system performance with actuation resource consumption. Our ET-SSC algorithm, the first of its kind, allows for arbitrarily large inter-sampling times, overcoming the limitations of classical sampled-data implementations for source seeking control.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
Enabling Continuous 5G Connectivity in Aircraft through Low Earth Orbit Satellites
Authors:
Raúl Parada,
Victor Monzon Baeza,
Carlos Horcajo Fernández de Gamboa,
Rocío Serrano Camacho,
Carlos Monzo
Abstract:
As air travel demand increases, uninterrupted high-speed internet access becomes essential. However, current satellite-based systems face latency and connectivity challenges. While prior research has focused on terrestrial 5G and geostationary satellites, there is a gap in optimizing Low Earth Orbit (LEO)-based 5G systems for aircraft. This study evaluates the feasibility of deployment strategies…
▽ More
As air travel demand increases, uninterrupted high-speed internet access becomes essential. However, current satellite-based systems face latency and connectivity challenges. While prior research has focused on terrestrial 5G and geostationary satellites, there is a gap in optimizing Low Earth Orbit (LEO)-based 5G systems for aircraft. This study evaluates the feasibility of deployment strategies and improving signal quality with LEO satellites for seamless in-flight 5G connectivity. Using Matlab and Simulink, we model satellite trajectories, aircraft movement, and handover mechanisms, complemented by ray-tracing techniques for in-cabin signal analysis. Results show that proposed LEO satellite configurations enhance coverage and reduce latency, with sequential handovers minimizing service interruptions. These findings contribute to advancing in-flight 5G networks, improving passenger experience, and supporting real-time global connectivity solutions.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Unit-Vector Control Design under Saturating Actuators
Authors:
Andevaldo da Encarnação Vitório,
Pedro Henrique Silva Coutinho,
Iury Bessa,
Victor Hugo Pereira Rodrigues,
Tiago Roux Oliveira
Abstract:
This paper deals with unit vector control design for multivariable polytopic uncertain systems under saturating actuators. For that purpose, we propose LMI-based conditions to design the unit vector control gain such that the origin of the closed-loop system is finite-time stable. Moreover, an optimization problem is provided to obtain an enlarged estimate of the region of attraction of the equili…
▽ More
This paper deals with unit vector control design for multivariable polytopic uncertain systems under saturating actuators. For that purpose, we propose LMI-based conditions to design the unit vector control gain such that the origin of the closed-loop system is finite-time stable. Moreover, an optimization problem is provided to obtain an enlarged estimate of the region of attraction of the equilibrium point for the closed-loop system, where the convergence of trajectories is ensured even in the presence of saturation functions. Numerical simulations illustrate the effectiveness of the proposed approach.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Matched Topological Subspace Detector
Authors:
Chengen Liu,
Victor M. Tenorio,
Antonio G. Marques,
Elvin Isufi
Abstract:
Topological spaces, represented by simplicial complexes, capture richer relationships than graphs by modeling interactions not only between nodes but also among higher-order entities, such as edges or triangles. This motivates the representation of information defined in irregular domains as topological signals. By leveraging the spectral dualities of Hodge and Dirac theory, practical topological…
▽ More
Topological spaces, represented by simplicial complexes, capture richer relationships than graphs by modeling interactions not only between nodes but also among higher-order entities, such as edges or triangles. This motivates the representation of information defined in irregular domains as topological signals. By leveraging the spectral dualities of Hodge and Dirac theory, practical topological signals often concentrate in specific spectral subspaces (e.g., gradient or curl). For instance, in a foreign currency exchange network, the exchange flow signals typically satisfy the arbitrage-free condition and hence are curl-free. However, the presence of anomalies can disrupt these conditions, causing the signals to deviate from such subspaces. In this work, we formulate a hypothesis testing framework to detect whether simplicial complex signals lie in specific subspaces in a principled and tractable manner. Concretely, we propose Neyman-Pearson matched topological subspace detectors for signals defined at a single simplicial level (such as edges) or jointly across all levels of a simplicial complex. The (energy-based projection) proposed detectors handle missing values, provide closed-form performance analysis, and effectively capture the unique topological properties of the data. We demonstrate the effectiveness of the proposed topological detectors on various real-world data, including foreign currency exchange networks.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends
Authors:
Victor Monzon Baeza,
Raúl Parada,
Laura Concha Salor,
Carlos Monzo
Abstract:
The integration of Artificial Intelligence (AI) in military communications and networking is reshaping modern defense strategies, enhancing secure data exchange, real-time situational awareness, and autonomous decision-making. This survey explores how AI-driven technologies improve tactical communication networks, radar-based data transmission, UAV-assisted relay systems, and electronic warfare re…
▽ More
The integration of Artificial Intelligence (AI) in military communications and networking is reshaping modern defense strategies, enhancing secure data exchange, real-time situational awareness, and autonomous decision-making. This survey explores how AI-driven technologies improve tactical communication networks, radar-based data transmission, UAV-assisted relay systems, and electronic warfare resilience. The study highlights AI applications in adaptive signal processing, multi-agent coordination for network optimization, radar-assisted target tracking, and AI-driven electronic countermeasures. Our work introduces a novel three-criteria evaluation methodology. It systematically assesses AI applications based on general system objectives, communications constraints in the military domain, and critical tactical environmental factors. We analyze key AI techniques for different types of learning applied to multi-domain network interoperability and distributed data information fusion in military operations. We also address challenges such as adversarial AI threats, the real-time adaptability of autonomous communication networks, and the limitations of current AI models under battlefield conditions. Finally, we discuss emerging trends in self-healing networks, AI-augmented decision support systems, and intelligent spectrum allocation. We provide a structured roadmap for future AI-driven defense communications and networking research.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
A Simultaneous Approach for Training Neural Differential-Algebraic Systems of Equations
Authors:
Laurens R. Lueg,
Victor Alves,
Daniel Schicksnus,
John R. Kitchin,
Carl D. Laird,
Lorenz T. Biegler
Abstract:
Scientific machine learning is an emerging field that broadly describes the combination of scientific computing and machine learning to address challenges in science and engineering. Within the context of differential equations, this has produced highly influential methods, such as neural ordinary differential equations (NODEs). Recent works extend this line of research to consider neural differen…
▽ More
Scientific machine learning is an emerging field that broadly describes the combination of scientific computing and machine learning to address challenges in science and engineering. Within the context of differential equations, this has produced highly influential methods, such as neural ordinary differential equations (NODEs). Recent works extend this line of research to consider neural differential-algebraic systems of equations (DAEs), where some unknown relationships within the DAE are learned from data. Training neural DAEs, similarly to neural ODEs, is computationally expensive, as it requires the solution of a DAE for every parameter update. Further, the rigorous consideration of algebraic constraints is difficult within common deep learning training algorithms such as stochastic gradient descent. In this work, we apply the simultaneous approach to neural DAE problems, resulting in a fully discretized nonlinear optimization problem, which is solved to local optimality and simultaneously obtains the neural network parameters and the solution to the corresponding DAE. We extend recent work demonstrating the simultaneous approach for neural ODEs, by presenting a general framework to solve neural DAEs, with explicit consideration of hybrid models, where some components of the DAE are known, e.g. physics-informed constraints. Furthermore, we present a general strategy for improving the performance and convergence of the nonlinear programming solver, based on solving an auxiliary problem for initialization and approximating Hessian terms. We achieve promising results in terms of accuracy, model generalizability and computational cost, across different problem settings such as sparse data, unobserved states and multiple trajectories. Lastly, we provide several promising future directions to improve the scalability and robustness of our approach.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
Optimized Vehicular Antenna Placement for Phase-Coherent Positioning
Authors:
Victor Pettersson,
Musa Furkan Keskin,
Carina Marcus,
Henk Wymeersch
Abstract:
Distributed multi-antenna systems are an important enabling technology for future intelligent transportation systems (ITS), showing promising performance in vehicular communications and near-field (NF) localization applications. This work investigates optimal deployments of phase-coherent sub-arrays on a vehicle for NF localization in terms of a Cramér-Rao lower bound (CRLB)-based metric. Sub-arra…
▽ More
Distributed multi-antenna systems are an important enabling technology for future intelligent transportation systems (ITS), showing promising performance in vehicular communications and near-field (NF) localization applications. This work investigates optimal deployments of phase-coherent sub-arrays on a vehicle for NF localization in terms of a Cramér-Rao lower bound (CRLB)-based metric. Sub-array placements consider practical geometrical constraints on a three-dimensional vehicle model accounting for self-occlusions. Results show that, for coherent NF localization of the vehicle, the aperture spanned by the sub-arrays should be maximized and a larger number of sub-arrays results in more even coverage over the vehicle orientations under a fixed total number of antenna elements, contrasting with the outcomes of incoherent localization. Moreover, while coherent NF processing significantly enhances accuracy, it also leads to more intricate cost functions, necessitating computationally more complex algorithms than incoherent processing.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Control of Humanoid Robots with Parallel Mechanisms using Kinematic Actuation Models
Authors:
Victor Lutz,
Ludovic de Matteïs,
Virgile Batto,
Nicolas Mansard
Abstract:
Inspired by the mechanical design of Cassie, several recently released humanoid robots are using actuator configuration in which the motor is displaced from the joint location to optimize the leg inertia. This in turn induces a non linearity in the reduction ratio of the transmission which is often neglected when computing the robot motion (e.g. by trajectory optimization or reinforcement learning…
▽ More
Inspired by the mechanical design of Cassie, several recently released humanoid robots are using actuator configuration in which the motor is displaced from the joint location to optimize the leg inertia. This in turn induces a non linearity in the reduction ratio of the transmission which is often neglected when computing the robot motion (e.g. by trajectory optimization or reinforcement learning) and only accounted for at control time. This paper proposes an analytical method to efficiently handle this non-linearity. Using this actuation model, we demonstrate that we can leverage the dynamic abilities of the non-linear transmission while only modeling the inertia of the main serial chain of the leg, without approximating the motor capabilities nor the joint range. Based on analytical inverse kinematics, our method does not need any numerical routines dedicated to the closed-kinematics actuation, hence leading to very efficient computations. Our study focuses on two mechanisms widely used in recent humanoid robots; the four bar knee linkage as well as a parallel 2 DoF ankle mechanism. We integrate these models inside optimization based (DDP) and learning (PPO) control approaches. A comparison of our model against a simplified model that completely neglects closed chains is then shown in simulation.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Vision-to-Music Generation: A Survey
Authors:
Zhaokai Wang,
Chenxi Bao,
Le Zhuo,
Jingrui Han,
Yang Yue,
Yihong Tang,
Victor Shea-Jay Huang,
Yue Liao
Abstract:
Vision-to-music Generation, including video-to-music and image-to-music tasks, is a significant branch of multimodal artificial intelligence demonstrating vast application prospects in fields such as film scoring, short video creation, and dance music synthesis. However, compared to the rapid development of modalities like text and images, research in vision-to-music is still in its preliminary st…
▽ More
Vision-to-music Generation, including video-to-music and image-to-music tasks, is a significant branch of multimodal artificial intelligence demonstrating vast application prospects in fields such as film scoring, short video creation, and dance music synthesis. However, compared to the rapid development of modalities like text and images, research in vision-to-music is still in its preliminary stage due to its complex internal structure and the difficulty of modeling dynamic relationships with video. Existing surveys focus on general music generation without comprehensive discussion on vision-to-music. In this paper, we systematically review the research progress in the field of vision-to-music generation. We first analyze the technical characteristics and core challenges for three input types: general videos, human movement videos, and images, as well as two output types of symbolic music and audio music. We then summarize the existing methodologies on vision-to-music generation from the architecture perspective. A detailed review of common datasets and evaluation metrics is provided. Finally, we discuss current challenges and promising directions for future research. We hope our survey can inspire further innovation in vision-to-music generation and the broader field of multimodal generation in academic research and industrial applications. To follow latest works and foster further innovation in this field, we are continuously maintaining a GitHub repository at https://github.com/wzk1015/Awesome-Vision-to-Music-Generation.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Insights into the explainability of Lasso-based DeePC for nonlinear systems
Authors:
Gianluca Giacomelli,
Simone Formentin,
Victor G. Lopez,
Matthias A. Müller,
Valentina Breschi
Abstract:
Data-enabled Predictive Control (DeePC) has recently gained the spotlight as an easy-to-use control technique that allows for constraint handling while relying on raw data only. Initially proposed for linear time-invariant systems, several DeePC extensions are now available to cope with nonlinear systems. Nonetheless, these solutions mainly focus on ensuring the controller's effectiveness, overloo…
▽ More
Data-enabled Predictive Control (DeePC) has recently gained the spotlight as an easy-to-use control technique that allows for constraint handling while relying on raw data only. Initially proposed for linear time-invariant systems, several DeePC extensions are now available to cope with nonlinear systems. Nonetheless, these solutions mainly focus on ensuring the controller's effectiveness, overlooking the explainability of the final result. As a step toward explaining the outcome of DeePC for the control of nonlinear systems, in this paper, we focus on analyzing the earliest and simplest DeePC approach proposed to cope with nonlinearities in the controlled system, using a Lasso regularization. Our theoretical analysis highlights that the decisions undertaken by DeePC with Lasso regularization are unexplainable, as control actions are determined by data incoherent with the system's local behavior. This result is true even when the available input/output samples are grouped according to the different operating conditions explored during data collection. Our numerical study confirms these findings, highlighting the benefits of data grouping in terms of performance while showing that explainability remains a challenge in control design via DeePC.
△ Less
Submitted 11 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances
Authors:
Gustavo Correia,
Victor Alves,
Paulo Novais
Abstract:
Artificial Intelligence (AI) is revolutionizing emergency medicine by enhancing diagnostic processes and improving patient outcomes. This article provides a review of the current applications of AI in emergency imaging studies, focusing on the last five years of advancements. AI technologies, particularly machine learning and deep learning, are pivotal in interpreting complex imaging data, offerin…
▽ More
Artificial Intelligence (AI) is revolutionizing emergency medicine by enhancing diagnostic processes and improving patient outcomes. This article provides a review of the current applications of AI in emergency imaging studies, focusing on the last five years of advancements. AI technologies, particularly machine learning and deep learning, are pivotal in interpreting complex imaging data, offering rapid, accurate diagnoses and potentially surpassing traditional diagnostic methods. Studies highlighted within the article demonstrate AI's capabilities in accurately detecting conditions such as fractures, pneumothorax, and pulmonary diseases from various imaging modalities including X-rays, CT scans, and MRIs. Furthermore, AI's ability to predict clinical outcomes like mechanical ventilation needs illustrates its potential in crisis resource optimization. Despite these advancements, the integration of AI into clinical practice presents challenges such as data privacy, algorithmic bias, and the need for extensive validation across diverse settings. This review underscores the transformative potential of AI in emergency settings, advocating for a future where AI and clinical expertise synergize to elevate patient care standards.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Blockchain-Enabled Management Framework for Federated Coalition Networks
Authors:
Jorge Álvaro González,
Ana María Saiz García,
Victor Monzon Baeza
Abstract:
In a globalized and interconnected world, interoperability has become a key concept for advancing tactical scenarios. Federated Coalition Networks (FCN) enable cooperation between entities from multiple nations while allowing each to maintain control over their systems. However, this interoperability necessitates the sharing of increasing amounts of information between different tactical assets, r…
▽ More
In a globalized and interconnected world, interoperability has become a key concept for advancing tactical scenarios. Federated Coalition Networks (FCN) enable cooperation between entities from multiple nations while allowing each to maintain control over their systems. However, this interoperability necessitates the sharing of increasing amounts of information between different tactical assets, raising the need for higher security measures. Emerging technologies like blockchain drive a revolution in secure communications, paving the way for new tactical scenarios. In this work, we propose a blockchain-based framework to enhance the resilience and security of the management of these networks. We offer a guide to FCN design to help a broad audience understand the military networks in international missions by a use case and key functions applied to a proposed architecture. We evaluate its effectiveness and performance in information encryption to validate this framework.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Efficient Coordination and Synchronization of Multi-Robot Systems Under Recurring Linear Temporal Logic
Authors:
Davide Peron,
Victor Nan Fernandez-Ayala,
Eleftherios E. Vlahakis,
Dimos V. Dimarogonas
Abstract:
We consider multi-robot systems under recurring tasks formalized as linear temporal logic (LTL) specifications. To solve the planning problem efficiently, we propose a bottom-up approach combining offline plan synthesis with online coordination, dynamically adjusting plans via real-time communication. To address action delays, we introduce a synchronization mechanism ensuring coordinated task exec…
▽ More
We consider multi-robot systems under recurring tasks formalized as linear temporal logic (LTL) specifications. To solve the planning problem efficiently, we propose a bottom-up approach combining offline plan synthesis with online coordination, dynamically adjusting plans via real-time communication. To address action delays, we introduce a synchronization mechanism ensuring coordinated task execution, leading to a multi-agent coordination and synchronization framework that is adaptable to a wide range of multi-robot applications. The software package is developed in Python and ROS2 for broad deployment. We validate our findings through lab experiments involving nine robots showing enhanced adaptability compared to previous methods. Additionally, we conduct simulations with up to ninety agents to demonstrate the reduced computational complexity and the scalability features of our work.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Inferring System and Optimal Control Parameters of Closed-Loop Systems from Partial Observations
Authors:
Victor Geadah,
Juncal Arbelaiz,
Harrison Ritz,
Nathaniel D. Daw,
Jonathan D. Cohen,
Jonathan W. Pillow
Abstract:
We consider the joint problem of system identification and inverse optimal control for discrete-time stochastic Linear Quadratic Regulators. We analyze finite and infinite time horizons in a partially observed setting, where the state is observed noisily. To recover closed-loop system parameters, we develop inference methods based on probabilistic state-space model (SSM) techniques. First, we show…
▽ More
We consider the joint problem of system identification and inverse optimal control for discrete-time stochastic Linear Quadratic Regulators. We analyze finite and infinite time horizons in a partially observed setting, where the state is observed noisily. To recover closed-loop system parameters, we develop inference methods based on probabilistic state-space model (SSM) techniques. First, we show that the system parameters exhibit non-identifiability in the infinite-horizon from closed-loop measurements, and we provide exact and numerical methods to disentangle the parameters. Second, to improve parameter identifiability, we show that we can further enhance recovery by either (1) incorporating additional partial measurements of the control signals or (2) moving to the finite-horizon setting. We further illustrate the performance of our methodology through numerical examples.
△ Less
Submitted 20 February, 2025;
originally announced February 2025.
-
Radar Network for Gait Monitoring: Technology and Validation
Authors:
Ignacio E. López-Delgado,
Víctor Navarro-López,
Francisco Grandas-Pérez,
Juan I. Godino-Llorente,
Jesús Grajal
Abstract:
In recent years, radar-based devices have emerged as an alternative approach for gait monitoring. However, the radar configuration and the algorithms used to extract the gait parameters often differ between contributions, lacking a systematic evaluation of the most appropriate setup. Additionally, radar-based studies often exclude motorically impaired subjects, leaving it unclear whether the exist…
▽ More
In recent years, radar-based devices have emerged as an alternative approach for gait monitoring. However, the radar configuration and the algorithms used to extract the gait parameters often differ between contributions, lacking a systematic evaluation of the most appropriate setup. Additionally, radar-based studies often exclude motorically impaired subjects, leaving it unclear whether the existing algorithms are applicable to such populations. In this paper, a radar network is developed and validated by monitoring the gait of five healthy individuals and three patients with Parkinson's disease. Six configurations and four algorithms were compared using Vicon as ground-truth to determine the most appropriate solution for gait monitoring. The best results were obtained using only three nodes: two oriented towards the feet and one towards the torso. The most accurate stride velocity and distance in the state of the art were obtained with this configuration. Moreover, we show that analyzing the feet velocity increases the reliability of the temporal parameters, especially with aged or motorically impaired subjects. The contribution is significant for the implementation of radar networks in clinical and domestic environments, as it addresses critical aspects concerning the radar network configuration and algorithms.
△ Less
Submitted 27 May, 2025; v1 submitted 18 February, 2025;
originally announced February 2025.
-
Inception networks, Data Augmentation and Transfer Learning in EEG-based photosensitivity diagnosis
Authors:
Fernando Moncada Martins,
Víctor M. González,
José R. Villar,
Beatriz García López,
Ana Isabel Gómez-Menéndez
Abstract:
Photosensitivity refers to a neurophysiological condition in which the brain generates epileptic discharges known as Photoparoxysmal Responses (PPR) in response to light flashes.In severe cases, these PPR can lead to epileptic seizures. The standardized diagnostic procedure for this condition is called Intermittent Photic Stimulation. During this procedure, the patient is exposed to a flashing lig…
▽ More
Photosensitivity refers to a neurophysiological condition in which the brain generates epileptic discharges known as Photoparoxysmal Responses (PPR) in response to light flashes.In severe cases, these PPR can lead to epileptic seizures. The standardized diagnostic procedure for this condition is called Intermittent Photic Stimulation. During this procedure, the patient is exposed to a flashing light, aiming to trigger these epileptic reactions while preventing their full development. Meanwhile, brain activity is monitored using Electroencephalography, which is visually analyzed by clinical staff to identify these responses. Hence, the automatic detection of PPR becomes a highly unbalanced problem that has been barely studied in the literature due to photosensitivity's low prevalence. This research tackles this problem and proposes using Inception-based Deep Learning (DL) neural networks that, together with transfer learning, are trained in epilepsy seizure detection and tuned in the PPR automatic detection task. A Data Augmentation (DA) technique is also applied to balance the available data set, evaluating its effects on the DL models. The proposal outperformed state-of-the-art solutions in the literature, achieving higher ratios on standard performance metrics, and with DA significantly improving the Sensitivity without affecting Accuracy and Specificity. This project is currently being developed with patients from Burgos University Hospital, Spain
△ Less
Submitted 31 January, 2025;
originally announced February 2025.
-
Towards Patient-Specific Surgical Planning for Bicuspid Aortic Valve Repair: Fully Automated Segmentation of the Aortic Valve in 4D CT
Authors:
Zaiyang Guo,
Ningjun J Dong,
Harold Litt,
Natalie Yushkevich,
Melanie Freas,
Jessica Nunez,
Victor Ferrari,
Jilei Hao,
Shir Goldfinger,
Matthew A. Jolley,
Joseph Bavaria,
Nimesh Desai,
Alison M. Pouch
Abstract:
The bicuspid aortic valve (BAV) is the most prevalent congenital heart defect and may require surgery for complications such as stenosis, regurgitation, and aortopathy. BAV repair surgery is effective but challenging due to the heterogeneity of BAV morphology. Multiple imaging modalities can be employed to assist the quantitative assessment of BAVs for surgical planning. Contrast-enhanced 4D compu…
▽ More
The bicuspid aortic valve (BAV) is the most prevalent congenital heart defect and may require surgery for complications such as stenosis, regurgitation, and aortopathy. BAV repair surgery is effective but challenging due to the heterogeneity of BAV morphology. Multiple imaging modalities can be employed to assist the quantitative assessment of BAVs for surgical planning. Contrast-enhanced 4D computed tomography (CT) produces volumetric temporal sequences with excellent contrast and spatial resolution. Segmentation of the aortic cusps and root in these images is an essential step in creating patient specific models for visualization and quantification. While deep learning-based methods are capable of fully automated segmentation, no BAV-specific model exists. Among valve segmentation studies, there has been limited quantitative assessment of the clinical usability of the segmentation results. In this work, we developed a fully automated multi-label BAV segmentation pipeline based on nnU-Net. The predicted segmentations were used to carry out surgically relevant morphological measurements including geometric cusp height, commissural angle and annulus diameter, and the results were compared against manual segmentation. Automated segmentation achieved average Dice scores of over 0.7 and symmetric mean distance below 0.7 mm for all three aortic cusps and the root wall. Clinically relevant benchmarks showed good consistency between manual and predicted segmentations. Overall, fully automated BAV segmentation of 3D frames in 4D CT can produce clinically usable measurements for surgical risk stratification, but the temporal consistency of segmentations needs to be improved.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Integrated Optimization and Game Theory Framework for Fair Cost Allocation in Community Microgrids
Authors:
K. Victor Sam Moses Babu,
Pratyush Chakraborty,
Mayukha Pal
Abstract:
Fair cost allocation in community microgrids remains a significant challenge due to the complex interactions between multiple participants with varying load profiles, distributed energy resources, and storage systems. Traditional cost allocation methods often fail to adequately address the dynamic nature of participant contributions and benefits, leading to inequitable distribution of costs and re…
▽ More
Fair cost allocation in community microgrids remains a significant challenge due to the complex interactions between multiple participants with varying load profiles, distributed energy resources, and storage systems. Traditional cost allocation methods often fail to adequately address the dynamic nature of participant contributions and benefits, leading to inequitable distribution of costs and reduced participant satisfaction. This paper presents a novel framework integrating multi-objective optimization with cooperative game theory for fair and efficient microgrid operation and cost allocation. The proposed approach combines mixed-integer linear programming for optimal resource dispatch with Shapley value analysis for equitable benefit distribution, ensuring both system efficiency and participant satisfaction. The framework was validated using real-world data across six distinct operational scenarios, demonstrating significant improvements in both technical and economic performance. Results show peak demand reductions ranging from 7.8% to 62.6%, solar utilization rates reaching 114.8% through effective storage integration, and cooperative gains of up to $1,801.01 per day. The Shapley value-based allocation achieved balanced benefit-cost distributions, with net positions ranging from -16.0% to +14.2% across different load categories, ensuring sustainable participant cooperation.
△ Less
Submitted 12 February, 2025;
originally announced February 2025.