-
Sequential Change Point Detection in High-dimensional Vector Auto-regressive Models
Authors:
Yuhan Tian,
Abolfazl Safikhani
Abstract:
Sequential (online) change-point detection involves continuously monitoring time-series data and triggering an alarm when shifts in the data distribution are detected. We propose an algorithm for real-time identification of alterations in the transition matrices of high-dimensional vector autoregressive models. The algorithm estimates transition matrices and error term variances using regularizati…
▽ More
Sequential (online) change-point detection involves continuously monitoring time-series data and triggering an alarm when shifts in the data distribution are detected. We propose an algorithm for real-time identification of alterations in the transition matrices of high-dimensional vector autoregressive models. The algorithm estimates transition matrices and error term variances using regularization techniques applied to training data, then computes a specific test statistic to detect changes in transition matrices as new data batches arrive. We establish the asymptotic normality of the test statistic under the scenario of no change points, subject to mild conditions. An alarm is raised when the calculated test statistic exceeds a predefined quantile of the standard normal distribution. We demonstrate that, as the size of the change (jump size) increases, the test power approaches one. The effectiveness of the algorithm is validated empirically across various simulation scenarios. Finally, we present two applications of the proposed methodology: analyzing shocks in S&P 500 data and detecting the timing of seizures in EEG data.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Learning k-Inductive Control Barrier Certificates for Unknown Nonlinear Dynamics Beyond Polynomials
Authors:
Ben Wooding,
Abolfazl Lavaei
Abstract:
This work is concerned with synthesizing safety controllers for discrete-time nonlinear systems beyond polynomials with unknown mathematical models using the notion of k-inductive control barrier certificates (k-CBCs). Conventional CBC conditions (with k=1) for ensuring safety over dynamical systems are often restrictive, as they require the CBCs to be non-increasing at every time step. Inspired b…
▽ More
This work is concerned with synthesizing safety controllers for discrete-time nonlinear systems beyond polynomials with unknown mathematical models using the notion of k-inductive control barrier certificates (k-CBCs). Conventional CBC conditions (with k=1) for ensuring safety over dynamical systems are often restrictive, as they require the CBCs to be non-increasing at every time step. Inspired by the success of k-induction in software verification, k-CBCs relax this requirement by allowing the barrier function to be non-increasing over k steps, while permitting k-1 (one-step) increases, each up to a threshold epsilon. This relaxation enhances the likelihood of finding feasible k-CBCs while providing safety guarantees across the dynamical systems. Despite showing promise, existing approaches for constructing k-CBCs often rely on precise mathematical knowledge of system dynamics, which is frequently unavailable in practical scenarios. In this work, we address the case where the underlying dynamics are unknown, a common occurrence in real-world applications, and employ the concept of persistency of excitation, grounded in Willems et al.'s fundamental lemma. This result implies that input-output data from a single trajectory can capture the behavior of an unknown system, provided the collected data fulfills a specific rank condition. We employ sum-of-squares (SOS) programming to synthesize the k-CBC as well as the safety controller directly from data while ensuring the safe behavior of the unknown system. The efficacy of our approach is demonstrated through a set of physical benchmarks with unknown dynamics, including a DC motor, an RLC circuit, a nonlinear nonpolynomial car, and a nonlinear polynomial Lorenz attractor.
△ Less
Submitted 10 December, 2024;
originally announced December 2024.
-
Development of Neural Network-Based Optimal Control Pulse Generator for Quantum Logic Gates Using the GRAPE Algorithm in NMR Quantum Computer
Authors:
Ebrahim Khaleghian,
Arash Fath Lipaei,
Abolfazl Bahrampour,
Morteza Nikaeen,
Alireza Bahrampour
Abstract:
In this paper, we introduce a neural network to generate the optimal control pulses for general single-qubit quantum logic gates, within a Nuclear Magnetic Resonance (NMR) quantum computer. By utilizing a neural network, we can efficiently implement any single-qubit quantum logic gates within a reasonable time scale. The network is trained by control pulses generated by the GRAPE algorithm, all st…
▽ More
In this paper, we introduce a neural network to generate the optimal control pulses for general single-qubit quantum logic gates, within a Nuclear Magnetic Resonance (NMR) quantum computer. By utilizing a neural network, we can efficiently implement any single-qubit quantum logic gates within a reasonable time scale. The network is trained by control pulses generated by the GRAPE algorithm, all starting from the same initial point. After implementing the network, we tested it using numerical simulations. Also, we present the results of applying Neural Network-generated pulses to a three-qubit benchtop NMR system and compare them with simulation outcomes. These numerical and experimental results showcase the precision of the Neural Network-generated pulses in executing the desired dynamics. Ultimately, by developing the neural network using the GRAPE algorithm, we discover the function that maps any single-qubit gate to its corresponding pulse shape. This model enables the real-time generation of arbitrary single-qubit pulses. When combined with the GRAPE-generated pulse for the CNOT gate, it creates a comprehensive and effective set of universal gates. This set can efficiently implement any algorithm in noisy intermediate-scale quantum computers (NISQ era), thereby enhancing the capabilities of quantum optimal control in this domain. Additionally, this approach can be extended to other quantum computer platforms with similar Hamiltonians.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
Model-Agnostic Meta-Learning for Fault Diagnosis of Induction Motors in Data-Scarce Environments with Varying Operating Conditions and Electric Drive Noise
Authors:
Ali Pourghoraba,
MohammadSadegh KhajueeZadeh,
Ali Amini,
Abolfazl Vahedi,
Gholam Reza Agah,
Akbar Rahideh
Abstract:
Reliable mechanical fault detection with limited data is crucial for the effective operation of induction machines, particularly given the real-world challenges present in industrial datasets, such as significant imbalances between healthy and faulty samples and the scarcity of data representing faulty conditions. This research introduces an innovative meta-learning approach to address these issue…
▽ More
Reliable mechanical fault detection with limited data is crucial for the effective operation of induction machines, particularly given the real-world challenges present in industrial datasets, such as significant imbalances between healthy and faulty samples and the scarcity of data representing faulty conditions. This research introduces an innovative meta-learning approach to address these issues, focusing on mechanical fault detection in induction motors across diverse operating conditions while mitigating the adverse effects of drive noise in scenarios with limited data. The process of identifying faults under varying operating conditions is framed as a few-shot classification challenge and approached through a model-agnostic meta-learning strategy. Specifically, this approach begins with training a meta-learner across multiple interconnected fault-diagnosis tasks conducted under different operating conditions. In this stage, cross-entropy is utilized to optimize parameters and develop a robust representation of the tasks. Subsequently, the parameters of the meta-learner are fine-tuned for new tasks, enabling rapid adaptation using only a small number of samples. This method achieves excellent accuracy in fault detection across various conditions, even when data availability is restricted. The findings indicate that the proposed model outperforms other sophisticated techniques, providing enhanced generalization and quicker adaptation. The accuracy of fault diagnosis reaches a minimum of 99%, underscoring the model's effectiveness for reliable fault identification.
△ Less
Submitted 3 April, 2025; v1 submitted 5 December, 2024;
originally announced December 2024.
-
A Physics-Informed Scenario Approach with Data Mitigation for Safety Verification of Nonlinear Systems
Authors:
Ali Aminzadeh,
MohammadHossein Ashoori,
Amy Nejati,
Abolfazl Lavaei
Abstract:
This paper develops a physics-informed scenario approach for safety verification of nonlinear systems using barrier certificates (BCs) to ensure that system trajectories remain within safe regions over an infinite time horizon. Designing BCs often relies on an accurate dynamics model; however, such models are often imprecise due to the model complexity involved, particularly when dealing with high…
▽ More
This paper develops a physics-informed scenario approach for safety verification of nonlinear systems using barrier certificates (BCs) to ensure that system trajectories remain within safe regions over an infinite time horizon. Designing BCs often relies on an accurate dynamics model; however, such models are often imprecise due to the model complexity involved, particularly when dealing with highly nonlinear systems. In such cases, while scenario approaches effectively address the safety problem using collected data to construct a guaranteed BC for the dynamical system, they often require substantial amounts of data-sometimes millions of samples-due to exponential sample complexity. To address this, we propose a physics-informed scenario approach that selects data samples such that the outputs of the physics-based model and the observed data are sufficiently close (within a specified threshold). This approach guides the scenario optimization process to eliminate redundant samples and significantly reduce the required dataset size. We demonstrate the capability of our approach in mitigating the amount of data required for scenario optimizations with both deterministic (i.e., confidence 1) and probabilistic (i.e., confidence between 0 and 1) guarantees. We validate our physics-informed scenario approach through two physical case studies, showcasing its practical application in reducing the required data.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
Learning Robust Safety Controllers for Uncertain Input-Affine Polynomial Systems
Authors:
Omid Akbarzadeh,
Abolfazl Lavaei
Abstract:
This paper offers a direct data-driven approach for learning robust control barrier certificates (R-CBCs) and robust safety controllers (R-SCs) for discrete-time input-affine polynomial systems with unknown dynamics under unknown-but-bounded disturbances. The proposed method relies on data from input-state observations collected over a finite-time horizon while satisfying a specific rank condition…
▽ More
This paper offers a direct data-driven approach for learning robust control barrier certificates (R-CBCs) and robust safety controllers (R-SCs) for discrete-time input-affine polynomial systems with unknown dynamics under unknown-but-bounded disturbances. The proposed method relies on data from input-state observations collected over a finite-time horizon while satisfying a specific rank condition to ensure the system is persistently excited. Our data-driven scheme enables the synthesis of R-CBCs and R-SCs directly from observed data, bypassing the need for explicit modeling of the system's dynamics and thus ensuring robust system safety against disturbances within a finite time horizon. Our proposed approach is formulated as a sum-of-squares (SOS) optimization problem, providing a structured design framework. Two case studies showcase our method's capability to provide robust safety guarantees for unknown input-affine polynomial systems under bounded disturbances, demonstrating its practical effectiveness.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
Certified Learning of Incremental ISS Controllers for Unknown Nonlinear Polynomial Dynamics
Authors:
Mahdieh Zaker,
David Angeli,
Abolfazl Lavaei
Abstract:
Incremental input-to-state stability (delta-ISS) offers a robust framework to ensure that small input variations result in proportionally minor deviations in the state of a nonlinear system. This property is essential in practical applications where input precision cannot be guaranteed. However, analyzing delta-ISS demands detailed knowledge of system dynamics to assess the state's incremental res…
▽ More
Incremental input-to-state stability (delta-ISS) offers a robust framework to ensure that small input variations result in proportionally minor deviations in the state of a nonlinear system. This property is essential in practical applications where input precision cannot be guaranteed. However, analyzing delta-ISS demands detailed knowledge of system dynamics to assess the state's incremental response to input changes, posing a challenge in real-world scenarios where mathematical models are unknown. In this work, we develop a data-driven approach to design delta-ISS Lyapunov functions together with their corresponding delta-ISS controllers for continuous-time input-affine nonlinear systems with polynomial dynamics, ensuring the delta-ISS property is achieved without requiring knowledge of the system dynamics. In our data-driven scheme, we collect only two sets of input-state trajectories from sufficiently excited dynamics, as introduced by Willems et al.'s fundamental lemma. By fulfilling a specific rank condition, we design delta-ISS controllers using the collected samples through formulating a sum-of-squares optimization program. The effectiveness of our data-driven approach is evidenced by its application on a physical case study.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
Abstraction-based Control of Unknown Continuous-Space Models with Just Two Trajectories
Authors:
Behrad Samari,
Mahdieh Zaker,
Abolfazl Lavaei
Abstract:
Finite abstractions (a.k.a. symbolic models) offer an effective scheme for approximating the complex continuous-space systems with simpler models in the discrete-space domain. A crucial aspect, however, is to establish a formal relation between the original system and its symbolic model, ensuring that a discrete controller designed for the symbolic model can be effectively implemented as a hybrid…
▽ More
Finite abstractions (a.k.a. symbolic models) offer an effective scheme for approximating the complex continuous-space systems with simpler models in the discrete-space domain. A crucial aspect, however, is to establish a formal relation between the original system and its symbolic model, ensuring that a discrete controller designed for the symbolic model can be effectively implemented as a hybrid controller (using an interface map) for the original system. This task becomes even more challenging when the exact mathematical model of the continuous-space system is unknown. To address this, the existing literature mainly employs scenario-based data-driven methods, which require collecting a large amount of data from the original system. In this work, we propose a data-driven framework that utilizes only two input-state trajectories collected from unknown nonlinear polynomial systems to synthesize a hybrid controller, enabling the desired behavior on the unknown system through the controller derived from its symbolic model. To accomplish this, we employ the concept of alternating simulation functions (ASFs) to quantify the closeness between the state trajectories of the unknown system and its data-driven symbolic model. By satisfying a specific rank condition on the collected data, which intuitively ensures that the unknown system is persistently excited, we directly design an ASF and its corresponding hybrid controller using finite-length data without explicitly identifying the unknown system, while providing correctness guarantees. This is achieved through proposing a data-based sum-of-squares (SOS) optimization program, enabling a systematic approach to the design process. We illustrate the effectiveness of our data-driven approach through a case study.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
Geographical Information Alignment Boosts Traffic Analysis via Transpose Cross-attention
Authors:
Xiangyu Jiang,
Xiwen Chen,
Hao Wang,
Abolfazl Razi
Abstract:
Traffic accident prediction is crucial for enhancing road safety and mitigating congestion, and recent Graph Neural Networks (GNNs) have shown promise in modeling the inherent graph-based traffic data. However, existing GNN- based approaches often overlook or do not explicitly exploit geographic position information, which often plays a critical role in understanding spatial dependencies. This is…
▽ More
Traffic accident prediction is crucial for enhancing road safety and mitigating congestion, and recent Graph Neural Networks (GNNs) have shown promise in modeling the inherent graph-based traffic data. However, existing GNN- based approaches often overlook or do not explicitly exploit geographic position information, which often plays a critical role in understanding spatial dependencies. This is also aligned with our observation, where accident locations are often highly relevant. To address this issue, we propose a plug-in-and-play module for common GNN frameworks, termed Geographic Information Alignment (GIA). This module can efficiently fuse the node feature and geographic position information through a novel Transpose Cross-attention mechanism. Due to the large number of nodes for traffic data, the conventional cross-attention mechanism performing the node-wise alignment may be infeasible in computation-limited resources. Instead, we take the transpose operation for Query, Key, and Value in the Cross-attention mechanism, which substantially reduces the computation cost while maintaining sufficient information. Experimental results for both traffic occurrence prediction and severity prediction (severity levels based on the interval of recorded crash counts) on large-scale city-wise datasets confirm the effectiveness of our proposed method. For example, our method can obtain gains ranging from 1.3% to 10.9% in F1 score and 0.3% to 4.8% in AUC.
△ Less
Submitted 3 December, 2024;
originally announced December 2024.
-
Many-MobileNet: Multi-Model Augmentation for Robust Retinal Disease Classification
Authors:
Hao Wang,
Wenhui Zhu,
Xuanzhao Dong,
Yanxi Chen,
Xin Li,
Peijie Qiu,
Xiwen Chen,
Vamsi Krishna Vasa,
Yujian Xiong,
Oana M. Dumitrascu,
Abolfazl Razi,
Yalin Wang
Abstract:
In this work, we propose Many-MobileNet, an efficient model fusion strategy for retinal disease classification using lightweight CNN architecture. Our method addresses key challenges such as overfitting and limited dataset variability by training multiple models with distinct data augmentation strategies and different model complexities. Through this fusion technique, we achieved robust generaliza…
▽ More
In this work, we propose Many-MobileNet, an efficient model fusion strategy for retinal disease classification using lightweight CNN architecture. Our method addresses key challenges such as overfitting and limited dataset variability by training multiple models with distinct data augmentation strategies and different model complexities. Through this fusion technique, we achieved robust generalization in data-scarce domains while balancing computational efficiency with feature extraction capabilities.
△ Less
Submitted 3 December, 2024;
originally announced December 2024.
-
Messenger size optimality in cellular communications
Authors:
Arash Tirandaz,
Abolfazl Ramezanpour,
Vivi Rottschäfer,
Mehrad Babaei,
Andrei Zinovyev,
Alireza Mashaghi
Abstract:
Living cells presumably employ optimized information transfer methods, enabling efficient communication even in noisy environments. As expected, the efficiency of chemical communications between cells depends on the properties of the molecular messenger. Evidence suggests that proteins from narrow ranges of molecular masses have been naturally selected to mediate cellular communications, yet the u…
▽ More
Living cells presumably employ optimized information transfer methods, enabling efficient communication even in noisy environments. As expected, the efficiency of chemical communications between cells depends on the properties of the molecular messenger. Evidence suggests that proteins from narrow ranges of molecular masses have been naturally selected to mediate cellular communications, yet the underlying communication design principles are not understood. Using a simple physical model that considers the cost of chemical synthesis, diffusion, molecular binding, and degradation, we show that optimal mass values exist that ensure efficient communication of various types of signals. Our findings provide insights into the design principles of biological communications and can be used to engineer chemically communicating biomimetic systems.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
Overview of NR Enhancements for Extended Reality (XR) in 3GPP 5G-Advanced
Authors:
Margarita Gapeyenko,
Stefano Paris,
Markus Isomaki,
Boyan Yanakiev,
Abolfazl Amiri,
Benoist Sébire,
Jorma Kaikkonen,
Chunli Wu,
Klaus I. Pedersen
Abstract:
Extended reality (XR) is unlocking numerous possibilities and continues attracting individuals and larger groups across different business sectors. With Virtual reality (VR), Augmented reality (AR), or Mixed reality (MR) it is possible to improve the way we access, deliver and exchange information in education, health care, entertainment, and many other aspects of our daily lives. However, to full…
▽ More
Extended reality (XR) is unlocking numerous possibilities and continues attracting individuals and larger groups across different business sectors. With Virtual reality (VR), Augmented reality (AR), or Mixed reality (MR) it is possible to improve the way we access, deliver and exchange information in education, health care, entertainment, and many other aspects of our daily lives. However, to fully exploit the potential of XR, it is important to provide reliable, fast and secure wireless connectivity to the users of XR and that requires refining existing solutions and tailoring those to support XR services. This article presents a tutorial on 3GPP 5G-Advanced Release 18 XR activities, summarizing physical as well as higher layer enhancements introduced for New Radio considering the specifics of XR. In addition, we also describe enhancements across 5G system architecture that impacted radio access network. Furthermore, the paper provides system-level simulation results for several Release 18 enhancements to show their benefits in terms of XR capacity and power saving gains. Finally, it concludes with an overview of future work in Release 19 that continues developing features to support XR services.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
Needle: A Generative AI-Powered Multi-modal Database for Answering Complex Natural Language Queries
Authors:
Mahdi Erfanian,
Mohsen Dehghankar,
Abolfazl Asudeh
Abstract:
Multi-modal datasets, like those involving images, often miss the detailed descriptions that properly capture the rich information encoded in each item. This makes answering complex natural language queries a major challenge in this domain. In particular, unlike the traditional nearest neighbor search, where the tuples and the query are represented as points in a single metric space, these setting…
▽ More
Multi-modal datasets, like those involving images, often miss the detailed descriptions that properly capture the rich information encoded in each item. This makes answering complex natural language queries a major challenge in this domain. In particular, unlike the traditional nearest neighbor search, where the tuples and the query are represented as points in a single metric space, these settings involve queries and tuples embedded in fundamentally different spaces, making the traditional query answering methods inapplicable. Existing literature addresses this challenge for image datasets through vector representations jointly trained on natural language and images. This technique, however, underperforms for complex queries due to various reasons.
This paper takes a step towards addressing this challenge by introducing a Generative-based Monte Carlo method that utilizes foundation models to generate synthetic samples that capture the complexity of the natural language query and represent it in the same metric space as the multi-modal data.
Following this method, we propose Needle, a database for image data retrieval. Instead of relying on contrastive learning or metadata-searching approaches, our system is based on synthetic data generation to capture the complexities of natural language queries. Our system is open-source and ready for deployment, designed to be easily adopted by researchers and developers. The comprehensive experiments on various benchmark datasets verify that this system significantly outperforms state-of-the-art text-to-image retrieval methods in the literature. Any foundation model and embedder can be easily integrated into Needle to improve the performance, piggybacking on the advancements in these technologies.
△ Less
Submitted 2 June, 2025; v1 submitted 30 November, 2024;
originally announced December 2024.
-
Rank It, Then Ask It: Input Reranking for Maximizing the Performance of LLMs on Symmetric Tasks
Authors:
Mohsen Dehghankar,
Abolfazl Asudeh
Abstract:
Large language models (LLMs) have quickly emerged as practical and versatile tools that provide new solutions for a wide range of domains. In this paper, we consider the application of LLMs on symmetric tasks where a query is asked on an (unordered) bag of elements. Examples of such tasks include answering aggregate queries on a database table. In general, when the bag contains a large number of e…
▽ More
Large language models (LLMs) have quickly emerged as practical and versatile tools that provide new solutions for a wide range of domains. In this paper, we consider the application of LLMs on symmetric tasks where a query is asked on an (unordered) bag of elements. Examples of such tasks include answering aggregate queries on a database table. In general, when the bag contains a large number of elements, LLMs tend to overlook some elements, leading to challenges in generating accurate responses to the query. LLMs receive their inputs as ordered sequences. However, in this problem, we leverage the fact that the symmetric input is not ordered, and reordering should not affect the LLM's response.
Observing that LLMs are less likely to miss elements at certain positions of the input, we introduce the problem of LLM input reranking: to find a ranking of the input that maximizes the LLM's accuracy for the given query without making explicit assumptions about the query. Finding the optimal ranking requires identifying (i) the relevance of each input element for answering the query and (ii) the importance of each rank position for the LLM's attention. We develop algorithms for estimating these values efficiently utilizing a helper LLM. We conduct comprehensive experiments on different synthetic and real datasets to validate our proposal and to evaluate the effectiveness of our proposed algorithms. Our experiments confirm that our reranking approach improves the accuracy of the LLMs on symmetric tasks by up to $99\%$ proximity to the optimum upper bound.
△ Less
Submitted 30 November, 2024;
originally announced December 2024.
-
Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop
Authors:
Zhaofang Qian,
Abolfazl Sharifi,
Tucker Carroll,
Ser-Nam Lim
Abstract:
Video generation has achieved impressive quality, but it still suffers from artifacts such as temporal inconsistency and violation of physical laws. Leveraging 3D scenes can fundamentally resolve these issues by providing precise control over scene entities. To facilitate the easy generation of diverse photorealistic scenes, we propose Scene Copilot, a framework combining large language models (LL…
▽ More
Video generation has achieved impressive quality, but it still suffers from artifacts such as temporal inconsistency and violation of physical laws. Leveraging 3D scenes can fundamentally resolve these issues by providing precise control over scene entities. To facilitate the easy generation of diverse photorealistic scenes, we propose Scene Copilot, a framework combining large language models (LLMs) with a procedural 3D scene generator. Specifically, Scene Copilot consists of Scene Codex, BlenderGPT, and Human in the loop. Scene Codex is designed to translate textual user input into commands understandable by the 3D scene generator. BlenderGPT provides users with an intuitive and direct way to precisely control the generated 3D scene and the final output video. Furthermore, users can utilize Blender UI to receive instant visual feedback. Additionally, we have curated a procedural dataset of objects in code format to further enhance our system's capabilities. Each component works seamlessly together to support users in generating desired 3D scenes. Extensive experiments demonstrate the capability of our framework in customizing 3D scenes and video generation.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Degrees of Freedom of Cache-Aided Interference Channels Assisted by Active Intelligent Reflecting Surfaces
Authors:
Abolfazl Changizi,
Ali H. Abdollahi Bafghi,
Masoumeh Nasiri-Kenari
Abstract:
This paper studies cache-aided wireless networks in the presence of active intelligent reflecting surfaces (IRS) from an information-theoretic perspective. Specifically, we explore interference management in a cache-aided wireless network assisted by an active IRS, to enhance the achievable degrees of freedom (DoF). To this end, we jointly design the content placement, delivery phase, and phase sh…
▽ More
This paper studies cache-aided wireless networks in the presence of active intelligent reflecting surfaces (IRS) from an information-theoretic perspective. Specifically, we explore interference management in a cache-aided wireless network assisted by an active IRS, to enhance the achievable degrees of freedom (DoF). To this end, we jointly design the content placement, delivery phase, and phase shifts of the IRS and propose a one-shot achievable scheme. Our scheme exploits transmitters' cooperation, cache contents (as side information), interference alignment, and IRS capabilities, adapting to the network's parameters. We derive the achievable one-shot sum-DoF for different sizes of cache memories, network configurations, and numbers of IRS elements. Our results highlight the potential of deploying an IRS in cache-aided wireless communication systems, underscoring the enhancement of achievable DoF for various parameter regimes, particularly when the sizes of the caches (especially at the transmitters) are inadequate. Notably, we show that access to an IRS with a sufficient number of elements enables the achievement of the maximum possible DoF for various parameter regimes of interest.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
sbi reloaded: a toolkit for simulation-based inference workflows
Authors:
Jan Boelts,
Michael Deistler,
Manuel Gloeckler,
Álvaro Tejero-Cantero,
Jan-Matthis Lueckmann,
Guy Moss,
Peter Steinbach,
Thomas Moreau,
Fabio Muratore,
Julia Linhart,
Conor Durkan,
Julius Vetter,
Benjamin Kurt Miller,
Maternus Herold,
Abolfazl Ziaeemehr,
Matthijs Pals,
Theo Gruner,
Sebastian Bischoff,
Nastya Krouglova,
Richard Gao,
Janne K. Lappalainen,
Bálint Mucsányi,
Felix Pei,
Auguste Schulz,
Zinovia Stefanidi
, et al. (8 additional authors not shown)
Abstract:
Scientists and engineers use simulators to model empirically observed phenomena. However, tuning the parameters of a simulator to ensure its outputs match observed data presents a significant challenge. Simulation-based inference (SBI) addresses this by enabling Bayesian inference for simulators, identifying parameters that match observed data and align with prior knowledge. Unlike traditional Bay…
▽ More
Scientists and engineers use simulators to model empirically observed phenomena. However, tuning the parameters of a simulator to ensure its outputs match observed data presents a significant challenge. Simulation-based inference (SBI) addresses this by enabling Bayesian inference for simulators, identifying parameters that match observed data and align with prior knowledge. Unlike traditional Bayesian inference, SBI only needs access to simulations from the model and does not require evaluations of the likelihood-function. In addition, SBI algorithms do not require gradients through the simulator, allow for massive parallelization of simulations, and can perform inference for different observations without further simulations or training, thereby amortizing inference. Over the past years, we have developed, maintained, and extended $\texttt{sbi}$, a PyTorch-based package that implements Bayesian SBI algorithms based on neural networks. The $\texttt{sbi}$ toolkit implements a wide range of inference methods, neural network architectures, sampling methods, and diagnostic tools. In addition, it provides well-tested default settings but also offers flexibility to fully customize every step of the simulation-based inference workflow. Taken together, the $\texttt{sbi}$ toolkit enables scientists and engineers to apply state-of-the-art SBI methods to black-box simulators, opening up new possibilities for aligning simulations with empirically observed data.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
RobustFormer: Noise-Robust Pre-training for images and videos
Authors:
Ashish Bastola,
Nishant Luitel,
Hao Wang,
Danda Pani Paudel,
Roshani Poudel,
Abolfazl Razi
Abstract:
While deep learning models are powerful tools that revolutionized many areas, they are also vulnerable to noise as they rely heavily on learning patterns and features from the exact details of the clean data. Transformers, which have become the backbone of modern vision models, are no exception. Current Discrete Wavelet Transforms (DWT) based methods do not benefit from masked autoencoder (MAE) pr…
▽ More
While deep learning models are powerful tools that revolutionized many areas, they are also vulnerable to noise as they rely heavily on learning patterns and features from the exact details of the clean data. Transformers, which have become the backbone of modern vision models, are no exception. Current Discrete Wavelet Transforms (DWT) based methods do not benefit from masked autoencoder (MAE) pre-training since the inverse DWT (iDWT) introduced in these approaches is computationally inefficient and lacks compatibility with video inputs in transformer architectures.
In this work, we present RobustFormer, a method that overcomes these limitations by enabling noise-robust pre-training for both images and videos; improving the efficiency of DWT-based methods by removing the need for computationally iDWT steps and simplifying the attention mechanism. To our knowledge, the proposed method is the first DWT-based method compatible with video inputs and masked pre-training. Our experiments show that MAE-based pre-training allows us to bypass the iDWT step, greatly reducing computation. Through extensive tests on benchmark datasets, RobustFormer achieves state-of-the-art results for both image and video tasks.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
Investigation of Vibrational Frequency of Canine Vocal Folds Using a Two-Way Fluid-Solid Interaction Analysis
Authors:
Abolfazl Mohammadi Gorjaei,
Mohammad Ali Nazari,
Asghar Afshari,
Saeed Farzad-Mohajeri,
Pascal Perrier
Abstract:
Introduction Speech is an integral component of human communication, requiring the coordinated efforts of various organs to produce sound (Titze & Alipour, 2006). The glottis region, a key player in voice production, assumes a crucial role in this intricate process. As air, emanating from the lungs in a confined space, interacts with the vocal folds (VFs) within the human body, it gives rise to th…
▽ More
Introduction Speech is an integral component of human communication, requiring the coordinated efforts of various organs to produce sound (Titze & Alipour, 2006). The glottis region, a key player in voice production, assumes a crucial role in this intricate process. As air, emanating from the lungs in a confined space, interacts with the vocal folds (VFs) within the human body, it gives rise to the creation of voice (Alipour & Vigmostad, 2012). Understanding the mechanical intricacies of this process is very important. Studying VFs in vivo situations is hard work. However, the orientation, shape and size of VFs fibers have been extracted with synchrotron X-ray microtomography. (Bailly et al., 2018) The investigation of mechanical properties of both human and animal VFs has been carried out through various methodologies in the literature. The mechanical properties of VFs have been studied using the uniaxial extension test (Alipour & Vigmostad, 2012) assuming a linear behavior, while the nonlinearity and anisotropy of VFs has been determined using a multiscale method as in Miri et al. (2013). Pipette aspiration has also been used to extract in vivo elastic properties of VFs (Scheible et al., 2023). Mechanical behavior of VFs layers in tension, compression and shear has been studied. (Cochereau et al., 2020). Fluid-structure interaction (FSI) simulations provide a valuable tool to gain a deeper understanding of voice production (Ghorbani et al. 2022). These simulations allow us to model the dynamic interplay between the VFs and air. Our research focuses on investigating the mechanical properties of canine vocal folds and utilizing these findings in an FSI simulation. Through this simulation, we aim to unravel how these mechanical properties affect voice production.Methods To investigate the mechanical properties of canine VFs, an in vitro study was conducted involving 6 mixedbreed dogs. The samples were harvested from canine cadavers euthanized for reasons unrelated to this study. In the following, the VFs were harvested and tested upon 3-4 hours post-animal sacrifice. Experimental trials were carried out using the STM-1 device (SANTAM Co.), equipped with a 100 kg load cell. Seven uniaxial tensile tests were done on each sample, with displacement rates of 1, 5, 10, 20, 40, 60, and 120 mm/min. The very slow rate of 1 mm/min was chosen to assess only elastic properties eliminating viscosity effects. Various hyperelastic models were used to fit the experimental data. Subsequently, for each model, both the mean and standard deviation (SD) were determined for the hyperelastic model parameters and their residuals. For FSI analysis we used a simplified laryngeal model as a hollow cylinder with a diameter of 50 mm and a thickness of 3 mm. The overall length of the larynx was set at 100 mm. The VFs were modeled as a circular disc with a small elliptical fissure in the midst of the cylinder section. Boundary conditions were established based on pressure differentials, with the inlet gauge pressure set at 1200 Pa and the relative pressure at the outlet set to 0. To account for the turbulent nature of airflow within the larynx, we employed the K-epsilon method to solve the motion differential equations in a two-way fluid-structure interaction simulation using ANSYS FLUENT 2021. This approach enabled us to investigate how the acquired mechanical properties of canine vocal folds affect the FSI simulations during phonation, resulting in a more comprehensive understanding of their impact. To determine the vibrational frequency of VFs, we calculated the time it took to reach maximum displacement and then quadrupled this value to obtain the period of vibration.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Towards Mitigating Sim2Real Gaps: A Formal Quantitative Approach
Authors:
P Sangeerth,
Abolfazl Lavaei,
Pushpak Jagtap
Abstract:
In this paper, we introduce the notion of simulation-gap functions to formally quantify the potential gap between an approximate nominal mathematical model and the high-fidelity simulator representation of a real system. Given a nominal mathematical model alongside a quantified simulation gap, the system can be conceptualized as one characterized by bounded states and input-dependent disturbances.…
▽ More
In this paper, we introduce the notion of simulation-gap functions to formally quantify the potential gap between an approximate nominal mathematical model and the high-fidelity simulator representation of a real system. Given a nominal mathematical model alongside a quantified simulation gap, the system can be conceptualized as one characterized by bounded states and input-dependent disturbances. This allows us to leverage the existing powerful model-based control algorithms effectively, ensuring the enforcement of desired specifications while guaranteeing a seamless transition from simulation to real-world application. To provide a formal guarantee for quantifying the simulation gap, we develop a data-driven approach. In particular, we collect data using high-fidelity simulators, leveraging recent advancements in Real-to-Sim transfer to ensure close alignment with reality. We demonstrate the effectiveness of the proposed method through experiments conducted on a nonlinear pendulum system and a nonlinear Turtlebot model in simulators.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Efficiency of energy-consuming random walkers: Variability in energy helps
Authors:
Mohsen Ghasemi Nezhadhaghighi,
Abolfazl Ramezanpour
Abstract:
Energy considerations can significantly affect the behavior of a population of energy-consuming agents with limited energy budgets, for instance, in the movement process of people in a city. We consider a population of interacting agents with an initial energy budget walking on a graph according to an exploration and return (to home) strategy that is based on the current energy of the person. Each…
▽ More
Energy considerations can significantly affect the behavior of a population of energy-consuming agents with limited energy budgets, for instance, in the movement process of people in a city. We consider a population of interacting agents with an initial energy budget walking on a graph according to an exploration and return (to home) strategy that is based on the current energy of the person. Each move reduces the available energy depending on the flow of movements and the strength of interactions, and the movement ends when an agent returns home with a negative energy. We observe that a uniform distribution of initial energy budgets results in a larger number of visited sites per consumed energy (efficiency) compared to case that all agents have the same initial energy if return to home is relevant from the beginning of the process. The uniform energy distribution also reduces the amount of uncertainties in the total travel times (entropy production) which is more pronounced when the strength of interactions and exploration play the relevant role in the movement process. That is variability in the energies can help to increase the efficiency and reduce the entropy production specially in presence of strong interactions.
△ Less
Submitted 12 November, 2024;
originally announced November 2024.
-
Data-Driven Control of Large-Scale Networks with Formal Guarantees: A Small-Gain Free Approach
Authors:
Behrad Samari,
Amy Nejati,
Abolfazl Lavaei
Abstract:
This paper offers a data-driven divide-and-conquer strategy to analyze large-scale interconnected networks, characterized by both unknown mathematical models and interconnection topologies. Our data-driven scheme treats an unknown network as an interconnection of individual agents (a.k.a. subsystems) and aims at constructing their symbolic models, referred to as discrete-domain representations of…
▽ More
This paper offers a data-driven divide-and-conquer strategy to analyze large-scale interconnected networks, characterized by both unknown mathematical models and interconnection topologies. Our data-driven scheme treats an unknown network as an interconnection of individual agents (a.k.a. subsystems) and aims at constructing their symbolic models, referred to as discrete-domain representations of unknown agents, by collecting data from their trajectories. The primary objective is to synthesize a control strategy that guarantees desired behaviors over an unknown network by employing local controllers, derived from symbolic models of individual agents. To achieve this, we leverage the concept of alternating sub-bisimulation function (ASBF) to capture the closeness between state trajectories of each unknown agent and its data-driven symbolic model. Under a newly developed data-driven compositional condition, we then establish an alternating bisimulation function (ABF) between an unknown network and its symbolic model, based on ASBFs of individual agents, while providing correctness guarantees. Despite the sample complexity in existing work being exponential with respect to the network size, we demonstrate that our divide-and-conquer strategy significantly reduces it to a linear scale with respect to the number of agents. We also showcase that our data-driven compositional condition does not necessitate the traditional small-gain condition, which demands precise knowledge of the interconnection topology for its fulfillment. We apply our data-driven findings to two benchmarks comprising unknown networks with an arbitrary, a-priori undefined number of agents and unknown interconnection topologies.
△ Less
Submitted 11 November, 2024;
originally announced November 2024.
-
An Efficient Matrix Multiplication Algorithm for Accelerating Inference in Binary and Ternary Neural Networks
Authors:
Mohsen Dehghankar,
Mahdi Erfanian,
Abolfazl Asudeh
Abstract:
Despite their tremendous success and versatility, Deep Neural Networks (DNNs) such as Large Language Models (LLMs) suffer from inference inefficiency and rely on advanced computational infrastructure. To address these challenges and make these models more accessible and cost-effective, in this paper, we propose algorithms to improve the inference time and memory efficiency of DNNs with binary and…
▽ More
Despite their tremendous success and versatility, Deep Neural Networks (DNNs) such as Large Language Models (LLMs) suffer from inference inefficiency and rely on advanced computational infrastructure. To address these challenges and make these models more accessible and cost-effective, in this paper, we propose algorithms to improve the inference time and memory efficiency of DNNs with binary and ternary weight matrices. Particularly focusing on matrix multiplication as the bottleneck operation of inference, we observe that, once trained, the weight matrices of a model no longer change. This allows us to preprocess these matrices and create indices that help reduce the storage requirements by a logarithmic factor while enabling our efficient inference algorithms. Specifically, for a $n\times n$ weight matrix, our efficient algorithm guarantees a time complexity of $O(\frac{n^2}{\log n})$, a logarithmic factor improvement over the standard vector-matrix multiplication. Besides theoretical analysis, we conduct extensive experiments to evaluate the practical efficiency of our algorithms. Our results confirm the superiority of our approach both with respect to time and memory, as we observed a reduction in the multiplication time up to 29x and memory usage up to 6x. When applied to LLMs, our experiments show up to a 5.24x speedup in the inference time.
△ Less
Submitted 2 May, 2025; v1 submitted 9 November, 2024;
originally announced November 2024.
-
Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups
Authors:
Mohsen Dehghankar,
Abolfazl Asudeh
Abstract:
Due to a variety of reasons, such as privacy, data in the wild often misses the grouping information required for identifying minorities. On the other hand, it is known that machine learning models are only as good as the data they are trained on and, hence, may underperform for the under-represented minority groups. The missing grouping information presents a dilemma for responsible data scientis…
▽ More
Due to a variety of reasons, such as privacy, data in the wild often misses the grouping information required for identifying minorities. On the other hand, it is known that machine learning models are only as good as the data they are trained on and, hence, may underperform for the under-represented minority groups. The missing grouping information presents a dilemma for responsible data scientists who find themselves in an unknown-unknown situation, where not only do they not have access to the grouping attributes but do not also know what groups to consider.
This paper is an attempt to address this dilemma. Specifically, we propose a minority mining problem, where we find vectors in the attribute space that reveal potential groups that are under-represented and under-performing. Technically speaking, we propose a geometric transformation of data into a dual space and use notions such as the arrangement of hyperplanes to design an efficient algorithm for the problem in lower dimensions. Generalizing our solution to the higher dimensions is cursed by dimensionality. Therefore, we propose a solution based on smart exploration of the search space for such cases. We conduct comprehensive experiments using real-world and synthetic datasets alongside the theoretical analysis. Our experiment results demonstrate the effectiveness of our proposed solutions in mining the unknown, under-represented, and under-performing minorities.
△ Less
Submitted 20 April, 2025; v1 submitted 7 November, 2024;
originally announced November 2024.
-
Current Trends in Global Quantum Metrology
Authors:
Chiranjib Mukhopadhyay,
Victor Montenegro,
Abolfazl Bayat
Abstract:
Quantum sensors are now universally acknowledged as one of the most promising near-term quantum technologies. The traditional formulation of quantum sensing introduces a concrete bound on ultimate precision through the so-called local sensing framework, in which a significant knowledge of prior information about the unknown parameter value is implicitly assumed. Moreover, the framework provides a…
▽ More
Quantum sensors are now universally acknowledged as one of the most promising near-term quantum technologies. The traditional formulation of quantum sensing introduces a concrete bound on ultimate precision through the so-called local sensing framework, in which a significant knowledge of prior information about the unknown parameter value is implicitly assumed. Moreover, the framework provides a systematic approach for optimizing the sensing protocol. In contrast, the paradigm of global sensing aims to find a precision bound for parameter estimation in the absence of such prior information. In recent years, vigorous research has been pursued to describe the contours of global quantum estimation. Here, we review some of these emerging developments. These developments are both in the realm of finding ultimate precision bounds with respect to appropriate figures of merit in the global sensing paradigm, as well as in the search for algorithms that achieve these bounds. We categorize these developments into two largely mutually exclusive camps; one employing Bayesian updating and the other seeking to generalize the frequentist picture of local sensing towards the global paradigm. In the first approach, in order to achieve the best performance, one has to optimize the measurement settings adaptively. In the second approach, the measurement setting is fixed, however the challenge is to identify this fixed measurement optimally.
△ Less
Submitted 12 February, 2025; v1 submitted 6 November, 2024;
originally announced November 2024.
-
Enhancing Graph Neural Networks in Large-scale Traffic Incident Analysis with Concurrency Hypothesis
Authors:
Xiwen Chen,
Sayed Pedram Haeri Boroujeni,
Xin Shu,
Huayu Li,
Abolfazl Razi
Abstract:
Despite recent progress in reducing road fatalities, the persistently high rate of traffic-related deaths highlights the necessity for improved safety interventions. Leveraging large-scale graph-based nationwide road network data across 49 states in the USA, our study first posits the Concurrency Hypothesis from intuitive observations, suggesting a significant likelihood of incidents occurring at…
▽ More
Despite recent progress in reducing road fatalities, the persistently high rate of traffic-related deaths highlights the necessity for improved safety interventions. Leveraging large-scale graph-based nationwide road network data across 49 states in the USA, our study first posits the Concurrency Hypothesis from intuitive observations, suggesting a significant likelihood of incidents occurring at neighboring nodes within the road network. To quantify this phenomenon, we introduce two novel metrics, Average Neighbor Crash Density (ANCD) and Average Neighbor Crash Continuity (ANCC), and subsequently employ them in statistical tests to validate the hypothesis rigorously. Building upon this foundation, we propose the Concurrency Prior (CP) method, a powerful approach designed to enhance the predictive capabilities of general Graph Neural Network (GNN) models in semi-supervised traffic incident prediction tasks. Our method allows GNNs to incorporate concurrent incident information, as mentioned in the hypothesis, via tokenization with negligible extra parameters.
The extensive experiments, utilizing real-world data across states and cities in the USA, demonstrate that integrating CP into 12 state-of-the-art GNN architectures leads to significant improvements, with gains ranging from 3% to 13% in F1 score and 1.3% to 9% in AUC metrics. The code is publicly available at https://github.com/xiwenc1/Incident-GNN-CP.
△ Less
Submitted 4 November, 2024;
originally announced November 2024.
-
Noisy Stark probes as quantum-enhanced sensors
Authors:
Saubhik Sarkar,
Abolfazl Bayat
Abstract:
Wannier-Stark localization has been proven to be a resource for quantum-enhanced sensitivity for precise estimation of a gradient field. An extremely promising feature of such probes is their ability to showcase such enhanced scaling even dynamically with system size, on top of the quadratic scaling in time. In this paper, we address the issue of decoherence that occurs during time evolution and c…
▽ More
Wannier-Stark localization has been proven to be a resource for quantum-enhanced sensitivity for precise estimation of a gradient field. An extremely promising feature of such probes is their ability to showcase such enhanced scaling even dynamically with system size, on top of the quadratic scaling in time. In this paper, we address the issue of decoherence that occurs during time evolution and characterize how that affects the sensing performance. We determine the parameter domains in which the enhancement is sustained under dephasing dynamics. In addition, we consider an effective non-Hermitian description of the open quantum system dynamics for describing the effect of decoherence on the sensing performance of the probe. By investigating the static and dynamic properties of the non-Hermitian Hamiltonians, we show that quantum-enhanced sensitivity can indeed be sustained over certain range of decoherence strength for Wannier-Stark probes. This is demonstrated with two examples of non-Hermitian systems with non-reciprocal couplings.
△ Less
Submitted 3 June, 2025; v1 submitted 4 November, 2024;
originally announced November 2024.
-
Quantum-enhanced sensing of spin-orbit coupling without fine-tuning
Authors:
Bin Yi,
Abolfazl Bayat,
Saubhik Sarkar
Abstract:
Spin-orbit coupling plays an important role in both fundamental physics and technological applications. Precise estimation of the spin-orbit coupling is necessary for accurate designing across various physical setups such as solid state devices and quantum hardware. Here, we exploit quantum features in a 1D quantum wire for estimating the Rashba spin-orbit coupling with enhanced sensitivity beyond…
▽ More
Spin-orbit coupling plays an important role in both fundamental physics and technological applications. Precise estimation of the spin-orbit coupling is necessary for accurate designing across various physical setups such as solid state devices and quantum hardware. Here, we exploit quantum features in a 1D quantum wire for estimating the Rashba spin-orbit coupling with enhanced sensitivity beyond the capability of classical probes. The Heisenberg limited enhanced precision is achieved across a wide range of parameters and does not require fine tuning. Such advantage is directly related to the gap-closing nature of the probe across the entire relevant range of parameters. This provides clear advantage over conventional criticality-based quantum sensors in which quantum enhanced sensitivity can only be achieved through fine-tuning around the phase transition point. We have demonstrated quantum enhanced sensitivity for both single particle and interacting many-body probes. In addition to extending our results to thermal states and the multi-parameter scenario, we have provided an measurement basis to perform close to the ultimate precision.
△ Less
Submitted 12 November, 2024; v1 submitted 1 November, 2024;
originally announced November 2024.
-
A Multiphysics Analysis and Investigation of Soft Magnetics Effect on IPMSM: Case Study Dynamometer
Authors:
Ali Amini,
MohammadSadegh KhajueeZadeh,
Abolfazl Vahedi
Abstract:
Nowadays, Interior Permanent Magnet Synchronous Motors (IPMSMs) are taken into attention in the industry owing to their advantages. Moreover, in many cases, performing static tests is not enough, and investigating electric machines under dynamic conditions is necessary. Accordingly, by employing a dynamometer system, the dynamic behavior of the electric machine under test is investigated. Among th…
▽ More
Nowadays, Interior Permanent Magnet Synchronous Motors (IPMSMs) are taken into attention in the industry owing to their advantages. Moreover, in many cases, performing static tests is not enough, and investigating electric machines under dynamic conditions is necessary. Accordingly, by employing a dynamometer system, the dynamic behavior of the electric machine under test is investigated. Among the dynamometers, the best is the Alternating (AC) dynamometer because the basic dynamometers cannot take loads with high complexity. So, in the following study, two IPMSM with V-type and Delta-type rotor configurations are designed and suggested to employ in AC dynamometer. Any non-ideality in the electric machines of AC dynamometers, electrically and mechanically, causes errors in the measurement of the motor under test. Electrically and mechanically, the behavior of a system significantly depends on the used soft magnetics besides its physical and magnetic configuration. Accordingly, by performing a Multiphysics analysis and using the FEM tool to change the soft magnetics in the rotor and stator core, comparing the electric motors' behavior in the AC dynamometer is investigated under the same operating conditions electrically and mechanically. Finally, which soft magnetics is more satisfactory for the AC dynamometer can be seen.
△ Less
Submitted 22 November, 2024; v1 submitted 31 October, 2024;
originally announced October 2024.
-
Non-Hermitian Discrete Time Crystals
Authors:
Rozhin Yousefjani,
Angelo Carollo,
Krzysztof Sacha,
Saif Al-Kuwari,
Abolfazl Bayat
Abstract:
Discrete time crystals (DTC) exhibit a special non-equilibrium phase of matter in periodically driven many-body systems with spontaneous breaking of time translational symmetry. The presence of decoherence generally enhances thermalization and destroys the coherence required for the existence of DTC. In this letter, we devise a mechanism for establishing a stable DTC with period-doubling oscillati…
▽ More
Discrete time crystals (DTC) exhibit a special non-equilibrium phase of matter in periodically driven many-body systems with spontaneous breaking of time translational symmetry. The presence of decoherence generally enhances thermalization and destroys the coherence required for the existence of DTC. In this letter, we devise a mechanism for establishing a stable DTC with period-doubling oscillations in an open quantum system that is governed by a properly tailored non-Hermitian Hamiltonian. We find a specific class of non-reciprocal couplings in our non-Hermitian dynamics which prevents thermalization through eigenstate ordering. Such choice of non-Hermitian dynamics, significantly enhances the stability of the DTC against imperfect pulses. Through a comprehensive analysis, we determine the phase diagram of the system in terms of pulse imperfection.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
Equitable Federated Learning with Activation Clustering
Authors:
Antesh Upadhyay,
Abolfazl Hashemi
Abstract:
Federated learning is a prominent distributed learning paradigm that incorporates collaboration among diverse clients, promotes data locality, and thus ensures privacy. These clients have their own technological, cultural, and other biases in the process of data generation. However, the present standard often ignores this bias/heterogeneity, perpetuating bias against certain groups rather than mit…
▽ More
Federated learning is a prominent distributed learning paradigm that incorporates collaboration among diverse clients, promotes data locality, and thus ensures privacy. These clients have their own technological, cultural, and other biases in the process of data generation. However, the present standard often ignores this bias/heterogeneity, perpetuating bias against certain groups rather than mitigating it. In response to this concern, we propose an equitable clustering-based framework where the clients are categorized/clustered based on how similar they are to each other. We propose a unique way to construct the similarity matrix that uses activation vectors. Furthermore, we propose a client weighing mechanism to ensure that each cluster receives equal importance and establish $O(1/\sqrt{K})$ rate of convergence to reach an $ε-$stationary solution. We assess the effectiveness of our proposed strategy against common baselines, demonstrating its efficacy in terms of reducing the bias existing amongst various client clusters and consequently ameliorating algorithmic bias against specific groups.
△ Less
Submitted 1 November, 2024; v1 submitted 24 October, 2024;
originally announced October 2024.
-
Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust Optimization
Authors:
Amir Hossein Saberi,
Amir Najafi,
Ala Emrani,
Amin Behjati,
Yasaman Zolfimoselo,
Mahdi Shadrooy,
Abolfazl Motahari,
Babak H. Khalaj
Abstract:
The aim of this paper is to address the challenge of gradual domain adaptation within a class of manifold-constrained data distributions. In particular, we consider a sequence of $T\ge2$ data distributions $P_1,\ldots,P_T$ undergoing a gradual shift, where each pair of consecutive measures $P_i,P_{i+1}$ are close to each other in Wasserstein distance. We have a supervised dataset of size $n$ sampl…
▽ More
The aim of this paper is to address the challenge of gradual domain adaptation within a class of manifold-constrained data distributions. In particular, we consider a sequence of $T\ge2$ data distributions $P_1,\ldots,P_T$ undergoing a gradual shift, where each pair of consecutive measures $P_i,P_{i+1}$ are close to each other in Wasserstein distance. We have a supervised dataset of size $n$ sampled from $P_0$, while for the subsequent distributions in the sequence, only unlabeled i.i.d. samples are available. Moreover, we assume that all distributions exhibit a known favorable attribute, such as (but not limited to) having intra-class soft/hard margins. In this context, we propose a methodology rooted in Distributionally Robust Optimization (DRO) with an adaptive Wasserstein radius. We theoretically show that this method guarantees the classification error across all $P_i$s can be suitably bounded. Our bounds rely on a newly introduced {\it {compatibility}} measure, which fully characterizes the error propagation dynamics along the sequence. Specifically, for inadequately constrained distributions, the error can exponentially escalate as we progress through the gradual shifts. Conversely, for appropriately constrained distributions, the error can be demonstrated to be linear or even entirely eradicated. We have substantiated our theoretical findings through several experimental results.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Saturable global quantum sensing
Authors:
Chiranjib Mukhopadhyay,
Matteo G. A. Paris,
Abolfazl Bayat
Abstract:
Conventional formulation of quantum sensing has been mostly developed in the context of local estimation, where the unknown parameter is roughly known. In contrast, global sensing, where the prior information is incomplete and the unknown parameter is only known to lie within a broad interval, is practically more engaging but has received far less theoretical attention. Available formulations of g…
▽ More
Conventional formulation of quantum sensing has been mostly developed in the context of local estimation, where the unknown parameter is roughly known. In contrast, global sensing, where the prior information is incomplete and the unknown parameter is only known to lie within a broad interval, is practically more engaging but has received far less theoretical attention. Available formulations of global sensing rely on adaptive Bayesian strategies requiring on-the-fly change in measurement settings, or minimizing average uncertainty yielding unsaturable bounds. Here, we provide an operationally motivated approach to global sensing for fixed but optimized settings. Our scheme yields a saturable precision bound optimizing the measurement as well as the probe preparation simultaneously. The formalism is general and computationally scalable for generic bosonic multimode Gaussian or many-particle free-fermionic quantum sensors. We illustrate the implications for Gaussian thermometry and Gaussian phase estimation by showing that the optimal measurement changes, either gradually or abruptly, from homodyne for local sensing, towards heterodyne for global sensing. In contrast, for fermionic transverse XY probes, the optimal measurement basis stays fixed independent of width.
△ Less
Submitted 20 June, 2025; v1 submitted 15 October, 2024;
originally announced October 2024.
-
Exponentially-enhanced quantum sensing with many-body phase transitions
Authors:
Saubhik Sarkar,
Abolfazl Bayat,
Sougato Bose,
Roopayan Ghosh
Abstract:
Quantum sensors based on critical many-body systems are known to exhibit enhanced sensing capability. Such enhancements typically scale algebraically with the probe size. Going beyond algebraic advantage and reaching exponential scaling has remained elusive when all the resources, such as the preparation time, are taken into account. In this work, we show that many-body systems featuring first ord…
▽ More
Quantum sensors based on critical many-body systems are known to exhibit enhanced sensing capability. Such enhancements typically scale algebraically with the probe size. Going beyond algebraic advantage and reaching exponential scaling has remained elusive when all the resources, such as the preparation time, are taken into account. In this work, we show that many-body systems featuring first order quantum phase transitions can indeed achieve exponential scaling of sensitivity, thanks to their exponential energy gap closing. Remarkably, even after considering the preparation time using local adiabatic driving, the exponential scaling is sustained. Our results are demonstrated through comprehensive analysis of three paradigmatic models exhibiting first order phase transitions, namely Grover, $p$-spin, and biclique models. We show that this scaling survives moderate decoherence during state preparation and also can be optimally measured in experimentally available basis. Our findings comply with the fundamental bounds and we show that one can harness the exponential advantage through an adaptive strategy even away from the phase transition point.
△ Less
Submitted 3 June, 2025; v1 submitted 15 October, 2024;
originally announced October 2024.
-
Adaptive Data Transport Mechanism for UAV Surveillance Missions in Lossy Environments
Authors:
Niloufar Mehrabi,
Sayed Pedram Haeri Boroujeni,
Jenna Hofseth,
Abolfazl Razi,
Long Cheng,
Manveen Kaur,
James Martin,
Rahul Amin
Abstract:
Unmanned Aerial Vehicles (UAVs) play an increasingly critical role in Intelligence, Surveillance, and Reconnaissance (ISR) missions such as border patrolling and criminal detection, thanks to their ability to access remote areas and transmit real-time imagery to processing servers. However, UAVs are highly constrained by payload size, power limits, and communication bandwidth, necessitating the de…
▽ More
Unmanned Aerial Vehicles (UAVs) play an increasingly critical role in Intelligence, Surveillance, and Reconnaissance (ISR) missions such as border patrolling and criminal detection, thanks to their ability to access remote areas and transmit real-time imagery to processing servers. However, UAVs are highly constrained by payload size, power limits, and communication bandwidth, necessitating the development of highly selective and efficient data transmission strategies. This has driven the development of various compression and optimal transmission technologies for UAVs. Nevertheless, most methods strive to preserve maximal information in transferred video frames, missing the fact that only certain parts of images/video frames might offer meaningful contributions to the ultimate mission objectives in the ISR scenarios involving moving object detection and tracking (OD/OT). This paper adopts a different perspective, and offers an alternative AI-driven scheduling policy that prioritizes selecting regions of the image that significantly contributes to the mission objective. The key idea is tiling the image into small patches and developing a deep reinforcement learning (DRL) framework that assigns higher transmission probabilities to patches that present higher overlaps with the detected object of interest, while penalizing sharp transitions over consecutive frames to promote smooth scheduling shifts. Although we used Yolov-8 object detection and UDP transmission protocols as a benchmark testing scenario the idea is general and applicable to different transmission protocols and OD/OT methods. To further boost the system's performance and avoid OD errors for cluttered image patches, we integrate it with interframe interpolations.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks
Authors:
Yiyue Chen,
Abolfazl Hashemi,
Haris Vikalo
Abstract:
Distributed stochastic non-convex optimization problems have recently received attention due to the growing interest of signal processing, computer vision, and natural language processing communities in applications deployed over distributed learning systems (e.g., federated learning). We study the setting where the data is distributed across the nodes of a time-varying directed network, a topolog…
▽ More
Distributed stochastic non-convex optimization problems have recently received attention due to the growing interest of signal processing, computer vision, and natural language processing communities in applications deployed over distributed learning systems (e.g., federated learning). We study the setting where the data is distributed across the nodes of a time-varying directed network, a topology suitable for modeling dynamic networks experiencing communication delays and straggler effects. The network nodes, which can access only their local objectives and query a stochastic first-order oracle to obtain gradient estimates, collaborate to minimize a global objective function by exchanging messages with their neighbors. We propose an algorithm, novel to this setting, that leverages stochastic gradient descent with momentum and gradient tracking to solve distributed non-convex optimization problems over time-varying networks. To analyze the algorithm, we tackle the challenges that arise when analyzing dynamic network systems which communicate gradient acceleration components. We prove that the algorithm's oracle complexity is $\mathcal{O}(1/ε^{1.5})$, and that under Polyak-$Ł$ojasiewicz condition the algorithm converges linearly to a steady error state. The proposed scheme is tested on several learning tasks: a non-convex logistic regression experiment on the MNIST dataset, an image classification task on the CIFAR-10 dataset, and an NLP classification test on the IMDB dataset. We further present numerical simulations with an objective that satisfies the PL condition. The results demonstrate superior performance of the proposed framework compared to the existing related methods.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Efficient learning of differential network in multi-source non-paranormal graphical models
Authors:
Mojtaba Nikahd,
Seyed Abolfazl Motahari
Abstract:
This paper addresses learning of sparse structural changes or differential network between two classes of non-paranormal graphical models. We assume a multi-source and heterogeneous dataset is available for each class, where the covariance matrices are identical for all non-paranormal graphical models. The differential network, which are encoded by the difference precision matrix, can then be deco…
▽ More
This paper addresses learning of sparse structural changes or differential network between two classes of non-paranormal graphical models. We assume a multi-source and heterogeneous dataset is available for each class, where the covariance matrices are identical for all non-paranormal graphical models. The differential network, which are encoded by the difference precision matrix, can then be decoded by optimizing a lasso penalized D-trace loss function. To this aim, an efficient approach is proposed that outputs the exact solution path, outperforming the previous methods that only sample from the solution path in pre-selected regularization parameters. Notably, our proposed method has low computational complexity, especially when the differential network are sparse. Our simulations on synthetic data demonstrate a superior performance for our strategy in terms of speed and accuracy compared to an existing method. Moreover, our strategy in combining datasets from multiple sources is shown to be very effective in inferring differential network in real-world problems. This is backed by our experimental results on drug resistance in tumor cancers. In the latter case, our strategy outputs important genes for drug resistance which are already confirmed by various independent studies.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
From Data to Control: A Formal Compositional Framework for Large-Scale Interconnected Networks
Authors:
Omid Akbarzadeh,
Amy Nejati,
Abolfazl Lavaei
Abstract:
We introduce a compositional data-driven methodology with noisy data for designing fully-decentralized safety controllers applicable to large-scale interconnected networks, encompassing a vast number of subsystems with unknown mathematical models. Our compositional scheme leverages the interconnection topology and breaks down the network analysis into the examination of distinct subsystems. This i…
▽ More
We introduce a compositional data-driven methodology with noisy data for designing fully-decentralized safety controllers applicable to large-scale interconnected networks, encompassing a vast number of subsystems with unknown mathematical models. Our compositional scheme leverages the interconnection topology and breaks down the network analysis into the examination of distinct subsystems. This is accompanied by utilizing a concept of control storage certificates (CSCs) to capture joint dissipativity-type properties among subsystems. These CSCs are instrumental in a compositional derivation of a control barrier certificate (CBC) specialized for the interconnected network, thereby ensuring its safety. In our data-driven scheme, we gather only a single noise-corrupted input-state trajectory from each unknown subsystem within a specified time frame. By fulfilling a specific rank condition, this process facilitates the construction of a CSC for each subsystem. Following this, by adhering to compositional dissipativity reasoning, we compose CSCs derived from noisy data and build a CBC for the unknown network, ensuring its safety over an infinite time horizon, while providing correctness guarantees. We demonstrate that our compositional data-driven approach significantly enhances the design of a CBC and its robust safety controller under noisy data across the interconnected network. This advancement is achieved by reducing the computational complexity from a polynomial growth in relation to network dimension, when using sum-of-squares (SOS) optimization, to a linear scale based on the number of subsystems. We apply our data-driven findings to a variety of benchmarks, involving physical networks with unknown models and diverse interconnection topologies.
△ Less
Submitted 17 June, 2025; v1 submitted 19 September, 2024;
originally announced September 2024.
-
SoccerNet 2024 Challenges Results
Authors:
Anthony Cioppa,
Silvio Giancola,
Vladimir Somers,
Victor Joos,
Floriane Magera,
Jan Held,
Seyed Abolfazl Ghasemzadeh,
Xin Zhou,
Karolina Seweryn,
Mateusz Kowalczyk,
Zuzanna Mróz,
Szymon Łukasik,
Michał Hałoń,
Hassan Mkhallati,
Adrien Deliège,
Carlos Hinojosa,
Karen Sanchez,
Amir M. Mansourian,
Pierre Miralles,
Olivier Barnich,
Christophe De Vleeschouwer,
Alexandre Alahi,
Bernard Ghanem,
Marc Van Droogenbroeck,
Adam Gorski
, et al. (59 additional authors not shown)
Abstract:
The SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team. These challenges aim to advance research across multiple themes in football, including broadcast video understanding, field understanding, and player understanding. This year, the challenges encompass four vision-based tasks. (1) Ball Action Spotting, focusing on precisely loca…
▽ More
The SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team. These challenges aim to advance research across multiple themes in football, including broadcast video understanding, field understanding, and player understanding. This year, the challenges encompass four vision-based tasks. (1) Ball Action Spotting, focusing on precisely localizing when and which soccer actions related to the ball occur, (2) Dense Video Captioning, focusing on describing the broadcast with natural language and anchored timestamps, (3) Multi-View Foul Recognition, a novel task focusing on analyzing multiple viewpoints of a potential foul incident to classify whether a foul occurred and assess its severity, (4) Game State Reconstruction, another novel task focusing on reconstructing the game state from broadcast videos onto a 2D top-view map of the field. Detailed information about the tasks, challenges, and leaderboards can be found at https://www.soccer-net.org, with baselines and development kits available at https://github.com/SoccerNet.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
From a Single Trajectory to Safety Controller Synthesis of Discrete-Time Nonlinear Polynomial Systems
Authors:
Behrad Samari,
Omid Akbarzadeh,
Mahdieh Zaker,
Abolfazl Lavaei
Abstract:
This work is concerned with developing a data-driven approach for learning control barrier certificates (CBCs) and associated safety controllers for discrete-time nonlinear polynomial systems with unknown mathematical models, guaranteeing system safety over an infinite time horizon. The proposed approach leverages measured data acquired through an input-output observation, referred to as a single…
▽ More
This work is concerned with developing a data-driven approach for learning control barrier certificates (CBCs) and associated safety controllers for discrete-time nonlinear polynomial systems with unknown mathematical models, guaranteeing system safety over an infinite time horizon. The proposed approach leverages measured data acquired through an input-output observation, referred to as a single trajectory, collected over a specified time horizon. By fulfilling a certain rank condition, which ensures the unknown system is persistently excited by the collected data, we design a CBC and its corresponding safety controller directly from the finite-length observed data, without explicitly identifying the unknown dynamical system. This is achieved through proposing a data-based sum-of-squares optimization (SOS) program to systematically design CBCs and their safety controllers. We validate our data-driven approach over two physical case studies including a jet engine and a Lorenz system, demonstrating the efficacy of our proposed method.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Compositional Design of Safety Controllers for Large-scale Stochastic Hybrid Systems
Authors:
Mahdieh Zaker,
Omid Akbarzadeh,
Behrad Samari,
Abolfazl Lavaei
Abstract:
In this work, we propose a compositional scheme based on small-gain reasoning for the safety controller synthesis of interconnected stochastic hybrid systems with both continuous evolutions and instantaneous jumps. In our proposed setting, we first offer an augmented scheme to represent each stochastic hybrid subsystem with continuous and discrete evolutions in a unified framework, ensuring that t…
▽ More
In this work, we propose a compositional scheme based on small-gain reasoning for the safety controller synthesis of interconnected stochastic hybrid systems with both continuous evolutions and instantaneous jumps. In our proposed setting, we first offer an augmented scheme to represent each stochastic hybrid subsystem with continuous and discrete evolutions in a unified framework, ensuring that the state trajectories match those of the original hybrid systems. We then introduce the concept of augmented control sub-barrier certificates (A-CSBC) for each subsystem, which allows the construction of augmented control barrier certificates (A-CBC) for interconnected systems and their safety controllers under small-gain compositional conditions. We eventually leverage the constructed A-CBC and quantify a guaranteed probabilistic bound across the safety of the interconnected system. While the computational complexity of designing a barrier certificate and its safety controller grows polynomially with network dimension using sum-of-squares (SOS) optimization program, our compositional approach significantly reduces it to a linear scale with respect to the number of subsystems. We verify the efficacy of our proposed approach over an interconnected stochastic hybrid system composed of $1000$ nonlinear subsystems.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Loop corrections for hard spheres in Hamming space
Authors:
Abolfazl Ramezanpour,
Saman Moghimi-Araghi
Abstract:
We begin with an exact expression for the entropy of a system of hard spheres within the Hamming space. This entropy relies on probability marginals, which are determined by an extended set of Belief Propagation (BP) equations. The BP probability marginals are functions of auxiliary variables which are introduced to model the effects of loopy interactions on a tree-structured interaction graph. We…
▽ More
We begin with an exact expression for the entropy of a system of hard spheres within the Hamming space. This entropy relies on probability marginals, which are determined by an extended set of Belief Propagation (BP) equations. The BP probability marginals are functions of auxiliary variables which are introduced to model the effects of loopy interactions on a tree-structured interaction graph. We explore various reasonable and approximate probability distributions, ensuring they align with the exact solutions of the BP equations. Our approach is based on an ansatz of (in)homogeneous cavity marginals respecting the permutation symmetry of the problem. Through thorough analysis, we aim to minimize errors in the BP equations. Our findings support the conjecture that the maximum packing density asymptotically conforms to the lower bound proposed by Gilbert and Varshamov, further validated by the solution of the loopy BP equations.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Cardinality of groups and rings via the idempotency of infinite cardinals
Authors:
Abolfazl Tarizadeh
Abstract:
An important classical result in ZFC asserts that every infinite cardinal number is idempotent. Using this fact, we obtain several algebraic results in this article. The first result asserts that an infinite Abelian group has a proper subgroup with the same cardinality if and only if it is not a Prüfer group. In the second result, the cardinality of any monoid-ring $R[M]$ (not necessarily commutat…
▽ More
An important classical result in ZFC asserts that every infinite cardinal number is idempotent. Using this fact, we obtain several algebraic results in this article. The first result asserts that an infinite Abelian group has a proper subgroup with the same cardinality if and only if it is not a Prüfer group. In the second result, the cardinality of any monoid-ring $R[M]$ (not necessarily commutative) is calculated. In particular, the cardinality of every polynomial ring with any number of variables (possibly infinite) is easily computed. Next, it is shown that every commutative ring and its total ring of fractions have the same cardinality. This set-theoretic observation leads us to a notion in ring theory that we call a balanced ring (i.e. a ring that is canonically isomorphic to its total ring of fractions). Every zero-dimensional ring is a balanced ring. Then we show that a Noetherian ring is a balanced ring if and only if its localization at every maximal ideal has zero depth. It is also proved that every self-injective ring (injective as a module over itself) is a balanced ring.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Leveraging Blockchain and ANFIS for Optimal Supply Chain Management
Authors:
Amirfarhad Farhadi,
Homayoun Safarpour Motealegh Mahalegi,
Abolfazl Pourrezaeian Firouzabad,
Azadeh Zamanifar,
Majid Sorouri
Abstract:
The supply chain is a critical segment of the product manufacturing cycle, continuously influenced by risky, uncertain, and undesirable events. Optimizing flexibility in the supply chain presents a complex, multi-objective, and nonlinear programming challenge. In the poultry supply chain, the development of mass customization capabilities has led manufacturing companies to increasingly focus on of…
▽ More
The supply chain is a critical segment of the product manufacturing cycle, continuously influenced by risky, uncertain, and undesirable events. Optimizing flexibility in the supply chain presents a complex, multi-objective, and nonlinear programming challenge. In the poultry supply chain, the development of mass customization capabilities has led manufacturing companies to increasingly focus on offering tailored and customized services for individual products. To safeguard against data tampering and ensure the integrity of setup costs and overall profitability, a multi-signature decentralized finance (DeFi) protocol, integrated with the IoT on a blockchain platform, is proposed. Managing the poultry supply chain involves uncertainties that may not account for parameters such as delivery time to retailers, reorder time, and the number of requested products. To address these challenges, this study employs an adaptive neuro-fuzzy inference system (ANFIS), combining neural networks with fuzzy logic to compensate for the lack of data training in parameter identification. Through MATLAB simulations, the study investigates the average shop delivery duration, the reorder time, and the number of products per order. By implementing the proposed technique, the average delivery time decreases from 40 to 37 minutes, the reorder time decreases from five to four days, and the quantity of items requested per order grows from six to eleven. Additionally, the ANFIS model enhances overall supply chain performance by reducing transaction times by 15\% compared to conventional systems, thereby improving real-time responsiveness and boosting transparency in supply chain operations, effectively resolving operational issues.
△ Less
Submitted 2 September, 2024; v1 submitted 30 August, 2024;
originally announced August 2024.
-
Review: Quantum Metrology and Sensing with Many-Body Systems
Authors:
Victor Montenegro,
Chiranjib Mukhopadhyay,
Rozhin Yousefjani,
Saubhik Sarkar,
Utkarsh Mishra,
Matteo G. A. Paris,
Abolfazl Bayat
Abstract:
The main power of quantum sensors is achieved when the probe is composed of several particles. In this situation, quantum features such as entanglement contribute to enhancing the precision of quantum sensors beyond the capacity of classical sensors. Originally, quantum sensing was formulated for non-interacting particles that are prepared in a special form of maximally entangled states. These pro…
▽ More
The main power of quantum sensors is achieved when the probe is composed of several particles. In this situation, quantum features such as entanglement contribute to enhancing the precision of quantum sensors beyond the capacity of classical sensors. Originally, quantum sensing was formulated for non-interacting particles that are prepared in a special form of maximally entangled states. These probes are extremely sensitive to decoherence, and any interaction between particles is detrimental to their performance. An alternative framework for quantum sensing has been developed exploiting quantum many-body systems, where the interaction between particles plays a crucial role. In this review, we investigate different aspects of the latter approach for quantum metrology and sensing. Many-body probes have been used in both equilibrium and non-equilibrium scenarios. Quantum criticality has been identified as a resource for achieving quantum-enhanced sensitivity in both scenarios. In equilibrium, various types of criticalities, such as first-order, second-order, topological, and localization phase transitions, have been exploited for sensing purposes. In non-equilibrium scenarios, quantum-enhanced sensitivity has been discovered for Floquet, dissipative, and time crystal phase transitions. While each type of these criticalities has its own characteristics, the presence of one feature is crucial for achieving quantum-enhanced sensitivity: the energy/quasi-energy gap closing. In non-equilibrium quantum sensing, time is another parameter that can affect the sensitivity of the probe. Typically, the sensitivity enhances as the probe evolves in time. In general, a more complete understanding of resources for non-equilibrium quantum sensors is now rapidly evolving. In this review, we provide an overview of recent progress in quantum metrology and sensing using many-body systems.
△ Less
Submitted 7 June, 2025; v1 submitted 27 August, 2024;
originally announced August 2024.
-
Submodular Maximization Approaches for Equitable Client Selection in Federated Learning
Authors:
Andrés Catalino Castillo Jiménez,
Ege C. Kaya,
Lintao Ye,
Abolfazl Hashemi
Abstract:
In a conventional Federated Learning framework, client selection for training typically involves the random sampling of a subset of clients in each iteration. However, this random selection often leads to disparate performance among clients, raising concerns regarding fairness, particularly in applications where equitable outcomes are crucial, such as in medical or financial machine learning tasks…
▽ More
In a conventional Federated Learning framework, client selection for training typically involves the random sampling of a subset of clients in each iteration. However, this random selection often leads to disparate performance among clients, raising concerns regarding fairness, particularly in applications where equitable outcomes are crucial, such as in medical or financial machine learning tasks. This disparity typically becomes more pronounced with the advent of performance-centric client sampling techniques. This paper introduces two novel methods, namely SUBTRUNC and UNIONFL, designed to address the limitations of random client selection. Both approaches utilize submodular function maximization to achieve more balanced models. By modifying the facility location problem, they aim to mitigate the fairness concerns associated with random selection. SUBTRUNC leverages client loss information to diversify solutions, while UNIONFL relies on historical client selection data to ensure a more equitable performance of the final model. Moreover, these algorithms are accompanied by robust theoretical guarantees regarding convergence under reasonable assumptions. The efficacy of these methods is demonstrated through extensive evaluations across heterogeneous scenarios, revealing significant improvements in fairness as measured by a client dissimilarity metric.
△ Less
Submitted 27 August, 2024; v1 submitted 24 August, 2024;
originally announced August 2024.
-
MPL: Lifting 3D Human Pose from Multi-view 2D Poses
Authors:
Seyed Abolfazl Ghasemzadeh,
Alexandre Alahi,
Christophe De Vleeschouwer
Abstract:
Estimating 3D human poses from 2D images is challenging due to occlusions and projective acquisition. Learning-based approaches have been largely studied to address this challenge, both in single and multi-view setups. These solutions however fail to generalize to real-world cases due to the lack of (multi-view) 'in-the-wild' images paired with 3D poses for training. For this reason, we propose co…
▽ More
Estimating 3D human poses from 2D images is challenging due to occlusions and projective acquisition. Learning-based approaches have been largely studied to address this challenge, both in single and multi-view setups. These solutions however fail to generalize to real-world cases due to the lack of (multi-view) 'in-the-wild' images paired with 3D poses for training. For this reason, we propose combining 2D pose estimation, for which large and rich training datasets exist, and 2D-to-3D pose lifting, using a transformer-based network that can be trained from synthetic 2D-3D pose pairs. Our experiments demonstrate decreases up to 45% in MPJPE errors compared to the 3D pose obtained by triangulating the 2D poses. The framework's source code is available at https://github.com/aghasemzadeh/OpenMPL .
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
On the Existence of Shimura curves in the Prym locus of abelian covers of projective line
Authors:
Abolfazl Mohajer
Abstract:
Using the theory of Higgs bundles and their stabitlity properties associated to fibered surfaces and the Viehweg-Zuo characterization of Shimura curves in the moduli space of abelian varieties in terms of Higgs bundles, we prove that there does not exist any non-compact Shimura curves in the Prym locus of totally ramified $\Z_{2p}$- or $\Z_{2p}\times (\Z_{p})^{m-1}$-covers of the projective line i…
▽ More
Using the theory of Higgs bundles and their stabitlity properties associated to fibered surfaces and the Viehweg-Zuo characterization of Shimura curves in the moduli space of abelian varieties in terms of Higgs bundles, we prove that there does not exist any non-compact Shimura curves in the Prym locus of totally ramified $\Z_{2p}$- or $\Z_{2p}\times (\Z_{p})^{m-1}$-covers of the projective line in $A_{g}$ for $g\geq 8$, where $p\geq 5$ is a prime number.
△ Less
Submitted 25 July, 2024;
originally announced August 2024.
-
Problem of Locating and Allocating Charging Equipment for Battery Electric Buses under Stochastic Charging Demand
Authors:
Sadjad Bazarnovi,
Taner Cokyasar,
Omer Verbas,
Abolfazl Kouros Mohammadian
Abstract:
Bus electrification plays a crucial role in advancing urban transportation sustainability. Battery Electric Buses (BEBs), however, often need recharging, making the Problem of Locating and Allocating Charging Equipment for BEBs (PLACE-BEB) essential for efficient operations. This study proposes an optimization framework to solve the PLACE-BEB by determining the optimal placement of charger types a…
▽ More
Bus electrification plays a crucial role in advancing urban transportation sustainability. Battery Electric Buses (BEBs), however, often need recharging, making the Problem of Locating and Allocating Charging Equipment for BEBs (PLACE-BEB) essential for efficient operations. This study proposes an optimization framework to solve the PLACE-BEB by determining the optimal placement of charger types at potential locations under the stochastic charging demand. Leveraging the existing stochastic location literature, we develop a Mixed-Integer Non-Linear Program (MINLP) to model the problem. To solve this problem, we develop an exact solution method that minimizes the costs related to building charging stations, charger allocation, travel to stations, and average queueing and charging times. Queueing dynamics are modeled using an M/M/s queue, with the number of servers at each location treated as a decision variable. To improve scalability, we implement a Simulated Annealing (SA) and a Genetic Algorithm (GA) allowing for efficient solutions to large-scale problems. The computational performance of the methods was thoroughly evaluated, revealing that SA was effective for small-scale problems, while GA outperformed others for large-scale instances. A case study comparing garage-only, other-only, and mixed scenarios, along with joint deployment, highlighted the cost benefits of a collaborative and a comprehensive approach. Sensitivity analyses showed that the waiting time is a key factor to consider in the decision-making.
△ Less
Submitted 15 October, 2024; v1 submitted 9 August, 2024;
originally announced August 2024.
-
Ballistic Entanglement Cloud after a Boundary Quench
Authors:
Bedoor Alkurtass,
Abolfazl Bayat,
Pasquale Sodano,
Sougato Bose,
Henrik Johannesson
Abstract:
Entanglement has been extensively used to characterize the structure of strongly correlated many-body systems. Most of these analyses focus on either spatial properties of entanglement or its temporal behavior. Negativity, as an entanglement measure, quantifies entanglement between different non-complementary blocks of a many-body system. Here, we consider a combined spatial-temporal analysis of e…
▽ More
Entanglement has been extensively used to characterize the structure of strongly correlated many-body systems. Most of these analyses focus on either spatial properties of entanglement or its temporal behavior. Negativity, as an entanglement measure, quantifies entanglement between different non-complementary blocks of a many-body system. Here, we consider a combined spatial-temporal analysis of entanglement negativity in a strongly correlated many-body system to characterize complex formation of correlations through non-equilibrium dynamics of such systems. A bond defect is introduced through a local quench at one of the boundaries of a uniform Heisenberg spin chain. Using negativity and entanglement entropy, computed by the time-dependent density matrix renormalization group, we analyze the extension of entanglement in the model as a function of time. We find that an entanglement cloud is formed, detached from the boundary spin and composed of spins with which it is highly entangled. The cloud travels ballistically in the chain until it reaches the other end where it reflects back and the cycle repeats. The revival dynamics exhibits an intriguing contraction (expansion) of the cloud as it moves away from (towards) the boundary spin.
△ Less
Submitted 20 March, 2025; v1 submitted 23 July, 2024;
originally announced July 2024.