-
VISTA: Vision-Language Inference for Training-Free Stock Time-Series Analysis
Authors:
Tina Khezresmaeilzadeh,
Parsa Razmara,
Seyedarmin Azizi,
Mohammad Erfan Sadeghi,
Erfan Baghaei Potraghloo
Abstract:
Stock price prediction remains a complex and high-stakes task in financial analysis, traditionally addressed using statistical models or, more recently, language models. In this work, we introduce VISTA (Vision-Language Inference for Stock Time-series Analysis), a novel, training-free framework that leverages Vision-Language Models (VLMs) for multi-modal stock forecasting. VISTA prompts a VLM with…
▽ More
Stock price prediction remains a complex and high-stakes task in financial analysis, traditionally addressed using statistical models or, more recently, language models. In this work, we introduce VISTA (Vision-Language Inference for Stock Time-series Analysis), a novel, training-free framework that leverages Vision-Language Models (VLMs) for multi-modal stock forecasting. VISTA prompts a VLM with both textual representations of historical stock prices and their corresponding line charts to predict future price values. By combining numerical and visual modalities in a zero-shot setting and using carefully designed chain-of-thought prompts, VISTA captures complementary patterns that unimodal approaches often miss. We benchmark VISTA against standard baselines, including ARIMA and text-only LLM-based prompting methods. Experimental results show that VISTA outperforms these baselines by up to 89.83%, demonstrating the effectiveness of multi-modal inference for stock time-series analysis and highlighting the potential of VLMs in financial forecasting tasks without requiring task-specific training.
△ Less
Submitted 11 June, 2025; v1 submitted 24 May, 2025;
originally announced May 2025.
-
Closed-loop control of seizure activity via real-time seizure forecasting by reservoir neuromorphic computing
Authors:
Maryam Sadeghi,
Darío Fernández Khatiboun,
Yasser Rezaeiyan,
Saima Rizwan,
Alessandro Barcellona,
Andrea Merello,
Marco Crepaldi,
Gabriella Panuccio,
Farshad Moradi
Abstract:
Closed-loop brain stimulation holds potential as personalized treatment for drug-resistant epilepsy (DRE) but still suffers from limitations that result in highly variable efficacy. First, stimulation is typically delivered upon detection of the seizure to abort rather than prevent it; second, the stimulation parameters are established by trial and error, requiring lengthy rounds of fine-tuning, w…
▽ More
Closed-loop brain stimulation holds potential as personalized treatment for drug-resistant epilepsy (DRE) but still suffers from limitations that result in highly variable efficacy. First, stimulation is typically delivered upon detection of the seizure to abort rather than prevent it; second, the stimulation parameters are established by trial and error, requiring lengthy rounds of fine-tuning, which delay steady-state therapeutic efficacy. Here, we address these limitations by leveraging the potential of neuromorphic computing. We present a system capable of driving personalized free-run stimulations based on seizure forecasting, wherein each forecast triggers an electrical pulse rather than an arbitrarily predefined fixed-frequency stimulus train. We validate the system against hippocampal spheroids coupled to 3D microelectrode array as a simplified testbed, showing that it can achieve seizure reduction >97% while primarily using instantaneous stimulation frequencies within 20 Hz, well below what typically used in clinical settings. Our work demonstrates the potential of neuromorphic systems as a next-generation neuromodulation strategy for personalized DRE treatment.
△ Less
Submitted 4 May, 2025;
originally announced May 2025.
-
Exploring Metamaterial Lasers through Non-Hermitian Scattering Formalism
Authors:
Özge Beyza Vardar,
Uğur Tamer,
Mohammad Mehdi Sadeghi,
Mustafa Sarısaman
Abstract:
This study explores the exciting properties of metamaterials and their innovative applications in non-Hermitian physics, with particular emphasis on the scattering formalism, a key topic of recent research. We have analyzed how light behaves in a negative index metamaterial (NIM), allowing us to develop a transfer matrix and identify the essential conditions for the occurrence of spectral singular…
▽ More
This study explores the exciting properties of metamaterials and their innovative applications in non-Hermitian physics, with particular emphasis on the scattering formalism, a key topic of recent research. We have analyzed how light behaves in a negative index metamaterial (NIM), allowing us to develop a transfer matrix and identify the essential conditions for the occurrence of spectral singularities. These findings are crucial for fine-tuning system parameters that will drive the development of metamaterial slab lasers and coherent perfect absorber (CPA) systems. Overall, our research demonstrates the enormous potential of metamaterials and their significant role in driving innovation in various technology areas.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
ELAB: Extensive LLM Alignment Benchmark in Persian Language
Authors:
Zahra Pourbahman,
Fatemeh Rajabi,
Mohammadhossein Sadeghi,
Omid Ghahroodi,
Somaye Bakhshaei,
Arash Amini,
Reza Kazemi,
Mahdieh Soleymani Baghshah
Abstract:
This paper presents a comprehensive evaluation framework for aligning Persian Large Language Models (LLMs) with critical ethical dimensions, including safety, fairness, and social norms. It addresses the gaps in existing LLM evaluation frameworks by adapting them to Persian linguistic and cultural contexts. This benchmark creates three types of Persian-language benchmarks: (i) translated data, (ii…
▽ More
This paper presents a comprehensive evaluation framework for aligning Persian Large Language Models (LLMs) with critical ethical dimensions, including safety, fairness, and social norms. It addresses the gaps in existing LLM evaluation frameworks by adapting them to Persian linguistic and cultural contexts. This benchmark creates three types of Persian-language benchmarks: (i) translated data, (ii) new data generated synthetically, and (iii) new naturally collected data. We translate Anthropic Red Teaming data, AdvBench, HarmBench, and DecodingTrust into Persian. Furthermore, we create ProhibiBench-fa, SafeBench-fa, FairBench-fa, and SocialBench-fa as new datasets to address harmful and prohibited content in indigenous culture. Moreover, we collect extensive dataset as GuardBench-fa to consider Persian cultural norms. By combining these datasets, our work establishes a unified framework for evaluating Persian LLMs, offering a new approach to culturally grounded alignment evaluation. A systematic evaluation of Persian LLMs is performed across the three alignment aspects: safety (avoiding harmful content), fairness (mitigating biases), and social norms (adhering to culturally accepted behaviors). We present a publicly available leaderboard that benchmarks Persian LLMs with respect to safety, fairness, and social norms at: https://huggingface.co/spaces/MCILAB/LLM_Alignment_Evaluation.
△ Less
Submitted 16 April, 2025;
originally announced April 2025.
-
Full-Diversity Construction-D Lattices: Design and Decoding Perspective on Block-Fading Channels
Authors:
Maryam Sadeghi,
Hassan Khodaiemehr,
Chen Feng
Abstract:
This paper introduces a novel framework for constructing algebraic lattices based on Construction-D, leveraging nested linear codes and prime ideals from algebraic number fields. We focus on the application of these lattices in block-fading (BF) channels, which are characterized by piecewise-constant fading across blocks of transmitted symbols. This approach results in a semi-systematic generator…
▽ More
This paper introduces a novel framework for constructing algebraic lattices based on Construction-D, leveraging nested linear codes and prime ideals from algebraic number fields. We focus on the application of these lattices in block-fading (BF) channels, which are characterized by piecewise-constant fading across blocks of transmitted symbols. This approach results in a semi-systematic generator matrix, providing a structured foundation for high-dimensional lattice design for BF channels. The proposed Construction-D lattices exhibit the full diversity property, making them highly effective for error performance improvement. To address this, we develop an efficient decoding algorithm designed specifically for full-diversity Construction-D lattices.
Simulations indicate that the proposed lattices notably enhance error performance compared to full-diversity Construction-A lattices in diversity-2 cases. Interestingly, unlike AWGN channels, the expected performance enhancement of Construction-D over Construction-A, resulting from an increased number of nested code levels, was observed only in the two-level and diversity-2 cases. This phenomenon is likely attributed to the intensified effects of error propagation that occur during successive cancellation at higher levels, as well as the higher diversity orders.
These findings highlight the promise of Construction-D lattices as an effective coding strategy for enhancing communication reliability in BF channels.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
The effect of thermal misbalance on magnetohydrodynamic modes in coronal magnetic cylinders
Authors:
S. M. Hejazi,
T. Van Doorsselaere,
M. Sadeghi,
D. Y. Kolotkov,
J. Hermans
Abstract:
This study investigates the dispersion of magnetohydrodynamic waves influenced by thermal misbalance in a cylindrical configuration with a finite axial magnetic field within solar coronal plasmas. Specifically, it examines how thermal misbalance, characterized by two distinct timescales directly linked to the cooling and heating functions, influences the dispersion relation. This investigation is…
▽ More
This study investigates the dispersion of magnetohydrodynamic waves influenced by thermal misbalance in a cylindrical configuration with a finite axial magnetic field within solar coronal plasmas. Specifically, it examines how thermal misbalance, characterized by two distinct timescales directly linked to the cooling and heating functions, influences the dispersion relation. This investigation is a key approach for understanding non-adiabatic effects on the behaviour of these waves. Our findings reveal that the effect of thermal misbalance on fast sausage and kink modes, consistent with previous studies on slabs, is small but slightly more pronounced than previously thought. The impact is smaller at long-wavelength limits but increases at shorter wavelengths, leading to higher damping rates. This minor effect on fast modes occurs despite the complex interaction of thermal misbalance terms within the dispersion relation, even at low-frequency limits defined by the characteristic timescales. Additionally, a very small amplification is observed, indicating a suppressed damping state for the long-wavelength fundamental fast kink mode. In contrast, slow magnetoacoustic modes are significantly affected by thermal misbalance, with the cusp frequency shifting slightly to lower values, which is significant for smaller longitudinal wavenumbers. This thermal misbalance likely accounts for the substantial attenuation observed in the propagation of slow magnetoacoustic waves within the solar atmosphere. The long-wavelength limit leads to an analytical expression that accurately describes the frequency shifts in slow modes due to misbalance, closely aligning with both numerical and observational results.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records
Authors:
Philip Chung,
Akshay Swaminathan,
Alex J. Goodell,
Yeasul Kim,
S. Momsen Reincke,
Lichy Han,
Ben Deverett,
Mohammad Amin Sadeghi,
Abdel-Badih Ariss,
Marc Ghanem,
David Seong,
Andrew A. Lee,
Caitlin E. Coombes,
Brad Bradshaw,
Mahir A. Sufian,
Hyo Jung Hong,
Teresa P. Nguyen,
Mohammad R. Rasouli,
Komal Kamra,
Mark A. Burbridge,
James C. McAvoy,
Roya Saffary,
Stephen P. Ma,
Dev Dash,
James Xie
, et al. (4 additional authors not shown)
Abstract:
Methods to ensure factual accuracy of text generated by large language models (LLM) in clinical medicine are lacking. VeriFact is an artificial intelligence system that combines retrieval-augmented generation and LLM-as-a-Judge to verify whether LLM-generated text is factually supported by a patient's medical history based on their electronic health record (EHR). To evaluate this system, we introd…
▽ More
Methods to ensure factual accuracy of text generated by large language models (LLM) in clinical medicine are lacking. VeriFact is an artificial intelligence system that combines retrieval-augmented generation and LLM-as-a-Judge to verify whether LLM-generated text is factually supported by a patient's medical history based on their electronic health record (EHR). To evaluate this system, we introduce VeriFact-BHC, a new dataset that decomposes Brief Hospital Course narratives from discharge summaries into a set of simple statements with clinician annotations for whether each statement is supported by the patient's EHR clinical notes. Whereas highest agreement between clinicians was 88.5%, VeriFact achieves up to 92.7% agreement when compared to a denoised and adjudicated average human clinican ground truth, suggesting that VeriFact exceeds the average clinician's ability to fact-check text against a patient's medical record. VeriFact may accelerate the development of LLM-based EHR applications by removing current evaluation bottlenecks.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
A theoretical framework to explain non-Nash equilibrium strategic behavior in experimental games
Authors:
Mojtaba Madadi Asl,
Mehdi Sadeghi
Abstract:
Conventional game theory assumes that players are perfectly rational. In a realistic situation, however, players are rarely perfectly rational. This bounded rationality is one of the main reasons why the predictions of Nash equilibrium in normative game theory often diverge from human behavior in real experiments. Motivated by the Boltzmann weight formalism, here we present a theoretical framework…
▽ More
Conventional game theory assumes that players are perfectly rational. In a realistic situation, however, players are rarely perfectly rational. This bounded rationality is one of the main reasons why the predictions of Nash equilibrium in normative game theory often diverge from human behavior in real experiments. Motivated by the Boltzmann weight formalism, here we present a theoretical framework to predict the non-Nash equilibrium probabilities of possible outcomes in strategic games by focusing on the differences in expected payoffs of players rather than traditional utility metrics. In this model, bounded rationality is parameterized by assigning a temperature to each player, reflecting their level of rationality by interpolating between two decision-making regimes, i.e., utility maximization and equiprobable choices. Our framework predicts all possible joint strategies and is able to determine the relative probabilities for multiple pure or mixed strategy equilibria. To validate model predictions, by analyzing experimental data we demonstrated that our model can successfully explain non-Nash equilibrium strategic behavior in experimental games. Our approach reinterprets the concept of temperature in game theory, leveraging the development of theoretical frameworks to bridge the gap between the predictions of normative game theory and the results of behavioral experiments.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Information, entropy and the paradox of choice: A theoretical framework for understanding choice satisfaction
Authors:
Mojtaba Madadi Asl,
Kamal Hajian,
Rouzbeh Torabi,
Mehdi Sadeghi
Abstract:
Choice overload occurs when individuals feel overwhelmed by an excessive number of options. Experimental evidence suggests that a larger selection can complicate the decision-making process. Consequently, choice satisfaction may diminish when the costs of making a choice outweigh its benefits, indicating that satisfaction follows an inverted U-shaped relationship with the size of the choice set. H…
▽ More
Choice overload occurs when individuals feel overwhelmed by an excessive number of options. Experimental evidence suggests that a larger selection can complicate the decision-making process. Consequently, choice satisfaction may diminish when the costs of making a choice outweigh its benefits, indicating that satisfaction follows an inverted U-shaped relationship with the size of the choice set. However, the theoretical underpinnings of this phenomenon remain underexplored. Here, we present a theoretical framework based on relative entropy and effective information to elucidate the inverted U-shaped relationship between satisfaction and choice set size. We begin by positing that individuals assign a probability distribution to a choice set based on their preferences, characterized by an observed Shannon entropy. We then define a maximum entropy that corresponds to a worst-case scenario where individuals are indifferent among options, leading to equal probabilities for all alternatives. We hypothesized that satisfaction is related to the probability of identifying an ideal choice within the set. By comparing observed entropy to maximum entropy, we derive the effective information of choice probabilities, demonstrating that this metric reflects satisfaction with the options available. For smaller choice sets, individuals can more easily identify their best option, resulting in a sharper probability distribution around the preferred choice and, consequently, minimum entropy, which signifies maximum information and satisfaction. Conversely, in larger choice sets, individuals struggle to compare and evaluate all alternatives, leading to missed opportunities and increased entropy. This smooth probability distribution ultimately reduces choice satisfaction, thereby producing the observed inverted U-shaped trend.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Thermodynamic Behavior of a 4D Nonminimal Maxwell-AdS Black Hole
Authors:
Mehdi Sadeghi,
Faramarz Rahmani
Abstract:
In this paper, we derive a black hole solution within the Einstein Maxwell framework incorporating a nonminimal coupling between the Ricci tensor and the Maxwell field strength tensor, using a perturbative approach. We subsequently explore the thermodynamic phase transitions of the black hole in an extended phase space, analyzing both canonical and grand canonical ensembles. Our findings reveal th…
▽ More
In this paper, we derive a black hole solution within the Einstein Maxwell framework incorporating a nonminimal coupling between the Ricci tensor and the Maxwell field strength tensor, using a perturbative approach. We subsequently explore the thermodynamic phase transitions of the black hole in an extended phase space, analyzing both canonical and grand canonical ensembles. Our findings reveal that the system exhibits Van der Waals like behavior in both ensembles. Moreover, for sufficiently small values of electric charge and Maxwell potential, the thermodynamics is dominated by a Hawking Page phase transition.
△ Less
Submitted 3 June, 2025; v1 submitted 11 December, 2024;
originally announced December 2024.
-
Long-distance feedback to cold atoms coupled to an optical nanofiber
Authors:
Mohammad Sadeghi,
Wayne Crump,
Scott Parkins,
Maarten Hoogerland
Abstract:
We investigate the interaction of spontaneous emission photons generated by a strongly driven laser-cooled atom sample with that same sample after a time delay, which is important for establishing long-distance entanglement between quantum systems. The photons are emitted into an optical nanofiber, connected to a length of conventional optical fiber and reflected back using a Fiber-Bragg Grating m…
▽ More
We investigate the interaction of spontaneous emission photons generated by a strongly driven laser-cooled atom sample with that same sample after a time delay, which is important for establishing long-distance entanglement between quantum systems. The photons are emitted into an optical nanofiber, connected to a length of conventional optical fiber and reflected back using a Fiber-Bragg Grating mirror. We show that the photon count rates as a function of exciting laser frequency and intensity follow a simple model.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
Nonlinear Yang-Mills AdS black brane and DC conductivity
Authors:
Mehdi Sadeghi
Abstract:
In this paper, we examine Einstein-Hilbert gravity featuring a cosmological constant and a non-abelian nonlinear electromagnetic field that is minimally coupled to gravity. We first present the black brane solution for this model and subsequently calculate the color non-abelian DC conductivity for this solution using AdS/CFT duality. Our results retrieve the Yang-Mills model in the limit as $q_1$…
▽ More
In this paper, we examine Einstein-Hilbert gravity featuring a cosmological constant and a non-abelian nonlinear electromagnetic field that is minimally coupled to gravity. We first present the black brane solution for this model and subsequently calculate the color non-abelian DC conductivity for this solution using AdS/CFT duality. Our results retrieve the Yang-Mills model in the limit as $q_1$ approaches zero.
△ Less
Submitted 5 December, 2024; v1 submitted 1 December, 2024;
originally announced December 2024.
-
Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription
Authors:
Can Cui,
Imran Ahamad Sheikh,
Mostafa Sadeghi,
Emmanuel Vincent
Abstract:
Distant-microphone meeting transcription is a challenging task. State-of-the-art end-to-end speaker-attributed automatic speech recognition (SA-ASR) architectures lack a multichannel noise and reverberation reduction front-end, which limits their performance. In this paper, we introduce a joint beamforming and SA-ASR approach for real meeting transcription. We first describe a data alignment and a…
▽ More
Distant-microphone meeting transcription is a challenging task. State-of-the-art end-to-end speaker-attributed automatic speech recognition (SA-ASR) architectures lack a multichannel noise and reverberation reduction front-end, which limits their performance. In this paper, we introduce a joint beamforming and SA-ASR approach for real meeting transcription. We first describe a data alignment and augmentation method to pretrain a neural beamformer on real meeting data. We then compare fixed, hybrid, and fully neural beamformers as front-ends to the SA-ASR model. Finally, we jointly optimize the fully neural beamformer and the SA-ASR model. Experiments on the real AMI corpus show that,while state-of-the-art multi-frame cross-channel attention based channel fusion fails to improve ASR performance, fine-tuning SA-ASR on the fixed beamformer's output and jointly fine-tuning SA-ASR with the neural beamformer reduce the word error rate by 8% and 9% relative, respectively.
△ Less
Submitted 29 October, 2024;
originally announced October 2024.
-
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Authors:
Jean-Eudes Ayilo,
Mostafa Sadeghi,
Romain Serizel,
Xavier Alameda-Pineda
Abstract:
This paper proposes a new unsupervised audio-visual speech enhancement (AVSE) approach that combines a diffusion-based audio-visual speech generative model with a non-negative matrix factorization (NMF) noise model. First, the diffusion model is pre-trained on clean speech conditioned on corresponding video data to simulate the speech generative distribution. This pre-trained model is then paired…
▽ More
This paper proposes a new unsupervised audio-visual speech enhancement (AVSE) approach that combines a diffusion-based audio-visual speech generative model with a non-negative matrix factorization (NMF) noise model. First, the diffusion model is pre-trained on clean speech conditioned on corresponding video data to simulate the speech generative distribution. This pre-trained model is then paired with the NMF-based noise model to estimate clean speech iteratively. Specifically, a diffusion-based posterior sampling approach is implemented within the reverse diffusion process, where after each iteration, a speech estimate is obtained and used to update the noise parameters. Experimental results confirm that the proposed AVSE approach not only outperforms its audio-only counterpart but also generalizes better than a recent supervised-generative AVSE method. Additionally, the new inference algorithm offers a better balance between inference speed and performance compared to the previous diffusion-based method. Code and demo available at: https://jeaneudesayilo.github.io/fast_UdiffSE
△ Less
Submitted 15 January, 2025; v1 submitted 4 October, 2024;
originally announced October 2024.
-
Efficient Noise Mitigation for Enhancing Inference Accuracy in DNNs on Mixed-Signal Accelerators
Authors:
Seyedarmin Azizi,
Mohammad Erfan Sadeghi,
Mehdi Kamal,
Massoud Pedram
Abstract:
In this paper, we propose a framework to enhance the robustness of the neural models by mitigating the effects of process-induced and aging-related variations of analog computing components on the accuracy of the analog neural networks. We model these variations as the noise affecting the precision of the activations and introduce a denoising block inserted between selected layers of a pre-trained…
▽ More
In this paper, we propose a framework to enhance the robustness of the neural models by mitigating the effects of process-induced and aging-related variations of analog computing components on the accuracy of the analog neural networks. We model these variations as the noise affecting the precision of the activations and introduce a denoising block inserted between selected layers of a pre-trained model. We demonstrate that training the denoising block significantly increases the model's robustness against various noise levels. To minimize the overhead associated with adding these blocks, we present an exploration algorithm to identify optimal insertion points for the denoising blocks. Additionally, we propose a specialized architecture to efficiently execute the denoising blocks, which can be integrated into mixed-signal accelerators. We evaluate the effectiveness of our approach using Deep Neural Network (DNN) models trained on the ImageNet and CIFAR-10 datasets. The results show that on average, by accepting 2.03% parameter count overhead, the accuracy drop due to the variations reduces from 31.7% to 1.15%.
△ Less
Submitted 27 September, 2024;
originally announced September 2024.
-
Hydrodynamics of Arcsin AdS Black Brane
Authors:
Mehdi Sadeghi
Abstract:
In this paper, we explore a modified black brane within AdS spacetime, characterized by the Lagrangian density $\frac{1}{q} \text{arcsin}(qR)-2Λ$. Due to the absence of an analytic solution, we approach the Einstein equations using a perturbative method, extending our analysis to the second order in $q$. Subsequently, we compute the ratio of shear viscosity to entropy density. Our results suggest…
▽ More
In this paper, we explore a modified black brane within AdS spacetime, characterized by the Lagrangian density $\frac{1}{q} \text{arcsin}(qR)-2Λ$. Due to the absence of an analytic solution, we approach the Einstein equations using a perturbative method, extending our analysis to the second order in $q$. Subsequently, we compute the ratio of shear viscosity to entropy density. Our results suggest that the KSS Bound is not saturated in this model.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
Approximating particle-based clustering dynamics by stochastic PDEs
Authors:
Nathalie Wehlitz,
Mohsen Sadeghi,
Alberto Montefusco,
Christof Schütte,
Grigorios A. Pavliotis,
Stefanie Winkelmann
Abstract:
This work proposes stochastic partial differential equations (SPDEs) as a practical tool to replicate clustering effects of more detailed particle-based dynamics. Inspired by membrane-mediated receptor dynamics on cell surfaces, we formulate a stochastic particle-based model for diffusion and pairwise interaction of particles, leading to intriguing clustering phenomena. Employing numerical simulat…
▽ More
This work proposes stochastic partial differential equations (SPDEs) as a practical tool to replicate clustering effects of more detailed particle-based dynamics. Inspired by membrane-mediated receptor dynamics on cell surfaces, we formulate a stochastic particle-based model for diffusion and pairwise interaction of particles, leading to intriguing clustering phenomena. Employing numerical simulation and cluster detection methods, we explore the approximation of the particle-based clustering dynamics through mean-field approaches. We find that SPDEs successfully reproduce spatiotemporal clustering dynamics, not only in the initial cluster formation period, but also on longer time scales where the successive merging of clusters cannot be tracked by deterministic mean-field models. The computational efficiency of the SPDE approach allows us to generate extensive statistical data for parameter estimation in a simpler model that uses a Markov jump process to capture the temporal evolution of the cluster number.
△ Less
Submitted 20 January, 2025; v1 submitted 12 July, 2024;
originally announced July 2024.
-
Exploring Facial Biomarkers for Depression through Temporal Analysis of Action Units
Authors:
Aditya Parikh,
Misha Sadeghi,
Bjorn Eskofier
Abstract:
Depression is characterized by persistent sadness and loss of interest, significantly impairing daily functioning and now a widespread mental disorder. Traditional diagnostic methods rely on subjective assessments, necessitating objective approaches for accurate diagnosis. Our study investigates the use of facial action units (AUs) and emotions as biomarkers for depression. We analyzed facial expr…
▽ More
Depression is characterized by persistent sadness and loss of interest, significantly impairing daily functioning and now a widespread mental disorder. Traditional diagnostic methods rely on subjective assessments, necessitating objective approaches for accurate diagnosis. Our study investigates the use of facial action units (AUs) and emotions as biomarkers for depression. We analyzed facial expressions from video data of participants classified with or without depression. Our methodology involved detailed feature extraction, mean intensity comparisons of key AUs, and the application of time series classification models. Furthermore, we employed Principal Component Analysis (PCA) and various clustering algorithms to explore the variability in emotional expression patterns. Results indicate significant differences in the intensities of AUs associated with sadness and happiness between the groups, highlighting the potential of facial analysis in depression assessment.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference
Authors:
Mohammad Erfan Sadeghi,
Arash Fayyazi,
Suhas Somashekar,
Armin Abdollahi,
Massoud Pedram
Abstract:
Vision Transformers (ViTs) represent a groundbreaking shift in machine learning approaches to computer vision. Unlike traditional approaches, ViTs employ the self-attention mechanism, which has been widely used in natural language processing, to analyze image patches. Despite their advantages in modeling visual tasks, deploying ViTs on hardware platforms, notably Field-Programmable Gate Arrays (FP…
▽ More
Vision Transformers (ViTs) represent a groundbreaking shift in machine learning approaches to computer vision. Unlike traditional approaches, ViTs employ the self-attention mechanism, which has been widely used in natural language processing, to analyze image patches. Despite their advantages in modeling visual tasks, deploying ViTs on hardware platforms, notably Field-Programmable Gate Arrays (FPGAs), introduces considerable challenges. These challenges stem primarily from the non-linear calculations and high computational and memory demands of ViTs. This paper introduces CHOSEN, a software-hardware co-design framework to address these challenges and offer an automated framework for ViT deployment on the FPGAs in order to maximize performance. Our framework is built upon three fundamental contributions: multi-kernel design to maximize the bandwidth, mainly targeting benefits of multi DDR memory banks, approximate non-linear functions that exhibit minimal accuracy degradation, and efficient use of available logic blocks on the FPGA, and efficient compiler to maximize the performance and memory-efficiency of the computing kernels by presenting a novel algorithm for design space exploration to find optimal hardware configuration that achieves optimal throughput and latency. Compared to the state-of-the-art ViT accelerators, CHOSEN achieves a 1.5x and 1.42x improvement in the throughput on the DeiT-S and DeiT-B models.
△ Less
Submitted 10 June, 2025; v1 submitted 17 July, 2024;
originally announced July 2024.
-
Exponential Modification of AdS Black Hole and Thermodynamic Behavior
Authors:
Mehdi Sadeghi,
Faramarz Rahmani
Abstract:
In this paper, we present an exponential modification for the action of an AdS black hole in the absence of a matter field. An approximated black hole solution is obtained up to the third order of perturbation coefficient. A thermodynamic investigation in canonical ensemble shows that the behavior of a Van der Waals fluid is not seen in this model. Nevertheless, the study of thermodynamic potentia…
▽ More
In this paper, we present an exponential modification for the action of an AdS black hole in the absence of a matter field. An approximated black hole solution is obtained up to the third order of perturbation coefficient. A thermodynamic investigation in canonical ensemble shows that the behavior of a Van der Waals fluid is not seen in this model. Nevertheless, the study of thermodynamic potentials and other related quantities suggests that the thermodynamic phase transitions of the first and second types can occur in this model. The forms of the phase transitions are more similar to the Hawking-Page phase transitions.
△ Less
Submitted 11 July, 2024; v1 submitted 5 July, 2024;
originally announced July 2024.
-
PEANO-ViT: Power-Efficient Approximations of Non-Linearities in Vision Transformers
Authors:
Mohammad Erfan Sadeghi,
Arash Fayyazi,
Seyedarmin Azizi,
Massoud Pedram
Abstract:
The deployment of Vision Transformers (ViTs) on hardware platforms, specially Field-Programmable Gate Arrays (FPGAs), presents many challenges, which are mainly due to the substantial computational and power requirements of their non-linear functions, notably layer normalization, softmax, and Gaussian Error Linear Unit (GELU). These critical functions pose significant obstacles to efficient hardwa…
▽ More
The deployment of Vision Transformers (ViTs) on hardware platforms, specially Field-Programmable Gate Arrays (FPGAs), presents many challenges, which are mainly due to the substantial computational and power requirements of their non-linear functions, notably layer normalization, softmax, and Gaussian Error Linear Unit (GELU). These critical functions pose significant obstacles to efficient hardware implementation due to their complex mathematical operations and the inherent resource count and architectural limitations of FPGAs. PEANO-ViT offers a novel approach to streamlining the implementation of the layer normalization layer by introducing a division-free technique that simultaneously approximates the division and square root function. Additionally, PEANO-ViT provides a multi-scale division strategy to eliminate division operations in the softmax layer, aided by a Pade-based approximation for the exponential function. Finally, PEANO-ViT introduces a piece-wise linear approximation for the GELU function, carefully designed to bypass the computationally intensive operations associated with GELU. In our comprehensive evaluations, PEANO-ViT exhibits minimal accuracy degradation (<= 0.5% for DeiT-B) while significantly enhancing power efficiency, achieving improvements of 1.91x, 1.39x, 8.01x for layer normalization, softmax, and GELU, respectively. This improvement is achieved through substantial reductions in DSP, LUT, and register counts for these non-linear operations. Consequently, PEANO-ViT enables efficient deployment of Vision Transformers on resource- and power-constrained FPGA platforms.
△ Less
Submitted 16 August, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
ADEP: A Novel Approach Based on Discriminator-Enhanced Encoder-Decoder Architecture for Accurate Prediction of Adverse Effects in Polypharmacy
Authors:
Katayoun Kobraei,
Mehrdad Baradaran,
Seyed Mohsen Sadeghi,
Raziyeh Masumshah,
Changiz Eslahchi
Abstract:
Motivation: Unanticipated drug-drug interactions (DDIs) pose significant risks in polypharmacy, emphasizing the need for predictive methods. Recent advancements in computational techniques aim to address this challenge.
Methods: We introduce ADEP, a novel approach integrating a discriminator and an encoder-decoder model to address data sparsity and enhance feature extraction. ADEP employs a thre…
▽ More
Motivation: Unanticipated drug-drug interactions (DDIs) pose significant risks in polypharmacy, emphasizing the need for predictive methods. Recent advancements in computational techniques aim to address this challenge.
Methods: We introduce ADEP, a novel approach integrating a discriminator and an encoder-decoder model to address data sparsity and enhance feature extraction. ADEP employs a three-part model, including multiple classification methods, to predict adverse effects in polypharmacy.
Results: Evaluation on benchmark datasets shows ADEP outperforms well-known methods such as GGI-DDI, SSF-DDI, LSFC, DPSP, GNN-DDI, MSTE, MDF-SA-DDI, NNPS, DDIMDL, Random Forest, K-Nearest-Neighbor, Logistic Regression, and Decision Tree. Key metrics include Accuracy, AUROC, AUPRC, F-score, Recall, Precision, False Negatives, and False Positives. ADEP achieves more accurate predictions of adverse effects in polypharmacy. A case study with real-world data illustrates ADEP's practical application in identifying potential DDIs and preventing adverse effects.
Conclusions: ADEP significantly advances the prediction of polypharmacy adverse effects, offering improved accuracy and reliability. Its innovative architecture enhances feature extraction from sparse medical data, improving medication safety and patient outcomes.
Availability: Source code and datasets are available at https://github.com/m0hssn/ADEP.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Forward-Backward Knowledge Distillation for Continual Clustering
Authors:
Mohammadreza Sadeghi,
Zihan Wang,
Narges Armanfard
Abstract:
Unsupervised Continual Learning (UCL) is a burgeoning field in machine learning, focusing on enabling neural networks to sequentially learn tasks without explicit label information. Catastrophic Forgetting (CF), where models forget previously learned tasks upon learning new ones, poses a significant challenge in continual learning, especially in UCL, where labeled information of data is not access…
▽ More
Unsupervised Continual Learning (UCL) is a burgeoning field in machine learning, focusing on enabling neural networks to sequentially learn tasks without explicit label information. Catastrophic Forgetting (CF), where models forget previously learned tasks upon learning new ones, poses a significant challenge in continual learning, especially in UCL, where labeled information of data is not accessible. CF mitigation strategies, such as knowledge distillation and replay buffers, often face memory inefficiency and privacy issues. Although current research in UCL has endeavored to refine data representations and address CF in streaming data contexts, there is a noticeable lack of algorithms specifically designed for unsupervised clustering. To fill this gap, in this paper, we introduce the concept of Unsupervised Continual Clustering (UCC). We propose Forward-Backward Knowledge Distillation for unsupervised Continual Clustering (FBCC) to counteract CF within the context of UCC. FBCC employs a single continual learner (the ``teacher'') with a cluster projector, along with multiple student models, to address the CF issue. The proposed method consists of two phases: Forward Knowledge Distillation, where the teacher learns new clusters while retaining knowledge from previous tasks with guidance from specialized student models, and Backward Knowledge Distillation, where a student model mimics the teacher's behavior to retain task-specific knowledge, aiding the teacher in subsequent tasks. FBCC marks a pioneering approach to UCC, demonstrating enhanced performance and memory efficiency in clustering across various tasks, outperforming the application of clustering algorithms to the latent space of state-of-the-art UCL algorithms.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
A Global Data-Driven Model for The Hippocampus and Nucleus Accumbens of Rat From The Local Field Potential Recordings (LFP)
Authors:
Maedeh Sadeghi,
Mahdi Aliyari Shoorehdeli,
Shole jamali,
Abbas Haghparast
Abstract:
In brain neural networks, Local Field Potential (LFP) signals represent the dynamic flow of information. Analyzing LFP clinical data plays a critical role in improving our understanding of brain mechanisms. One way to enhance our understanding of these mechanisms is to identify a global model to predict brain signals in different situations. This paper identifies a global data-driven based on LFP…
▽ More
In brain neural networks, Local Field Potential (LFP) signals represent the dynamic flow of information. Analyzing LFP clinical data plays a critical role in improving our understanding of brain mechanisms. One way to enhance our understanding of these mechanisms is to identify a global model to predict brain signals in different situations. This paper identifies a global data-driven based on LFP recordings of the Nucleus Accumbens and Hippocampus regions in freely moving rats. The LFP is recorded from each rat in two different situations: before and after the process of getting a reward which can be either a drug (Morphine) or natural food (like popcorn or biscuit). A comparison of five machine learning methods including Long Short Term Memory (LSTM), Echo State Network (ESN), Deep Echo State Network (DeepESN), Radial Basis Function (RBF), and Local Linear Model Tree (LLM) is conducted to develop this model. LoLiMoT was chosen with the best performance among all methods. This model can predict the future states of these regions with one pre-trained model. Identifying this model showed that Morphine and natural rewards do not change the dynamic features of neurons in these regions.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Deep Clustering with Self-Supervision using Pairwise Similarities
Authors:
Mohammadreza Sadeghi,
Narges Armanfard
Abstract:
Deep clustering incorporates embedding into clustering to find a lower-dimensional space appropriate for clustering. In this paper, we propose a novel deep clustering framework with self-supervision using pairwise similarities (DCSS). The proposed method consists of two successive phases. In the first phase, we propose to form hypersphere-like groups of similar data points, i.e. one hypersphere pe…
▽ More
Deep clustering incorporates embedding into clustering to find a lower-dimensional space appropriate for clustering. In this paper, we propose a novel deep clustering framework with self-supervision using pairwise similarities (DCSS). The proposed method consists of two successive phases. In the first phase, we propose to form hypersphere-like groups of similar data points, i.e. one hypersphere per cluster, employing an autoencoder that is trained using cluster-specific losses. The hyper-spheres are formed in the autoencoder's latent space. In the second phase, we propose to employ pairwise similarities to create a $K$-dimensional space that is capable of accommodating more complex cluster distributions, hence providing more accurate clustering performance. $K$ is the number of clusters. The autoencoder's latent space obtained in the first phase is used as the input of the second phase. The effectiveness of both phases is demonstrated on seven benchmark datasets by conducting a rigorous set of experiments.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Authors:
Can Cui,
Imran Ahamad Sheikh,
Mostafa Sadeghi,
Emmanuel Vincent
Abstract:
Past studies on end-to-end meeting transcription have focused on model architecture and have mostly been evaluated on simulated meeting data. We present a novel study aiming to optimize the use of a Speaker-Attributed ASR (SA-ASR) system in real-life scenarios, such as the AMI meeting corpus, for improved speaker assignment of speech segments. First, we propose a pipeline tailored to real-life app…
▽ More
Past studies on end-to-end meeting transcription have focused on model architecture and have mostly been evaluated on simulated meeting data. We present a novel study aiming to optimize the use of a Speaker-Attributed ASR (SA-ASR) system in real-life scenarios, such as the AMI meeting corpus, for improved speaker assignment of speech segments. First, we propose a pipeline tailored to real-life applications involving Voice Activity Detection (VAD), Speaker Diarization (SD), and SA-ASR. Second, we advocate using VAD output segments to fine-tune the SA-ASR model, considering that it is also applied to VAD segments during test, and show that this results in a relative reduction of Speaker Error Rate (SER) up to 28%. Finally, we explore strategies to enhance the extraction of the speaker embedding templates used as inputs by the SA-ASR system. We show that extracting them from SD output rather than annotated speaker segments results in a relative SER reduction up to 20%.
△ Less
Submitted 5 September, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Non-Abelian Exponential Yang-Mills AdS Black Brane and Transport Coefficients
Authors:
Mehdi Sadeghi,
Faramaz Rahmani
Abstract:
In this paper, AdS black brane solution of Einstein-Hilbert gravity with non-abelian exponential guage theory of Yang-Mills type is introduced. DC conductivity and the ratio of shear viscosity to entropy density as two important transport coefficients are calculated by using of Kubo formula in the context of AdS/CFT duality. Our results recover the Yang-Mills model in $q\to \infty$ limit.
In this paper, AdS black brane solution of Einstein-Hilbert gravity with non-abelian exponential guage theory of Yang-Mills type is introduced. DC conductivity and the ratio of shear viscosity to entropy density as two important transport coefficients are calculated by using of Kubo formula in the context of AdS/CFT duality. Our results recover the Yang-Mills model in $q\to \infty$ limit.
△ Less
Submitted 17 June, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Optimizing Near Field Computation in the MLFMA Algorithm with Data Redundancy and Performance Modeling on a Single GPU
Authors:
Morteza Sadeghi,
Abdolreza Torabi
Abstract:
The Multilevel Fast Multipole Algorithm (MLFMA) has known applications in scientific modeling in the fields of telecommunications, physics, mechanics, and chemistry. Accelerating calculation of far-field using GPUs and GPU clusters for large-scale problems has been studied for more than a decade. The acceleration of the Near Field Computation (P2P operator) however was less of a concern because it…
▽ More
The Multilevel Fast Multipole Algorithm (MLFMA) has known applications in scientific modeling in the fields of telecommunications, physics, mechanics, and chemistry. Accelerating calculation of far-field using GPUs and GPU clusters for large-scale problems has been studied for more than a decade. The acceleration of the Near Field Computation (P2P operator) however was less of a concern because it does not face the challenges of distributed processing which does far field. This article proposes a modification of the P2P algorithm and uses performance models to determine its optimality criteria. By modeling the speedup, we found that making threads independence by creating redundancy in the data makes the algorithm for lower dense (higher frequency) problems nearly 13 times faster than non-redundant mode.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
SmartEx: A Framework for Generating User-Centric Explanations in Smart Environments
Authors:
Mersedeh Sadeghi,
Lars Herbold,
Max Unterbusch,
Andreas Vogelsang
Abstract:
Explainability is crucial for complex systems like pervasive smart environments, as they collect and analyze data from various sensors, follow multiple rules, and control different devices resulting in behavior that is not trivial and, thus, should be explained to the users. The current approaches, however, offer flat, static, and algorithm-focused explanations. User-centric explanations, on the o…
▽ More
Explainability is crucial for complex systems like pervasive smart environments, as they collect and analyze data from various sensors, follow multiple rules, and control different devices resulting in behavior that is not trivial and, thus, should be explained to the users. The current approaches, however, offer flat, static, and algorithm-focused explanations. User-centric explanations, on the other hand, consider the recipient and context, providing personalized and context-aware explanations. To address this gap, we propose an approach to incorporate user-centric explanations into smart environments. We introduce a conceptual model and a reference architecture for characterizing and generating such explanations. Our work is the first technical solution for generating context-aware and granular explanations in smart environments. Our architecture implementation demonstrates the feasibility of our approach through various scenarios.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Generating Context-Aware Contrastive Explanations in Rule-based Systems
Authors:
Lars Herbold,
Mersedeh Sadeghi,
Andreas Vogelsang
Abstract:
Human explanations are often contrastive, meaning that they do not answer the indeterminate "Why?" question, but instead "Why P, rather than Q?". Automatically generating contrastive explanations is challenging because the contrastive event (Q) represents the expectation of a user in contrast to what happened. We present an approach that predicts a potential contrastive event in situations where a…
▽ More
Human explanations are often contrastive, meaning that they do not answer the indeterminate "Why?" question, but instead "Why P, rather than Q?". Automatically generating contrastive explanations is challenging because the contrastive event (Q) represents the expectation of a user in contrast to what happened. We present an approach that predicts a potential contrastive event in situations where a user asks for an explanation in the context of rule-based systems. Our approach analyzes a situation that needs to be explained and then selects the most likely rule a user may have expected instead of what the user has observed. This contrastive event is then used to create a contrastive explanation that is presented to the user. We have implemented the approach as a plugin for a home automation system and demonstrate its feasibility in four test scenarios.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
The Phase Transition of $4D$ Yang-Mills Charged GB AdS Black Hole with Cloud of Strings
Authors:
Faramarz Rahmani,
Mehdi Sadeghi
Abstract:
In this paper, we present an exact spherically symmetric and Yang-Mills charged AdS black hole solution in the context of $4D$ Einstein-Gauss-Bonnet (EGB) gravity in the presence of a cloud of strings. The regularity of the solution is checked. Thermodynamics of this solution is studied. The critical behavior, the types of phase transitions in canonical ensemble, the Joule-Thomson expansion, the C…
▽ More
In this paper, we present an exact spherically symmetric and Yang-Mills charged AdS black hole solution in the context of $4D$ Einstein-Gauss-Bonnet (EGB) gravity in the presence of a cloud of strings. The regularity of the solution is checked. Thermodynamics of this solution is studied. The critical behavior, the types of phase transitions in canonical ensemble, the Joule-Thomson expansion, the Clapeyron equation and the critical exponents shall be investigated.
△ Less
Submitted 14 September, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Authors:
Simon Leglaive,
Matthieu Fraticelli,
Hend ElGhazaly,
Léonie Borne,
Mostafa Sadeghi,
Scott Wisdom,
Manuel Pariente,
John R. Hershey,
Daniel Pressnitzer,
Jon P. Barker
Abstract:
Supervised models for speech enhancement are trained using artificially generated mixtures of clean speech and noise signals. However, the synthetic training conditions may not accurately reflect real-world conditions encountered during testing. This discrepancy can result in poor performance when the test domain significantly differs from the synthetic training domain. To tackle this issue, the U…
▽ More
Supervised models for speech enhancement are trained using artificially generated mixtures of clean speech and noise signals. However, the synthetic training conditions may not accurately reflect real-world conditions encountered during testing. This discrepancy can result in poor performance when the test domain significantly differs from the synthetic training domain. To tackle this issue, the UDASE task of the 7th CHiME challenge aimed to leverage real-world noisy speech recordings from the test domain for unsupervised domain adaptation of speech enhancement models. Specifically, this test domain corresponds to the CHiME-5 dataset, characterized by real multi-speaker and conversational speech recordings made in noisy and reverberant domestic environments, for which ground-truth clean speech signals are not available. In this paper, we present the objective and subjective evaluations of the systems that were submitted to the CHiME-7 UDASE task, and we provide an analysis of the results. This analysis reveals a limited correlation between subjective ratings and several supervised nonintrusive performance metrics recently proposed for speech enhancement. Conversely, the results suggest that more traditional intrusive objective metrics can be used for in-domain performance evaluation using the reverberant LibriCHiME-5 dataset developed for the challenge. The subjective evaluation indicates that all systems successfully reduced the background noise, but always at the expense of increased distortion. Out of the four speech enhancement methods evaluated subjectively, only one demonstrated an improvement in overall quality compared to the unprocessed noisy speech, highlighting the difficulty of the task. The tools and audio material created for the CHiME-7 UDASE task are shared with the community.
△ Less
Submitted 10 July, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
NU-Class Net: A Novel Approach for Video Quality Enhancement
Authors:
Parham Zilouchian Moghaddam,
Mehdi Modarressi,
Mohammad Amin Sadeghi
Abstract:
Video content has experienced a surge in popularity, asserting its dominance over internet traffic and Internet of Things (IoT) networks. Video compression has long been regarded as the primary means of efficiently managing the substantial multimedia traffic generated by video-capturing devices. Nevertheless, video compression algorithms entail significant computational demands in order to achieve…
▽ More
Video content has experienced a surge in popularity, asserting its dominance over internet traffic and Internet of Things (IoT) networks. Video compression has long been regarded as the primary means of efficiently managing the substantial multimedia traffic generated by video-capturing devices. Nevertheless, video compression algorithms entail significant computational demands in order to achieve substantial compression ratios. This complexity presents a formidable challenge when implementing efficient video coding standards in resource-constrained embedded systems, such as IoT edge node cameras. To tackle this challenge, this paper introduces NU-Class Net, an innovative deep-learning model designed to mitigate compression artifacts stemming from lossy compression codecs. This enhancement significantly elevates the perceptible quality of low-bit-rate videos. By employing the NU-Class Net, the video encoder within the video-capturing node can reduce output quality, thereby generating low-bit-rate videos and effectively curtailing both computation and bandwidth requirements at the edge. On the decoder side, which is typically less encumbered by resource limitations, NU-Class Net is applied after the video decoder to compensate for artifacts and approximate the quality of the original video. Experimental results affirm the efficacy of the proposed model in enhancing the perceptible quality of videos, especially those streamed at low bit rates.
△ Less
Submitted 3 June, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Inverse anisotropic catalysis and complexity
Authors:
Mojtaba Shahbazi,
Mehdi Sadeghi
Abstract:
In this work the effect of anisotropy on computational complexity is considered by CA proposal in holographic two-sided black brane dual of a strongly coupled gauge theory. It is shown that due to confinement-deconfinement phase transition there are two different behaviors: by increase in anisotropy there would be an increase in complexity growth rate in small anisotropy and a decreases in the com…
▽ More
In this work the effect of anisotropy on computational complexity is considered by CA proposal in holographic two-sided black brane dual of a strongly coupled gauge theory. It is shown that due to confinement-deconfinement phase transition there are two different behaviors: by increase in anisotropy there would be an increase in complexity growth rate in small anisotropy and a decreases in the complexity growth rate in large anisotropy. In the extreme case the very large anisotropy leads to the unity of the complexity growth rate and complexity itself, it means that in this case getting the target state from the reference state is reachable by no effort. Moreover, we suggest that $\frac{1}{M}\frac{dC}{dt}$ is a better representation of system degrees of freedom rather than the complexity growth rate $\frac{dC}{dt}$ and show that how it is related to inverse anisotropic catalysis. In addition, we consider the one-sided black brane dual to the quantum quench and showed that increase in anisotropy comes with decrease in complexity regardless of the anisotropy value which is due to the fact that the system do not experience a phase transition.
△ Less
Submitted 27 February, 2025; v1 submitted 1 January, 2024;
originally announced January 2024.
-
End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data
Authors:
Can Cui,
Imran Ahamad Sheikh,
Mostafa Sadeghi,
Emmanuel Vincent
Abstract:
Joint punctuated and normalized automatic speech recognition (ASR), that outputs transcripts with and without punctuation and casing, remains challenging due to the lack of paired speech and punctuated text data in most ASR corpora. We propose two approaches to train an end-to-end joint punctuated and normalized ASR system using limited punctuated data. The first approach uses a language model to…
▽ More
Joint punctuated and normalized automatic speech recognition (ASR), that outputs transcripts with and without punctuation and casing, remains challenging due to the lack of paired speech and punctuated text data in most ASR corpora. We propose two approaches to train an end-to-end joint punctuated and normalized ASR system using limited punctuated data. The first approach uses a language model to convert normalized training transcripts into punctuated transcripts. This achieves a better performance on out-of-domain test data, with up to 17% relative Punctuation-Case-aware Word Error Rate (PC-WER) reduction. The second approach uses a single decoder conditioned on the type of output. This yields a 42% relative PC-WER reduction compared to Whisper-base and a 4% relative (normalized) WER reduction compared to the normalized output of a punctuated-only model. Additionally, our proposed modeldemonstrates the feasibility of a joint ASR system using as little as 5% punctuated training data with a moderate (2.42% absolute) PC-WER increase.
△ Less
Submitted 29 October, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach
Authors:
Ali Javidani,
Mohammad Amin Sadeghi,
Babak Nadjar Araabi
Abstract:
Self-supervised visual representation learning traditionally focuses on image-level instance discrimination. Our study introduces an innovative, fine-grained dimension by integrating patch-level discrimination into these methodologies. This integration allows for the simultaneous analysis of local and global visual features, thereby enriching the quality of the learned representations. Initially,…
▽ More
Self-supervised visual representation learning traditionally focuses on image-level instance discrimination. Our study introduces an innovative, fine-grained dimension by integrating patch-level discrimination into these methodologies. This integration allows for the simultaneous analysis of local and global visual features, thereby enriching the quality of the learned representations. Initially, the original images undergo spatial augmentation. Subsequently, we employ a distinctive photometric patch-level augmentation, where each patch is individually augmented, independent from other patches within the same view. This approach generates a diverse training dataset with distinct color variations in each segment. The augmented images are then processed through a self-distillation learning framework, utilizing the Vision Transformer (ViT) as its backbone. The proposed method minimizes the representation distances across both image and patch levels to capture details from macro to micro perspectives. To this end, we present a simple yet effective patch-matching algorithm to find the corresponding patches across the augmented views. Thanks to the efficient structure of the patch-matching algorithm, our method reduces computational complexity compared to similar approaches. Consequently, we achieve an advanced understanding of the model without adding significant computational requirements. We have extensively pretrained our method on datasets of varied scales, such as Cifar10, ImageNet-100, and ImageNet-1K. It demonstrates superior performance over state-of-the-art self-supervised representation learning methods in image classification and downstream tasks, such as copy detection and image retrieval. The implementation of our method is accessible on GitHub.
△ Less
Submitted 3 June, 2024; v1 submitted 28 October, 2023;
originally announced October 2023.
-
Efficient Active Deep Decoding of Linear Codes using Importance Sampling
Authors:
Hassan Noghrei,
Mohammad-Reza Sadeghi,
Wai Ho Mow
Abstract:
The quality and quantity of data used for training greatly influence the performance and effectiveness of deep learning models. In the context of error correction, it is essential to generate high-quality samples that are neither excessively noisy nor entirely correct but close to the decoding region's decision boundary. To accomplish this objective, this paper utilizes a restricted version of a r…
▽ More
The quality and quantity of data used for training greatly influence the performance and effectiveness of deep learning models. In the context of error correction, it is essential to generate high-quality samples that are neither excessively noisy nor entirely correct but close to the decoding region's decision boundary. To accomplish this objective, this paper utilizes a restricted version of a recent result on Importance Sampling (IS) distribution for fast performance evaluation of linear codes. The IS distribution is used over the segmented observation space and integrated with active learning. This combination allows for the iterative generation of samples from the shells whose acquisition functions, defined as the error probabilities conditioned on each shell, fall within a specific range. By intelligently sampling based on the proposed IS distribution, significant improvements are demonstrated in the performance of BCH(63,36) and BCH(63,45) codes with cycle-reduced parity-check matrices. The proposed IS-based-active Weight Belief Propagation (WBP) decoder shows improvements of up to 0.4dB in the waterfall region and up to 1.9dB in the error-floor region of the BER curve, over the conventional WBP. This approach can be easily adapted to generate efficient samples to train any other deep learning-based decoder.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
Authors:
Can Cui,
Imran Ahamad Sheikh,
Mostafa Sadeghi,
Emmanuel Vincent
Abstract:
We present an end-to-end multichannel speaker-attributed automatic speech recognition (MC-SA-ASR) system that combines a Conformer-based encoder with multi-frame crosschannel attention and a speaker-attributed Transformer-based decoder. To the best of our knowledge, this is the first model that efficiently integrates ASR and speaker identification modules in a multichannel setting. On simulated mi…
▽ More
We present an end-to-end multichannel speaker-attributed automatic speech recognition (MC-SA-ASR) system that combines a Conformer-based encoder with multi-frame crosschannel attention and a speaker-attributed Transformer-based decoder. To the best of our knowledge, this is the first model that efficiently integrates ASR and speaker identification modules in a multichannel setting. On simulated mixtures of LibriSpeech data, our system reduces the word error rate (WER) by up to 12% and 16% relative compared to previously proposed single-channel and multichannel approaches, respectively. Furthermore, we investigate the impact of different input features, including multichannel magnitude and phase information, on the ASR performance. Finally, our experiments on the AMI corpus confirm the effectiveness of our system for real-world multichannel meeting transcription.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Diffusion-based speech enhancement with a weighted generative-supervised learning loss
Authors:
Jean-Eudes Ayilo,
Mostafa Sadeghi,
Romain Serizel
Abstract:
Diffusion-based generative models have recently gained attention in speech enhancement (SE), providing an alternative to conventional supervised methods. These models transform clean speech training samples into Gaussian noise centered at noisy speech, and subsequently learn a parameterized model to reverse this process, conditionally on noisy speech. Unlike supervised methods, generative-based SE…
▽ More
Diffusion-based generative models have recently gained attention in speech enhancement (SE), providing an alternative to conventional supervised methods. These models transform clean speech training samples into Gaussian noise centered at noisy speech, and subsequently learn a parameterized model to reverse this process, conditionally on noisy speech. Unlike supervised methods, generative-based SE approaches usually rely solely on an unsupervised loss, which may result in less efficient incorporation of conditioned noisy speech. To address this issue, we propose augmenting the original diffusion training objective with a mean squared error (MSE) loss, measuring the discrepancy between estimated enhanced speech and ground-truth clean speech at each reverse process iteration. Experimental results demonstrate the effectiveness of our proposed methodology.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Unsupervised speech enhancement with diffusion-based generative models
Authors:
Berné Nortier,
Mostafa Sadeghi,
Romain Serizel
Abstract:
Recently, conditional score-based diffusion models have gained significant attention in the field of supervised speech enhancement, yielding state-of-the-art performance. However, these methods may face challenges when generalising to unseen conditions. To address this issue, we introduce an alternative approach that operates in an unsupervised manner, leveraging the generative power of diffusion…
▽ More
Recently, conditional score-based diffusion models have gained significant attention in the field of supervised speech enhancement, yielding state-of-the-art performance. However, these methods may face challenges when generalising to unseen conditions. To address this issue, we introduce an alternative approach that operates in an unsupervised manner, leveraging the generative power of diffusion models. Specifically, in a training phase, a clean speech prior distribution is learnt in the short-time Fourier transform (STFT) domain using score-based diffusion models, allowing it to unconditionally generate clean speech from Gaussian noise. Then, we develop a posterior sampling methodology for speech enhancement by combining the learnt clean speech prior with a noise model for speech signal inference. The noise parameters are simultaneously learnt along with clean speech estimation through an iterative expectationmaximisation (EM) approach. To the best of our knowledge, this is the first work exploring diffusion-based generative models for unsupervised speech enhancement, demonstrating promising results compared to a recent variational auto-encoder (VAE)-based unsupervised approach and a state-of-the-art diffusion-based supervised method. It thus opens a new direction for future research in unsupervised speech enhancement.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
Authors:
Mostafa Sadeghi,
Romain Serizel
Abstract:
In this paper, we address the unsupervised speech enhancement problem based on recurrent variational autoencoder (RVAE). This approach offers promising generalization performance over the supervised counterpart. Nevertheless, the involved iterative variational expectation-maximization (VEM) process at test time, which relies on a variational inference method, results in high computational complexi…
▽ More
In this paper, we address the unsupervised speech enhancement problem based on recurrent variational autoencoder (RVAE). This approach offers promising generalization performance over the supervised counterpart. Nevertheless, the involved iterative variational expectation-maximization (VEM) process at test time, which relies on a variational inference method, results in high computational complexity. To tackle this issue, we present efficient sampling techniques based on Langevin dynamics and Metropolis-Hasting algorithms, adapted to the EM-based speech enhancement with RVAE. By directly sampling from the intractable posterior distribution within the EM process, we circumvent the intricacies of variational inference. We conduct a series of experiments, comparing the proposed methods with VEM and a state-of-the-art supervised speech enhancement approach based on diffusion models. The results reveal that our sampling-based algorithms significantly outperform VEM, not only in terms of computational efficiency but also in overall performance. Furthermore, when compared to the supervised baseline, our methods showcase robust generalization performance in mismatched test conditions.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Conductivity and Shear Viscosity of $arcsin$-Yang-Mills AdS Black Brane
Authors:
Mehdi Sadeghi,
S. M. Moosavi Khansari
Abstract:
In this paper, a non-abelian $arcsin$-Yang-Mills AdS black brane solution is introduced. Then, the color non-abelian direct current (DC) conductivity and shear viscosity to entropy density ratio of this model is calculated using fluid-gravity duality. Our results show that the Kovtun, Son and Starinets (KSS) bound is saturated and is exactly equal to $\frac{1}{4 π}$ but the color conductivity boun…
▽ More
In this paper, a non-abelian $arcsin$-Yang-Mills AdS black brane solution is introduced. Then, the color non-abelian direct current (DC) conductivity and shear viscosity to entropy density ratio of this model is calculated using fluid-gravity duality. Our results show that the Kovtun, Son and Starinets (KSS) bound is saturated and is exactly equal to $\frac{1}{4 π}$ but the color conductivity bound is violated for this model. Also, our outcomes recover the Yang-Mills AdS black brane when the coupling of Yang-Mills and gravity fields approaches zero.
△ Less
Submitted 24 August, 2023; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Explanation Needs in App Reviews: Taxonomy and Automated Detection
Authors:
Max Unterbusch,
Mersedeh Sadeghi,
Jannik Fischbach,
Martin Obaidi,
Andreas Vogelsang
Abstract:
Explainability, i.e. the ability of a system to explain its behavior to users, has become an important quality of software-intensive systems. Recent work has focused on methods for generating explanations for various algorithmic paradigms (e.g., machine learning, self-adaptive systems). There is relatively little work on what situations and types of behavior should be explained. There is also a la…
▽ More
Explainability, i.e. the ability of a system to explain its behavior to users, has become an important quality of software-intensive systems. Recent work has focused on methods for generating explanations for various algorithmic paradigms (e.g., machine learning, self-adaptive systems). There is relatively little work on what situations and types of behavior should be explained. There is also a lack of support for eliciting explainability requirements. In this work, we explore the need for explanation expressed by users in app reviews. We manually coded a set of 1,730 app reviews from 8 apps and derived a taxonomy of Explanation Needs. We also explore several approaches to automatically identify Explanation Needs in app reviews. Our best classifier identifies Explanation Needs in 486 unseen reviews of 4 different apps with a weighted F-score of 86%. Our work contributes to a better understanding of users' Explanation Needs. Automated tools can help engineers focus on these needs and ultimately elicit valid Explanation Needs.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
Authors:
Simon Leglaive,
Léonie Borne,
Efthymios Tzinis,
Mostafa Sadeghi,
Matthieu Fraticelli,
Scott Wisdom,
Manuel Pariente,
Daniel Pressnitzer,
John R. Hershey
Abstract:
Supervised speech enhancement models are trained using artificially generated mixtures of clean speech and noise signals, which may not match real-world recording conditions at test time. This mismatch can lead to poor performance if the test domain significantly differs from the synthetic training domain. This paper introduces the unsupervised domain adaptation for conversational speech enhanceme…
▽ More
Supervised speech enhancement models are trained using artificially generated mixtures of clean speech and noise signals, which may not match real-world recording conditions at test time. This mismatch can lead to poor performance if the test domain significantly differs from the synthetic training domain. This paper introduces the unsupervised domain adaptation for conversational speech enhancement (UDASE) task of the 7th CHiME challenge. This task aims to leverage real-world noisy speech recordings from the target domain for unsupervised domain adaptation of speech enhancement models. The target domain corresponds to the multi-speaker reverberant conversational speech recordings of the CHiME-5 dataset, for which the ground-truth clean speech reference is unavailable. Given a CHiME-5 recording, the task is to estimate the clean, potentially multi-speaker, reverberant speech, removing the additive background noise. We discuss the motivation for the CHiME-7 UDASE task and describe the data, the task, and the baseline system.
△ Less
Submitted 2 October, 2023; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Open Source-based Over-The-Air 5G New Radio Sidelink Testbed
Authors:
Melissa Elkadi,
Doekseong Kim,
Ejaz Ahmed,
Moein Sadeghi,
Anh Le,
Paul Russell,
Bo Ryu
Abstract:
The focus of this paper is to demonstrate an over-the-air (OTA) 5G new radio (NR) sidelink communication prototype. 5G NR sidelink communications allow NR UEs to transfer data independently without the assistance of a base station (gNB), which enables V2X communications, including platooning, autonomous driving, sensor extension, industrial IoT, public safety communication and much more. Our desig…
▽ More
The focus of this paper is to demonstrate an over-the-air (OTA) 5G new radio (NR) sidelink communication prototype. 5G NR sidelink communications allow NR UEs to transfer data independently without the assistance of a base station (gNB), which enables V2X communications, including platooning, autonomous driving, sensor extension, industrial IoT, public safety communication and much more. Our design leverages the open-source OpenAirInterface5G (OAI) software, which operates on software-defined radios (SDRs) and can be easily extended for mesh networking. The software includes all signal processing components specified by the 3GPP 5G sidelink standards, including Low-Density Parity Check (LDPC) encoding/decoding, polar encoding/decoding, data and control multiplexing, modulation/demodulation, and orthogonal frequency-division multiplexing (OFDM) modulation/demodulation. It can be configured to operate with different bands, bandwidths, and antenna settings. The first milestone in this work was to demonstrate the completed Physical Sidelink Broadcast Channel (PSBCH) development, which conducts synchronization between a Synchronization Reference (SyncRef) UE and a nearby UE. The SyncRef UE broadcasts a sidelink synchronization signal block (S-SSB) periodically, which the nearby UE detects and uses to synchronize its timing and frequency components with the SyncRef UE. Once a connection is established, the next developmental milestone is to transmit real data (text messages) via the Physical Sidelink Shared Channel (PSSCH). Our PHY sidelink framework is tested using both an RF simulator and an OTA testbed with multiple nearby UEs. Beyond the development of synchronization and data transmission/reception in 5G sidelink, we conclude with various performance tests and validation experiments. The results of these metrics show that our simulator is comparable to the OTA testbed.
△ Less
Submitted 6 October, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
The Phase Transition of Non-minimal Yang-Mills AdS Black Brane
Authors:
Mehdi Sadeghi,
Faramaz Rahmani
Abstract:
In this paper, we shall study the phase transition of non-minimal coupling of Einstein-Hilbert gravity and electric field of Yang-Mills type in AdS space-time. We couple the Ricci scalar to the Yang-Mills invariant to obtain a modified theory of gravity. A black brane solution is introduced up to the first order of the term $RF^{(a)}_{μα}F^{(a)μα} $ in this model. Then, the phase transition of thi…
▽ More
In this paper, we shall study the phase transition of non-minimal coupling of Einstein-Hilbert gravity and electric field of Yang-Mills type in AdS space-time. We couple the Ricci scalar to the Yang-Mills invariant to obtain a modified theory of gravity. A black brane solution is introduced up to the first order of the term $RF^{(a)}_{μα}F^{(a)μα} $ in this model. Then, the phase transition of this solution will be investigated in canonical ensemble. Our investigation shows that only the second order phase transition behavior is seen in this model. Also, due to the coupling of the Yang-Mills field and Ricci scalar, there are differences with the phase transitions of the usual minimal models. We shall show that in the absence of non-minimal coupling there is no any phase transition.
△ Less
Submitted 20 January, 2024; v1 submitted 8 June, 2023;
originally announced June 2023.
-
DC: Depth Control on Quantum Classical Circuit
Authors:
Movahhed Sadeghi,
Soheil Khadirsharbiyani,
Mostafa Eghbali Zarch,
Mahmut Taylan Kandemir
Abstract:
The growing prevalence of near-term intermediate-scale quantum (NISQ) systems has brought forth a heightened focus on the issue of circuit reliability. Several quantum computing activities, such as circuit design and multi-qubit mapping, are focused on enhancing reliability via the use of different optimization techniques. The optimization of quantum classical circuits has been the subject of subs…
▽ More
The growing prevalence of near-term intermediate-scale quantum (NISQ) systems has brought forth a heightened focus on the issue of circuit reliability. Several quantum computing activities, such as circuit design and multi-qubit mapping, are focused on enhancing reliability via the use of different optimization techniques. The optimization of quantum classical circuits has been the subject of substantial research, with a focus on techniques such as ancilla-qubit reuse and tactics aimed at minimizing circuit size and depth. Nevertheless, the reliability of bigger and more complex circuits remains a difficulty due to potential failures or the need for time-consuming compilation processes, despite the use of modern optimization strategies.
This study presents a revolutionary Depth Control (DC) methodology that involves slicing and lowering the depth of conventional circuits. This strategy aims to improve the reliability and decrease the mapping costs associated with quantum hardware. DC provides reliable outcomes for circuits of indefinite size on any Noisy Intermediate-Scale Quantum (NISQ) system. The experimental findings demonstrate that the use of DC leads to a substantial improvement in the Probability of Success Threshold (PST), with an average increase of 11x compared to non-DC baselines. Furthermore, DC exhibits a notable superiority over the next best outcome by ensuring accurate outputs with a considerable margin. In addition, the utilization of Design Compiler (DC) enables the execution of mapping and routing optimizations inside a polynomial-time complexity, which represents an advancement compared to previously suggested methods that need exponential time.
△ Less
Submitted 30 September, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
The Effect of Flow and Magnetic Twist on Resonant Absorption of Slow MHD Waves in Magnetic Flux Tubes
Authors:
Mohammad Sadeghi,
Karam Bahari,
Kayoomars Karami
Abstract:
Observations show that there are twisted magnetic flux tubes and plasma flow throughout the solar atmosphere. The main purpose of this work is to obtain the damping rate of sausage modes in the presence of magnetic twist and plasma flow. We obtain the dispersion relation for sausage modes in slow continuity in an inhomogeneous layer under the conditions of magnetic pores, then we solve it numerica…
▽ More
Observations show that there are twisted magnetic flux tubes and plasma flow throughout the solar atmosphere. The main purpose of this work is to obtain the damping rate of sausage modes in the presence of magnetic twist and plasma flow. We obtain the dispersion relation for sausage modes in slow continuity in an inhomogeneous layer under the conditions of magnetic pores, then we solve it numerically. For the selected density profile, the magnetic field, and the plasma flow as a function of radius across the inhomogeneous layer, we show that the effect of the twisted magnetic field on the resonance absorption at low speed of the plasma flow is greater than one at high speed.
△ Less
Submitted 9 April, 2023;
originally announced April 2023.
-
AdS Black Hole with Cylindrical Symmetry
Authors:
Mehdi Sadeghi,
Ramin Anvari Asl,
Mohammad Shamseh
Abstract:
In this paper, we consider Einstein-Hilbert gravity in the presence of cosmological constant with cylindrical symmetry to introduce the black hole solution of this model. Here, we solve the Einstein's vacuum field equation, and then we calculate the appropriate metric for this problem.
In this paper, we consider Einstein-Hilbert gravity in the presence of cosmological constant with cylindrical symmetry to introduce the black hole solution of this model. Here, we solve the Einstein's vacuum field equation, and then we calculate the appropriate metric for this problem.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Holographic Aspects of Non-minimal $RF^{(a)}_{μα}F^{(a)μα} $ Black Brane
Authors:
Mehdi Sadeghi
Abstract:
In this paper, we consider Einstein-Hilbert gravity in the presence of cosmological constant and an electric field of Yang-Mills type, which is minimally coupled to gravity. We couple the Ricci scalar to the Yang-Mills invariant to obtain a modified theory of gravity. The black brane solution of this model is introduced up to the first order of the $RF^{(a)}_{μα}F^{(a)μα} $ term. Then, the color n…
▽ More
In this paper, we consider Einstein-Hilbert gravity in the presence of cosmological constant and an electric field of Yang-Mills type, which is minimally coupled to gravity. We couple the Ricci scalar to the Yang-Mills invariant to obtain a modified theory of gravity. The black brane solution of this model is introduced up to the first order of the $RF^{(a)}_{μα}F^{(a)μα} $ term. Then, the color non-abelian direct current (DC) conductivity and the ratio of shear viscosity to entropy density are calculated for this solution. Our results recover the Yang-Mills Schwarzschild AdS black brane in the limit of $q_2 \to 0$.
△ Less
Submitted 14 July, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.