-
The Effects of GitHub Copilot on Computing Students' Programming Effectiveness, Efficiency, and Processes in Brownfield Programming Tasks
Authors:
Md Istiak Hossain Shihab,
Christopher Hundhausen,
Ahsun Tariq,
Summit Haque,
Yunhan Qiao,
Brian Mulanda
Abstract:
When graduates of computing degree programs enter the software industry, they will most likely join teams working on legacy code bases developed by people other than themselves. In these so-called brownfield software development settings, generative artificial intelligence (GenAI) coding assistants like GitHub Copilot are rapidly transforming software development practices, yet the impact of GenAI…
▽ More
When graduates of computing degree programs enter the software industry, they will most likely join teams working on legacy code bases developed by people other than themselves. In these so-called brownfield software development settings, generative artificial intelligence (GenAI) coding assistants like GitHub Copilot are rapidly transforming software development practices, yet the impact of GenAI on student programmers performing brownfield development tasks remains underexplored. This paper investigates how GitHub Copilot influences undergraduate students' programming performance, behaviors, and understanding when completing brownfield programming tasks in which they add new code to an unfamiliar code base. We conducted a controlled experiment in which 10 undergraduate computer science students completed highly similar brownfield development tasks with and without Copilot in a legacy web application. Using a mixed-methods approach combining performance analysis, behavioral analysis, and exit interviews, we found that students completed tasks 35% faster (p < 0.05) and made 50% more solution progress p (< 0.05) when using Copilot. Moreover, our analysis revealed that, when using Copilot, students spent 11% less time manually writing code (p < 0.05), and 12% less time conducting web searches (p < 0.05), providing evidence of a fundamental shift in how they engaged in programming. In exit interviews, students reported concerns about not understanding how or why Copilot suggestions work. This research suggests the need for computing educators to develop new pedagogical approaches that leverage GenAI assistants' benefits while fostering reflection on how and why GenAI suggestions address brownfield programming tasks. Complete study results and analysis are presented at https://ghcopilot-icer.github.io/.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
A case study of translating sonifications across musical cultures for an educational application
Authors:
Chris M. Harrison,
James W. Trayford,
Arron George,
Leigh Harrison,
Rubén García-Benito,
Shirin Haque,
Rose Shepherd
Abstract:
Sonification can be part of educational resources that can be accessible to those who prefer, or require, non-visual learning methods. Furthermore, sonification can contribute to an engaging multi-sensory learning experience, which are known to benefit general learners. Whilst some sonification can be relatively agnostic to musical culture, many sonifications are subject to culturally influenced c…
▽ More
Sonification can be part of educational resources that can be accessible to those who prefer, or require, non-visual learning methods. Furthermore, sonification can contribute to an engaging multi-sensory learning experience, which are known to benefit general learners. Whilst some sonification can be relatively agnostic to musical culture, many sonifications are subject to culturally influenced choices, such as the chosen harmonies, rhythmic structures, and instrumentation. This is important when considering how universally inclusive and relatable sonification-based educational resources will be. Here we present a case study of translating a sonification-based educational show about the Solar System, that was originally designed with influences from Euro-American (Western-classical) music, to be more culturally relevant to the Caribbean region. We describe the motivation, approach, some of the challenges, and the initial feedback of the resulting output of the project. Finally, we provide reflections on the importance of further work exploring how educational sonifications can transcend international borders and musical cultures.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Reactive Glass Metal Interaction under Ambient Conditions Enables Surface Modification of Gold Nanoislands
Authors:
Sinorul Haque,
Shweta R. Keshri,
G. Ganesh,
Kaustuv Chatterjee,
Shubhangi Majumdar,
Sudheer Ganisetti,
Indrajeet Mandal,
Dudekula Althaf Basha,
Prabir Pal,
Pramit K Chowdhury,
Niharika Joshi,
Subrahmanyam Sappati,
Nitya Nand Gosvami,
Eswaraiah Varrla,
N. M. Anoop Krishnan,
Amarnath R. Allu
Abstract:
Stabilizing gold nanoparticles with tunable surface composition via reactive metal support interactions under ambient conditions remains a significant challenge. We discovered that a reactive glass metal interaction (RGMI) under ambient conditions, driven by the intrinsic catalytic activity of gold nanoislands (GNIs) and the unique properties of sodium aluminophosphosilicate glass, including its c…
▽ More
Stabilizing gold nanoparticles with tunable surface composition via reactive metal support interactions under ambient conditions remains a significant challenge. We discovered that a reactive glass metal interaction (RGMI) under ambient conditions, driven by the intrinsic catalytic activity of gold nanoislands (GNIs) and the unique properties of sodium aluminophosphosilicate glass, including its chemical composition, molar volume, and high Na ion mobility, enables the formation of robustly anchored GNIs with altered surface compositions. Comprehensive characterization reveals that the adsorption of Na and P at the GNI surfaces induces lattice distortions in the Au(111) planes. Additionally, a smooth GNI glass interface significantly influences the hot carrier dynamics of the GNIs. Altogether, RGMI presents a versatile strategy for engineering stable, multi element nanostructures with potential applications in heterogeneous catalysis, sensing, and optoelectronics.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Early Accessibility: Automating Alt-Text Generation for UI Icons During App Development
Authors:
Sabrina Haque,
Christoph Csallner
Abstract:
Alt-text is essential for mobile app accessibility, yet UI icons often lack meaningful descriptions, limiting accessibility for screen reader users. Existing approaches either require extensive labeled datasets, struggle with partial UI contexts, or operate post-development, increasing technical debt. We first conduct a formative study to determine when and how developers prefer to generate icon a…
▽ More
Alt-text is essential for mobile app accessibility, yet UI icons often lack meaningful descriptions, limiting accessibility for screen reader users. Existing approaches either require extensive labeled datasets, struggle with partial UI contexts, or operate post-development, increasing technical debt. We first conduct a formative study to determine when and how developers prefer to generate icon alt-text. We then explore the ALTICON approach for generating alt-text for UI icons during development using two fine-tuned models: a text-only large language model that processes extracted UI metadata and a multi-modal model that jointly analyzes icon images and textual context. To improve accuracy, the method extracts relevant UI information from the DOM tree, retrieves in-icon text via OCR, and applies structured prompts for alt-text generation. Our empirical evaluation with the most closely related deep-learning and vision-language models shows that ALTICON generates alt-text that is of higher quality while not requiring a full-screen input.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
Evidence of Phase Transition from Binary Neutron Star Merger
Authors:
Sagnik Chatterjee,
Shamim Haque,
Kamal Krishna Nath,
Ritam Mallick,
Rana Nandi
Abstract:
Binary neutron-star mergers offer crucial insights into the matter properties of neutron stars. We present direct evidence of phase transition on the observational signatures from such events. Our study employs a range of equations of states with hadron-quark phase transition surveying from Maxwell construction smoothened up to the Gibbs construction using a control parameter $Δp$. This smoothenin…
▽ More
Binary neutron-star mergers offer crucial insights into the matter properties of neutron stars. We present direct evidence of phase transition on the observational signatures from such events. Our study employs a range of equations of states with hadron-quark phase transition surveying from Maxwell construction smoothened up to the Gibbs construction using a control parameter $Δp$. This smoothening parameter allows us to explore different mixed phases and analyse their direct impact on merger dynamics. Post-merger gravitational wave emissions reveal the expression of specific signatures in the spectogram and power spectral density, serving as a distinct signature of equations of state with mixed phases. We found additional peaks in power spectral density that were fully responsible from the post-merger remnant experiencing a phase transition. Alongside this signature, the nature of phase transition transition leaves specific imprints on the spectrogram, leading to a two-folded signature from gravitational wave analysis. Furthermore, we establish a direct correlation between our findings and the threshold mass for prompt collapse. Our analysis also provides key evidence against a Maxwell-type phase transition for GW170817 if the post-merger is believed to have experienced a prompt collapse into a black hole. We show that $Δp \gtrsim 0.04$ is required for such a scenario. Effects of $Δp$ have also been observed on the ejecta mass from the event, which can affect the kilonova afterglow.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
RxRx3-core: Benchmarking drug-target interactions in High-Content Microscopy
Authors:
Oren Kraus,
Federico Comitani,
John Urbanik,
Kian Kenyon-Dean,
Lakshmanan Arumugam,
Saber Saberian,
Cas Wognum,
Safiye Celik,
Imran S. Haque
Abstract:
High Content Screening (HCS) microscopy datasets have transformed the ability to profile cellular responses to genetic and chemical perturbations, enabling cell-based inference of drug-target interactions (DTI). However, the adoption of representation learning methods for HCS data has been hindered by the lack of accessible datasets and robust benchmarks. To address this gap, we present RxRx3-core…
▽ More
High Content Screening (HCS) microscopy datasets have transformed the ability to profile cellular responses to genetic and chemical perturbations, enabling cell-based inference of drug-target interactions (DTI). However, the adoption of representation learning methods for HCS data has been hindered by the lack of accessible datasets and robust benchmarks. To address this gap, we present RxRx3-core, a curated and compressed subset of the RxRx3 dataset, and an associated DTI benchmarking task. At just 18GB, RxRx3-core significantly reduces the size barrier associated with large-scale HCS datasets while preserving critical data necessary for benchmarking representation learning models against a zero-shot DTI prediction task. RxRx3-core includes 222,601 microscopy images spanning 736 CRISPR knockouts and 1,674 compounds at 8 concentrations. RxRx3-core is available on HuggingFace and Polaris, along with pre-trained embeddings and benchmarking code, ensuring accessibility for the research community. By providing a compact dataset and robust benchmarks, we aim to accelerate innovation in representation learning methods for HCS data and support the discovery of novel biological insights.
△ Less
Submitted 30 May, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise
Authors:
Siddharth Chandak,
Shaan Ul Haque,
Nicholas Bambos
Abstract:
Two-time-scale Stochastic Approximation (SA) is an iterative algorithm with applications in reinforcement learning and optimization. Prior finite time analysis of such algorithms has focused on fixed point iterations with mappings contractive under Euclidean norm. Motivated by applications in reinforcement learning, we give the first mean square bound on non linear two-time-scale SA where the iter…
▽ More
Two-time-scale Stochastic Approximation (SA) is an iterative algorithm with applications in reinforcement learning and optimization. Prior finite time analysis of such algorithms has focused on fixed point iterations with mappings contractive under Euclidean norm. Motivated by applications in reinforcement learning, we give the first mean square bound on non linear two-time-scale SA where the iterations have arbitrary norm contractive mappings and Markovian noise. We show that the mean square error decays at a rate of $O(1/n^{2/3})$ in the general case, and at a rate of $O(1/n)$ in a special case where the slower timescale is noiseless. Our analysis uses the generalized Moreau envelope to handle the arbitrary norm contractions and solutions of Poisson equation to deal with the Markovian noise. By analyzing the SSP Q-Learning algorithm, we give the first $O(1/n)$ bound for an algorithm for asynchronous control of MDPs under the average reward criterion. We also obtain a rate of $O(1/n)$ for Q-Learning with Polyak-averaging and provide an algorithm for learning Generalized Nash Equilibrium (GNE) for strongly monotone games which converges at a rate of $O(1/n^{2/3})$.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation
Authors:
Masud Ahmed,
Zahid Hasan,
Syed Arefinul Haque,
Abu Zaher Md Faridee,
Sanjay Purushotham,
Suya You,
Nirmalya Roy
Abstract:
Traditional transformer-based semantic segmentation relies on quantized embeddings. However, our analysis reveals that autoencoder accuracy on segmentation mask using quantized embeddings (e.g. VQ-VAE) is 8% lower than continuous-valued embeddings (e.g. KL-VAE). Motivated by this, we propose a continuous-valued embedding framework for semantic segmentation. By reformulating semantic mask generatio…
▽ More
Traditional transformer-based semantic segmentation relies on quantized embeddings. However, our analysis reveals that autoencoder accuracy on segmentation mask using quantized embeddings (e.g. VQ-VAE) is 8% lower than continuous-valued embeddings (e.g. KL-VAE). Motivated by this, we propose a continuous-valued embedding framework for semantic segmentation. By reformulating semantic mask generation as a continuous image-to-embedding diffusion process, our approach eliminates the need for discrete latent representations while preserving fine-grained spatial and semantic details. Our key contribution includes a diffusion-guided autoregressive transformer that learns a continuous semantic embedding space by modeling long-range dependencies in image features. Our framework contains a unified architecture combining a VAE encoder for continuous feature extraction, a diffusion-guided transformer for conditioned embedding generation, and a VAE decoder for semantic mask reconstruction. Our setting facilitates zero-shot domain adaptation capabilities enabled by the continuity of the embedding space. Experiments across diverse datasets (e.g., Cityscapes and domain-shifted variants) demonstrate state-of-the-art robustness to distribution shifts, including adverse weather (e.g., fog, snow) and viewpoint variations. Our model also exhibits strong noise resilience, achieving robust performance ($\approx$ 95% AP compared to baseline) under gaussian noise, moderate motion blur, and moderate brightness/contrast variations, while experiencing only a moderate impact ($\approx$ 90% AP compared to baseline) from 50% salt and pepper noise, saturation and hue shifts. Code available: https://github.com/mahmed10/CAMSS.git
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
A fast and slightly robust covariance estimator
Authors:
John Duchi,
Saminul Haque,
Rohith Kuditipudi
Abstract:
Let $\mathcal{Z} = \{Z_1, \dots, Z_n\} \stackrel{\mathrm{i.i.d.}}{\sim} P \subset \mathbb{R}^d$ from a distribution $P$ with mean zero and covariance $Σ$. Given a dataset $\mathcal{X}$ such that $d_{\mathrm{ham}}(\mathcal{X}, \mathcal{Z}) \leq \varepsilon n$, we are interested in finding an efficient estimator $\widehatΣ$ that achieves…
▽ More
Let $\mathcal{Z} = \{Z_1, \dots, Z_n\} \stackrel{\mathrm{i.i.d.}}{\sim} P \subset \mathbb{R}^d$ from a distribution $P$ with mean zero and covariance $Σ$. Given a dataset $\mathcal{X}$ such that $d_{\mathrm{ham}}(\mathcal{X}, \mathcal{Z}) \leq \varepsilon n$, we are interested in finding an efficient estimator $\widehatΣ$ that achieves $\mathrm{err}(\widehatΣ, Σ) := \|Σ^{-\frac{1}{2}}\widehatΣΣ^{-\frac{1}{2}} - I\| _{\mathrm{op}} \leq 1/2$. We focus on the low contamination regime $\varepsilon = o(1/\sqrt{d}$). In this regime, prior work required either $Ω(d^{3/2})$ samples or runtime that is exponential in $d$. We present an algorithm that, for subgaussian data, has near-linear sample complexity $n = \widetildeΩ(d)$ and runtime $O((n+d)^{ω+ \frac{1}{2}})$, where $ω$ is the matrix multiplication exponent. We also show that this algorithm works for heavy-tailed data with near-linear sample complexity, but in a smaller regime of $\varepsilon$. Concurrent to our work, Diakonikolas et al. [2024] give Sum-of-Squares estimators that achieve similar sample complexity but with large polynomial runtime.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms
Authors:
Zaiwei Chen,
Sheng Zhang,
Zhe Zhang,
Shaan Ul Haque,
Siva Theja Maguluri
Abstract:
We study the problem of solving fixed-point equations for seminorm-contractive operators and establish foundational results on the non-asymptotic behavior of iterative algorithms in both deterministic and stochastic settings. Specifically, in the deterministic setting, we prove a fixed-point theorem for seminorm-contractive operators, showing that iterates converge geometrically to the kernel of t…
▽ More
We study the problem of solving fixed-point equations for seminorm-contractive operators and establish foundational results on the non-asymptotic behavior of iterative algorithms in both deterministic and stochastic settings. Specifically, in the deterministic setting, we prove a fixed-point theorem for seminorm-contractive operators, showing that iterates converge geometrically to the kernel of the seminorm. In the stochastic setting, we analyze the corresponding stochastic approximation (SA) algorithm under seminorm-contractive operators and Markovian noise, providing a finite-sample analysis for various stepsize choices.
A benchmark for equation solving is linear systems of equations, where the convergence behavior of fixed-point iteration is closely tied to the stability of linear dynamical systems. In this special case, our results provide a complete characterization of system stability with respect to a seminorm, linking it to the solution of a Lyapunov equation in terms of positive semi-definite matrices. In the stochastic setting, we establish a finite-sample analysis for linear Markovian SA without requiring the Hurwitzness assumption.
Our theoretical results offer a unified framework for deriving finite-sample bounds for various reinforcement learning algorithms in the average reward setting, including TD($λ$) for policy evaluation (which is a special case of solving a Poisson equation) and Q-learning for control.
△ Less
Submitted 19 February, 2025;
originally announced February 2025.
-
Probing the self-coherence of primordial quantum fluctuations with complexity
Authors:
Arpan Bhattacharya,
Suddhasattwa Brahma,
S. Shajidul Haque,
Jacob S. Lund,
Arpon Paul
Abstract:
A smoking gun for our current paradigm of the early universe would be direct evidence for the quantum mechanical origin of density perturbations which are conjectured to seed the large scale structure of our universe. A recently-proposed novel phenomenon is that of \textit{recoherence}, wherein a specific interaction between the adiabatic and the entropic sector leads to the adiabatic mode retaini…
▽ More
A smoking gun for our current paradigm of the early universe would be direct evidence for the quantum mechanical origin of density perturbations which are conjectured to seed the large scale structure of our universe. A recently-proposed novel phenomenon is that of \textit{recoherence}, wherein a specific interaction between the adiabatic and the entropic sector leads to the adiabatic mode retaining a coherent state after a transient increase in linear entropy. In this paper, we choose the most general Gaussian action and analyze the evolution of linear entropy, complexity of purification (COP), and complexity of formation (COF) to capture the interplay between decoherence and recoherence in this model. In the presence of two types of couplings that drive these two opposing characteristics, we highlight how COF is an efficient tool for diagnosing dynamics for such an open quantum system.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Accurately Estimating Unreported Infections using Information Theory
Authors:
Jiaming Cui,
Bijaya Adhikari,
Arash Haddadan,
A S M Ahsan-Ul Haque,
Jilles Vreeken,
Anil Vullikanti,
B. Aditya Prakash
Abstract:
One of the most significant challenges in combating against the spread of infectious diseases was the difficulty in estimating the true magnitude of infections. Unreported infections could drive up disease spread, making it very hard to accurately estimate the infectivity of the pathogen, therewith hampering our ability to react effectively. Despite the use of surveillance-based methods such as se…
▽ More
One of the most significant challenges in combating against the spread of infectious diseases was the difficulty in estimating the true magnitude of infections. Unreported infections could drive up disease spread, making it very hard to accurately estimate the infectivity of the pathogen, therewith hampering our ability to react effectively. Despite the use of surveillance-based methods such as serological studies, identifying the true magnitude is still challenging. This paper proposes an information theoretic approach for accurately estimating the number of total infections. Our approach is built on top of Ordinary Differential Equations (ODE) based models, which are commonly used in epidemiology and for estimating such infections. We show how we can help such models to better compute the number of total infections and identify the parametrization by which we need the fewest bits to describe the observed dynamics of reported infections. Our experiments on COVID-19 spread show that our approach leads to not only substantially better estimates of the number of total infections but also better forecasts of infections than standard model calibration based methods. We additionally show how our learned parametrization helps in modeling more accurate what-if scenarios with non-pharmaceutical interventions. Our approach provides a general method for improving epidemic modeling which is applicable broadly.
△ Less
Submitted 26 January, 2025;
originally announced February 2025.
-
Nonresonant nonlinear magnonics in an antiferromagnet
Authors:
Gu-Feng Zhang,
Sheikh Rubaiat Ul Haque,
Kelson J. Kaj,
Xiang Chen,
Urban F. P. Seifert,
Jingdi Zhang,
Kevin A. Cremin,
Leon Balents,
Stephen D. Wilson,
Richard D. Averitt
Abstract:
Antiferromagnets exhibit rapid spin dynamics in a net zero magnetic background which enables novel spintronic applications and interrogation of many-body quantum phenomena. The layered antiferromagnet Sr$_2$IrO$_4$ hosts an exotic spin one-half Mott insulating state with an electronic gap arising from on-site Coulomb repulsion and strong spin-orbit coupling. This makes Sr$_2$IrO$_4$ an interesting…
▽ More
Antiferromagnets exhibit rapid spin dynamics in a net zero magnetic background which enables novel spintronic applications and interrogation of many-body quantum phenomena. The layered antiferromagnet Sr$_2$IrO$_4$ hosts an exotic spin one-half Mott insulating state with an electronic gap arising from on-site Coulomb repulsion and strong spin-orbit coupling. This makes Sr$_2$IrO$_4$ an interesting candidate to interrogate dynamical attributes of the magnetic order using ultrafast laser pulses. We investigate the magnetization dynamics of Sr$_2$IrO$_4$ following circularly-polarized photoexcitation with below-gap mid-infrared (mid-IR -- 9 $μm$) and above-gap near-infrared (near-IR -- 1.3 $μm$) pulses. In both cases, we observe excitation of a zone-center coherent magnon mode featuring a 0.5 THz oscillation in the pump-induced Kerr-rotation signal. However, only below-gap excitation exhibits a helicity dependent response and linear (quadratic) scaling of the coherent magnon amplitude with excitation fluence (electric field). Moreover, below-gap excitation has a magnon generation efficiency that is at least two orders of magnitude greater in comparison to above-gap excitation. Our analysis indicates that the helicity dependence and enhanced generation efficiency arises from a unique one-photon two-magnon coupling mechanism for magnon generation. Thus, preferential spin-photon coupling without photoexcitation of electrons permits extremely efficient magnon generation. Our results reveal new possibilities for ultrafast control of antiferromagnets.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
Translating current ALP photon coupling strength bounds to the Randall-Sundrum model
Authors:
Shihabul Haque,
Sourov Roy,
Soumitra SenGupta
Abstract:
In this article, we look at the current bounds on the coupling strength of axion-like particles (ALPs) with two photons in the context of the Randall-Sundrum (RS) model. We relate the coupling strength to the compactification radius that governs the size of the extra dimension in the RS warped geometry model and show how the current bounds on the ALP can be used to derive appropriate constraints o…
▽ More
In this article, we look at the current bounds on the coupling strength of axion-like particles (ALPs) with two photons in the context of the Randall-Sundrum (RS) model. We relate the coupling strength to the compactification radius that governs the size of the extra dimension in the RS warped geometry model and show how the current bounds on the ALP can be used to derive appropriate constraints on the size of the extra fifth dimension in the RS model. We show that the resulting constraints fail to resolve the gauge hierarchy problem for light/ultralight ALPs and require a massive ALP of at least $m_{a} \gtrsim 0.1$ [GeV] to be relevant in the context of the hierarchy problem when the gauge field is in the bulk.
△ Less
Submitted 18 November, 2024; v1 submitted 13 November, 2024;
originally announced November 2024.
-
Global well-posedness and equicontinuity for mKdV in modulation spaces
Authors:
Saikatul Haque,
Rowan Killip,
Monica Visan,
Yunfeng Zhang
Abstract:
We establish global well-posedness for both the defocusing and focusing complex-valued modified Korteweg--de Vries equations on the real line in modulation spaces $M_p^{s,2}(\mathbb{R})$, for all $1\leq p<\infty$ and $0\leq s<3/2-1/p$. We will also show that such solutions admit global-in-time bounds in these spaces and that equicontinuous sets of initial data lead to equicontinuous ensembles of o…
▽ More
We establish global well-posedness for both the defocusing and focusing complex-valued modified Korteweg--de Vries equations on the real line in modulation spaces $M_p^{s,2}(\mathbb{R})$, for all $1\leq p<\infty$ and $0\leq s<3/2-1/p$. We will also show that such solutions admit global-in-time bounds in these spaces and that equicontinuous sets of initial data lead to equicontinuous ensembles of orbits. Indeed, such information forms a crucial part of our well-posedness argument.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy
Authors:
Kian Kenyon-Dean,
Zitong Jerry Wang,
John Urbanik,
Konstantin Donhauser,
Jason Hartford,
Saber Saberian,
Nil Sahin,
Ihab Bendidi,
Safiye Celik,
Marta Fay,
Juan Sebastian Rodriguez Vera,
Imran S Haque,
Oren Kraus
Abstract:
Large-scale cell microscopy screens are used in drug discovery and molecular biology research to study the effects of millions of chemical and genetic perturbations on cells. To use these images in downstream analysis, we need models that can map each image into a feature space that represents diverse biological phenotypes consistently, in the sense that perturbations with similar biological effec…
▽ More
Large-scale cell microscopy screens are used in drug discovery and molecular biology research to study the effects of millions of chemical and genetic perturbations on cells. To use these images in downstream analysis, we need models that can map each image into a feature space that represents diverse biological phenotypes consistently, in the sense that perturbations with similar biological effects have similar representations. In this work, we present the largest foundation model for cell microscopy data to date, a new 1.9 billion-parameter ViT-G/8 MAE trained on over 8 billion microscopy image crops. Compared to a previous published ViT-L/8 MAE, our new model achieves a 60% improvement in linear separability of genetic perturbations and obtains the best overall performance on whole-genome biological relationship recall and replicate consistency benchmarks. Beyond scaling, we developed two key methods that improve performance: (1) training on a curated and diverse dataset; and, (2) using biologically motivated linear probing tasks to search across each transformer block for the best candidate representation of whole-genome screens. We find that many self-supervised vision transformers, pretrained on either natural or microscopy images, yield significantly more biologically meaningful representations of microscopy images in their intermediate blocks than in their typically used final blocks. More broadly, our approach and results provide insights toward a general strategy for successfully building foundation models for large-scale biological data.
△ Less
Submitted 4 November, 2024;
originally announced November 2024.
-
Detection of Dark Matter using levitated nanoparticles within a Bessel-Gaussian beam via Yukawa coupling
Authors:
Iftekher S. Chowdhury,
Binay Prakash Akhouri,
Shah Haque,
Martin H. Bacci,
Eric Howard
Abstract:
We present a novel experimental approach to detect dark matter by probing Yukawa interactions, commonly referred to as a fifth force, between dark matter and baryonic matter. Our method involves optically levitating nanoparticles within a Bessel-Gaussian beam to detect minute forces exerted by potential dark matter interaction with test masses. The non-diffracting properties of Bessel-Gaussian bea…
▽ More
We present a novel experimental approach to detect dark matter by probing Yukawa interactions, commonly referred to as a fifth force, between dark matter and baryonic matter. Our method involves optically levitating nanoparticles within a Bessel-Gaussian beam to detect minute forces exerted by potential dark matter interaction with test masses. The non-diffracting properties of Bessel-Gaussian beams, combined with feedback cooling techniques, provide exceptional sensitivity to small perturbations in the motion of the nanoparticles. This setup allows for precise control over trapping conditions and enhances the detection sensitivity to forces on the order of \(10^{-18}\) N. We explore the parameter space of the Yukawa interaction, focusing on the coupling strength (\(α\)) and interaction range (\(λ\)), and discuss the potential of this experiment to place new constraints on dark matter couplings, complementing existing direct detection methods.
△ Less
Submitted 29 October, 2024;
originally announced October 2024.
-
Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem
Authors:
Shaan Ul Haque,
Siva Theja Maguluri
Abstract:
Motivated by engineering applications such as resource allocation in networks and inventory systems, we consider average-reward Reinforcement Learning with unbounded state space and reward function. Recent works studied this problem in the actor-critic framework and established finite sample bounds assuming access to a critic with certain error guarantees. We complement their work by studying Temp…
▽ More
Motivated by engineering applications such as resource allocation in networks and inventory systems, we consider average-reward Reinforcement Learning with unbounded state space and reward function. Recent works studied this problem in the actor-critic framework and established finite sample bounds assuming access to a critic with certain error guarantees. We complement their work by studying Temporal Difference (TD) learning with linear function approximation and establishing finite-time bounds with the optimal $\mathcal{O}\left(1/ε^2\right)$ sample complexity. These results are obtained using the following general-purpose theorem for non-linear Stochastic Approximation (SA).
Suppose that one constructs a Lyapunov function for a non-linear SA with certain drift condition. Then, our theorem establishes finite-time bounds when this SA is driven by unbounded Markovian noise under suitable conditions. It serves as a black box tool to generalize sample guarantees on SA from i.i.d. or martingale difference case to potentially unbounded Markovian noise. The generality and the mild assumption of the setup enables broad applicability of our theorem. We illustrate its power by studying two more systems: (i) We improve upon the finite-time bounds of $Q$-learning by tightening the error bounds and also allowing for a larger class of behavior policies. (ii) We establish the first ever finite-time bounds for distributed stochastic optimization of high-dimensional smooth strongly convex function using cyclic block coordinate descent.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
Near Exact Privacy Amplification for Matrix Mechanisms
Authors:
Christopher A. Choquette-Choo,
Arun Ganesh,
Saminul Haque,
Thomas Steinke,
Abhradeep Thakurta
Abstract:
We study the problem of computing the privacy parameters for DP machine learning when using privacy amplification via random batching and noise correlated across rounds via a correlation matrix $\textbf{C}$ (i.e., the matrix mechanism). Past work on this problem either only applied to banded $\textbf{C}$, or gave loose privacy parameters. In this work, we give a framework for computing near-exact…
▽ More
We study the problem of computing the privacy parameters for DP machine learning when using privacy amplification via random batching and noise correlated across rounds via a correlation matrix $\textbf{C}$ (i.e., the matrix mechanism). Past work on this problem either only applied to banded $\textbf{C}$, or gave loose privacy parameters. In this work, we give a framework for computing near-exact privacy parameters for any lower-triangular, non-negative $\textbf{C}$. Our framework allows us to optimize the correlation matrix $\textbf{C}$ while accounting for amplification, whereas past work could not. Empirically, we show this lets us achieve smaller RMSE on prefix sums than the previous state-of-the-art (SOTA). We also show that we can improve on the SOTA performance on deep learning tasks. Our two main technical tools are (i) using Monte Carlo accounting to bypass composition, which was the main technical challenge for past work, and (ii) a "balls-in-bins" batching scheme that enables easy privacy analysis and is closer to practical random batching than Poisson sampling.
△ Less
Submitted 20 March, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Mutual information and correlation measures in holographic RG flows
Authors:
Iftekher S. Chowdhury,
Binay Prakash Akhouri,
Shah Haque,
Eric Howard
Abstract:
This paper investigates the behavior of mutual information, entanglement negativity, and multipartite correlations in holographic RG flows, particularly during phase transitions. Mutual information provides a UV-finite measure of total correlations between subsystems, while entanglement negativity and multipartite correlations offer finer insights into quantum structures, especially near critical…
▽ More
This paper investigates the behavior of mutual information, entanglement negativity, and multipartite correlations in holographic RG flows, particularly during phase transitions. Mutual information provides a UV-finite measure of total correlations between subsystems, while entanglement negativity and multipartite correlations offer finer insights into quantum structures, especially near critical points. Through numerical simulations, we show that while mutual information remains relatively smooth, both entanglement negativity and multipartite correlations exhibit sharp changes near phase transitions. These results support the hypothesis that multipartite correlations play a dominant role in signaling critical phenomena in strongly coupled quantum systems.
△ Less
Submitted 6 October, 2024;
originally announced October 2024.
-
Finite-temperature CFT in Rindler Vacuum
Authors:
Iftekher S. Chowdhury,
Binay Prakash Akhouri,
Shah Haque,
Eric Howard
Abstract:
This paper investigates the finite-temperature behavior of Conformal Field Theory (CFT) in Rindler vacuum, focusing on the relation between acceleration and thermality in quantum field theory. We illustrate how uniformly accelerated observers perceive the vacuum as a thermal state via Unruh effect, shedding light on the thermal properties of Rindler horizon. Through numerical simulations of the he…
▽ More
This paper investigates the finite-temperature behavior of Conformal Field Theory (CFT) in Rindler vacuum, focusing on the relation between acceleration and thermality in quantum field theory. We illustrate how uniformly accelerated observers perceive the vacuum as a thermal state via Unruh effect, shedding light on the thermal properties of Rindler horizon. Through numerical simulations of the heat kernel, Unruh temperature, Planck distribution, and detector response, we demonstrate that acceleration enhances the thermal characteristics of quantum fields. These results provide important insights into horizon-induced thermality, with significant implications for black hole thermodynamics and quantum gravity.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Autoencoder-based learning of Quantum phase transitions in the two-component Bose-Hubbard model
Authors:
Iftekher S. Chowdhury,
Binay Prakash Akhouri,
Shah Haque,
Eric Howard
Abstract:
This paper investigates the use of autoencoders and machine learning methods for detecting and analyzing quantum phase transitions in the Two-Component Bose-Hubbard Model. By leveraging deep learning models such as autoencoders, we investigate latent space representations, reconstruction error analysis, and cluster distance calculations to identify phase boundaries and critical points. The study i…
▽ More
This paper investigates the use of autoencoders and machine learning methods for detecting and analyzing quantum phase transitions in the Two-Component Bose-Hubbard Model. By leveraging deep learning models such as autoencoders, we investigate latent space representations, reconstruction error analysis, and cluster distance calculations to identify phase boundaries and critical points. The study is supplemented by dimensionality reduction techniques such as PCA and t-SNE for latent space visualization. The results demonstrate the potential of autoencoders to describe the dynamics of quantum phase transitions.
△ Less
Submitted 27 September, 2024;
originally announced September 2024.
-
Inferring Alt-text For UI Icons With Large Language Models During App Development
Authors:
Sabrina Haque,
Christoph Csallner
Abstract:
Ensuring accessibility in mobile applications remains a significant challenge, particularly for visually impaired users who rely on screen readers. User interface icons are essential for navigation and interaction and often lack meaningful alt-text, creating barriers to effective use. Traditional deep learning approaches for generating alt-text require extensive datasets and struggle with the dive…
▽ More
Ensuring accessibility in mobile applications remains a significant challenge, particularly for visually impaired users who rely on screen readers. User interface icons are essential for navigation and interaction and often lack meaningful alt-text, creating barriers to effective use. Traditional deep learning approaches for generating alt-text require extensive datasets and struggle with the diversity and imbalance of icon types. More recent Vision Language Models (VLMs) require complete UI screens, which can be impractical during the iterative phases of app development. To address these issues, we introduce a novel method using Large Language Models (LLMs) to autonomously generate informative alt-text for mobile UI icons with partial UI data. By incorporating icon context, that include class, resource ID, bounds, OCR-detected text, and contextual information from parent and sibling nodes, we fine-tune an off-the-shelf LLM on a small dataset of approximately 1.4k icons, yielding IconDesc. In an empirical evaluation and a user study IconDesc demonstrates significant improvements in generating relevant alt-text. This ability makes IconDesc an invaluable tool for developers, aiding in the rapid iteration and enhancement of UI accessibility.
△ Less
Submitted 7 October, 2024; v1 submitted 26 September, 2024;
originally announced September 2024.
-
Simulating black hole quantum dynamics on an optical lattice using the complex Sachdev-Ye-Kitaev model
Authors:
Iftekher S. Chowdhury,
Binay Prakash Akhouri,
Shah Haque,
Martin H. Bacci,
Eric Howard
Abstract:
We propose a low energy model for simulating an analog black hole on an optical lattice using ultracold atoms. Assuming the validity of the holographic principle, we employ the Sachdev-Ye-Kitaev (SYK) model, which describes a system of randomly infinite range interacting fermions, also conjectured to be an exactly solvable UV-complete model for an extremal black hole in a higher dimensional Anti-d…
▽ More
We propose a low energy model for simulating an analog black hole on an optical lattice using ultracold atoms. Assuming the validity of the holographic principle, we employ the Sachdev-Ye-Kitaev (SYK) model, which describes a system of randomly infinite range interacting fermions, also conjectured to be an exactly solvable UV-complete model for an extremal black hole in a higher dimensional Anti-de Sitter (AdS) dilaton gravity. At low energies, the SYK model exhibits an emergent conformal symmetry and is dual to the extremal black hole solution in near AdS2 spacetime. Furthermore, we show how the SYK maximally chaotic behaviour at large N limit, found to be dual to a gauge theory in higher dimensions, can also be employed as a non-trivial investigation tool for the holographic principle. The proposed setup is a theoretical platform to realize the SYK model with relevant exotic effects and behaviour at low energies as a highly non-trivial example of the AdS/CFT duality and a framework for studying black holes.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
On the Hardy-Hénon heat equation with an inverse square potential
Authors:
Divyang G. Bhimani,
Saikatul Haque,
Masahiro Ikeda
Abstract:
We study Cauchy problem for the Hardy-Hénon parabolic equation with an inverse square potential, namely, \[\partial_tu -Δu+a|x|^{-2} u= |x|^γ F_α(u),\] where $a\ge-(\frac{d-2}{2})^2,$ $γ\in \mathbb R$, $α>1$ and $F_α(u)=μ|u|^{α-1}u, μ|u|^α$ or $μu^α$, $μ\in \{-1,0,1\}$. We establish sharp fixed time-time decay estimates for heat semigroups $e^{-t (-Δ+ a|x|^{-2})}$ in weighted Lebesgue spaces, whic…
▽ More
We study Cauchy problem for the Hardy-Hénon parabolic equation with an inverse square potential, namely, \[\partial_tu -Δu+a|x|^{-2} u= |x|^γ F_α(u),\] where $a\ge-(\frac{d-2}{2})^2,$ $γ\in \mathbb R$, $α>1$ and $F_α(u)=μ|u|^{α-1}u, μ|u|^α$ or $μu^α$, $μ\in \{-1,0,1\}$. We establish sharp fixed time-time decay estimates for heat semigroups $e^{-t (-Δ+ a|x|^{-2})}$ in weighted Lebesgue spaces, which is of independent interest. As an application, we establish:
$\bullet$ Local well-posedness (LWP) in scale subcritical and critical weighted Lebesgue spaces.
$\bullet$ Small data global existence in critical weighted Lebesgue spaces.
$\bullet$ Under certain conditions on $γ$ and $α,$ we show that local solution cannot be extended to global one for certain initial data in the subcritical regime. Thus, finite time blow-up in the subcritical Lebesgue space norm is exhibited.
$\bullet$ We also demonstrate nonexistence of local positive weak solution (and hence failure of LWP) in supercritical case for $α>1+\frac{2+γ}{d}$ the Fujita exponent. This indicates that subcriticality or criticality are necessary in the first point above.
In summary, we establish a sharp dissipative estimate and addresses short and long time behaviors of solutions. In particular, we complement several classical results and shed new light on the dynamics of the considered equation.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Harvesting Private Medical Images in Federated Learning Systems with Crafted Models
Authors:
Shanghao Shi,
Md Shahedul Haque,
Abhijeet Parida,
Marius George Linguraru,
Y. Thomas Hou,
Syed Muhammad Anwar,
Wenjing Lou
Abstract:
Federated learning (FL) allows a set of clients to collaboratively train a machine-learning model without exposing local training samples. In this context, it is considered to be privacy-preserving and hence has been adopted by medical centers to train machine-learning models over private data. However, in this paper, we propose a novel attack named MediLeak that enables a malicious parameter serv…
▽ More
Federated learning (FL) allows a set of clients to collaboratively train a machine-learning model without exposing local training samples. In this context, it is considered to be privacy-preserving and hence has been adopted by medical centers to train machine-learning models over private data. However, in this paper, we propose a novel attack named MediLeak that enables a malicious parameter server to recover high-fidelity patient images from the model updates uploaded by the clients. MediLeak requires the server to generate an adversarial model by adding a crafted module in front of the original model architecture. It is published to the clients in the regular FL training process and each client conducts local training on it to generate corresponding model updates. Then, based on the FL protocol, the model updates are sent back to the server and our proposed analytical method recovers private data from the parameter updates of the crafted module. We provide a comprehensive analysis for MediLeak and show that it can successfully break the state-of-the-art cryptographic secure aggregation protocols, designed to protect the FL systems from privacy inference attacks. We implement MediLeak on the MedMNIST and COVIDx CXR-4 datasets. The results show that MediLeak can nearly perfectly recover private images with high recovery rates and quantitative scores. We further perform downstream tasks such as disease classification with the recovered data, where our results show no significant performance degradation compared to using the original training samples.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Universal Early-Time Growth in Quantum Circuit Complexity
Authors:
S. Shajidul Haque,
Ghadir Jafari,
Bret Underwood
Abstract:
We show that quantum circuit complexity for the unitary time evolution operator of any time-independent Hamiltonian is bounded by linear growth at early times, independent of any choices of the fundamental gates or cost metric. Deviations from linear early-time growth arise from the commutation algebra of the gates and are manifestly negative for any circuit, decreasing the linear growth rate and…
▽ More
We show that quantum circuit complexity for the unitary time evolution operator of any time-independent Hamiltonian is bounded by linear growth at early times, independent of any choices of the fundamental gates or cost metric. Deviations from linear early-time growth arise from the commutation algebra of the gates and are manifestly negative for any circuit, decreasing the linear growth rate and leading to a bound on the growth rate of complexity of a circuit at early times. We illustrate this general result by applying it to qubit and harmonic oscillator systems, including the coupled and anharmonic oscillator. By discretizing free and interacting scalar field theories on a lattice, we are also able to extract the early-time behavior and dependence on the lattice spacing of complexity of these field theories in the continuum limit, demonstrating how this approach applies to systems that have been previously difficult to study using existing techniques for quantum circuit complexity.
△ Less
Submitted 12 October, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Spread Complexity of High Energy Neutrino Propagation over Astrophysical Distances
Authors:
Khushboo Dixit,
S. Shajidul Haque,
Soebur Razzaque
Abstract:
Spread complexity measures the minimized spread of quantum states over all choices of basis. It generalizes Krylov operator complexity to quantum states under continuous Hamiltonian evolution. In this paper, we study spread complexity in the context of high-energy astrophysical neutrinos and propose a new flavor ratio based on complexity. Our findings indicate that our proposal might favor an init…
▽ More
Spread complexity measures the minimized spread of quantum states over all choices of basis. It generalizes Krylov operator complexity to quantum states under continuous Hamiltonian evolution. In this paper, we study spread complexity in the context of high-energy astrophysical neutrinos and propose a new flavor ratio based on complexity. Our findings indicate that our proposal might favor an initial ratio of fluxes as $φ_{ν_e}^0: φ_{ν_μ}^0: φ_{ν_τ}^0 = 1:0:0$ over a more generally expected ratio of $1:2:0$, when the IceCube neutrino observatory achieves its projected sensitivity to discriminate between flavors. Additionally, complexity-based definitions of flavor ratios exhibit a slight but nonzero sensitivity to the neutrino mass ordering, which traditional flavor ratios cannot capture.
△ Less
Submitted 24 December, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
Short-Term Electricity Demand Forecasting of Dhaka City Using CNN with Stacked BiLSTM
Authors:
Kazi Fuad Bin Akhter,
Sadia Mobasshira,
Saief Nowaz Haque,
Mahjub Alam Khan Hesham,
Tanvir Ahmed
Abstract:
The precise forecasting of electricity demand also referred to as load forecasting, is essential for both planning and managing a power system. It is crucial for many tasks, including choosing which power units to commit to, making plans for future power generation capacity, enhancing the power network, and controlling electricity consumption. As Bangladesh is a developing country, the electricity…
▽ More
The precise forecasting of electricity demand also referred to as load forecasting, is essential for both planning and managing a power system. It is crucial for many tasks, including choosing which power units to commit to, making plans for future power generation capacity, enhancing the power network, and controlling electricity consumption. As Bangladesh is a developing country, the electricity infrastructure is critical for economic growth and employment in this country. Accurate forecasting of electricity demand is crucial for ensuring that this country has a reliable and sustainable electricity supply to meet the needs of its growing population and economy. The complex and nonlinear behavior of such energy systems inhibits the creation of precise algorithms. Within this context, this paper aims to propose a hybrid model of Convolutional Neural Network (CNN) and stacked Bidirectional Long-short Term Memory (BiLSTM) architecture to perform an accurate short-term forecast of the electricity demand of Dhaka city. Short-term forecasting is ordinarily done to anticipate load for the following few hours to a few weeks. Normalization techniques have been also investigated because of the sensitivity of these models towards the input range. The proposed approach produced the best prediction results in comparison to the other benchmark models (LSTM, CNN- BiLSTM and CNN-LSTM) used in the study, with MAPE 1.64%, MSE 0.015, RMSE 0.122 and MAE 0.092. The result of the proposed model also outperformed some of the existing works on load-forecasting.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Ultrafast Broadband Strong-Field Tunnelling in Asymmetric Nanogaps for Time-Resolved Nanoscopy
Authors:
Haoqing Ning,
Marios Maimaris,
Jiewen Wei,
Emilie Gérouville,
Evangelos Moutoulas,
Zhu Meng,
Clement Ferchaud,
Dmitry Maslennikov,
Navendu Mondal,
Tong Wang,
Colin Chow,
Aleksandar P. Ivanov,
Joshua B. Edel,
Saif A. Haque,
Misha Ivanov,
Jon P. Marangos,
Dimitra G. Georgiadou,
Artem A. Bakulin
Abstract:
Femtosecond-fast and nanometre-size pulses of electrons are emerging as unique probes for ultrafast dynamics at the nanoscale. Presently, such pulses are achievable only in highly sophisticated ultrafast electron microscopes or equally complex setups involving few-cycle-pulsed lasers with stable carrier-envelope phase (CEP) and nanotip probes. Here, we show that the generation of femtosecond pulse…
▽ More
Femtosecond-fast and nanometre-size pulses of electrons are emerging as unique probes for ultrafast dynamics at the nanoscale. Presently, such pulses are achievable only in highly sophisticated ultrafast electron microscopes or equally complex setups involving few-cycle-pulsed lasers with stable carrier-envelope phase (CEP) and nanotip probes. Here, we show that the generation of femtosecond pulses of nanoscale tunnelling electrons can be achieved in any ultrafast optical laboratory, using any (deep-UV to mid-IR) femtosecond laser in combination with photosensitive asymmetric nanogap (PAN) diodes fabricated via easy-to-scale adhesion lithography. The dominant mechanism producing tunnelling electrons in PANs is strong-field emission, which is easily achievable without CEP locking or external bias voltage. We employ PANs to demonstrate ultrafast nanoscopy of metal-halide perovskite quantum dots immobilised inside a 10-nm Al/Au nanogap and to characterise laser pulses across the entire optical region (266-6700 nm). Short electron pulses in PANs open the way towards scalable on-chip femtosecond electron measurements and novel design approaches for integrated ultrafast sensing nanodevices.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Cosmological Singularity and Power-Law Solutions in Modified Gravity
Authors:
Saurya Das,
S. Shajidul Haque,
Seturumane Tema
Abstract:
A bouncing Universe avoids the big-bang singularity. Using the time-like and null Raychaudhhuri equations, we explore whether the bounce near the big-bang, within a broad spectrum of modified theories of gravity, allows for cosmologically relevant power-law solutions under reasonable physical conditions. Our study shows that certain modified theories of gravity, such as Stelle gravity, do not demo…
▽ More
A bouncing Universe avoids the big-bang singularity. Using the time-like and null Raychaudhhuri equations, we explore whether the bounce near the big-bang, within a broad spectrum of modified theories of gravity, allows for cosmologically relevant power-law solutions under reasonable physical conditions. Our study shows that certain modified theories of gravity, such as Stelle gravity, do not demonstrate singularity resolution under any reasonable conditions, while others including $f(R)$ gravity and Brans-Dicke theory can demonstrate singularity resolution under suitable conditions. For these theories, we show that the accelerating solution is slightly favoured over ekypyrosis.
△ Less
Submitted 30 August, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
A Survey on Multimodal Wearable Sensor-based Human Action Recognition
Authors:
Jianyuan Ni,
Hao Tang,
Syed Tousiful Haque,
Yan Yan,
Anne H. H. Ngu
Abstract:
The combination of increased life expectancy and falling birth rates is resulting in an aging population. Wearable Sensor-based Human Activity Recognition (WSHAR) emerges as a promising assistive technology to support the daily lives of older individuals, unlocking vast potential for human-centric applications. However, recent surveys in WSHAR have been limited, focusing either solely on deep lear…
▽ More
The combination of increased life expectancy and falling birth rates is resulting in an aging population. Wearable Sensor-based Human Activity Recognition (WSHAR) emerges as a promising assistive technology to support the daily lives of older individuals, unlocking vast potential for human-centric applications. However, recent surveys in WSHAR have been limited, focusing either solely on deep learning approaches or on a single sensor modality. In real life, our human interact with the world in a multi-sensory way, where diverse information sources are intricately processed and interpreted to accomplish a complex and unified sensing system. To give machines similar intelligence, multimodal machine learning, which merges data from various sources, has become a popular research area with recent advancements. In this study, we present a comprehensive survey from a novel perspective on how to leverage multimodal learning to WSHAR domain for newcomers and researchers. We begin by presenting the recent sensor modalities as well as deep learning approaches in HAR. Subsequently, we explore the techniques used in present multimodal systems for WSHAR. This includes inter-multimodal systems which utilize sensor modalities from both visual and non-visual systems and intra-multimodal systems that simply take modalities from non-visual systems. After that, we focus on current multimodal learning approaches that have applied to solve some of the challenges existing in WSHAR. Specifically, we make extra efforts by connecting the existing multimodal literature from other domains, such as computer vision and natural language processing, with current WSHAR area. Finally, we identify the corresponding challenges and potential research direction in current WSHAR area for further improvement.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
An information-theoretic lower bound in time-uniform estimation
Authors:
John C. Duchi,
Saminul Haque
Abstract:
We present an information-theoretic lower bound for the problem of parameter estimation with time-uniform coverage guarantees. Via a new a reduction to sequential testing, we obtain stronger lower bounds that capture the hardness of the time-uniform setting. In the case of location model estimation, logistic regression, and exponential family models, our $Ω(\sqrt{n^{-1}\log \log n})$ lower bound i…
▽ More
We present an information-theoretic lower bound for the problem of parameter estimation with time-uniform coverage guarantees. Via a new a reduction to sequential testing, we obtain stronger lower bounds that capture the hardness of the time-uniform setting. In the case of location model estimation, logistic regression, and exponential family models, our $Ω(\sqrt{n^{-1}\log \log n})$ lower bound is sharp to within constant factors in typical settings.
△ Less
Submitted 11 June, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning
Authors:
Md Shazid Islam,
Md Saydur Rahman,
Md Saad Ul Haque,
Farhana Akter Tumpa,
Md Sanzid Bin Hossain,
Abul Al Arabi
Abstract:
Rain precipitation prediction is a challenging task as it depends on weather and meteorological features which vary from location to location. As a result, a prediction model that performs well at one location does not perform well at other locations due to the distribution shifts. In addition, due to global warming, the weather patterns are changing very rapidly year by year which creates the pos…
▽ More
Rain precipitation prediction is a challenging task as it depends on weather and meteorological features which vary from location to location. As a result, a prediction model that performs well at one location does not perform well at other locations due to the distribution shifts. In addition, due to global warming, the weather patterns are changing very rapidly year by year which creates the possibility of ineffectiveness of those models even at the same location as time passes. In our work, we have proposed an adaptive deep learning-based framework in order to provide a solution to the aforementioned challenges. Our method can generalize the model for the prediction of precipitation for any location where the methods without adaptation fail. Our method has shown 43.51%, 5.09%, and 38.62% improvement after adaptation using a deep neural network for predicting the precipitation of Paris, Los Angeles, and Tokyo, respectively.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Comparative Evaluation of Weather Forecasting using Machine Learning Models
Authors:
Md Saydur Rahman,
Farhana Akter Tumpa,
Md Shazid Islam,
Abul Al Arabi,
Md Sanzid Bin Hossain,
Md Saad Ul Haque
Abstract:
Gaining a deeper understanding of weather and being able to predict its future conduct have always been considered important endeavors for the growth of our society. This research paper explores the advancements in understanding and predicting nature's behavior, particularly in the context of weather forecasting, through the application of machine learning algorithms. By leveraging the power of ma…
▽ More
Gaining a deeper understanding of weather and being able to predict its future conduct have always been considered important endeavors for the growth of our society. This research paper explores the advancements in understanding and predicting nature's behavior, particularly in the context of weather forecasting, through the application of machine learning algorithms. By leveraging the power of machine learning, data mining, and data analysis techniques, significant progress has been made in this field. This study focuses on analyzing the contributions of various machine learning algorithms in predicting precipitation and temperature patterns using a 20-year dataset from a single weather station in Dhaka city. Algorithms such as Gradient Boosting, AdaBoosting, Artificial Neural Network, Stacking Random Forest, Stacking Neural Network, and Stacking KNN are evaluated and compared based on their performance metrics, including Confusion matrix measurements. The findings highlight remarkable achievements and provide valuable insights into their performances and features correlation.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
The Early Universe as an Open Quantum System: Complexity and Decoherence
Authors:
Arpan Bhattacharyya,
Suddhasattwa Brahma,
S. Shajidul Haque,
Jacob S. Lund,
Arpon Paul
Abstract:
In this work, we extend previous results, demonstrating how complexity in an open quantum system can identify decoherence between two fields, even in the presence of an accelerating background. Using the curved-space Caldeira-Leggett two-field model in de Sitter as our toy model, we discover a distinctive feature in the growth of complexity of purification, providing an alternative diagnostic for…
▽ More
In this work, we extend previous results, demonstrating how complexity in an open quantum system can identify decoherence between two fields, even in the presence of an accelerating background. Using the curved-space Caldeira-Leggett two-field model in de Sitter as our toy model, we discover a distinctive feature in the growth of complexity of purification, providing an alternative diagnostic for studying decoherence when the adiabatic perturbation is coupled to a heavy field. This paper initiates a new pathway to explore the features of quantum complexity in an accelerating background, thereby expanding our understanding of the evolution of primordial cosmological perturbations in the early universe.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise
Authors:
Shaan Ul Haque,
Sajad Khodadadian,
Siva Theja Maguluri
Abstract:
Stochastic approximation (SA) is an iterative algorithm for finding the fixed point of an operator using noisy samples and widely used in optimization and Reinforcement Learning (RL). The noise in RL exhibits a Markovian structure, and in some cases, such as gradient temporal difference (GTD) methods, SA is employed in a two-time-scale framework. This combination introduces significant theoretical…
▽ More
Stochastic approximation (SA) is an iterative algorithm for finding the fixed point of an operator using noisy samples and widely used in optimization and Reinforcement Learning (RL). The noise in RL exhibits a Markovian structure, and in some cases, such as gradient temporal difference (GTD) methods, SA is employed in a two-time-scale framework. This combination introduces significant theoretical challenges for analysis.
We derive an upper bound on the error for the iterations of linear two-time-scale SA with Markovian noise. We demonstrate that the mean squared error decreases as $trace (Σ^y)/k + o(1/k)$ where $k$ is the number of iterates, and $Σ^y$ is an appropriately defined covariance matrix. A key feature of our bounds is that the leading term, $Σ^y$, exactly matches with the covariance in the Central Limit Theorem (CLT) for the two-time-scale SA, and we call them tight finite-time bounds. We illustrate their use in RL by establishing sample complexity for off-policy algorithms, TDC, GTD, and GTD2.
A special case of linear two-time-scale SA that is extensively studied is linear SA with Polyak-Ruppert averaging. We present tight finite time bounds corresponding to the covariance matrix of the CLT. Such bounds can be used to study TD-learning with Polyak-Ruppert averaging.
△ Less
Submitted 11 May, 2025; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Complexity and Operator Growth for Quantum Systems in Dynamic Equilibrium
Authors:
Cameron Beetar,
Nitin Gupta,
S. Shajidul Haque,
Jeff Murugan,
Hendrik J R Van Zyl
Abstract:
Krylov complexity is a measure of operator growth in quantum systems, based on the number of orthogonal basis vectors needed to approximate the time evolution of an operator. In this paper, we study the Krylov complexity of a $\mathsf{PT}$-symmetric system of oscillators, which exhibits two phase transitions that separate a dissipative state, a Rabi-oscillation state, and an ultra-strongly coupled…
▽ More
Krylov complexity is a measure of operator growth in quantum systems, based on the number of orthogonal basis vectors needed to approximate the time evolution of an operator. In this paper, we study the Krylov complexity of a $\mathsf{PT}$-symmetric system of oscillators, which exhibits two phase transitions that separate a dissipative state, a Rabi-oscillation state, and an ultra-strongly coupled regime. We use a generalization of the $su(1,1)$ algebra associated to the Bateman oscillator to describe the Hamiltonian of the coupled system, and construct a set of coherent states associated with this algebra. We compute the Krylov (spread) complexity using these coherent states, and find that it can distinguish between the $\mathsf{PT}$-symmetric and $\mathsf{PT}$ symmetry-broken phases. We also show that the Krylov complexity reveals the ill-defined nature of the vacuum of the Bateman oscillator, which is a special case of our system. Our results demonstrate the utility of Krylov complexity as a tool to probe the properties and transitions of $\mathsf{PT}$-symmetric systems.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Multi-Agent Join
Authors:
Vahid Ghadakchi,
Mian Xie,
Arash Termehchy,
Bakhtiyar Doskenov,
Bharghav Srikhakollu,
Summit Haque,
Huazheng Wang
Abstract:
It is crucial to provide real-time performance in many applications, such as interactive and exploratory data analysis. In these settings, users often need to view subsets of query results quickly. It is challenging to deliver such results over large datasets for relational operators over multiple relations, such as join. Join algorithms usually spend a long time on scanning and attempting to join…
▽ More
It is crucial to provide real-time performance in many applications, such as interactive and exploratory data analysis. In these settings, users often need to view subsets of query results quickly. It is challenging to deliver such results over large datasets for relational operators over multiple relations, such as join. Join algorithms usually spend a long time on scanning and attempting to join parts of relations that may not generate any result. Current solutions usually require lengthy and repeated preprocessing, which is costly and may not be possible to do in many settings. Also, they often support restricted types of joins. In this paper, we outline a novel approach for achieving efficient join processing in which a scan operator of the join learns during query execution, the portions of its relations that might satisfy the join predicate. We further improve this method using an algorithm in which both scan operators collaboratively learn an efficient join execution strategy. We also show that this approach generalizes traditional and non-learning methods for joining. Our extensive empirical studies using standard benchmarks indicate that this approach outperforms similar methods considerably.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Boosting Stock Price Prediction with Anticipated Macro Policy Changes
Authors:
Md Sabbirul Haque,
Md Shahedul Amin,
Jonayet Miah,
Duc Minh Cao,
Ashiqul Haque Ahmed
Abstract:
Prediction of stock prices plays a significant role in aiding the decision-making of investors. Considering its importance, a growing literature has emerged trying to forecast stock prices with improved accuracy. In this study, we introduce an innovative approach for forecasting stock prices with greater accuracy. We incorporate external economic environment-related information along with stock pr…
▽ More
Prediction of stock prices plays a significant role in aiding the decision-making of investors. Considering its importance, a growing literature has emerged trying to forecast stock prices with improved accuracy. In this study, we introduce an innovative approach for forecasting stock prices with greater accuracy. We incorporate external economic environment-related information along with stock prices. In our novel approach, we improve the performance of stock price prediction by taking into account variations due to future expected macroeconomic policy changes as investors adjust their current behavior ahead of time based on expected future macroeconomic policy changes. Furthermore, we incorporate macroeconomic variables along with historical stock prices to make predictions. Results from this strongly support the inclusion of future economic policy changes along with current macroeconomic information. We confirm the supremacy of our method over the conventional approach using several tree-based machine-learning algorithms. Results are strongly conclusive across various machine learning models. Our preferred model outperforms the conventional approach with an RMSE value of 1.61 compared to an RMSE value of 1.75 from the conventional approach.
△ Less
Submitted 27 October, 2023;
originally announced November 2023.
-
Generative AI Model for Artistic Style Transfer Using Convolutional Neural Networks
Authors:
Jonayet Miah,
Duc M Cao,
Md Abu Sayed,
Md. Sabbirul Haque
Abstract:
Artistic style transfer, a captivating application of generative artificial intelligence, involves fusing the content of one image with the artistic style of another to create unique visual compositions. This paper presents a comprehensive overview of a novel technique for style transfer using Convolutional Neural Networks (CNNs). By leveraging deep image representations learned by CNNs, we demons…
▽ More
Artistic style transfer, a captivating application of generative artificial intelligence, involves fusing the content of one image with the artistic style of another to create unique visual compositions. This paper presents a comprehensive overview of a novel technique for style transfer using Convolutional Neural Networks (CNNs). By leveraging deep image representations learned by CNNs, we demonstrate how to separate and manipulate image content and style, enabling the synthesis of high-quality images that combine content and style in a harmonious manner. We describe the methodology, including content and style representations, loss computation, and optimization, and showcase experimental results highlighting the effectiveness and versatility of the approach across different styles and content
△ Less
Submitted 30 October, 2023; v1 submitted 27 October, 2023;
originally announced October 2023.
-
Advancing Brain Tumor Detection: A Thorough Investigation of CNNs, Clustering, and SoftMax Classification in the Analysis of MRI Images
Authors:
Jonayet Miah,
Duc M Cao,
Md Abu Sayed3,
Md Siam Taluckder,
Md Sabbirul Haque,
Fuad Mahmud
Abstract:
Brain tumors pose a significant global health challenge due to their high prevalence and mortality rates across all age groups. Detecting brain tumors at an early stage is crucial for effective treatment and patient outcomes. This study presents a comprehensive investigation into the use of Convolutional Neural Networks (CNNs) for brain tumor detection using Magnetic Resonance Imaging (MRI) images…
▽ More
Brain tumors pose a significant global health challenge due to their high prevalence and mortality rates across all age groups. Detecting brain tumors at an early stage is crucial for effective treatment and patient outcomes. This study presents a comprehensive investigation into the use of Convolutional Neural Networks (CNNs) for brain tumor detection using Magnetic Resonance Imaging (MRI) images. The dataset, consisting of MRI scans from both healthy individuals and patients with brain tumors, was processed and fed into the CNN architecture. The SoftMax Fully Connected layer was employed to classify the images, achieving an accuracy of 98%. To evaluate the CNN's performance, two other classifiers, Radial Basis Function (RBF) and Decision Tree (DT), were utilized, yielding accuracy rates of 98.24% and 95.64%, respectively. The study also introduced a clustering method for feature extraction, improving CNN's accuracy. Sensitivity, Specificity, and Precision were employed alongside accuracy to comprehensively evaluate the network's performance. Notably, the SoftMax classifier demonstrated the highest accuracy among the categorizers, achieving 99.52% accuracy on test data. The presented research contributes to the growing field of deep learning in medical image analysis. The combination of CNNs and MRI data offers a promising tool for accurately detecting brain tumors, with potential implications for early diagnosis and improved patient care.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
An Exploratory Study on Simulated Annealing for Feature Selection in Learning-to-Rank
Authors:
Mohd. Sayemul Haque,
Md. Fahim,
Muhammad Ibrahim
Abstract:
Learning-to-rank is an applied domain of supervised machine learning. As feature selection has been found to be effective for improving the accuracy of learning models in general, it is intriguing to investigate this process for learning-to-rank domain. In this study, we investigate the use of a popular meta-heuristic approach called simulated annealing for this task. Under the general framework o…
▽ More
Learning-to-rank is an applied domain of supervised machine learning. As feature selection has been found to be effective for improving the accuracy of learning models in general, it is intriguing to investigate this process for learning-to-rank domain. In this study, we investigate the use of a popular meta-heuristic approach called simulated annealing for this task. Under the general framework of simulated annealing, we explore various neighborhood selection strategies and temperature cooling schemes. We further introduce a new hyper-parameter called the progress parameter that can effectively be used to traverse the search space. Our algorithms are evaluated on five publicly benchmark datasets of learning-to-rank. For a better validation, we also compare the simulated annealing-based feature selection algorithm with another effective meta-heuristic algorithm, namely local beam search. Extensive experimental results shows the efficacy of our proposed models.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Untargeted White-box Adversarial Attack with Heuristic Defence Methods in Real-time Deep Learning based Network Intrusion Detection System
Authors:
Khushnaseeb Roshan,
Aasim Zafar,
Sheikh Burhan Ul Haque
Abstract:
Network Intrusion Detection System (NIDS) is a key component in securing the computer network from various cyber security threats and network attacks. However, consider an unfortunate situation where the NIDS is itself attacked and vulnerable more specifically, we can say, How to defend the defender?. In Adversarial Machine Learning (AML), the malicious actors aim to fool the Machine Learning (ML)…
▽ More
Network Intrusion Detection System (NIDS) is a key component in securing the computer network from various cyber security threats and network attacks. However, consider an unfortunate situation where the NIDS is itself attacked and vulnerable more specifically, we can say, How to defend the defender?. In Adversarial Machine Learning (AML), the malicious actors aim to fool the Machine Learning (ML) and Deep Learning (DL) models to produce incorrect predictions with intentionally crafted adversarial examples. These adversarial perturbed examples have become the biggest vulnerability of ML and DL based systems and are major obstacles to their adoption in real-time and mission-critical applications such as NIDS. AML is an emerging research domain, and it has become a necessity for the in-depth study of adversarial attacks and their defence strategies to safeguard the computer network from various cyber security threads. In this research work, we aim to cover important aspects related to NIDS, adversarial attacks and its defence mechanism to increase the robustness of the ML and DL based NIDS. We implemented four powerful adversarial attack techniques, namely, Fast Gradient Sign Method (FGSM), Jacobian Saliency Map Attack (JSMA), Projected Gradient Descent (PGD) and Carlini & Wagner (C&W) in NIDS. We analyzed its performance in terms of various performance metrics in detail. Furthermore, the three heuristics defence strategies, i.e., Adversarial Training (AT), Gaussian Data Augmentation (GDA) and High Confidence (HC), are implemented to improve the NIDS robustness under adversarial attack situations. The complete workflow is demonstrated in real-time network with data packet flow. This research work provides the overall background for the researchers interested in AML and its implementation from a computer network security point of view.
△ Less
Submitted 7 October, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Retail Demand Forecasting: A Comparative Study for Multivariate Time Series
Authors:
Md Sabbirul Haque,
Md Shahedul Amin,
Jonayet Miah
Abstract:
Accurate demand forecasting in the retail industry is a critical determinant of financial performance and supply chain efficiency. As global markets become increasingly interconnected, businesses are turning towards advanced prediction models to gain a competitive edge. However, existing literature mostly focuses on historical sales data and ignores the vital influence of macroeconomic conditions…
▽ More
Accurate demand forecasting in the retail industry is a critical determinant of financial performance and supply chain efficiency. As global markets become increasingly interconnected, businesses are turning towards advanced prediction models to gain a competitive edge. However, existing literature mostly focuses on historical sales data and ignores the vital influence of macroeconomic conditions on consumer spending behavior. In this study, we bridge this gap by enriching time series data of customer demand with macroeconomic variables, such as the Consumer Price Index (CPI), Index of Consumer Sentiment (ICS), and unemployment rates. Leveraging this comprehensive dataset, we develop and compare various regression and machine learning models to predict retail demand accurately.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Prediction of Pneumonia and COVID-19 Using Deep Neural Networks
Authors:
M. S. Haque,
M. S. Taluckder,
S. B. Shawkat,
M. A. Shahriyar,
M. A. Sayed,
C. Modak
Abstract:
Pneumonia, caused by bacteria and viruses, is a rapidly spreading viral infection with global implications. Prompt identification of infected individuals is crucial for containing its transmission. This study explores the potential of medical image analysis to address this challenge. We propose machine-learning techniques for predicting Pneumonia from chest X-ray images. Chest X-ray imaging is vit…
▽ More
Pneumonia, caused by bacteria and viruses, is a rapidly spreading viral infection with global implications. Prompt identification of infected individuals is crucial for containing its transmission. This study explores the potential of medical image analysis to address this challenge. We propose machine-learning techniques for predicting Pneumonia from chest X-ray images. Chest X-ray imaging is vital for Pneumonia diagnosis due to its accessibility and cost-effectiveness. However, interpreting X-rays for Pneumonia detection can be complex, as radiographic features can overlap with other respiratory conditions. We evaluate the performance of different machine learning models, including DenseNet121, Inception Resnet-v2, Inception Resnet-v3, Resnet50, and Xception, using chest X-ray images of pneumonia patients. Performance measures and confusion matrices are employed to assess and compare the models. The findings reveal that DenseNet121 outperforms other models, achieving an accuracy rate of 99.58%. This study underscores the significance of machine learning in the accurate detection of Pneumonia, leveraging chest X-ray images. Our study offers insights into the potential of technology to mitigate the spread of pneumonia through precise diagnostics.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
A Novel Deep Learning based Model to Defend Network Intrusion Detection System against Adversarial Attacks
Authors:
Khushnaseeb Roshan,
Aasim Zafar,
Shiekh Burhan Ul Haque
Abstract:
Network Intrusion Detection System (NIDS) is an essential tool in securing cyberspace from a variety of security risks and unknown cyberattacks. A number of solutions have been implemented for Machine Learning (ML), and Deep Learning (DL) based NIDS. However, all these solutions are vulnerable to adversarial attacks, in which the malicious actor tries to evade or fool the model by injecting advers…
▽ More
Network Intrusion Detection System (NIDS) is an essential tool in securing cyberspace from a variety of security risks and unknown cyberattacks. A number of solutions have been implemented for Machine Learning (ML), and Deep Learning (DL) based NIDS. However, all these solutions are vulnerable to adversarial attacks, in which the malicious actor tries to evade or fool the model by injecting adversarial perturbed examples into the system. The main aim of this research work is to study powerful adversarial attack algorithms and their defence method on DL-based NIDS. Fast Gradient Sign Method (FGSM), Jacobian Saliency Map Attack (JSMA), Projected Gradient Descent (PGD) and Carlini & Wagner (C&W) are four powerful adversarial attack methods implemented against the NIDS. As a defence method, Adversarial Training is used to increase the robustness of the NIDS model. The results are summarized in three phases, i.e., 1) before the adversarial attack, 2) after the adversarial attack, and 3) after the adversarial defence. The Canadian Institute for Cybersecurity Intrusion Detection System 2017 (CICIDS-2017) dataset is used for evaluation purposes with various performance measurements like f1-score, accuracy etc.
△ Less
Submitted 31 July, 2023;
originally announced August 2023.
-
Krylov Complexity and Spectral Form Factor for Noisy Random Matrix Models
Authors:
Arpan Bhattacharyya,
S. Shajidul Haque,
Ghadir Jafari,
Jeff Murugan,
Dimakatso Rapotu
Abstract:
We study the spectral properties of two classes of random matrix models: non-Gaussian RMT with quartic and sextic potentials, and RMT with Gaussian noise. We compute and analyze the quantum Krylov complexity and the spectral form factor for both of these models. We find that both models show suppression of the spectral form factor at short times due to decoherence effects, but they differ in their…
▽ More
We study the spectral properties of two classes of random matrix models: non-Gaussian RMT with quartic and sextic potentials, and RMT with Gaussian noise. We compute and analyze the quantum Krylov complexity and the spectral form factor for both of these models. We find that both models show suppression of the spectral form factor at short times due to decoherence effects, but they differ in their long-time behavior. In particular, we show that the Krylov complexity for the non-Gaussian RMT and RMT with noise deviates from that of a Gaussian RMT. We discuss the implications and limitations of our results for quantum chaos and quantum information in open quantum systems. Our study reveals the distinct sensitivities of the spectral form factor and complexity to non-Gaussianity and noise, which contribute to the observed differences in the different time domains.
△ Less
Submitted 29 October, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Statement-based Memory for Neural Source Code Summarization
Authors:
Aakash Bansal,
Siyuan Jiang,
Sakib Haque,
Collin McMillan
Abstract:
Source code summarization is the task of writing natural language descriptions of source code behavior. Code summarization underpins software documentation for programmers. Short descriptions of code help programmers understand the program quickly without having to read the code itself. Lately, neural source code summarization has emerged as the frontier of research into automated code summarizati…
▽ More
Source code summarization is the task of writing natural language descriptions of source code behavior. Code summarization underpins software documentation for programmers. Short descriptions of code help programmers understand the program quickly without having to read the code itself. Lately, neural source code summarization has emerged as the frontier of research into automated code summarization techniques. By far the most popular targets for summarization are program subroutines. The idea, in a nutshell, is to train an encoder-decoder neural architecture using large sets of examples of subroutines extracted from code repositories. The encoder represents the code and the decoder represents the summary. However, most current approaches attempt to treat the subroutine as a single unit. For example, by taking the entire subroutine as input to a Transformer or RNN-based encoder. But code behavior tends to depend on the flow from statement to statement. Normally dynamic analysis may shed light on this flow, but dynamic analysis on hundreds of thousands of examples in large datasets is not practical. In this paper, we present a statement-based memory encoder that learns the important elements of flow during training, leading to a statement-based subroutine representation without the need for dynamic analysis. We implement our encoder for code summarization and demonstrate a significant improvement over the state-of-the-art.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Thermodynamic calculations using reverse Monte Carlo: Simultaneously tuning multiple short-range order parameters for 2D lattice adsorption problem
Authors:
Suhail Haque,
Abhijit Chatterjee
Abstract:
Lattice simulations are an important class of problems in crystalline solids, surface science, alloys, adsorption, absorption, separation, catalysis, to name a few. We describe a fast computational method for performing lattice thermodynamic calculations that is based on the use of the reverse Monte Carlo (RMC) technique and multiple short-range order (SRO) parameters. The approach is comparable i…
▽ More
Lattice simulations are an important class of problems in crystalline solids, surface science, alloys, adsorption, absorption, separation, catalysis, to name a few. We describe a fast computational method for performing lattice thermodynamic calculations that is based on the use of the reverse Monte Carlo (RMC) technique and multiple short-range order (SRO) parameters. The approach is comparable in accuracy to the Metropolis Monte Carlo (MC) method. The equilibrium configuration is determined in 5-10 Newton-Raphson iterations by solving a system of coupled nonlinear algebraic flux equations. This makes the RMC-based method computationally more efficient than MC, given that MC typically requires sampling of millions of configurations. The technique is applied to the interacting 2D adsorption problem. Unlike grand canonical MC, RMC is found to be adept at tackling geometric frustration, as it is able to quickly and correctly provide the ordered c(2x2) adlayer configuration for Cl adsorbed on a Cu (100) surface.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.