-
Joint neutrino oscillation analysis from the T2K and NOvA experiments
Authors:
NOvA,
T2K Collaborations,
:,
K. Abe,
S. Abe,
S. Abubakar,
M. A. Acero,
B. Acharya,
P. Adamson,
H. Adhkary,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
N. Anfimov,
L. Anthony,
A. Antoshkin,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
E. Arrieta-Diaz,
Y. Ashida,
L. Asquith
, et al. (577 additional authors not shown)
Abstract:
The landmark discovery that neutrinos have mass and can change type (or "flavor") as they propagate -- a process called neutrino oscillation -- has opened up a rich array of theoretical and experimental questions being actively pursued today. Neutrino oscillation remains the most powerful experimental tool for addressing many of these questions, including whether neutrinos violate charge-parity (C…
▽ More
The landmark discovery that neutrinos have mass and can change type (or "flavor") as they propagate -- a process called neutrino oscillation -- has opened up a rich array of theoretical and experimental questions being actively pursued today. Neutrino oscillation remains the most powerful experimental tool for addressing many of these questions, including whether neutrinos violate charge-parity (CP) symmetry, which has possible connections to the unexplained preponderance of matter over antimatter in the universe. Oscillation measurements also probe the mass-squared differences between the different neutrino mass states ($Δm^2$), whether there are two light states and a heavier one (normal ordering) or vice versa (inverted ordering), and the structure of neutrino mass and flavor mixing. Here, we carry out the first joint analysis of data sets from NOvA and T2K, the two currently operating long-baseline neutrino oscillation experiments (hundreds of kilometers of neutrino travel distance), taking advantage of our complementary experimental designs and setting new constraints on several neutrino sector parameters. This analysis provides new precision on the $Δm^2_{32}$ mass difference, finding $2.43^{+0.04}_{-0.03}\ \left(-2.48^{+0.03}_{-0.04}\right)\times 10^{-3}~\mathrm{eV}^2$ in the normal (inverted) ordering, as well as a $3σ$ interval on $δ_{\rm CP}$ of $[-1.38π,\ 0.30π]$ $\left([-0.92π,\ -0.04π]\right)$ in the normal (inverted) ordering. The data show no strong preference for either mass ordering, but notably if inverted ordering were assumed true within the three-flavor mixing paradigm, then our results would provide evidence of CP symmetry violation in the lepton sector.
△ Less
Submitted 24 October, 2025; v1 submitted 22 October, 2025;
originally announced October 2025.
-
What is Implementation Science; and Why It Matters for Bridging the Artificial Intelligence Innovation-to-Application Gap in Medical Imaging
Authors:
Ahmad Fayaz-Bakhsh,
Janice Tania,
Syaheerah Lebai Lutfi,
Abhinav K. Jha,
Arman Rahmim
Abstract:
The transformative potential of artificial intelligence (AI) in medical Imaging (MI) is well recognized. Yet despite promising reports in research settings, many AI tools fail to achieve clinical adoption in practice. In fact, more generally, there is a documented 17-year average delay between evidence generation and implementation of a technology1. Implementation science (IS) may provide a practi…
▽ More
The transformative potential of artificial intelligence (AI) in medical Imaging (MI) is well recognized. Yet despite promising reports in research settings, many AI tools fail to achieve clinical adoption in practice. In fact, more generally, there is a documented 17-year average delay between evidence generation and implementation of a technology1. Implementation science (IS) may provide a practical, evidence-based framework to bridge the gap between AI development and real-world clinical imaging use that helps shorten this lag through systematic frameworks, strategies, and hybrid research designs. We outline challenges specific to AI adoption in MI workflows, including infrastructural, educational, and cultural barriers. We highlight the complementary roles of effectiveness research and implementation research, emphasizing hybrid study designs and the role of integrated KT (iKT), stakeholder engagement, and equity-focused co-creation in designing sustainable and generalizable solutions. We discuss integration of Human-Computer Interaction (HCI) frameworks in MI towards usable AI. Adopting IS is not only a methodological advancement; it is a strategic imperative for accelerating translation of innovation into improved patient outcomes.
△ Less
Submitted 20 October, 2025; v1 submitted 14 October, 2025;
originally announced October 2025.
-
PyMigTool: a tool for end-to-end Python library migration
Authors:
Mohayeminul Islam,
Ajay Kumar Jha,
May Mahmoud,
Sarah Nadi
Abstract:
Library migration is the process of replacing a library with a similar one in a software project. Manual library migration is time consuming and error prone, as it requires developers to understand the Application Programming Interfaces (API) of both libraries, map equivalent APIs, and perform the necessary code transformations. Due to the difficulty of the library migration process, most of the e…
▽ More
Library migration is the process of replacing a library with a similar one in a software project. Manual library migration is time consuming and error prone, as it requires developers to understand the Application Programming Interfaces (API) of both libraries, map equivalent APIs, and perform the necessary code transformations. Due to the difficulty of the library migration process, most of the existing automated techniques and tooling stop at the API mapping stage or support a limited set of libraries and code transformations. In this paper, we develop an end-to-end solution that can automatically migrate code between any arbitrary pair of Python libraries that provide similar functionality. Due to the promising capabilities of Large Language Models (LLMs) in code generation and transformation, we use LLMs as the primary engine for migration. Before building the tool, we first study the capabilities of LLMs for library migration on a benchmark of 321 real-world library migrations. We find that LLMs can effectively perform library migration, but some post-processing steps can further improve the performance. Based on this, we develop PyMigTool, a command line application that combines the power of LLMs, static analysis, and dynamic analysis to provide accurate library migration. We evaluate PyMigTool on 717 real-world Python applications that are not from our benchmark. We find that PyMigTool can migrate 32% of the migrations with complete correctness. Of the remaining migrations, only 14% of the migration-related changes are left for developers to fix for more than half of the projects.
△ Less
Submitted 9 October, 2025;
originally announced October 2025.
-
Humanoid Everyday: A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation
Authors:
Zhenyu Zhao,
Hongyi Jing,
Xiawei Liu,
Jiageng Mao,
Abha Jha,
Hanwen Yang,
Rong Xue,
Sergey Zakharor,
Vitor Guizilini,
Yue Wang
Abstract:
From loco-motion to dextrous manipulation, humanoid robots have made remarkable strides in demonstrating complex full-body capabilities. However, the majority of current robot learning datasets and benchmarks mainly focus on stationary robot arms, and the few existing humanoid datasets are either confined to fixed environments or limited in task diversity, often lacking human-humanoid interaction…
▽ More
From loco-motion to dextrous manipulation, humanoid robots have made remarkable strides in demonstrating complex full-body capabilities. However, the majority of current robot learning datasets and benchmarks mainly focus on stationary robot arms, and the few existing humanoid datasets are either confined to fixed environments or limited in task diversity, often lacking human-humanoid interaction and lower-body locomotion. Moreover, there are a few standardized evaluation platforms for benchmarking learning-based policies on humanoid data. In this work, we present Humanoid Everyday, a large-scale and diverse humanoid manipulation dataset characterized by extensive task variety involving dextrous object manipulation, human-humanoid interaction, locomotion-integrated actions, and more. Leveraging a highly efficient human-supervised teleoperation pipeline, Humanoid Everyday aggregates high-quality multimodal sensory data, including RGB, depth, LiDAR, and tactile inputs, together with natural language annotations, comprising 10.3k trajectories and over 3 million frames of data across 260 tasks across 7 broad categories. In addition, we conduct an analysis of representative policy learning methods on our dataset, providing insights into their strengths and limitations across different task categories. For standardized evaluation, we introduce a cloud-based evaluation platform that allows researchers to seamlessly deploy their policies in our controlled setting and receive performance feedback. By releasing Humanoid Everyday along with our policy learning analysis and a standardized cloud-based evaluation platform, we intend to advance research in general-purpose humanoid manipulation and lay the groundwork for more capable and embodied robotic agents in real-world scenarios. Our dataset, data collection code, and cloud evaluation website are made publicly available on our project website.
△ Less
Submitted 9 October, 2025;
originally announced October 2025.
-
Numerical Demonstration of Kolmogorov Scaling in Magnetohydrodynamic Turbulence
Authors:
Manthan Verma,
Abhishek K. Jha,
Shashwat Nirgudkar,
Mahendra K. Verma
Abstract:
The two leading models of isotropic magnetohydrodynamic (MHD) turbulence have competing predictions: $k^{-5/3}$ (Kolmogorov) and $k^{-3/2}$ (Iroshnikov-Kraichnan) scalings. This paper identifies the valid MHD turbulence model using high-resolution numerical and diagnostics-structure functions, intermittency exponents, and energy spectra and fluxes of imbalance MHD. The energy spectra of our forced…
▽ More
The two leading models of isotropic magnetohydrodynamic (MHD) turbulence have competing predictions: $k^{-5/3}$ (Kolmogorov) and $k^{-3/2}$ (Iroshnikov-Kraichnan) scalings. This paper identifies the valid MHD turbulence model using high-resolution numerical and diagnostics-structure functions, intermittency exponents, and energy spectra and fluxes of imbalance MHD. The energy spectra of our forced MHD simulations on $8192^2$, $4096^2$, $1024^3$, and $512^3$ support Kolmogorov's k^{-5/3} spectrum over Iroshnikov-Kraichnan's k^{-3/2} spectrum, but the difference in the spectral exponents is small. However, the numerically computed third-order structure functions and intermittency exponents support Kolmogorov scaling in both two and three dimensions. Also, the energy fluxes of the imbalance MHD follow the predictions of Kolmogorov scaling. These results would help in better modelling of solar wind, solar corona, and dynamos.
△ Less
Submitted 6 October, 2025;
originally announced October 2025.
-
SAGE: Streaming Agreement-Driven Gradient Sketches for Representative Subset Selection
Authors:
Ashish Jha,
Salman Ahmadi-Asl
Abstract:
Training modern neural networks on large datasets is computationally and energy intensive. We present SAGE, a streaming data-subset selection method that maintains a compact Frequent Directions (FD) sketch of gradient geometry in $O(\ell D)$ memory and prioritizes examples whose sketched gradients align with a consensus direction. The approach eliminates $N \times N$ pairwise similarities and expl…
▽ More
Training modern neural networks on large datasets is computationally and energy intensive. We present SAGE, a streaming data-subset selection method that maintains a compact Frequent Directions (FD) sketch of gradient geometry in $O(\ell D)$ memory and prioritizes examples whose sketched gradients align with a consensus direction. The approach eliminates $N \times N$ pairwise similarities and explicit $N \times \ell$ gradient stores, yielding a simple two-pass, GPU-friendly pipeline. Leveraging FD's deterministic approximation guarantees, we analyze how agreement scoring preserves gradient energy within the principal sketched subspace. Across multiple benchmarks, SAGE trains with small kept-rate budgets while retaining competitive accuracy relative to full-data training and recent subset-selection baselines, and reduces end-to-end compute and peak memory. Overall, SAGE offers a practical, constant-memory alternative that complements pruning and model compression for efficient training.
△ Less
Submitted 8 October, 2025; v1 submitted 2 October, 2025;
originally announced October 2025.
-
Market-Driven Subset Selection for Budgeted Training
Authors:
Ashish Jha,
Valentin Leplat,
AH Phan
Abstract:
Training large language models on massive datasets is computationally expensive, yet empirical evidence suggests that substantial portions of training examples contribute minimally to final performance. Data subset selection addresses this inefficiency by identifying small, high-utility subsets under resource constraints. However, example utility is inherently multi-faceted, encompassing uncertain…
▽ More
Training large language models on massive datasets is computationally expensive, yet empirical evidence suggests that substantial portions of training examples contribute minimally to final performance. Data subset selection addresses this inefficiency by identifying small, high-utility subsets under resource constraints. However, example utility is inherently multi-faceted, encompassing uncertainty, distributional rarity, and diversity signals that are heterogeneous and typically combined through ad hoc weighted sums lacking theoretical grounding. We propose a market-based framework that treats each training example as a tradeable contract and employs the Logarithmic Market Scoring Rule to aggregate multiple utility signals into coherent prices. Heterogeneous signals act as traders, a single liquidity parameter controls concentration versus smoothing, and topic-wise normalization ensures calibrated aggregation. Token budgets are handled explicitly through a price-per-token decision rule with an interpretable length-bias parameter. We establish theoretical connections to maximum-entropy aggregation and provide utility recovery guarantees under noisy but monotone signals. On GSM8K mathematical reasoning under strict 60k-token budgets, our selector achieves parity with strong single-signal baselines while exhibiting lower variance and incurring less than 0.1 GPU-hour overhead. On AGNews classification at 5-25\% retention rates, the market formulation delivers competitive accuracy with improved stability. Our framework unifies multi-signal data curation under fixed computational budgets for prompt-level reasoning and classification tasks.
△ Less
Submitted 20 October, 2025; v1 submitted 2 October, 2025;
originally announced October 2025.
-
LLM-Enhanced, Data-Driven Personalized and Equitable Clinician Scheduling: A Predict-then-Optimize Approach
Authors:
Anjali Jha,
Wanqing Chen,
Maxim Eckmann,
Ian Stockwell,
Jianwu Wang,
Kai Sun
Abstract:
Clinician scheduling remains a persistent challenge due to limited clinical resources and fluctuating demands. This complexity is especially acute in large academic anesthesiology departments as physicians balance responsibilities across multiple clinical sites with conflicting priorities. Further, scheduling must account for individual clinical and lifestyle preferences to ensure job satisfaction…
▽ More
Clinician scheduling remains a persistent challenge due to limited clinical resources and fluctuating demands. This complexity is especially acute in large academic anesthesiology departments as physicians balance responsibilities across multiple clinical sites with conflicting priorities. Further, scheduling must account for individual clinical and lifestyle preferences to ensure job satisfaction and well-being. Traditional approaches, often based on statistical or rule-based optimization models, rely on structured data and explicit domain knowledge. However, these methods often overlook unstructured information, e.g., free-text notes from routinely administered clinician well-being surveys and scheduling platforms. These notes may reveal implicit and underutilized clinical resources. Neglecting such information can lead to misaligned schedules, increased burnout, overlooked staffing flexibility, and suboptimal utilization of available resources. To address this gap, we propose a predict-then-optimize framework that integrates classification-based clinician availability predictions with a mixed-integer programming schedule optimization model. Large language models (LLMs) are employed to extract actionable preferences and implicit constraints from unstructured schedule notes, enhancing the reliability of availability predictions. These predictions then inform the schedule optimization considering four objectives: first, ensuring clinical full-time equivalent compliance, second, reducing workload imbalances by enforcing equitable proportions of shift types, third, maximizing clinician availability for assigned shifts, and fourth, schedule consistency. By combining the interpretive power of LLMs with the rigor of mathematical optimization, our framework provides a robust, data-driven solution that enhances operational efficiency while supporting equity and clinician well-being.
△ Less
Submitted 2 October, 2025;
originally announced October 2025.
-
RL-Guided Data Selection for Language Model Finetuning
Authors:
Animesh Jha,
Harshit Gupta,
Ananjan Nandi
Abstract:
Data selection for finetuning Large Language Models (LLMs) can be framed as a budget-constrained optimization problem: maximizing a model's downstream performance under a strict training data budget. Solving this problem is generally intractable, and existing approximate approaches are pretraining-oriented and transfer poorly to the fine-tuning setting. We reformulate this problem as a tractable M…
▽ More
Data selection for finetuning Large Language Models (LLMs) can be framed as a budget-constrained optimization problem: maximizing a model's downstream performance under a strict training data budget. Solving this problem is generally intractable, and existing approximate approaches are pretraining-oriented and transfer poorly to the fine-tuning setting. We reformulate this problem as a tractable Markov Decision Process (MDP) and train agents using various Reinforcement Learning (RL) methods to learn optimal data selection policies, guided by an efficient, proxy-model-based reward signal. Across four datasets, training on a $5\%$ subset selected by our approach matches or outperforms fine-tuning on the full dataset by up to $10.8$ accuracy points, while cutting wall-clock training time by up to $2 \times$, highlighting the promise of RL-guided data selection.
△ Less
Submitted 30 September, 2025;
originally announced September 2025.
-
Excitonic Energy Transfer in Red Algal Photosystem I Reveals an Evolutionary Bridge between Cyanobacteria and Plants
Authors:
Mengyuan Cui,
Zihui Liu,
Miriam Izzo,
Junhua Zhou,
Enhu He,
Vandana Tiwari,
Petar H. Lambrev,
R. J. Dwayne Miller,
Joanna Kargu,
Fulu Zheng,
Ajay Jha,
Hong-GuangDuan
Abstract:
Photosystem I converts light into chemical energy with near-unity quantum efficiency,yet its energy-transfer and charge-separation mechanisms remain debated. Evolution has diversified PSI architectures. The unicellular red algae Cyanidioschyzon merolae represents a key evolutionary intermediate,featuring a cyanobacterial-like monomeric core surrounded by three to five LHCR subunits. This hybrid or…
▽ More
Photosystem I converts light into chemical energy with near-unity quantum efficiency,yet its energy-transfer and charge-separation mechanisms remain debated. Evolution has diversified PSI architectures. The unicellular red algae Cyanidioschyzon merolae represents a key evolutionary intermediate,featuring a cyanobacterial-like monomeric core surrounded by three to five LHCR subunits. This hybrid organization provides a unique system to bridge mechanistic models across lineages. We applied two-dimensional electronic spectroscopy at ultralow temperatures to disentangle overlapping excitation pathways in C. merolae PSI. Cryogenic measurements suppressed thermal broadening, resolving five dynamical components: sub-picosecond equilibration acrossthe core-LHCR interface, subsequent population transfer into progressively lowerenergy manifolds, and slower feeding into red pools distributed across both core and antenna. On the longest timescales, a persistent ground-state bleach signifies excitons stabilised in terminal sinks. Notably, comparison of 8 K and 80 K spectra reveals that excitations are heterogeneously partitioned among multiple sinks at low disorder, whereas modest thermal activation promotes selective convergence into core-associated red chlorophylls. To interpret these dynamics, we employed atomistic excitonic Hamiltonians with time-nonlocal master equations, providing a quantitative framework for exciton migration and thermal redistribution. Together, these results demonstrate that C. merolae PSI broadens the kinetic funnel by distributing sinks across core and antenna, an evolutionary adaptation that extends spectral coverage whilst ensuring efficient trapping. These insights reconcile cyanobacterial and plant paradigms and illuminate how antenna expansion reshaped PSI function during the course of photosynthetic evolution.
△ Less
Submitted 29 September, 2025;
originally announced September 2025.
-
Low-Noise Nanoscale Vortex Sensor for Out-of-Plane Magnetic Field Detection
Authors:
Ajay Jha,
Alvaro Palomino,
Stéphane Auffret,
Hélène Béa,
Ricardo C. Sousa,
Liliana D. Buda-Prejbeanu,
Bernard Dieny
Abstract:
This study investigates a vortex sensor based on a nanoscale (sub-100 nm) magnetic tunnel junction (MTJ) with a strong shape anisotropy, designed for sensitivity to the out-of-plane magnetic field component ($H_z$). The sensor comprises a free layer with a vortex configuration and a perpendicularly magnetized reference layer, which provides a reproducible and linear response when excited by a perp…
▽ More
This study investigates a vortex sensor based on a nanoscale (sub-100 nm) magnetic tunnel junction (MTJ) with a strong shape anisotropy, designed for sensitivity to the out-of-plane magnetic field component ($H_z$). The sensor comprises a free layer with a vortex configuration and a perpendicularly magnetized reference layer, which provides a reproducible and linear response when excited by a perpendicular magnetic field. Experimental measurements and micromagnetic simulations were combined to systematically assess the influence of structural parameters, specifically aspect ratio and defect landscape, on key sensor performance metrics, including dynamic range, sensitivity, and detectivity. The out-of-plane vortex sensor demonstrates a significantly improved dynamic range exceeding 200 mT, compared to the 40-80 mT typical of conventional in-plane vortex sensors. Frequency-dependent noise measurements reveal that the sensor exhibits low intrinsic noise, along with improved detectivity and resolution. This performance is ascribed to the field-dependent expansion and contraction of the vortex core, which reduces Barkhausen-type noise caused by defect-induced pinning potentials. Moreover, the sub-100\,nm lateral dimensions of the sensor enable scalable array integration, providing further enhancements in noise and detectivity through collective averaging. These results underscore the potential of this sensor architecture for advanced magnetic field sensing applications requiring a wide dynamic range and high measurement accuracy at the same time.
△ Less
Submitted 20 September, 2025;
originally announced September 2025.
-
Quantum spread complexity as a probe of NSI, $CP$ Violation, and mass ordering in neutrino oscillations in matter
Authors:
Abhishek Kumar Jha,
Govind Krishna G,
Anandbhai Pravinbhai Prajapati,
Subhashish Banerjee
Abstract:
Quantum spread complexity characterizes how a quantum state evolves and becomes distributed over the Hilbert space under unitary dynamics. In this work, we employ a cost function as a quantitative measure of spread complexity. We investigate this cost function within the framework of three-flavor neutrino oscillations in vacuum and matter, incorporating the $CP$-violation phase and Non-Standard In…
▽ More
Quantum spread complexity characterizes how a quantum state evolves and becomes distributed over the Hilbert space under unitary dynamics. In this work, we employ a cost function as a quantitative measure of spread complexity. We investigate this cost function within the framework of three-flavor neutrino oscillations in vacuum and matter, incorporating the $CP$-violation phase and Non-Standard Interaction (NSI) effects, under both normal and inverted mass ordering scenarios. The cost function is evaluated for each scenario and analyzed with the corresponding neutrino transition probabilities for both initial muon neutrino and muon antineutrino flavor states. The results are presented using the energy where the first oscillation is maximum and baseline lengths of ongoing long-baseline accelerator neutrino experiments, including T2K and NOvA, as well as upcoming experiments such as DUNE and P2O. Our findings indicate that the difference in the cost function between normal and inverted mass orderings during neutrino propagation in matter is sensitive to these experiments, with the appropriate choice of NSI parameters and the best-fit $CP$-violation phase values.
△ Less
Submitted 11 September, 2025;
originally announced September 2025.
-
Measurement of muon neutrino induced charged current interactions without charged pions in the final state using a new T2K off-axis near detector WAGASCI-BabyMIND
Authors:
K. Abe,
S. Abe,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
L. Anthony,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
Y. Ashida,
E. T. Atkin,
N. Babu,
V. Baranov,
G. J. Barker,
G. Barr,
D. Barrow,
P. Bates,
L. Bathe-Peters,
M. Batkiewicz-Kwasniak,
N. Baudis,
V. Berardi,
L. Berns
, et al. (377 additional authors not shown)
Abstract:
We report a flux-integrated cross section measurement of muon neutrino interactions on water and hydrocarbon via charged current reactions without charged pions in the final state with the WAGASCI-BabyMIND detector which was installed in the T2K near detector hall in 2018. The detector is located 1.5$^\circ$ off-axis and is exposed to a more energetic neutrino flux than ND280, another T2K near det…
▽ More
We report a flux-integrated cross section measurement of muon neutrino interactions on water and hydrocarbon via charged current reactions without charged pions in the final state with the WAGASCI-BabyMIND detector which was installed in the T2K near detector hall in 2018. The detector is located 1.5$^\circ$ off-axis and is exposed to a more energetic neutrino flux than ND280, another T2K near detector, which is located at a different off-axis position. The total flux-integrated cross section is measured to be $1.26 \pm 0.18\,(stat.+syst.) \times 10^{-39} $ $\mathrm{cm^{2}/nucleon}$ on CH and $1.44 \pm 0.21\,(stat.+syst.) \times 10^{-39} $ $\mathrm{cm^{2}/nucleon}$ on H$_{2}$O. These results are compared to model predictions provided by the NEUT v5.3.2 and GENIE v2.8.0 MC generators and the measurements are compatible with these models. Differential cross sections in muon momentum and cosine of the muon scattering angle are also reported. This is the first such measurement reported with the WAGASCI-BabyMIND detector and utilizes the 2020 and 2021 datasets.
△ Less
Submitted 9 September, 2025;
originally announced September 2025.
-
From Diagnosis to Therapy: Progress in SPECT and PET Reconstruction for Theranostics
Authors:
Kweku Enninful,
Fardeen Ahmed,
Bradley Girod,
Richard Laforest,
Daniel L. J. Thorek,
Vikas Prasad,
Abhinav K. Jha
Abstract:
The theranostic paradigm enables personalization of treatment by selecting patients with a diagnostic radiopharmaceutical and monitoring therapy using a matched therapeutic isotope. This strategy relies on accurate image reconstruction of both pre-therapy and post-therapy images for patient selection and monitoring treatment. However, traditional reconstruction methods are hindered by challenges s…
▽ More
The theranostic paradigm enables personalization of treatment by selecting patients with a diagnostic radiopharmaceutical and monitoring therapy using a matched therapeutic isotope. This strategy relies on accurate image reconstruction of both pre-therapy and post-therapy images for patient selection and monitoring treatment. However, traditional reconstruction methods are hindered by challenges such as crosstalk in multi-isotope imaging and extremely low-count measurements when imaging of alpha- (α-) emitting therapies. Additionally, to fully realize the benefits of new imaging systems being developed for theranostic applications, advanced reconstruction techniques are needed. These needs, alongside the growing clinical adoption of theranostics, have spurred the development of novel PET and SPECT reconstruction algorithms. This review highlights recent progress and addresses critical challenges and unmet needs in theranostic image reconstruction.
△ Less
Submitted 8 September, 2025;
originally announced September 2025.
-
Thermal Fluctuation Driven Structural Relaxation in Undeformed Glasses: Unraveling the Evolution of Mechanical Stability
Authors:
Avinash Kumar Jha,
Shiladitya Sengupta
Abstract:
Glasses are mechanically rigid, still undergo structural relaxation which changes their properties and affects potential technological applications. Understanding the underlying physical processes is a problem of broad theoretical and practical interest. We investigate intermittent structural relaxation events or ``avalanches'' occurring inside glassy regime. Contrary to the more well-known avalan…
▽ More
Glasses are mechanically rigid, still undergo structural relaxation which changes their properties and affects potential technological applications. Understanding the underlying physical processes is a problem of broad theoretical and practical interest. We investigate intermittent structural relaxation events or ``avalanches'' occurring inside glassy regime. Contrary to the more well-known avalanches due to shear, here they are induced by thermal fluctuations in undeformed glass. By analyzing changes in structural, mechanical, dynamical, topological and vibrational properties of the system, we provide a multi-faceted characterization of avalanches. Overall we find that the system softens due to avalanches. Further, we develop a formalism to extract local measures of non-Affine displacement and tensorial strain for thermal amorphous solids in absence of any external deformation. Our analysis highlights a key difference between two types of driving: while the shear deformation response is dominated by volume preserving deviatoric strain, changes in local density must be considered to model response of undeformed glass under thermal noise. The observations suggest the idea of Generalized Strain Transformation Zones (GSTZ), where coupled shear and volume-changing deformations govern thermally-mediated plasticity. Our work paves the way for a unified description of elasto-plastic response of (athermal) mechanically deformed and thermally driven undeformed glasses.
△ Less
Submitted 7 September, 2025;
originally announced September 2025.
-
3D-Image Reconstruction using MIMO-SAR FMCW Radar
Authors:
Ayush Jha,
Dhanireddy Chandrika,
Chandra Sekhar Seelamantula,
Chetan Singh Thakur
Abstract:
With the advancement of millimeter-wave radar technology, Synthetic Aperture Radar (SAR) imaging at millimeter-wave frequencies has gained significant attention in both academic research and industrial applications. However, traditional SAR imaging algorithms primarily focus on extracting two-dimensional information from detected targets, which limits their potential for 3D scene reconstruction. I…
▽ More
With the advancement of millimeter-wave radar technology, Synthetic Aperture Radar (SAR) imaging at millimeter-wave frequencies has gained significant attention in both academic research and industrial applications. However, traditional SAR imaging algorithms primarily focus on extracting two-dimensional information from detected targets, which limits their potential for 3D scene reconstruction. In this work, we demonstrated a fast time-domain reconstruction algorithm for achieving high-resolution 3D radar imaging at millimeter-wave (mmWave) frequencies. This approach leverages a combination of virtual Multiple Input Multiple Output (MIMO) Frequency Modulated Continuous Wave (FMCW) radar with the precision of Synthetic Aperture Radar (SAR) technique, setting the stage for a new era of advanced radar imaging applications.
△ Less
Submitted 7 September, 2025;
originally announced September 2025.
-
PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming
Authors:
Wesley Hanwen Deng,
Sunnie S. Y. Kim,
Akshita Jha,
Ken Holstein,
Motahhare Eslami,
Lauren Wilcox,
Leon A Gatys
Abstract:
Recent developments in AI governance and safety research have called for red-teaming methods that can effectively surface potential risks posed by AI models. Many of these calls have emphasized how the identities and backgrounds of red-teamers can shape their red-teaming strategies, and thus the kinds of risks they are likely to uncover. While automated red-teaming approaches promise to complement…
▽ More
Recent developments in AI governance and safety research have called for red-teaming methods that can effectively surface potential risks posed by AI models. Many of these calls have emphasized how the identities and backgrounds of red-teamers can shape their red-teaming strategies, and thus the kinds of risks they are likely to uncover. While automated red-teaming approaches promise to complement human red-teaming by enabling larger-scale exploration of model behavior, current approaches do not consider the role of identity. As an initial step towards incorporating people's background and identities in automated red-teaming, we develop and evaluate a novel method, PersonaTeaming, that introduces personas in the adversarial prompt generation process to explore a wider spectrum of adversarial strategies. In particular, we first introduce a methodology for mutating prompts based on either "red-teaming expert" personas or "regular AI user" personas. We then develop a dynamic persona-generating algorithm that automatically generates various persona types adaptive to different seed prompts. In addition, we develop a set of new metrics to explicitly measure the "mutation distance" to complement existing diversity measurements of adversarial prompts. Our experiments show promising improvements (up to 144.1%) in the attack success rates of adversarial prompts through persona mutation, while maintaining prompt diversity, compared to RainbowPlus, a state-of-the-art automated red-teaming method. We discuss the strengths and limitations of different persona types and mutation methods, shedding light on future opportunities to explore complementarities between automated and human red-teaming approaches.
△ Less
Submitted 27 October, 2025; v1 submitted 3 September, 2025;
originally announced September 2025.
-
Mix, Align, Distil: Reliable Cross-Domain Atypical Mitosis Classification
Authors:
Kaustubh Atey,
Sameer Anand Jha,
Gouranga Bala,
Amit Sethi
Abstract:
Atypical mitotic figures (AMFs) are important histopathological markers yet remain challenging to identify consistently, particularly under domain shift stemming from scanner, stain, and acquisition differences. We present a simple training-time recipe for domain-robust AMF classification in MIDOG 2025 Task 2. The approach (i) increases feature diversity via style perturbations inserted at early a…
▽ More
Atypical mitotic figures (AMFs) are important histopathological markers yet remain challenging to identify consistently, particularly under domain shift stemming from scanner, stain, and acquisition differences. We present a simple training-time recipe for domain-robust AMF classification in MIDOG 2025 Task 2. The approach (i) increases feature diversity via style perturbations inserted at early and mid backbone stages, (ii) aligns attention-refined features across sites using weak domain labels (Scanner, Origin, Species, Tumor) through an auxiliary alignment loss, and (iii) stabilizes predictions by distilling from an exponential moving average (EMA) teacher with temperature-scaled KL divergence. On the organizer-run preliminary leaderboard for atypical mitosis classification, our submission attains balanced accuracy of 0.8762, sensitivity of 0.8873, specificity of 0.8651, and ROC AUC of 0.9499. The method incurs negligible inference-time overhead, relies only on coarse domain metadata, and delivers strong, balanced performance, positioning it as a competitive submission for the MIDOG 2025 challenge.
△ Less
Submitted 28 August, 2025;
originally announced August 2025.
-
Sealing The Backdoor: Unlearning Adversarial Text Triggers In Diffusion Models Using Knowledge Distillation
Authors:
Ashwath Vaithinathan Aravindan,
Abha Jha,
Matthew Salaway,
Atharva Sandeep Bhide,
Duygu Nur Yaldiz
Abstract:
Text-to-image diffusion models have revolutionized generative AI, but their vulnerability to backdoor attacks poses significant security risks. Adversaries can inject imperceptible textual triggers into training data, causing models to generate manipulated outputs. Although text-based backdoor defenses in classification models are well-explored, generative models lack effective mitigation techniqu…
▽ More
Text-to-image diffusion models have revolutionized generative AI, but their vulnerability to backdoor attacks poses significant security risks. Adversaries can inject imperceptible textual triggers into training data, causing models to generate manipulated outputs. Although text-based backdoor defenses in classification models are well-explored, generative models lack effective mitigation techniques against. We address this by selectively erasing the model's learned associations between adversarial text triggers and poisoned outputs, while preserving overall generation quality. Our approach, Self-Knowledge Distillation with Cross-Attention Guidance (SKD-CAG), uses knowledge distillation to guide the model in correcting responses to poisoned prompts while maintaining image quality by exploiting the fact that the backdoored model still produces clean outputs in the absence of triggers. Using the cross-attention mechanism, SKD-CAG neutralizes backdoor influences at the attention level, ensuring the targeted removal of adversarial effects. Extensive experiments show that our method outperforms existing approaches, achieving removal accuracy 100\% for pixel backdoors and 93\% for style-based attacks, without sacrificing robustness or image fidelity. Our findings highlight targeted unlearning as a promising defense to secure generative models. Code and model weights can be found at https://github.com/Mystic-Slice/Sealing-The-Backdoor .
△ Less
Submitted 19 August, 2025;
originally announced August 2025.
-
Do VLMs Have Bad Eyes? Diagnosing Compositional Failures via Mechanistic Interpretability
Authors:
Ashwath Vaithinathan Aravindan,
Abha Jha,
Mihir Kulkarni
Abstract:
Vision-Language Models (VLMs) have shown remarkable performance in integrating visual and textual information for tasks such as image captioning and visual question answering. However, these models struggle with compositional generalization and object binding, which limit their ability to handle novel combinations of objects and their attributes. Our work explores the root causes of these failures…
▽ More
Vision-Language Models (VLMs) have shown remarkable performance in integrating visual and textual information for tasks such as image captioning and visual question answering. However, these models struggle with compositional generalization and object binding, which limit their ability to handle novel combinations of objects and their attributes. Our work explores the root causes of these failures using mechanistic interpretability techniques. We show evidence that individual neurons in the MLP layers of CLIP's vision encoder represent multiple features, and this "superposition" directly hinders its compositional feature representation which consequently affects compositional reasoning and object binding capabilities. We hope this study will serve as an initial step toward uncovering the mechanistic roots of compositional failures in VLMs. The code and supporting results can be found https://github.com/Mystic-Slice/Do-VLMs-Have-Bad-Eyes.
△ Less
Submitted 26 August, 2025; v1 submitted 19 August, 2025;
originally announced August 2025.
-
GRAFT: Gradient-Aware Fast MaxVol Technique for Dynamic Data Sampling
Authors:
Ashish Jha,
Anh huy Phan,
Razan Dibo,
Valentin Leplat
Abstract:
Training modern neural networks on large datasets is computationally and environmentally costly. We introduce GRAFT, a scalable in-training subset selection method that (i) extracts a low-rank feature representation for each batch, (ii) applies a Fast MaxVol sampler to select a small, diverse subset that spans the batch's dominant subspace, and (iii) dynamically adjusts the subset size using a gra…
▽ More
Training modern neural networks on large datasets is computationally and environmentally costly. We introduce GRAFT, a scalable in-training subset selection method that (i) extracts a low-rank feature representation for each batch, (ii) applies a Fast MaxVol sampler to select a small, diverse subset that spans the batch's dominant subspace, and (iii) dynamically adjusts the subset size using a gradient-approximation criterion. By operating in low-rank subspaces and training on carefully chosen examples instead of full batches, GRAFT preserves the training trajectory while reducing wall-clock time, energy consumption, and $\mathrm{CO}_2$ emissions. Across multiple benchmarks, GRAFT matches or exceeds recent selection baselines in both accuracy and efficiency, providing a favorable trade-off between accuracy, efficiency, and emissions.
△ Less
Submitted 22 August, 2025; v1 submitted 19 August, 2025;
originally announced August 2025.
-
CARGO: A Co-Optimization Framework for EV Charging and Routing in Goods Delivery Logistics
Authors:
Arindam Khanda,
Anurag Satpathy,
Amit Jha,
Sajal K. Das
Abstract:
With growing interest in sustainable logistics, electric vehicle (EV)-based deliveries offer a promising alternative for urban distribution. However, EVs face challenges due to their limited battery capacity, requiring careful planning for recharging. This depends on factors such as the charging point (CP) availability, cost, proximity, and vehicles' state of charge (SoC). We propose CARGO, a fram…
▽ More
With growing interest in sustainable logistics, electric vehicle (EV)-based deliveries offer a promising alternative for urban distribution. However, EVs face challenges due to their limited battery capacity, requiring careful planning for recharging. This depends on factors such as the charging point (CP) availability, cost, proximity, and vehicles' state of charge (SoC). We propose CARGO, a framework addressing the EV-based delivery route planning problem (EDRP), which jointly optimizes route planning and charging for deliveries within time windows. After proving the problem's NP-hardness, we propose a mixed integer linear programming (MILP)-based exact solution and a computationally efficient heuristic method. Using real-world datasets, we evaluate our methods by comparing the heuristic to the MILP solution, and benchmarking it against baseline strategies, Earliest Deadline First (EDF) and Nearest Delivery First (NDF). The results show up to 39% and 22% reductions in the charging cost over EDF and NDF, respectively, while completing comparable deliveries.
△ Less
Submitted 2 August, 2025;
originally announced August 2025.
-
confopt: A Library for Implementation and Evaluation of Gradient-based One-Shot NAS Methods
Authors:
Abhash Kumar Jha,
Shakiba Moradian,
Arjun Krishnakumar,
Martin Rapp,
Frank Hutter
Abstract:
Gradient-based one-shot neural architecture search (NAS) has significantly reduced the cost of exploring architectural spaces with discrete design choices, such as selecting operations within a model. However, the field faces two major challenges. First, evaluations of gradient-based NAS methods heavily rely on the DARTS benchmark, despite the existence of other available benchmarks. This overreli…
▽ More
Gradient-based one-shot neural architecture search (NAS) has significantly reduced the cost of exploring architectural spaces with discrete design choices, such as selecting operations within a model. However, the field faces two major challenges. First, evaluations of gradient-based NAS methods heavily rely on the DARTS benchmark, despite the existence of other available benchmarks. This overreliance has led to saturation, with reported improvements often falling within the margin of noise. Second, implementations of gradient-based one-shot NAS methods are fragmented across disparate repositories, complicating fair and reproducible comparisons and further development. In this paper, we introduce Configurable Optimizer (confopt), an extensible library designed to streamline the development and evaluation of gradient-based one-shot NAS methods. Confopt provides a minimal API that makes it easy for users to integrate new search spaces, while also supporting the decomposition of NAS optimizers into their core components. We use this framework to create a suite of new DARTS-based benchmarks, and combine them with a novel evaluation protocol to reveal a critical flaw in how gradient-based one-shot NAS methods are currently assessed. The code can be found at https://github.com/automl/ConfigurableOptimizer.
△ Less
Submitted 22 July, 2025;
originally announced July 2025.
-
A Study of Anatomical Priors for Deep Learning-Based Segmentation of Pheochromocytoma in Abdominal CT
Authors:
Tanjin Taher Toma,
Tejas Sudharshan Mathai,
Bikash Santra,
Pritam Mukherjee,
Jianfei Liu,
Wesley Jong,
Darwish Alabyad,
Vivek Batheja,
Abhishek Jha,
Mayank Patel,
Darko Pucar,
Jayadira del Rivero,
Karel Pacak,
Ronald M. Summers
Abstract:
Accurate segmentation of pheochromocytoma (PCC) in abdominal CT scans is essential for tumor burden estimation, prognosis, and treatment planning. It may also help infer genetic clusters, reducing reliance on expensive testing. This study systematically evaluates anatomical priors to identify configurations that improve deep learning-based PCC segmentation. We employed the nnU-Net framework to eva…
▽ More
Accurate segmentation of pheochromocytoma (PCC) in abdominal CT scans is essential for tumor burden estimation, prognosis, and treatment planning. It may also help infer genetic clusters, reducing reliance on expensive testing. This study systematically evaluates anatomical priors to identify configurations that improve deep learning-based PCC segmentation. We employed the nnU-Net framework to evaluate eleven annotation strategies for accurate 3D segmentation of pheochromocytoma, introducing a set of novel multi-class schemes based on organ-specific anatomical priors. These priors were derived from adjacent organs commonly surrounding adrenal tumors (e.g., liver, spleen, kidney, aorta, adrenal gland, and pancreas), and were compared against a broad body-region prior used in previous work. The framework was trained and tested on 105 contrast-enhanced CT scans from 91 patients at the NIH Clinical Center. Performance was measured using Dice Similarity Coefficient (DSC), Normalized Surface Distance (NSD), and instance-wise F1 score. Among all strategies, the Tumor + Kidney + Aorta (TKA) annotation achieved the highest segmentation accuracy, significantly outperforming the previously used Tumor + Body (TB) annotation across DSC (p = 0.0097), NSD (p = 0.0110), and F1 score (25.84% improvement at an IoU threshold of 0.5), measured on a 70-30 train-test split. The TKA model also showed superior tumor burden quantification (R^2 = 0.968) and strong segmentation across all genetic subtypes. In five-fold cross-validation, TKA consistently outperformed TB across IoU thresholds (0.1 to 0.5), reinforcing its robustness and generalizability. These findings highlight the value of incorporating relevant anatomical context into deep learning models to achieve precise PCC segmentation, offering a valuable tool to support clinical assessment and longitudinal disease monitoring in PCC patients.
△ Less
Submitted 24 July, 2025; v1 submitted 20 July, 2025;
originally announced July 2025.
-
Universal Scaling Laws in Freeway Traffic
Authors:
Garyoung Lee,
Aryaman Jha,
Kurt Wiesenfeld,
Jorge Laval
Abstract:
Traffic congestion, a daily frustration for millions and a multi-billion dollar drain on economies, has long resisted deep physical understanding. While simple theoretical models of traffic flow have suggested connections to critical phenomena and non-equilibrium universality, direct empirical validation is lacking. Using extensive, high-resolution vehicle trajectory data from the I-24 MOTION test…
▽ More
Traffic congestion, a daily frustration for millions and a multi-billion dollar drain on economies, has long resisted deep physical understanding. While simple theoretical models of traffic flow have suggested connections to critical phenomena and non-equilibrium universality, direct empirical validation is lacking. Using extensive, high-resolution vehicle trajectory data from the I-24 MOTION testbed, we show that traffic flow exhibits both a percolation phase transition that is self-organized critical and fluctuations consistent with the Kardar-Parisi-Zhang universality in 1+1 dimensions. This suggests that the complex and seemingly chaotic formation of traffic jams has predictable statistical properties, which opens new avenues in traffic science for developing advanced forecasting and management strategies grounded in universal scaling laws.
△ Less
Submitted 13 July, 2025;
originally announced July 2025.
-
Objective Task-based Evaluation of Quantitative Medical Imaging Methods: Emerging Frameworks and Future Directions
Authors:
Yan Liu,
Huitian Xia,
Nancy A. Obuchowski,
Richard Laforest,
Arman Rahmim,
Barry A. Siegel,
Abhinav K. Jha
Abstract:
Quantitative imaging (QI) is demonstrating strong promise across multiple clinical applications. For clinical translation of QI methods, objective evaluation on clinically relevant tasks is essential. To address this need, multiple evaluation strategies are being developed. In this paper, based on previous literature, we outline four emerging frameworks to perform evaluation studies of QI methods.…
▽ More
Quantitative imaging (QI) is demonstrating strong promise across multiple clinical applications. For clinical translation of QI methods, objective evaluation on clinically relevant tasks is essential. To address this need, multiple evaluation strategies are being developed. In this paper, based on previous literature, we outline four emerging frameworks to perform evaluation studies of QI methods. We first discuss the use of virtual imaging trials (VITs) to evaluate QI methods. Next, we outline a no-gold-standard evaluation framework to clinically evaluate QI methods without ground truth. Third, a framework to evaluate QI methods for joint detection and quantification tasks is outlined. Finally, we outline a framework to evaluate QI methods that output multi-dimensional parameters, such as radiomic features. We review these frameworks, discussing their utilities and limitations. Further, we examine future research areas in evaluation of QI methods. Given the recent advancements in PET, including long axial field-of-view scanners and the development of artificial-intelligence algorithms, we present these frameworks in the context of PET.
△ Less
Submitted 25 August, 2025; v1 submitted 6 July, 2025;
originally announced July 2025.
-
Behavioral Probability Weighting and Portfolio Optimization under Semi-Heavy Tails
Authors:
Ayush Jha,
Abootaleb Shirvani,
Ali M. Jaffri,
Svetlozar T. Rachev,
Frank J. Fabozzi
Abstract:
This paper develops a unified framework that integrates behavioral distortions into rational portfolio optimization by extracting implied probability weighting functions (PWFs) from optimal portfolios modeled under Gaussian and Normal-Inverse-Gaussian (NIG) return distributions. Using DJIA constituents, we construct mean-CVaR99 frontiers, alongwith Sharpe- and CVaR-maximizing portfolios, and estim…
▽ More
This paper develops a unified framework that integrates behavioral distortions into rational portfolio optimization by extracting implied probability weighting functions (PWFs) from optimal portfolios modeled under Gaussian and Normal-Inverse-Gaussian (NIG) return distributions. Using DJIA constituents, we construct mean-CVaR99 frontiers, alongwith Sharpe- and CVaR-maximizing portfolios, and estimate PWFs that capture nonlinear beliefs consistent with fear and greed. We show that increasing tail fatness amplifies these distortions and that shifts in the term structure of risk-free rates alter their curvature. The results highlight the importance of jointly modeling return asymmetry and belief distortions in portfolio risk management and capital allocation under extreme-risk environments.
△ Less
Submitted 5 July, 2025;
originally announced July 2025.
-
Testing T2K's Bayesian constraints with priors in alternate parameterisations
Authors:
The T2K Collaboration,
K. Abe,
S. Abe,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
L. Anthony,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
Y. Ashida,
E. T. Atkin,
N. Babu,
V. Baranov,
G. J. Barker,
G. Barr,
D. Barrow,
P. Bates,
L. Bathe-Peters,
M. Batkiewicz-Kwasniak,
N. Baudis,
V. Berardi
, et al. (379 additional authors not shown)
Abstract:
Bayesian analysis results require a choice of prior distribution. In long-baseline neutrino oscillation physics, the usual parameterisation of the mixing matrix induces a prior that privileges certain neutrino mass and flavour state symmetries. Here we study the effect of privileging alternate symmetries on the results of the T2K experiment. We find that constraints on the level of CP violation (a…
▽ More
Bayesian analysis results require a choice of prior distribution. In long-baseline neutrino oscillation physics, the usual parameterisation of the mixing matrix induces a prior that privileges certain neutrino mass and flavour state symmetries. Here we study the effect of privileging alternate symmetries on the results of the T2K experiment. We find that constraints on the level of CP violation (as given by the Jarlskog invariant) are robust under the choices of prior considered in the analysis. On the other hand, the degree of octant preference for the atmospheric angle depends on which symmetry has been privileged.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Development and in silico imaging trial evaluation of a deep-learning-based transmission-less attenuation compensation method for DaT SPECT
Authors:
Zitong Yu,
Md Ashequr Rahman,
Zekun Li,
Chunwei Ying,
Hongyu An,
Tammie L. S. Benzinger,
Richard Laforest,
Jingqin Luo,
Scott A. Norris,
Abhinav K. Jha
Abstract:
Quantitative measures of dopamine transporter (DaT) uptake in caudate, putamen, and globus pallidus derived from DaT-single-photon emission computed tomography (SPECT) images are being investigated as biomarkers to diagnose, assess disease status, and track the progression of Parkinsonism. Reliable quantification from DaT-SPECT images requires performing attenuation compensation (AC), typically wi…
▽ More
Quantitative measures of dopamine transporter (DaT) uptake in caudate, putamen, and globus pallidus derived from DaT-single-photon emission computed tomography (SPECT) images are being investigated as biomarkers to diagnose, assess disease status, and track the progression of Parkinsonism. Reliable quantification from DaT-SPECT images requires performing attenuation compensation (AC), typically with a separate X-ray CT scan. Such CT-based AC (CTAC) has multiple challenges, a key one being the non-availability of X-ray CT component on many clinical SPECT systems. Even when a CT is available, the additional CT scan leads to increased radiation dose, costs, and complexity, potential quantification errors due to SPECT-CT misalignment, and higher training and regulatory requirements. To overcome the challenges with the requirement of a CT scan for AC in DaT SPECT, we propose a deep learning (DL)-based transmission-less AC method for DaT-SPECT (DaT-CTLESS). An in silico imaging trial, titled ISIT-DaT, was designed to evaluate the performance of DaT-CTLESS on the regional uptake quantification task. We observed that DaT-CTLESS yielded a significantly higher correlation with CTAC than that between UAC and CTAC on the regional DaT uptake quantification task. Further, DaT-CLTESS had an excellent agreement with CTAC on this task, significantly outperformed UAC in distinguishing patients with normal versus reduced putamen SBR, yielded good generalizability across two scanners, was generally insensitive to intra-regional uptake heterogeneity, demonstrated good repeatability, exhibited robust performance even as the size of the training data was reduced, and generally outperformed the other considered DL methods on the task of quantifying regional uptake across different training dataset sizes. These results provide a strong motivation for further clinical evaluation of DaT-CTLESS.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Classification of Cattle Behavior and Detection of Heat (Estrus) using Sensor Data
Authors:
Druva Dhakshinamoorthy,
Avikshit Jha,
Sabyasachi Majumdar,
Devdulal Ghosh,
Ranjita Chakraborty,
Hena Ray
Abstract:
This paper presents a novel system for monitoring cattle behavior and detecting estrus (heat) periods using sensor data and machine learning. We designed and deployed a low-cost Bluetooth-based neck collar equipped with accelerometer and gyroscope sensors to capture real-time behavioral data from real cows, which was synced to the cloud. A labeled dataset was created using synchronized CCTV footag…
▽ More
This paper presents a novel system for monitoring cattle behavior and detecting estrus (heat) periods using sensor data and machine learning. We designed and deployed a low-cost Bluetooth-based neck collar equipped with accelerometer and gyroscope sensors to capture real-time behavioral data from real cows, which was synced to the cloud. A labeled dataset was created using synchronized CCTV footage to annotate behaviors such as feeding, rumination, lying, and others. We evaluated multiple machine learning models -- Support Vector Machines (SVM), Random Forests (RF), and Convolutional Neural Networks (CNN) -- for behavior classification. Additionally, we implemented a Long Short-Term Memory (LSTM) model for estrus detection using behavioral patterns and anomaly detection. Our system achieved over 93% behavior classification accuracy and 96% estrus detection accuracy on a limited test set. The approach offers a scalable and accessible solution for precision livestock monitoring, especially in resource-constrained environments.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine
Authors:
Anushka Jha,
Tanushree Dewangan,
Mukul Lokhande,
Santosh Kumar Vishvakarma
Abstract:
Reinforcement Learning (RL) has outperformed other counterparts in sequential decision-making and dynamic environment control. However, FPGA deployment is significantly resource-expensive, as associated with large number of computations in training agents with high-quality images and possess new challenges. In this work, we propose QForce-RL takes benefits of quantization to enhance throughput and…
▽ More
Reinforcement Learning (RL) has outperformed other counterparts in sequential decision-making and dynamic environment control. However, FPGA deployment is significantly resource-expensive, as associated with large number of computations in training agents with high-quality images and possess new challenges. In this work, we propose QForce-RL takes benefits of quantization to enhance throughput and reduce energy footprint with light-weight RL architecture, without significant performance degradation. QForce-RL takes advantages from E2HRL to reduce overall RL actions to learn desired policy and QuaRL for quantization based SIMD for hardware acceleration. We have also provided detailed analysis for different RL environments, with emphasis on model size, parameters, and accelerated compute ops. The architecture is scalable for resource-constrained devices and provide parametrized efficient deployment with flexibility in latency, throughput, power, and energy efficiency. The proposed QForce-RL provides performance enhancement up to 2.3x and better FPS - 2.6x compared to SoTA works.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Certified Unlearning for Neural Networks
Authors:
Anastasia Koloskova,
Youssef Allouah,
Animesh Jha,
Rachid Guerraoui,
Sanmi Koyejo
Abstract:
We address the problem of machine unlearning, where the goal is to remove the influence of specific training data from a model upon request, motivated by privacy concerns and regulatory requirements such as the "right to be forgotten." Unfortunately, existing methods rely on restrictive assumptions or lack formal guarantees. To this end, we propose a novel method for certified machine unlearning,…
▽ More
We address the problem of machine unlearning, where the goal is to remove the influence of specific training data from a model upon request, motivated by privacy concerns and regulatory requirements such as the "right to be forgotten." Unfortunately, existing methods rely on restrictive assumptions or lack formal guarantees. To this end, we propose a novel method for certified machine unlearning, leveraging the connection between unlearning and privacy amplification by stochastic post-processing. Our method uses noisy fine-tuning on the retain data, i.e., data that does not need to be removed, to ensure provable unlearning guarantees. This approach requires no assumptions about the underlying loss function, making it broadly applicable across diverse settings. We analyze the theoretical trade-offs in efficiency and accuracy and demonstrate empirically that our method not only achieves formal unlearning guarantees but also performs effectively in practice, outperforming existing baselines. Our code is available at https://github.com/stair-lab/certified-unlearning-neural-networks-icml-2025
△ Less
Submitted 10 June, 2025; v1 submitted 7 June, 2025;
originally announced June 2025.
-
A Sinusoidal Hull-White Model for Interest Rate Dynamics: Capturing Long-Term Periodicity in U.S. Treasury Yields
Authors:
Amit Kumar Jha
Abstract:
This study is motivated by empirical observations of periodic fluctuations in interest rates, notably long-term economic cycles spanning decades, which the conventional Hull-White short-rate model fails to adequately capture. To address this limitation, we propose an extension that incorporates a sinusoidal, time-varying mean reversion speed, allowing the model to reflect cyclic interest rate dyna…
▽ More
This study is motivated by empirical observations of periodic fluctuations in interest rates, notably long-term economic cycles spanning decades, which the conventional Hull-White short-rate model fails to adequately capture. To address this limitation, we propose an extension that incorporates a sinusoidal, time-varying mean reversion speed, allowing the model to reflect cyclic interest rate dynamics more effectively.
The model is calibrated using a comprehensive dataset of daily U.S. Treasury yield curves obtained from the Federal Reserve Economic Data (FRED) database, covering the period from January 1990 to December 2022. The dataset includes tenors of 1, 2, 3, 5, 7, 10, 20, and 30 years, with the most recent yields ranging from 1.22% (1-year) to 2.36% (30-year).
Calibration is performed using the Nelder-Mead optimization algorithm, and Monte Carlo simulations with 200 paths and a time step of 0.05 years. The resulting 30-year zero-coupon bond price under the proposed model is 0.43, compared to 0.47 under the standard Hull-White model. This corresponds to root mean squared errors of 0.12% and 0.14%, respectively, indicating a noticeable improvement in fit, particularly for longer maturities.
These results highlight the model's enhanced capability to capture long-term yield dynamics and suggest significant implications for bond pricing, interest rate risk management, and the valuation of interest rate derivatives. The findings also open avenues for further research into stochastic periodicity and alternative interest rate modeling frameworks.
△ Less
Submitted 27 May, 2025;
originally announced June 2025.
-
Results from the T2K experiment on neutrino mixing including a new far detector $μ$-like sample
Authors:
The T2K Collaboration,
K. Abe,
S. Abe,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
L. Anthony,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
Y. Ashida,
E. T. Atkin,
N. Babu,
V. Baranov,
G. J. Barker,
G. Barr,
D. Barrow,
P. Bates,
L. Bathe-Peters,
M. Batkiewicz-Kwasniak,
N. Baudis,
V. Berardi
, et al. (380 additional authors not shown)
Abstract:
T2K has made improved measurements of three-flavor neutrino mixing with 19.7(16.3)$\times 10^{20}$ protons on target in (anti-)neutrino-enhanced beam modes. A new sample of muon-neutrino events with tagged pions has been added at the far detector, increasing the neutrino-enhanced muon-neutrino sample size by 42.5%. In addition, new samples have been added at the near detector, and significant impr…
▽ More
T2K has made improved measurements of three-flavor neutrino mixing with 19.7(16.3)$\times 10^{20}$ protons on target in (anti-)neutrino-enhanced beam modes. A new sample of muon-neutrino events with tagged pions has been added at the far detector, increasing the neutrino-enhanced muon-neutrino sample size by 42.5%. In addition, new samples have been added at the near detector, and significant improvements have been made to the flux and neutrino interaction modeling. T2K data continues to prefer the normal mass ordering and upper octant of $\sin^2θ_{23}$ with a near-maximal value of the charge-parity violating phase with best-fit values in the normal ordering of $δ_{\scriptscriptstyle\mathrm{CP}}=-2.18\substack{+1.22 \\ -0.47}$, $\sin^2θ_{23}=0.559\substack{+0.018 \\ -0.078}$ and $Δm^2_{32}=(+2.506\substack{+0.039 \\ -0.052})\times 10^{-3}$ eV$^{2}$.
△ Less
Submitted 10 June, 2025; v1 submitted 6 June, 2025;
originally announced June 2025.
-
Winners vs. Losers: Momentum-based Strategies with Intertemporal Choice for ESG Portfolios
Authors:
Ayush Jha,
Abootaleb Shirvani,
Ali Jaffri,
Svetlozar T. Rachev,
Frank J. Fabozzi
Abstract:
This paper introduces a state-dependent momentum framework that integrates ESG regime switching with tail-risk-aware reward-risk metrics. Using a dynamic programming approach and solving a finite-horizon Bellman equation, we construct long-short momentum portfolios that adjust to changing ESG sentiment regimes. Unlike traditional momentum strategies based on historical returns, our approach incorp…
▽ More
This paper introduces a state-dependent momentum framework that integrates ESG regime switching with tail-risk-aware reward-risk metrics. Using a dynamic programming approach and solving a finite-horizon Bellman equation, we construct long-short momentum portfolios that adjust to changing ESG sentiment regimes. Unlike traditional momentum strategies based on historical returns, our approach incorporates the Stable Tail Adjusted Return ratio and Rachev ratio to better capture downside risk in turbulent markets. We apply this framework across three asset classes, Russell 3000 equities, Dow Jones~30 stocks, and cryptocurrencies, under both pro- and anti-ESG market regimes. We find that ESG-loser portfolios significantly outperform ESG-winner portfolios in pro-ESG regimes, a counterintuitive result suggesting that market overreaction to ESG sentiment creates short-term pricing inefficiencies. This pattern is robust across tail-sensitive performance metrics and is most pronounced under a two-week formation and holding period. Our framework highlights how ESG considerations and sentiment regimes alter return dynamics, offering practical guidance for investors seeking to implement responsive momentum strategies under sustainability constraints. These findings challenge conventional assumptions about ESG investing and underscore the importance of dynamic, regime-aware portfolio construction in environments shaped by regulatory signals, investor flows, and behavioral biases.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification
Authors:
Poojah Ganesan,
Rajat Aayush Jha,
Dan Roth,
Vivek Gupta
Abstract:
Recent advances in large language models (LLMs) have greatly improved Text-to-SQL performance for single-table queries. But, it remains challenging in multi-table databases due to complex schema and relational operations. Existing methods often struggle with retrieving the right tables and columns, generating accurate JOINs and UNIONs, and generalizing across diverse schemas. To address these issu…
▽ More
Recent advances in large language models (LLMs) have greatly improved Text-to-SQL performance for single-table queries. But, it remains challenging in multi-table databases due to complex schema and relational operations. Existing methods often struggle with retrieving the right tables and columns, generating accurate JOINs and UNIONs, and generalizing across diverse schemas. To address these issues, we introduce UNJOIN, a two-stage framework that decouples the retrieval of schema elements from SQL logic generation. In the first stage, we merge the column names of all tables in the database into a single-table representation by prefixing each column with its table name. This allows the model to focus purely on accurate retrieval without being distracted by the need to write complex SQL logic. In the second stage, the SQL query is generated on this simplified schema and mapped back to the original schema by reconstructing JOINs, UNIONs, and relational logic. Evaluations on SPIDER and BIRD datasets show that UNJOIN matches or exceeds the state-of-the-art baselines. UNJOIN uses only schema information, which does not require data access or fine-tuning, making it scalable and adaptable across databases.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Handloom Design Generation Using Generative Networks
Authors:
Rajat Kanti Bhattacharjee,
Meghali Nandi,
Amrit Jha,
Gunajit Kalita,
Ferdous Ahmed Barbhuiya
Abstract:
This paper proposes deep learning techniques of generating designs for clothing, focused on handloom fabric and discusses the associated challenges along with its application. The capability of generative neural network models in understanding artistic designs and synthesizing those is not yet explored well. In this work, multiple methods are employed incorporating the current state of the art gen…
▽ More
This paper proposes deep learning techniques of generating designs for clothing, focused on handloom fabric and discusses the associated challenges along with its application. The capability of generative neural network models in understanding artistic designs and synthesizing those is not yet explored well. In this work, multiple methods are employed incorporating the current state of the art generative models and style transfer algorithms to study and observe their performance for the task. The results are then evaluated through user score. This work also provides a new dataset NeuralLoom for the task of the design generation.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Multivariate Affine GARCH with Heavy Tails: A Unified Framework for Portfolio Optimization and Option Valuation
Authors:
Ayush Jha,
Abootaleb Shirvani,
Ali Jaffri,
Svetlozar T. Rachev,
Frank J. Fabozzi
Abstract:
This paper develops and estimates a multivariate affine GARCH(1,1) model with Normal Inverse Gaussian innovations that captures time-varying volatility, heavy tails, and dynamic correlation across asset returns. We generalize the Heston-Nandi framework to a multivariate setting and apply it to 30 Dow Jones Industrial Average stocks. The model jointly supports three core financial applications: dyn…
▽ More
This paper develops and estimates a multivariate affine GARCH(1,1) model with Normal Inverse Gaussian innovations that captures time-varying volatility, heavy tails, and dynamic correlation across asset returns. We generalize the Heston-Nandi framework to a multivariate setting and apply it to 30 Dow Jones Industrial Average stocks. The model jointly supports three core financial applications: dynamic portfolio optimization, wealth path simulation, and option pricing. Closed-form solutions are derived for a Constant Relative Risk Aversion (CRRA) investor's intertemporal asset allocation, and we implement a forward-looking risk-adjusted performance comparison against Merton-style constant strategies. Using the model's conditional volatilities, we also construct implied volatility surfaces for European options, capturing skew and smile features. Empirically, we document substantial wealth-equivalent utility losses from ignoring time-varying correlation and tail risk. These findings underscore the value of a unified econometric framework for analyzing joint asset dynamics and for managing portfolio and derivative exposures under non-Gaussian risks.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Advancing Remote and Continuous Cardiovascular Patient Monitoring through a Novel and Resource-efficient IoT-Driven Framework
Authors:
Sanam Nayab,
Sohail Raza Chohan,
Aqsa Jameel,
Syed Rehan Shah,
Syed Ahsan Masud Zaidi,
Aditya Nath Jha,
Kamran Siddique
Abstract:
Cardiovascular diseases are a leading cause of fatalities worldwide, often occurring suddenly with limited time for intervention. Current healthcare monitoring systems for cardiac patients rely heavily on hospitalization, which can be impractical for continuous monitoring. This paper presents a novel IoT-based solution for remote, real-time tracking of critical cardiac metrics, addressing the pres…
▽ More
Cardiovascular diseases are a leading cause of fatalities worldwide, often occurring suddenly with limited time for intervention. Current healthcare monitoring systems for cardiac patients rely heavily on hospitalization, which can be impractical for continuous monitoring. This paper presents a novel IoT-based solution for remote, real-time tracking of critical cardiac metrics, addressing the pressing need for accessible and continuous healthcare, particularly for the aging population in Pakistan. The proposed IoT kit measures essential parameters such as body temperature, heart rate (HR), blood pressure (BP), oxygen saturation (SPO2), and electrocardiography (ECG).
A key innovation of the system is its integration with a cloud-based application, enabling constant remote monitoring and incorporating an alarm mechanism to alert medical professionals for timely intervention, reducing the risk of catastrophic incidents. The system was tested in a clinical environment with 20 participants, demonstrating results closely aligned with those obtained using standard medical devices. The findings validate the system's potential for reliable remote monitoring, offering a significant step forward in proactive cardiac healthcare management. This novel approach combines IoT technology with cloud-based applications to provide a cost-effective and efficient solution for reducing unexpected fatalities among cardiac patients.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
Non-universal Impact of Cholesterol on Ionic Liquid-Membrane Interactions
Authors:
J. Gupta,
V. K. Sharma,
P. Hitaishi,
A. K. Jha,
J. B. Mitra,
H. Srinivasan,
S. Kumar,
A. Kumar,
S. K. Ghosh,
S. Mitra
Abstract:
Understanding the role of cholesterol in ionic liquid (IL)-membrane interactions is essential for advancing biomedical applications of ILs, including the development of innovative antimicrobial agents. In this study, we explore the intricate and multifaceted role of cholesterol in modulating IL-membrane interactions, employing a comprehensive suite of biophysical techniques. We systematically exam…
▽ More
Understanding the role of cholesterol in ionic liquid (IL)-membrane interactions is essential for advancing biomedical applications of ILs, including the development of innovative antimicrobial agents. In this study, we explore the intricate and multifaceted role of cholesterol in modulating IL-membrane interactions, employing a comprehensive suite of biophysical techniques. We systematically examine how IL alkyl chain length and membrane physical state influence the impact of cholesterol on IL-lipid membrane interaction. The incorporation of ILs is shown to increase the area per lipid in both pristine dipalmitoylphosphatidylcholine (DPPC) and DPPC-cholesterol membranes. Cholesterol modulates the impact of ILs on lipid conformation, membrane viscoelasticity, and phase behavior. Small-angle neutron scattering and dynamic light scattering measurements reveal that cholesterol mitigates IL-induced structural perturbations in vesicles. Our isothermal titration calorimetry measurements reveal that the presence of cholesterol significantly weakens the binding of ILs to membranes. Intriguingly, despite this reduced binding affinity, cholesterol-containing membranes demonstrate enhanced permeabilization. This counterintuitive effect is attributed to cholesterol's ordering of lipid membranes, which increases susceptibility to stress and defects. Our results underscore the complex and non-universal interplay between lipid composition, IL alkyl chain length, and membrane phase state. These insights provide a deeper understanding of cholesterol's role in IL-membrane interactions, paving the way for the design of advanced applications of ILs in antimicrobial therapy and drug delivery.
△ Less
Submitted 23 May, 2025; v1 submitted 2 May, 2025;
originally announced May 2025.
-
First Measurement of the Electron Neutrino Charged-Current Pion Production Cross Section on Carbon with the T2K Near Detector
Authors:
K. Abe,
S. Abe,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
L. Anthony,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
E. T. Atkin,
N. Babu,
V. Baranov,
G. J. Barker,
G. Barr,
D. Barrow,
P. Bates,
L. Bathe-Peters,
M. Batkiewicz-Kwasniak,
N. Baudis,
V. Berardi,
L. Berns,
S. Bhattacharjee
, et al. (371 additional authors not shown)
Abstract:
The T2K Collaboration presents the first measurement of electron neutrino-induced charged-current pion production on carbon in a restricted kinematical phase space. This is performed using data from the 2.5$^°$ off-axis near detector, ND280. The differential cross sections with respect to the outgoing electron and pion kinematics, in addition to the total flux-integrated cross section, are obtai…
▽ More
The T2K Collaboration presents the first measurement of electron neutrino-induced charged-current pion production on carbon in a restricted kinematical phase space. This is performed using data from the 2.5$^°$ off-axis near detector, ND280. The differential cross sections with respect to the outgoing electron and pion kinematics, in addition to the total flux-integrated cross section, are obtained. Comparisons between the measured and predicted cross section results using the Neut, Genie and NuWro Monte Carlo event generators are presented. The measured total flux-integrated cross section is [2.52 $\pm$ 0.52 (stat) $\pm$ 0.30 (sys)] x $10^{-39}$ cm$^2$ nucleon$^{-1}$, which is lower than the event generator predictions.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models
Authors:
Mainak Singha,
Subhankar Roy,
Sarthak Mehrotra,
Ankit Jha,
Moloud Abdar,
Biplab Banerjee,
Elisa Ricci
Abstract:
In federated learning, textual prompt tuning adapts Vision-Language Models (e.g., CLIP) by tuning lightweight input tokens (or prompts) on local client data, while keeping network weights frozen. After training, only the prompts are shared by the clients with the central server for aggregation. However, textual prompt tuning suffers from overfitting to known concepts, limiting its generalizability…
▽ More
In federated learning, textual prompt tuning adapts Vision-Language Models (e.g., CLIP) by tuning lightweight input tokens (or prompts) on local client data, while keeping network weights frozen. After training, only the prompts are shared by the clients with the central server for aggregation. However, textual prompt tuning suffers from overfitting to known concepts, limiting its generalizability to unseen concepts. To address this limitation, we propose Multimodal Visual Prompt Tuning (FedMVP) that conditions the prompts on multimodal contextual information - derived from the input image and textual attribute features of a class. At the core of FedMVP is a PromptFormer module that synergistically aligns textual and visual features through a cross-attention mechanism. The dynamically generated multimodal visual prompts are then input to the frozen vision encoder of CLIP, and trained with a combination of CLIP similarity loss and a consistency loss. Extensive evaluation on 20 datasets, spanning three generalization settings, demonstrates that FedMVP not only preserves performance on in-distribution classes and domains, but also displays higher generalizability to unseen classes and domains, surpassing state-of-the-art methods by a notable margin of +1.57% - 2.26%. Code is available at https://github.com/mainaksingha01/FedMVP.
△ Less
Submitted 2 September, 2025; v1 submitted 29 April, 2025;
originally announced April 2025.
-
Probing the quantum speed limit and entanglement in flavor oscillations of neutrino-antineutrino system in curved spacetime
Authors:
Abhishek Kumar Jha,
Mriganka Dutta,
Mayank Pathak,
Subhashish Banerjee,
Banibrata Mukhopadhyay
Abstract:
We consider a spinning primordial black hole (PBH) described by the Kerr metric in Kerr-Schild polar coordinates. We derive an analytical expression for the four-vector gravitational potential in the underlying Hermitian Dirac Hamiltonian using these coordinates. This gravitational potential introduces an axial vector term in the Dirac equation in curved spacetime. We find that the magnitudes of t…
▽ More
We consider a spinning primordial black hole (PBH) described by the Kerr metric in Kerr-Schild polar coordinates. We derive an analytical expression for the four-vector gravitational potential in the underlying Hermitian Dirac Hamiltonian using these coordinates. This gravitational potential introduces an axial vector term in the Dirac equation in curved spacetime. We find that the magnitudes of the temporal and spatial components of the four-vector gravitational potential are significantly affected by the angle of the position vector of the spinor with respect to the spin axis of the PBH, its radial distance from the PBH, and the strength of the specific angular momentum of the PBH. These potentials modify the effective mass matrix of the neutrino-antineutrino system and significantly affect the transition probabilities during the flavor oscillation of the neutrino-antineutrino system. We then use the transition probability to investigate the quantum speed limit time bound ratio for the two-flavor oscillation of the neutrino-antineutrino system in curved spacetime. This helps us estimate how quickly the initial neutrino flavor state evolves over time under the influence of the gravitational field. Finally, we discuss quantum correlations such as entanglement entropy during the two-flavor oscillation of the neutrino-antineutrino system near a spinning PBH.
△ Less
Submitted 1 September, 2025; v1 submitted 28 April, 2025;
originally announced April 2025.
-
Backdoor Defense in Diffusion Models via Spatial Attention Unlearning
Authors:
Abha Jha,
Ashwath Vaithinathan Aravindan,
Matthew Salaway,
Atharva Sandeep Bhide,
Duygu Nur Yaldiz
Abstract:
Text-to-image diffusion models are increasingly vulnerable to backdoor attacks, where malicious modifications to the training data cause the model to generate unintended outputs when specific triggers are present. While classification models have seen extensive development of defense mechanisms, generative models remain largely unprotected due to their high-dimensional output space, which complica…
▽ More
Text-to-image diffusion models are increasingly vulnerable to backdoor attacks, where malicious modifications to the training data cause the model to generate unintended outputs when specific triggers are present. While classification models have seen extensive development of defense mechanisms, generative models remain largely unprotected due to their high-dimensional output space, which complicates the detection and mitigation of subtle perturbations. Defense strategies for diffusion models, in particular, remain under-explored. In this work, we propose Spatial Attention Unlearning (SAU), a novel technique for mitigating backdoor attacks in diffusion models. SAU leverages latent space manipulation and spatial attention mechanisms to isolate and remove the latent representation of backdoor triggers, ensuring precise and efficient removal of malicious effects. We evaluate SAU across various types of backdoor attacks, including pixel-based and style-based triggers, and demonstrate its effectiveness in achieving 100% trigger removal accuracy. Furthermore, SAU achieves a CLIP score of 0.7023, outperforming existing methods while preserving the model's ability to generate high-quality, semantically aligned images. Our results show that SAU is a robust, scalable, and practical solution for securing text-to-image diffusion models against backdoor attacks.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
A detection-task-specific deep-learning method to improve the quality of sparse-view myocardial perfusion SPECT images
Authors:
Zezhang Yang,
Zitong Yu,
Nuri Choi,
Abhinav K. Jha
Abstract:
Myocardial perfusion imaging (MPI) with single-photon emission computed tomography (SPECT) is a widely used and cost-effective diagnostic tool for coronary artery disease. However, the lengthy scanning time in this imaging procedure can cause patient discomfort, motion artifacts, and potentially inaccurate diagnoses due to misalignment between the SPECT scans and the CT-scans which are acquired fo…
▽ More
Myocardial perfusion imaging (MPI) with single-photon emission computed tomography (SPECT) is a widely used and cost-effective diagnostic tool for coronary artery disease. However, the lengthy scanning time in this imaging procedure can cause patient discomfort, motion artifacts, and potentially inaccurate diagnoses due to misalignment between the SPECT scans and the CT-scans which are acquired for attenuation compensation. Reducing projection angles is a potential way to shorten scanning time, but this can adversely impact the quality of the reconstructed images. To address this issue, we propose a detection-task-specific deep-learning method for sparse-view MPI SPECT images. This method integrates an observer loss term that penalizes the loss of anthropomorphic channel features with the goal of improving performance in perfusion defect-detection task. We observed that, on the task of detecting myocardial perfusion defects, the proposed method yielded an area under the receiver operating characteristic (ROC) curve (AUC) significantly larger than the sparse-view protocol. Further, the proposed method was observed to be able to restore the structure of the left ventricle wall, demonstrating ability to overcome sparse-sampling artifacts. Our preliminary results motivate further evaluations of the method.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
An Empirical Study of Python Library Migration Using Large Language Models
Authors:
Md Mohayeminul Islam,
Ajay Kumar Jha,
May Mahmoud,
Ildar Akhmetov,
Sarah Nadi
Abstract:
Library migration is the process of replacing one library with another library that provides similar functionality. Manual library migration is time consuming and error prone, as it requires developers to understand the APIs of both libraries, map them, and perform the necessary code transformations. Large Language Models (LLMs) are shown to be effective at generating and transforming code as well…
▽ More
Library migration is the process of replacing one library with another library that provides similar functionality. Manual library migration is time consuming and error prone, as it requires developers to understand the APIs of both libraries, map them, and perform the necessary code transformations. Large Language Models (LLMs) are shown to be effective at generating and transforming code as well as finding similar code, which are necessary upstream tasks for library migration. Such capabilities suggest that LLMs may be suitable for library migration. Accordingly, this paper investigates the effectiveness of LLMs for migration between Python libraries. We evaluate three LLMs, Llama 3.1, GPT-4o mini, and GPT-4o on PyMigBench, where we migrate 321 real-world library migrations that include 2,989 migration-related code changes. To measure correctness, we (1) compare the LLM's migrated code with the developers' migrated code in the benchmark and (2) run the unit tests available in the client repositories. We find that LLama 3.1, GPT-4o mini, and GPT-4o correctly migrate 89%, 89%, and 94% of the migration-related code changes, respectively. We also find that 36%, 52% and 64% of the LLama 3.1, GPT-4o mini, and GPT-4o migrations pass the same tests that passed in the developer's migration. To ensure the LLMs are not reciting the migrations, we also evaluate them on 10 new repositories where the migration never happened. Overall, our results suggest that LLMs can be effective in migrating code between libraries, but we also identify some open challenges.
△ Less
Submitted 12 October, 2025; v1 submitted 17 April, 2025;
originally announced April 2025.
-
Reasoning Towards Fairness: Mitigating Bias in Language Models through Reasoning-Guided Fine-Tuning
Authors:
Sanchit Kabra,
Akshita Jha,
Chandan K. Reddy
Abstract:
Recent advances in large-scale generative language models have shown that reasoning capabilities can significantly improve model performance across a variety of tasks. However, the impact of reasoning on a model's ability to mitigate stereotypical responses remains largely underexplored. In this work, we investigate the crucial relationship between a model's reasoning ability and fairness, and ask…
▽ More
Recent advances in large-scale generative language models have shown that reasoning capabilities can significantly improve model performance across a variety of tasks. However, the impact of reasoning on a model's ability to mitigate stereotypical responses remains largely underexplored. In this work, we investigate the crucial relationship between a model's reasoning ability and fairness, and ask whether improved reasoning capabilities can mitigate harmful stereotypical responses, especially those arising due to shallow or flawed reasoning. We conduct a comprehensive evaluation of multiple open-source LLMs, and find that larger models with stronger reasoning abilities exhibit substantially lower stereotypical bias on existing fairness benchmarks. Building on this insight, we introduce ReGiFT -- Reasoning Guided Fine-Tuning, a novel approach that extracts structured reasoning traces from advanced reasoning models and infuses them into models that lack such capabilities. We use only general-purpose reasoning and do not require any fairness-specific supervision for bias mitigation. Notably, we see that models fine-tuned using ReGiFT not only improve fairness relative to their non-reasoning counterparts but also outperform advanced reasoning models on fairness benchmarks. We also analyze how variations in the correctness of the reasoning traces and their length influence model fairness and their overall performance. Our findings highlight that enhancing reasoning capabilities is an effective, fairness-agnostic strategy for mitigating stereotypical bias caused by reasoning flaws.
△ Less
Submitted 5 June, 2025; v1 submitted 7 April, 2025;
originally announced April 2025.
-
Redefining Network Topology in Complex Systems: Merging Centrality Metrics, Spectral Theory, and Diffusion Dynamics
Authors:
Arsh Jha
Abstract:
This paper introduces a novel framework that combines traditional centrality measures with eigenvalue spectra and diffusion processes for a more comprehensive analysis of complex networks. While centrality measures such as degree, closeness, and betweenness have been commonly used to assess nodal importance, they provide limited insight into dynamic network behaviors. By incorporating eigenvalue a…
▽ More
This paper introduces a novel framework that combines traditional centrality measures with eigenvalue spectra and diffusion processes for a more comprehensive analysis of complex networks. While centrality measures such as degree, closeness, and betweenness have been commonly used to assess nodal importance, they provide limited insight into dynamic network behaviors. By incorporating eigenvalue analysis, which evaluates network robustness and connectivity through spectral properties, and diffusion processes that model information flow, this framework offers a deeper understanding of how networks function under dynamic conditions. Applied to synthetic networks, the approach identifies key nodes not only by centrality but also by their role in diffusion dynamics and vulnerability points, offering a multi-dimensional view that traditional methods alone cannot. This integrated analysis enables a more precise identification of critical nodes and potential weaknesses, with implications for improving network resilience in fields ranging from epidemiology to cybersecurity. Keywords: Centrality measures, eigenvalue spectra, diffusion processes, network analysis, network robustness, information flow, synthetic networks.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Anvil: A General-Purpose Timing-Safe Hardware Description Language
Authors:
Jason Zhijingcheng Yu,
Aditya Ranjan Jha,
Umang Mathur,
Trevor E. Carlson,
Prateek Saxena
Abstract:
Expressing hardware designs using hardware description languages (HDLs) routinely involves using stateless signals whose values change according to their underlying registers. Unintended behaviours can arise when the stored values in these underlying registers are mutated while their dependent signals are expected to remain constant across multiple cycles. Such timing hazards are common because, w…
▽ More
Expressing hardware designs using hardware description languages (HDLs) routinely involves using stateless signals whose values change according to their underlying registers. Unintended behaviours can arise when the stored values in these underlying registers are mutated while their dependent signals are expected to remain constant across multiple cycles. Such timing hazards are common because, with a few exceptions, existing HDLs lack abstractions for values that remain unchanged over multiple clock cycles, delegating this responsibility to hardware designers. Designers must then carefully decide whether a value should remain unchanged, sometimes even across hardware modules. This paper proposes Anvil, an HDL which statically prevents timing hazards with a novel type system. Anvil is the only HDL we know of that guarantees timing safety, i.e., absence of timing hazards, without sacrificing expressiveness for cycle-level timing control or dynamic timing behaviours. Unlike many HLS languages that abstract away the differences between registers and signals, Anvil's type system exposes them fully while capturing the timing relationships between register value mutations and signal usages to enforce timing safety. This, in turn, enables safe composition of communicating hardware modules by static enforcement of timing contracts that encode timing constraints on shared signals. Such timing contracts can be specified parametric on abstract time points that can vary during run-time, allowing the type system to statically express dynamic timing behaviour. We have implemented Anvil and successfully used it to implement key timing-sensitive modules, comparing them against open-source SystemVerilog counterparts to demonstrate the practicality and expressiveness of the generated hardware.
△ Less
Submitted 27 October, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
ISIT-GEN: An in silico imaging trial to assess the inter-scanner generalizability of CTLESS for myocardial perfusion SPECT on defect-detection task
Authors:
Zitong Yu,
Nu Ri Choi,
Zezhang Yang,
Nancy A. Obuchowski,
Barry A. Siegel,
Abhinav K. Jha
Abstract:
A recently proposed scatter-window and deep learning-based attenuation compensation (AC) method for myocardial perfusion imaging (MPI) by single-photon emission computed tomography (SPECT), namely CTLESS, demonstrated promising performance on the clinical task of myocardial perfusion defect detection with retrospective data acquired on SPECT scanners from a single vendor. For clinical translation…
▽ More
A recently proposed scatter-window and deep learning-based attenuation compensation (AC) method for myocardial perfusion imaging (MPI) by single-photon emission computed tomography (SPECT), namely CTLESS, demonstrated promising performance on the clinical task of myocardial perfusion defect detection with retrospective data acquired on SPECT scanners from a single vendor. For clinical translation of CTLESS, it is important to assess the generalizability of CTLESS across different SPECT scanners. For this purpose, we conducted a virtual imaging trial, titled in silico imaging trial to assess generalizability (ISIT-GEN). ISIT-GEN assessed the generalizability of CTLESS on the cardiac perfusion defect detection task across SPECT scanners from three different vendors. The performance of CTLESS was compared with a standard-of-care CT-based AC (CTAC) method and a no-attenuation compensation (NAC) method using an anthropomorphic model observer. We observed that CTLESS had receiver operating characteristic (ROC) curves and area under the ROC curves similar to those of CTAC. Further, CTLESS was observed to significantly outperform the NAC method across three scanners. These results are suggestive of the inter-scanner generalizability of CTLESS and motivate further clinical evaluations. The study also highlights the value of using in silico imaging trials to assess the generalizability of deep learning-based AC methods feasibly and rigorously.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.