-
Composer: A Search Framework for Hybrid Neural Architecture Design
Authors:
Bilge Acun,
Prasoon Sinha,
Newsha Ardalani,
Sangmin Bae,
Alicia Golden,
Chien-Yu Lin,
Meghana Madhyastha,
Fei Sun,
Neeraja J. Yadwadkar,
Carole-Jean Wu
Abstract:
Hybrid model architectures that combine computational primitives (e.g., Attention, MLP) in different ratios have shown promising performance beyond Transformers. Some studies have shown that different interleavings of primitives can affect model quality as well. However, prior works explore the hybrid model architecture design space manually. Due to the large design space and training costs, disco…
▽ More
Hybrid model architectures that combine computational primitives (e.g., Attention, MLP) in different ratios have shown promising performance beyond Transformers. Some studies have shown that different interleavings of primitives can affect model quality as well. However, prior works explore the hybrid model architecture design space manually. Due to the large design space and training costs, discovering hybrid models that combine key computational primitives for pre-training is challenging. In this work, we take a principled approach in designing a modular hybrid model architecture search framework -- Composer. Composer explores model architectures at a small scale and extrapolates the top-performing model architectures to a larger scale using our proposed scaling strategies. Using Composer, we discover new hybrid LLM architectures that outperform Llama 3.2. Compared to Llama 3.2 and previous state-of-the-art baselines, the new model architectures consistently reduce validation loss at parameter scales of 350M-3B and improve evaluation accuracy on the downstream tasks by up to 2.8-8.3% (1.1-3.1% on average) while improving both training and inference efficiency.
△ Less
Submitted 30 September, 2025;
originally announced October 2025.
-
Nearly optimal algorithms to learn sparse quantum Hamiltonians in physically motivated distances
Authors:
Amira Abbas,
Nunzia Cerrato,
Francisco Escudero Gutiérrez,
Dmitry Grinko,
Francesco Anna Mele,
Pulkit Sinha
Abstract:
We study the problem of learning Hamiltonians $H$ that are $s$-sparse in the Pauli basis, given access to their time evolution. Although Hamiltonian learning has been extensively investigated, two issues recur in much of the existing literature: the absence of matching lower bounds and the use of mathematically convenient but physically opaque error measures.
We address both challenges by introd…
▽ More
We study the problem of learning Hamiltonians $H$ that are $s$-sparse in the Pauli basis, given access to their time evolution. Although Hamiltonian learning has been extensively investigated, two issues recur in much of the existing literature: the absence of matching lower bounds and the use of mathematically convenient but physically opaque error measures.
We address both challenges by introducing two physically motivated distances between Hamiltonians and designing a nearly optimal algorithm with respect to one of these metrics. The first, time-constrained distance, quantifies distinguishability through dynamical evolution up to a bounded time. The second, temperature-constrained distance, captures distinguishability through thermal states at bounded inverse temperatures.
We show that $s$-sparse Hamiltonians with bounded operator norm can be learned in both distances with $O(s \log(1/ε))$ experiments and $O(s^2/ε)$ evolution time. For the time-constrained distance, we further establish lower bounds of $Ω((s/n)\log(1/ε) + s)$ experiments and $Ω(\sqrt{s}/ε)$ evolution time, demonstrating near-optimality in the number of experiments.
As an intermediate result, we obtain an algorithm that learns every Pauli coefficient of $s$-sparse Hamiltonians up to error $ε$ in $O(s\log(1/ε))$ experiments and $O(s/ε)$ evolution time, improving upon several recent results.
The source of this improvement is a new isolation technique, inspired by the Valiant-Vazirani theorem (STOC'85), which shows that NP is as easy as detecting unique solutions. This isolation technique allows us to query the time evolution of a single Pauli coefficient of a sparse Hamiltonian--even when the Pauli support of the Hamiltonian is unknown--ultimately enabling us to recover the Pauli support itself.
△ Less
Submitted 11 September, 2025;
originally announced September 2025.
-
Scaling of the Electrical Conductivity Spectra Reveals Distinct Transport Responses in A2SmTaO6 [A = Ba, Sr, Ca]
Authors:
Saswata Halder,
Binita Ghosh,
T. P. Sinha
Abstract:
Disorder plays an important role in materials science, influencing material behavior across different length scales. Imperfections like vacancies, atomic substitutions, lattice distortions, and microstructural inhomogeneities, disrupt ideal periodicity thereby altering physical properties. Analogous to spin-glass systems, electrical 'glassiness' arises when charge carriers confront disordered ener…
▽ More
Disorder plays an important role in materials science, influencing material behavior across different length scales. Imperfections like vacancies, atomic substitutions, lattice distortions, and microstructural inhomogeneities, disrupt ideal periodicity thereby altering physical properties. Analogous to spin-glass systems, electrical 'glassiness' arises when charge carriers confront disordered energy landscapes, leading to a broad range of relaxation times, especially in polycrystalline materials where dipoles experience competing exchange interactions. Complex impedance, permittivity, and electric modulus distill out separate resistive and capacitive effects, offering insights into how microstructural inhomogeneities affects conduction mechanism. In polycrystalline double perovskites A2SmTaO6 (A = Ba, Ca), with a power law driven ac conductivity, the hopping and relaxation of carriers is affected by both grains and grain boundaries. Scaling of ac conductivity and impedance response reveals correlation between conduction and relaxation timescales. The inhomogeneities in local energy landscape of 'frustrated' dipoles restrict the 'universality' of conduction mechanism across the bulk length scale.
△ Less
Submitted 29 August, 2025;
originally announced August 2025.
-
BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding
Authors:
Xuan-Bac Nguyen,
Thanh-Dat Truong,
Pawan Sinha,
Khoa Luu
Abstract:
Memory decay makes it harder for the human brain to recognize visual objects and retain details. Consequently, recorded brain signals become weaker, uncertain, and contain poor visual context over time. This paper presents one of the first vision-learning approaches to address this problem. First, we statistically and experimentally demonstrate the existence of inconsistency in brain signals and i…
▽ More
Memory decay makes it harder for the human brain to recognize visual objects and retain details. Consequently, recorded brain signals become weaker, uncertain, and contain poor visual context over time. This paper presents one of the first vision-learning approaches to address this problem. First, we statistically and experimentally demonstrate the existence of inconsistency in brain signals and its impact on the Vision-Brain Understanding (VBU) model. Our findings show that brain signal representations shift over recording sessions, leading to compounding bias, which poses challenges for model learning and degrades performance. Then, we propose a new Bias-Mitigation Continual Learning (BRAIN) approach to address these limitations. In this approach, the model is trained in a continual learning setup and mitigates the growing bias from each learning step. A new loss function named De-bias Contrastive Learning is also introduced to address the bias problem. In addition, to prevent catastrophic forgetting, where the model loses knowledge from previous sessions, the new Angular-based Forgetting Mitigation approach is introduced to preserve learned knowledge in the model. Finally, the empirical experiments demonstrate that our approach achieves State-of-the-Art (SOTA) performance across various benchmarks, surpassing prior and non-continual learning methods.
△ Less
Submitted 25 August, 2025;
originally announced August 2025.
-
MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving
Authors:
Ruihao Li,
Shagnik Pal,
Vineeth Narayan Pullu,
Prasoon Sinha,
Jeeho Ryoo,
Lizy K. John,
Neeraja J. Yadwadkar
Abstract:
KV cache accelerates LLM inference by avoiding redundant computation, at the expense of memory. To support larger KV caches, prior work extends GPU memory with CPU memory via CPU-offloading. This involves swapping KV cache between GPU and CPU memory. However, because the cache updates dynamically, such swapping incurs high CPU memory traffic. We make a key observation that model parameters remain…
▽ More
KV cache accelerates LLM inference by avoiding redundant computation, at the expense of memory. To support larger KV caches, prior work extends GPU memory with CPU memory via CPU-offloading. This involves swapping KV cache between GPU and CPU memory. However, because the cache updates dynamically, such swapping incurs high CPU memory traffic. We make a key observation that model parameters remain constant during runtime, unlike the dynamically updated KV cache. Building on this, we introduce MIRAGE, which avoids KV cache swapping by remapping, and thereby repurposing, the memory allocated to model parameters for KV cache. This parameter remapping is especially beneficial in multi-tenant environments, where the memory used for the parameters of the inactive models can be more aggressively reclaimed. Exploiting the high CPU-GPU bandwidth offered by the modern hardware, such as the NVIDIA Grace Hopper Superchip, we show that MIRAGE significantly outperforms state-of-the-art solutions, achieving a reduction of 44.8%-82.5% in tail time-between-token latency, 20.7%-99.3% in tail time-to-first-token latency, and 6.6%-86.7% higher throughput compared to vLLM.
△ Less
Submitted 15 July, 2025;
originally announced July 2025.
-
Early Warning From Eccentric Compact Binaries: Template Initialization And Sub-dominant Mode Effects
Authors:
Priyanka Sinha,
R. Prasad,
Mukesh Kumar Singh,
Prayush Kumar,
Akash Maurya,
Kaushik Paul
Abstract:
Early warning of gravitational waves (GWs) is essential for multi-messenger observations of binary neutron star and black hole-neutron star merger events. In this study, we investigate early warning prospects from eccentric compact binaries, whose mergers are expected to comprise a significant fraction of detected GW events in the future. Eccentric binaries exhibit oscillatory frequency evolution,…
▽ More
Early warning of gravitational waves (GWs) is essential for multi-messenger observations of binary neutron star and black hole-neutron star merger events. In this study, we investigate early warning prospects from eccentric compact binaries, whose mergers are expected to comprise a significant fraction of detected GW events in the future. Eccentric binaries exhibit oscillatory frequency evolution, causing GW frequencies to recur multiple times through their coalescence. Consequently, generating eccentric waveform templates for early warning requires specification of initial conditions. While the standard approach involves initiating waveform generation when the orbit-averaged frequency enters the detector band, we compare this with an alternative approach that uses the periastron frequency as the starting point. Our analysis shows that initializing at the periastron frequency yields an improved signal-to-noise ratio and sky localization. Additionally, including subdominant modes alongside the dominant $(2,2)$ mode leads to further improvements in sky localization. We explore the parameter space of primary mass $m_1 \in [1.4, 15] \, M_\odot$, spin $χ_1 \in [0, 0.8]$, and eccentricity $e \leq 0.4$ across three detector configurations: O5, Voyager, and 3G. We find that in the O5 (Voyager) configuration, including eccentricity and subdominant modes, the sky localization area can be reduced by $2-80\% (2-85\%)$ at 1000 sq. deg. with increasing eccentricity from $e_5 = 0.1$ to $e_5 = 0.4$, yielding up to $41$ seconds (1 minute) of extra early warning time. For NSBH systems, subdominant modes contribute up to $70$ $(94)\%$ reduction for O5 (Voyager) scenario. In the 3G detector scenario, the sky area reduction due to eccentricity reaches $80\%$ (from $e_{2.5} = 0.1$ to $e_{2.5} = 0.4$) at 100 sq. deg., and subdominant modes enhance the reduction up to $98\%$ for NSBH systems.
△ Less
Submitted 9 July, 2025;
originally announced July 2025.
-
Price equilibria with positive margins in loyal-strategic markets with discrete prices
Authors:
Gurkirat Wadhwa,
Akansh Verma,
Veeraruna Kavitha,
Priyank Sinha
Abstract:
In competitive supply chains (SCs), pricing decisions are crucial, as they directly impact market share and profitability. Traditional SC models often assume continuous pricing for mathematical convenience, overlooking the practical reality of discrete price increments driven by currency constraints. Additionally, customer behavior, influenced by loyalty and strategic considerations, plays a signi…
▽ More
In competitive supply chains (SCs), pricing decisions are crucial, as they directly impact market share and profitability. Traditional SC models often assume continuous pricing for mathematical convenience, overlooking the practical reality of discrete price increments driven by currency constraints. Additionally, customer behavior, influenced by loyalty and strategic considerations, plays a significant role in purchasing decisions. To address these gaps, this study examines a SC model involving one supplier and two manufacturers, incorporating realistic factors such as customer demand segmentation and discrete price setting. Our analysis shows that the Nash equilibria (NE) among manufacturers are not unique, we then discuss the focal equilibrium. Our analysis also reveals that low denomination factors can lead to instability as the corresponding game does not have NE. Numerical simulations demonstrate that even small changes in price increments significantly affect the competitive dynamics and market share distribution.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations
Authors:
Pranav Sinha,
Sumit Kumar Jha,
Sunny Raj
Abstract:
We are in the midst of the noisy intermediate-scale quantum (NISQ) era, where quantum computers are limited by noisy gates, some of which are more error-prone than others and can render the final computation incomprehensible. Quantum circuit compilation algorithms attempt to minimize these noisy gates when mapping quantum algorithms onto quantum hardware but face computational challenges that rest…
▽ More
We are in the midst of the noisy intermediate-scale quantum (NISQ) era, where quantum computers are limited by noisy gates, some of which are more error-prone than others and can render the final computation incomprehensible. Quantum circuit compilation algorithms attempt to minimize these noisy gates when mapping quantum algorithms onto quantum hardware but face computational challenges that restrict their application to circuits with no more than 5-6 qubits, necessitating the need to partition large circuits before the application of noisy quantum gate minimization algorithms. The existing generation of these algorithms is heuristic in nature and does not account for downstream gate minimization tasks. Large language models (LLMs) have the potential to change this and help improve quantum circuit partitions. This paper investigates the use of LLMs, such as Llama and Mistral, for partitioning quantum circuits by capitalizing on their abilities to understand and generate code, including QASM. Specifically, we teach LLMs to partition circuits using the quick partition approach of the Berkeley Quantum Synthesis Toolkit. Through experimental evaluations, we show that careful fine-tuning of open source LLMs enables us to obtain an accuracy of 53.4% for the partition task while over-the-shelf LLMs are unable to correctly partition circuits, using standard 1-shot and few-shot training approaches.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Punitive policies to combat misreporting in dynamic supply chains
Authors:
Madhu Dhiman,
Atul Maurya,
Veeraruna Kavitha,
Priyank Sinha
Abstract:
Wholesale price contracts are known to be associated with double marginalization effects, which prevents supply chains from achieving their true market share. In a dynamic setting under information asymmetry, these inefficiencies manifest in the form of misreporting of the market potential by the manufacturer to the supplier, again leading to the loss of market share. We pose the dynamics of inter…
▽ More
Wholesale price contracts are known to be associated with double marginalization effects, which prevents supply chains from achieving their true market share. In a dynamic setting under information asymmetry, these inefficiencies manifest in the form of misreporting of the market potential by the manufacturer to the supplier, again leading to the loss of market share. We pose the dynamics of interaction between the supplier and manufacturer as the Stackelberg game and develop theoretical results for optimal punitive strategies that the supplier can implement to ensure that the manufacturer truthfully reveals the market potential in the single-stage setting. Later, we validate these results through the randomly generated, Monte-Carlo simulation based numerical examples.
△ Less
Submitted 11 May, 2025; v1 submitted 18 April, 2025;
originally announced April 2025.
-
Onset of Constituent Quark Number Scaling in Heavy-Ion Collisions at RHIC
Authors:
STAR Collaboration,
B. E. Aboona,
J. Adam,
L. Adamczyk,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. K. Alshammri,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
X. Bao,
K. Barish,
S. Behera,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (347 additional authors not shown)
Abstract:
Partonic collectivity is one of the necessary signatures for the formation of quark-gluon plasma in high-energy nuclear collisions. Number of constituent quarks (NCQ) scaling has been observed for hadron elliptic flow $v_2$ in top energy nuclear collisions at the Relativistic Heavy Ion Collider and the LHC, and this has been theoretically suggested as strong evidence for partonic collectivity. In…
▽ More
Partonic collectivity is one of the necessary signatures for the formation of quark-gluon plasma in high-energy nuclear collisions. Number of constituent quarks (NCQ) scaling has been observed for hadron elliptic flow $v_2$ in top energy nuclear collisions at the Relativistic Heavy Ion Collider and the LHC, and this has been theoretically suggested as strong evidence for partonic collectivity. In this Letter, a systematic analysis of $v_2$ of $π^{\pm}$, $K^{\pm}$, $K^{0}_{S}$, $p$, and $Λ$ in Au+Au collisions at ${\sqrt{s_{_{\rm{NN}}}}}$ = 3.2, 3.5, 3.9, and 4.5 GeV, with the STAR experiment at the Relativistic Heavy Ion Collider, is presented. NCQ scaling is markedly violated at 3.2 GeV, consistent with a hadronic-interaction dominated equation of state. However, as the collision energy increases, a gradual evolution to NCQ scaling is observed. This beam-energy dependence of $v_2$ for all hadrons studied provides evidence for the onset of dominant partonic interactions by ${\sqrt{s_{_{\rm{NN}}}}}$ = 4.5 GeV.
△ Less
Submitted 11 August, 2025; v1 submitted 2 April, 2025;
originally announced April 2025.
-
Magneto transport of pressure induced flatbands in large angle twisted bilayer graphene
Authors:
Ayan Mondal,
Priyanka Sinha,
Bheema Lingam Chittari
Abstract:
Twisted bilayer graphene (TBG) exhibits flat electronic bands at the so-called magic angle ($\sim 1.1^\circ$), leading to strong electron correlations and emergent quantum phases such as superconductivity and correlated insulating states. However, beyond the magic angle, the band structure generally remains dispersive, diminishing interaction-driven phenomena. In this work, we explore the equivale…
▽ More
Twisted bilayer graphene (TBG) exhibits flat electronic bands at the so-called magic angle ($\sim 1.1^\circ$), leading to strong electron correlations and emergent quantum phases such as superconductivity and correlated insulating states. However, beyond the magic angle, the band structure generally remains dispersive, diminishing interaction-driven phenomena. In this work, we explore the equivalence between pressure-induced flatbands and the magic-angle flatband in large-angle TBG by systematically analyzing the role of interlayer coupling modifications under perpendicular pressure. We show that pressure-induced flatbands exhibit spatial localization similar to magic-angle TBG, with charge density concentrated in the AA-stacked regions. Furthermore, the Hall conductivity and magneto-transport properties under an external magnetic field reveal that these pressure-induced flatbands share key signatures with the quantum Hall response of magic-angle TBG. The obtained Hofstadter spectrum shows four consistent low-energy gaps across all twist angles under pressure, which align with the calculated Hall conductivity plateaus. Our findings suggest that pressure offers an alternative pathway to engineer flat electronic bands and correlated states in TBG, extending the landscape of tunable moiré materials beyond the constraints of the magic angle.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Stabilizer Ranks, Barnes Wall Lattices and Magic Monotones
Authors:
Amolak Ratan Kalra,
Pulkit Sinha
Abstract:
In 2024, Kliuchnikov and Schönnenbeck showed a connection between the Barnes Wall lattices, stabilizer states and Clifford operations. In this work, we study their results and relate them to the problem of lower bounding stabilizer ranks. We show the first quantitative lower bound on stabilizer fidelity as a function of stabilizer ranks, which reproduces the linear-by-log lower bound for…
▽ More
In 2024, Kliuchnikov and Schönnenbeck showed a connection between the Barnes Wall lattices, stabilizer states and Clifford operations. In this work, we study their results and relate them to the problem of lower bounding stabilizer ranks. We show the first quantitative lower bound on stabilizer fidelity as a function of stabilizer ranks, which reproduces the linear-by-log lower bound for $χ_δ({|{H}\rangle^{ \otimes n}})$, i.e, on the approximate stabilizer rank of $|H\rangle^{\otimes n}$. In fact, we show that the lower bound holds even when the fidelity between the approximation and ${|H\rangle}^{\otimes n}$ is exponentially small, which is currently the best lower bound in this regime.
Next, we define a new magic monotone for pure states, the Barnes Wall norm, and its corresponding approximate variant. We upper bound these monotones by the $CS$-count of state preparation, and also by the stabilizer ranks. In particular, the upper bound given by the $CS$-count is tight, in the sense that we exhibit states that achieve the bound.
Apart from these results, we give a Fidelity Amplification algorithm, which provides a trade-off between approximation error and the stabilizer rank. As a corollary, it gives us a way to compose approximate stabilizer decompositions into approximate decompositions of their tensor products.
Finally, we provide an alternate, elementary proof of the existence and density of product states with maximal stabilizer ranks, which was first proven by Lovitz and Steffan (2022), where they used results from algebraic geometry.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Uniform Resampling vs. Image Blur: Aliasing Approximation via Isotropic Gaussian Filtering
Authors:
Suayb S. Arslan,
Lukas Vogelsang,
Michal Fux,
Pawan Sinha
Abstract:
One of the key approximations to range simulation is downscaling the image, dictated by the natural trigonometric relationships that arise due to long-distance viewing. It is well-known that standard downsampling applied to an image without prior low-pass filtering leads to a type of signal distortion called \textit{aliasing}. In this study, we aim at modeling the distortion due to aliasing and sh…
▽ More
One of the key approximations to range simulation is downscaling the image, dictated by the natural trigonometric relationships that arise due to long-distance viewing. It is well-known that standard downsampling applied to an image without prior low-pass filtering leads to a type of signal distortion called \textit{aliasing}. In this study, we aim at modeling the distortion due to aliasing and show that a downsampled/upsampled image after an interpolation process can be very well approximated through the application of isotropic Gaussian low-pass filtering to the original image. In other words, the distortion due to aliasing can approximately be generated by low-pass filtering the image with a carefully determined cut-off frequency. We have found that the standard deviation of the isotropic Gaussian kernel $σ$ and the reduction factor $m$ (also called downsampling ratio) satisfy an approximate $m \approx 2 σ$ relationship. We provide both theoretical and practical arguments using two relatively small face datasets (Chicago DB, LRFID) as well as TinyImageNet to corroborate this empirically observed relationship.
△ Less
Submitted 8 May, 2025; v1 submitted 17 February, 2025;
originally announced February 2025.
-
Smoothness of Classical Limit in KMOC Formalism
Authors:
Pritish Sinha
Abstract:
In this paper, we revisit the smoothness of the classical limit of inclusive observables in the formalism developed by Kosower, Maybee and O'Connell (KMOC). Building on the earlier work [1-3], we prove that the classical limit of three classes of inclusive observables, namely scattering angle, radiative field and angular impulse is smooth and does not suffer from any so-called superclassical diver…
▽ More
In this paper, we revisit the smoothness of the classical limit of inclusive observables in the formalism developed by Kosower, Maybee and O'Connell (KMOC). Building on the earlier work [1-3], we prove that the classical limit of three classes of inclusive observables, namely scattering angle, radiative field and angular impulse is smooth and does not suffer from any so-called superclassical divergences at all orders in perturbation. Our analysis goes some way in showing that KMOC formalism can be used to compute classical radiation by simply focussing on all the terms that scale as $\hbar^0$ as all the terms that scale with inverse power of $\hbar$ vanish.
△ Less
Submitted 18 October, 2025; v1 submitted 23 January, 2025;
originally announced January 2025.
-
iServe: An Intent-based Serving System for LLMs
Authors:
Dimitrios Liakopoulos,
Tianrui Hu,
Prasoon Sinha,
Neeraja J. Yadwadkar
Abstract:
Large Language Models (LLMs) are becoming ubiquitous across industries, where applications demand they fulfill diverse user intents. However, developers currently face the challenge of manually exploring numerous deployment configurations - combinations of parallelism and compression techniques that impact resource usage, latency, cost, and accuracy - to meet these intents. Assessing the impact of…
▽ More
Large Language Models (LLMs) are becoming ubiquitous across industries, where applications demand they fulfill diverse user intents. However, developers currently face the challenge of manually exploring numerous deployment configurations - combinations of parallelism and compression techniques that impact resource usage, latency, cost, and accuracy - to meet these intents. Assessing the impact of these configurations on user metrics requires extensive, costly profiling for each model. Existing approaches avoid this expense by using fixed, static configurations, but this often leads to sub-optimal performance and higher costs. Moreover, none of these solutions dynamically adapt to changing user intents to balance latency and cost, effectively. We present iServe, an automated, intent-based system for distributed LLM inference. Instead of manually selecting deployment configurations, developers simply specify their intent - such as minimizing latency, reducing cost, or meeting specific targets for either. iServe introduces fingerprints, lightweight representations of LLMs, to efficiently estimate how different configurations impact latency and memory usage. Based on these insights and GPU availability, iServe dynamically selects the optimal configuration to align with the user's intent. For various LLMs and query arrival rates, iServe best meets user intents compared to state-of-the-art systems by reducing latency by 77.62% and SLO violations by 7.09x while improving GPU throughput by 4.72x. Moreover, iServe's fingerprint-based profiling reduces profiling cost by 6.05x (GPU-hours) compared to baselines.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Level crossing instabilities in inviscid isothermal compressible Couette flow
Authors:
Govind S. Krishnaswami,
Sonakshi Sachdev,
Pritish Sinha
Abstract:
We study the linear stability of inviscid steady parallel flow of an ideal gas in a channel of finite width. Compressible isothermal two-dimensional monochromatic perturbations are considered. The eigenvalue problem governing density and velocity perturbations is a compressible version of Rayleigh's equation and involves two parameters: a flow Mach number $M$ and the perturbation wavenumber $k$. F…
▽ More
We study the linear stability of inviscid steady parallel flow of an ideal gas in a channel of finite width. Compressible isothermal two-dimensional monochromatic perturbations are considered. The eigenvalue problem governing density and velocity perturbations is a compressible version of Rayleigh's equation and involves two parameters: a flow Mach number $M$ and the perturbation wavenumber $k$. For an odd background velocity profile, there is a $\mathbb{Z}_2 \times \mathbb{Z}_2$ symmetry and growth rates $γ$ come in symmetrically placed 4-tuples in the complex eigenplane. Specializing to uniform background vorticity Couette flow, we find an infinite tower of noninflectional eigenmodes and derive stability theorems and bounds on growth rates. We show that eigenmodes are neutrally stable for small $k$ and small $M$ but that they otherwise display an infinite sequence of stability transitions with increasing $k$ or $M$. Using a search algorithm based on the Fredholm alternative, we find that the transitions are associated to level crossings between neighboring eigenmodes. Repeated level crossings result in windows of instability. For a given eigenmode, they are arranged in a zebra-like striped pattern on the $k$-$M$ plane. A canonical square-root power law form for $γ(k,M)$ in the vicinity of a stability transition is identified. In addition to the discrete spectrum, we find a continuous spectrum of eigenmodes that are always neutrally stable but fail to be smooth across critical layers.
△ Less
Submitted 30 December, 2024;
originally announced December 2024.
-
COBRA: A Continual Learning Approach to Vision-Brain Understanding
Authors:
Xuan-Bac Nguyen,
Manuel Serna-Aguilera,
Arabinda Kumar Choudhary,
Pawan Sinha,
Xin Li,
Khoa Luu
Abstract:
Vision-Brain Understanding (VBU) aims to extract visual information perceived by humans from brain activity recorded through functional Magnetic Resonance Imaging (fMRI). Despite notable advancements in recent years, existing studies in VBU continue to face the challenge of catastrophic forgetting, where models lose knowledge from prior subjects as they adapt to new ones. Addressing continual lear…
▽ More
Vision-Brain Understanding (VBU) aims to extract visual information perceived by humans from brain activity recorded through functional Magnetic Resonance Imaging (fMRI). Despite notable advancements in recent years, existing studies in VBU continue to face the challenge of catastrophic forgetting, where models lose knowledge from prior subjects as they adapt to new ones. Addressing continual learning in this field is, therefore, essential. This paper introduces a novel framework called Continual Learning for Vision-Brain (COBRA) to address continual learning in VBU. Our approach includes three novel modules: a Subject Commonality (SC) module, a Prompt-based Subject Specific (PSS) module, and a transformer-based module for fMRI, denoted as MRIFormer module. The SC module captures shared vision-brain patterns across subjects, preserving this knowledge as the model encounters new subjects, thereby reducing the impact of catastrophic forgetting. On the other hand, the PSS module learns unique vision-brain patterns specific to each subject. Finally, the MRIFormer module contains a transformer encoder and decoder that learns the fMRI features for VBU from common and specific patterns. In a continual learning setup, COBRA is trained in new PSS and MRIFormer modules for new subjects, leaving the modules of previous subjects unaffected. As a result, COBRA effectively addresses catastrophic forgetting and achieves state-of-the-art performance in both continual learning and vision-brain reconstruction tasks, surpassing previous methods.
△ Less
Submitted 6 August, 2025; v1 submitted 25 November, 2024;
originally announced November 2024.
-
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Authors:
Hoang-Quan Nguyen,
Xuan-Bac Nguyen,
Hugh Churchill,
Arabinda Kumar Choudhary,
Pawan Sinha,
Samee U. Khan,
Khoa Luu
Abstract:
Vision-brain understanding aims to extract semantic information about brain signals from human perceptions. Existing deep learning methods for vision-brain understanding are usually introduced in a traditional learning paradigm missing the ability to learn the connectivities between brain regions. Meanwhile, the quantum computing theory offers a new paradigm for designing deep learning models. Mot…
▽ More
Vision-brain understanding aims to extract semantic information about brain signals from human perceptions. Existing deep learning methods for vision-brain understanding are usually introduced in a traditional learning paradigm missing the ability to learn the connectivities between brain regions. Meanwhile, the quantum computing theory offers a new paradigm for designing deep learning models. Motivated by the connectivities in the brain signals and the entanglement properties in quantum computing, we propose a novel Quantum-Brain approach, a quantum-inspired neural network, to tackle the vision-brain understanding problem. To compute the connectivity between areas in brain signals, we introduce a new Quantum-Inspired Voxel-Controlling module to learn the impact of a brain voxel on others represented in the Hilbert space. To effectively learn connectivity, a novel Phase-Shifting module is presented to calibrate the value of the brain signals. Finally, we introduce a new Measurement-like Projection module to present the connectivity information from the Hilbert space into the feature space. The proposed approach can learn to find the connectivities between fMRI voxels and enhance the semantic information obtained from human perceptions. Our experimental results on the Natural Scene Dataset benchmarks illustrate the effectiveness of the proposed method with Top-1 accuracies of 95.1% and 95.6% on image and brain retrieval tasks and an Inception score of 95.3% on fMRI-to-image reconstruction task. Our proposed quantum-inspired network brings a potential paradigm to solving the vision-brain problems via the quantum computing theory.
△ Less
Submitted 14 August, 2025; v1 submitted 20 November, 2024;
originally announced November 2024.
-
Dimension Independent and Computationally Efficient Shadow Tomography
Authors:
Pulkit Sinha
Abstract:
We describe a new shadow tomography algorithm that uses $n=Θ(\sqrt{m}\log m/ε^2)$ samples, for $m$ measurements and additive error $ε$, which is independent of the dimension of the quantum state being learned. This stands in contrast to all previously known algorithms that improve upon the naive approach. The sample complexity also has optimal dependence on $ε$. Additionally, this algorithm is eff…
▽ More
We describe a new shadow tomography algorithm that uses $n=Θ(\sqrt{m}\log m/ε^2)$ samples, for $m$ measurements and additive error $ε$, which is independent of the dimension of the quantum state being learned. This stands in contrast to all previously known algorithms that improve upon the naive approach. The sample complexity also has optimal dependence on $ε$. Additionally, this algorithm is efficient in various aspects, including quantum memory usage (possibly even $O(1)$), gate complexity, classical computation, and robustness to qubit measurement noise. It can also be implemented as a read-once quantum circuit with low quantum memory usage, i.e., it will hold only one copy of $ρ$ in memory, and discard it before asking for a new one, with the additional memory needed being $O(m\log n)$. Our approach builds on the idea of using noisy measurements, but instead of focusing on gentleness in trace distance, we focus on the \textit{gentleness in shadows}, i.e., we show that the noisy measurements do not significantly perturb the expected values.
△ Less
Submitted 2 November, 2024;
originally announced November 2024.
-
NP-hardness of testing equivalence to sparse polynomials and to constant-support polynomials
Authors:
Omkar Baraskar,
Agrim Dewan,
Chandan Saha,
Pulkit Sinha
Abstract:
An $s$-sparse polynomial has at most $s$ monomials with nonzero coefficients. The Equivalence Testing problem for sparse polynomials (ETsparse) asks to decide if a given polynomial $f$ is equivalent to (i.e., in the orbit of) some $s$-sparse polynomial. In other words, given $f \in \mathbb{F}[\mathbf{x}]$ and $s \in \mathbb{N}$, ETsparse asks to check if there exist…
▽ More
An $s$-sparse polynomial has at most $s$ monomials with nonzero coefficients. The Equivalence Testing problem for sparse polynomials (ETsparse) asks to decide if a given polynomial $f$ is equivalent to (i.e., in the orbit of) some $s$-sparse polynomial. In other words, given $f \in \mathbb{F}[\mathbf{x}]$ and $s \in \mathbb{N}$, ETsparse asks to check if there exist $A \in \mathrm{GL}(|\mathbf{x}|, \mathbb{F})$ and $\mathbf{b} \in \mathbb{F}^{|\mathbf{x}|}$ such that $f(A\mathbf{x} + \mathbf{b})$ is $s$-sparse. We show that ETsparse is NP-hard over any field $\mathbb{F}$, if $f$ is given in the sparse representation, i.e., as a list of nonzero coefficients and exponent vectors. This answers a question posed in [Gupta-Saha-Thankey, SODA'23] and [Baraskar-Dewan-Saha, STACS'24]. The result implies that the Minimum Circuit Size Problem (MCSP) is NP-hard for a dense subclass of depth-$3$ arithmetic circuits if the input is given in sparse representation. We also show that approximating the smallest $s_0$ such that a given $s$-sparse polynomial $f$ is in the orbit of some $s_0$-sparse polynomial to within a factor of $s^{\frac{1}{3} - ε}$ is NP-hard for any $ε> 0$; observe that $s$-factor approximation is trivial as the input is $s$-sparse. Finally, we show that for any constant $σ\geq 5$, checking if a polynomial (given in sparse representation) is in the orbit of some support-$σ$ polynomial is NP-hard. Support of a polynomial $f$ is the maximum number of variables present in any monomial of $f$. These results are obtained via direct reductions from the $3$-SAT problem.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Phase error rate estimation in QKD with imperfect detectors
Authors:
Devashish Tupkary,
Shlok Nahar,
Pulkit Sinha,
Norbert Lütkenhaus
Abstract:
We present a finite-size security proof of the decoy-state BB84 QKD protocol against coherent attacks, using entropic uncertainty relations, for imperfect detectors. We apply this result to the case of detectors with imperfectly characterized basis-efficiency mismatch. Our proof works by obtaining a suitable bound on the phase error rate, without requiring any new modifications to the protocol ste…
▽ More
We present a finite-size security proof of the decoy-state BB84 QKD protocol against coherent attacks, using entropic uncertainty relations, for imperfect detectors. We apply this result to the case of detectors with imperfectly characterized basis-efficiency mismatch. Our proof works by obtaining a suitable bound on the phase error rate, without requiring any new modifications to the protocol steps or hardware. It is applicable to imperfectly characterized detectors, and only requires the maximum relative difference in detection efficiencies and dark count rates of the detectors to be characterized. Moreover, our proof allows Eve to choose detector efficiencies and dark count rates in their allowed ranges in each round, thereby addressing an important problem of detector side channels. We prove security in the variable-length framework, where users are allowed to adaptively determine the length of key to be produced, and number of bits to be used for error-correction, based on observations made during the protocol. We quantitatively demonstrate the effect of basis-efficiency mismatch by applying our results to the decoy-state BB84 protocol.
△ Less
Submitted 27 June, 2025; v1 submitted 30 August, 2024;
originally announced August 2024.
-
Configural processing as an optimized strategy for robust object recognition in neural networks
Authors:
Hojin Jang,
Pawan Sinha,
Xavier Boix
Abstract:
Configural processing, the perception of spatial relationships among an object's components, is crucial for object recognition. However, the teleology and underlying neurocomputational mechanisms of such processing are still elusive, notwithstanding decades of research. We hypothesized that processing objects via configural cues provides a more robust means to recognizing them relative to local fe…
▽ More
Configural processing, the perception of spatial relationships among an object's components, is crucial for object recognition. However, the teleology and underlying neurocomputational mechanisms of such processing are still elusive, notwithstanding decades of research. We hypothesized that processing objects via configural cues provides a more robust means to recognizing them relative to local featural cues. We evaluated this hypothesis by devising identification tasks with composite letter stimuli and comparing different neural network models trained with either only local or configural cues available. We found that configural cues yielded more robust performance to geometric transformations such as rotation or scaling. Furthermore, when both features were simultaneously available, configural cues were favored over local featural cues. Layerwise analysis revealed that the sensitivity to configural cues emerged later relative to local feature cues, possibly contributing to the robustness to pixel-level transformations. Notably, this configural processing occurred in a purely feedforward manner, without the need for recurrent computations. Our findings with letter stimuli were successfully extended to naturalistic face images. Thus, our study provides neurocomputational evidence that configural processing emerges in a naïve network based on task contingencies, and is beneficial for robust object processing under varying viewing conditions.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Strain induced tunable band gap and optical properties of graphene on hexagonal boron nitride
Authors:
Priyanka Sinha,
Prasanta K. Panigrahi,
Bheemalingam Chittari
Abstract:
In this study, we highlight the potential of strain engineering in graphene/hBN (hexagonal Boron nitride) 2D heterostructures, enabling their use as wide-range light absorbers with significant implications for optoelectronic applications. We systematically investigate the electronic and optical properties of graphene/hBN under the application of strain, considering various stacking geometries with…
▽ More
In this study, we highlight the potential of strain engineering in graphene/hBN (hexagonal Boron nitride) 2D heterostructures, enabling their use as wide-range light absorbers with significant implications for optoelectronic applications. We systematically investigate the electronic and optical properties of graphene/hBN under the application of strain, considering various stacking geometries within the framework of density-functional theory. The semimetallic graphene layer upon aligning on the insulating hexagonal boron nitride sheet opens a few tens of meV band gap at the Dirac point due to the induced on-site energy differences on the two sublattices of graphene. Here, we demonstrate that by simultaneously tuning the interlayer distance and lattice constant, this band gap can be significantly increased to 1 eV. Interestingly, in both scenarios (small and large band gaps), the material undergoes a transition from a semiconductor to a semimetallic state. Importantly, the tunability of this band gap is strongly influenced by the specific stacking configuration. We further explored the optical properties across a broad spectrum, revealing that the presence of a strain-induced band gap fundamentally alters how light interacts with the system.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator
Authors:
Mandovi Mukherjee,
Xiangyu Mao,
Nael Rahman,
Coleman DeLude,
Joe Driscoll,
Sudarshan Sharma,
Payman Behnam,
Uday Kamal,
Jongseok Woo,
Daehyun Kim,
Sharjeel Khan,
Jianming Tong,
Jamin Seo,
Prachi Sinha,
Madhavan Swaminathan,
Tushar Krishna,
Santosh Pande,
Justin Romberg,
Saibal Mukhopadhyay
Abstract:
A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous…
▽ More
A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous control to extract concurrency in compute as well as low latency. It achieves a $518$ MHz per channel bandwidth in a prototype $4$-node system. The maximum emulation range supported in this paradigm is $9.5$ km with $0.24$ $μ$s of per-sample emulation latency. 2). The FPGA-based implementation, evaluated on a Xilinx ZCU104 board, demonstrates a $9$-node test case (two Transmitters, one Receiver, and $6$ passive reflectors) with an emulation range of $1.13$ km to $27.3$ km at $215$ MHz bandwidth.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
BRACTIVE: A Brain Activation Approach to Human Visual Brain Learning
Authors:
Xuan-Bac Nguyen,
Hojin Jang,
Xin Li,
Samee U. Khan,
Pawan Sinha,
Khoa Luu
Abstract:
The human brain is a highly efficient processing unit, and understanding how it works can inspire new algorithms and architectures in machine learning. In this work, we introduce a novel framework named Brain Activation Network (BRACTIVE), a transformer-based approach to studying the human visual brain. The primary objective of BRACTIVE is to align the visual features of subjects with their corres…
▽ More
The human brain is a highly efficient processing unit, and understanding how it works can inspire new algorithms and architectures in machine learning. In this work, we introduce a novel framework named Brain Activation Network (BRACTIVE), a transformer-based approach to studying the human visual brain. The primary objective of BRACTIVE is to align the visual features of subjects with their corresponding brain representations using functional Magnetic Resonance Imaging (fMRI) signals. It enables us to identify the brain's Regions of Interest (ROIs) in the subjects. Unlike previous brain research methods, which can only identify ROIs for one subject at a time and are limited by the number of subjects, BRACTIVE automatically extends this identification to multiple subjects and ROIs. Our experiments demonstrate that BRACTIVE effectively identifies person-specific regions of interest, such as face and body-selective areas, aligning with neuroscience findings and indicating potential applicability to various object categories. More importantly, we found that leveraging human visual brain activity to guide deep neural networks enhances performance across various benchmarks. It encourages the potential of BRACTIVE in both neuroscience and machine intelligence studies.
△ Less
Submitted 2 September, 2025; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Correlations of event activity with hard and soft processes in $p$ + Au collisions at $\sqrt{s_\mathrm{NN}}$ = 200 GeV at STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (338 additional authors not shown)
Abstract:
With the STAR experiment at the BNL Relativisic Heavy Ion Collider, we characterize $\sqrt{s_\mathrm{NN}}$ = 200 GeV p+Au collisions by event activity (EA) measured within the pseudorapidity range $eta$ $in$ [-5, -3.4] in the Au-going direction and report correlations between this EA and hard- and soft- scale particle production at midrapidity ($η$ $\in$ [-1, 1]). At the soft scale, charged partic…
▽ More
With the STAR experiment at the BNL Relativisic Heavy Ion Collider, we characterize $\sqrt{s_\mathrm{NN}}$ = 200 GeV p+Au collisions by event activity (EA) measured within the pseudorapidity range $eta$ $in$ [-5, -3.4] in the Au-going direction and report correlations between this EA and hard- and soft- scale particle production at midrapidity ($η$ $\in$ [-1, 1]). At the soft scale, charged particle production in low-EA p+Au collisions is comparable to that in p+p collisions and increases monotonically with increasing EA. At the hard scale, we report measurements of high transverse momentum (pT) jets in events of different EAs. In contrast with the soft particle production, high-pT particle production and EA are found to be inversely related. To investigate whether this is a signal of jet quenching in high-EA events, we also report ratios of pT imbalance and azimuthal separation of dijets in high- and low-EA events. Within our measurement precision, no significant differences are observed, disfavoring the presence of jet quenching in the highest 30% EA p+Au collisions at $\sqrt{s_\mathrm{NN}}$ = 200 GeV.
△ Less
Submitted 21 October, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
The Argument for Meta-Modeling-Based Approaches to Hardware Generation Languages
Authors:
Johannes Schreiner,
Daniel Gerl,
Robert Kunzelmann,
Paritosh Kumar Sinha,
Wolfgang Ecker
Abstract:
The rapid evolution of Integrated Circuit (IC) development necessitates innovative methodologies such as code generation to manage complexity and increase productivity. Using the right methodology for generator development to maximize the capability and, most notably, the feasibility of generators is a crucial part of this work. Meta-Modeling-based approaches drawing on the principles of Model Dri…
▽ More
The rapid evolution of Integrated Circuit (IC) development necessitates innovative methodologies such as code generation to manage complexity and increase productivity. Using the right methodology for generator development to maximize the capability and, most notably, the feasibility of generators is a crucial part of this work. Meta-Modeling-based approaches drawing on the principles of Model Driven Architecture (MDA) are a promising methodology for generator development. The goal of this paper is to show why such an MDA-based approach can provide extremely powerful generators with minimal implementation effort and to demonstrate that this approach is a superior alternative to the most advanced hardware generation languages such as SpinalHDL and Chisel. For this purpose, this paper provides an in-depth comparison of the Meta-Modeling approach against these hardware generation languages, highlighting the unique advantages of a Meta-Modeling-based approach and summarizes the benefits.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Proper vs Improper Quantum PAC learning
Authors:
Ashwin Nayak,
Pulkit Sinha
Abstract:
A basic question in the PAC model of learning is whether proper learning is harder than improper learning. In the classical case, there are examples of concept classes with VC dimension $d$ that have sample complexity $Ω\left(\frac dε\log\frac1ε\right)$ for proper learning with error $ε$, while the complexity for improper learning is O$\!\left(\frac dε\right)$. One such example arises from the Cou…
▽ More
A basic question in the PAC model of learning is whether proper learning is harder than improper learning. In the classical case, there are examples of concept classes with VC dimension $d$ that have sample complexity $Ω\left(\frac dε\log\frac1ε\right)$ for proper learning with error $ε$, while the complexity for improper learning is O$\!\left(\frac dε\right)$. One such example arises from the Coupon Collector problem.
Motivated by the efficiency of proper versus improper learning with quantum samples, Arunachalam, Belovs, Childs, Kothari, Rosmanis, and de Wolf (TQC 2020) studied an analogue, the Quantum Coupon Collector problem. Curiously, they discovered that for learning size $k$ subsets of $[n]$ the problem has sample complexity $Θ(k\log\min\{k,n-k+1\})$, in contrast with the complexity of $Θ(k\log k)$ for Coupon Collector. This effectively negates the possibility of a separation between the two modes of learning via the quantum problem, and Arunachalam et al.\ posed the possibility of such a separation as an open question.
In this work, we first present an algorithm for the Quantum Coupon Collector problem with sample complexity that matches the sharper lower bound of $(1-o_k(1))k\ln\min\{k,n-k+1\}$ shown recently by Bab Hadiashar, Nayak, and Sinha (IEEE TIT 2024), for the entire range of the parameter $k$. Next, we devise a variant of the problem, the Quantum Padded Coupon Collector. We prove that its sample complexity matches that of the classical Coupon Collector problem for both modes of learning, thereby exhibiting the same asymptotic separation between proper and improper quantum learning as mentioned above. The techniques we develop in the process can be directly applied to any form of padded quantum data. We hope that padding can more generally lift other forms of classical learning behaviour to the quantum setting.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Shabari: Delayed Decision-Making for Faster and Efficient Serverless Functions
Authors:
Prasoon Sinha,
Kostis Kaffes,
Neeraja J. Yadwadkar
Abstract:
Serverless computing relieves developers from the burden of resource management, thus providing ease-of-use to the users and the opportunity to optimize resource utilization for the providers. However, today's serverless systems lack performance guarantees for function invocations, thus limiting support for performance-critical applications: we observed severe performance variability (up to 6x). P…
▽ More
Serverless computing relieves developers from the burden of resource management, thus providing ease-of-use to the users and the opportunity to optimize resource utilization for the providers. However, today's serverless systems lack performance guarantees for function invocations, thus limiting support for performance-critical applications: we observed severe performance variability (up to 6x). Providers lack visibility into user functions and hence find it challenging to right-size them: we observed heavy resource underutilization (up to 80%). To understand the causes behind the performance variability and underutilization, we conducted a measurement study of commonly deployed serverless functions and learned that the function performance and resource utilization depend crucially on function semantics and inputs. Our key insight is to delay making resource allocation decisions until after the function inputs are available. We introduce Shabari, a resource management framework for serverless systems that makes decisions as late as possible to right-size each invocation to meet functions' performance objectives (SLOs) and improve resource utilization. Shabari uses an online learning agent to right-size each function invocation based on the features of the function input and makes cold-start-aware scheduling decisions. For a range of serverless functions and inputs, Shabari reduces SLO violations by 11-73% while not wasting any vCPUs and reducing wasted memory by 64-94% in the median case, compared to state-of-the-art systems, including Aquatope, Parrotfish, and Cypress.
△ Less
Submitted 25 January, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Measurement of flow coefficients in high-multiplicity $p$+Au, $d$+Au and $^{3}$He$+$Au collisions at $\sqrt{s_{_{\mathrm{NN}}}}$=200 GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
Flow coefficients ($v_2$ and $v_3$) are measured in high-multiplicity $p$+Au, $d$+Au, and $^{3}$He$+$Au collisions at a center-of-mass energy of $\sqrt{s_{_{\mathrm{NN}}}}$ = 200 GeV using the STAR detector. The measurements utilize two-particle correlations with a pseudorapidity requirement of $|η| <$ 0.9 and a pair gap of $|Δη|>1.0$. The primary focus is on analysis methods, particularly the sub…
▽ More
Flow coefficients ($v_2$ and $v_3$) are measured in high-multiplicity $p$+Au, $d$+Au, and $^{3}$He$+$Au collisions at a center-of-mass energy of $\sqrt{s_{_{\mathrm{NN}}}}$ = 200 GeV using the STAR detector. The measurements utilize two-particle correlations with a pseudorapidity requirement of $|η| <$ 0.9 and a pair gap of $|Δη|>1.0$. The primary focus is on analysis methods, particularly the subtraction of non-flow contributions. Four established non-flow subtraction methods are applied to determine $v_n$, validated using the HIJING event generator. $v_n$ values are compared across the three collision systems at similar multiplicities; this comparison cancels the final state effects and isolates the impact of initial geometry. While $v_2$ values show differences among these collision systems, $v_3$ values are largely similar, consistent with expectations of subnucleon fluctuations in the initial geometry. The ordering of $v_n$ differs quantitatively from previous measurements using two-particle correlations with a larger rapidity gap, which, according to model calculations, can be partially attributed to the effects of longitudinal flow decorrelations. The prospects for future measurements to improve our understanding of flow decorrelation and subnucleonic fluctuations are also discussed.
△ Less
Submitted 6 November, 2024; v1 submitted 12 December, 2023;
originally announced December 2023.
-
Poisson Geometric Formulation of Quantum Mechanics
Authors:
Pritish Sinha,
Ankit Yadav
Abstract:
We study the Poisson geometrical formulation of quantum mechanics for finite dimensional mixed and pure states. Equivalently, we show that quantum mechanics can be understood in the language of classical mechanics. We review the symplectic structure of the Hilbert space and identify its canonical coordinates. We extend the geometric picture to the space of density matrices $D_N^+$. We find it is n…
▽ More
We study the Poisson geometrical formulation of quantum mechanics for finite dimensional mixed and pure states. Equivalently, we show that quantum mechanics can be understood in the language of classical mechanics. We review the symplectic structure of the Hilbert space and identify its canonical coordinates. We extend the geometric picture to the space of density matrices $D_N^+$. We find it is not symplectic but admits a linear $\mathfrak{su}(N)$ Poisson structure. We identify Casimir surfaces of $D_N^+$ and show that the space of pure states $P_N \equiv \mathbb{C}P^{N-1}$ is one of its symplectic submanifolds which is an intersection of primitive Casimirs. We identify generic symplectic submanifolds of $D_N^+$ and calculate their dimensions. We find that $D_N^+$ is singularly foliated by the symplectic leaves of varying dimensions, also known as coadjoint orbits. We also find an ascending chain of Poisson submanifolds $D_N^M \subset D_N^{M+1}$ for $ 1 \leq M \leq N-1$. Each such Poisson submanifold $D_N^M$ is obtained by tracing out the $\mathbb{C}^M$ states from the bipartite system $\mathbb{C}^N \times \mathbb{C}^M$ and is an intersection of $N-M$ primitive Casimirs of $D_N^+$. Their Poisson structure is induced from the symplectic structure of the bipartite system. We also show their foliations. Finally, we study the positive semi-definite geometry of the symplectic submanifold $E_N^M$ consisting of the mixed states with maximum entropy in $D_N^M$.
△ Less
Submitted 3 June, 2024; v1 submitted 9 December, 2023;
originally announced December 2023.
-
Brainformer: Mimic Human Visual Brain Functions to Machine Vision Models via fMRI
Authors:
Xuan-Bac Nguyen,
Xin Li,
Pawan Sinha,
Samee U. Khan,
Khoa Luu
Abstract:
Human perception plays a vital role in forming beliefs and understanding reality. A deeper understanding of brain functionality will lead to the development of novel deep neural networks. In this work, we introduce a novel framework named Brainformer, a straightforward yet effective Transformer-based framework, to analyze Functional Magnetic Resonance Imaging (fMRI) patterns in the human perceptio…
▽ More
Human perception plays a vital role in forming beliefs and understanding reality. A deeper understanding of brain functionality will lead to the development of novel deep neural networks. In this work, we introduce a novel framework named Brainformer, a straightforward yet effective Transformer-based framework, to analyze Functional Magnetic Resonance Imaging (fMRI) patterns in the human perception system from a machine-learning perspective. Specifically, we present the Multi-scale fMRI Transformer to explore brain activity patterns through fMRI signals. This architecture includes a simple yet efficient module for high-dimensional fMRI signal encoding and incorporates a novel embedding technique called 3D Voxels Embedding. Secondly, drawing inspiration from the functionality of the brain's Region of Interest, we introduce a novel loss function called Brain fMRI Guidance Loss. This loss function mimics brain activity patterns from these regions in the deep neural network using fMRI data. This work introduces a prospective approach to transferring knowledge from human perception to neural networks. Our experiments demonstrate that leveraging fMRI information allows the machine vision model to achieve results comparable to State-of-the-Art methods in various image recognition tasks.
△ Less
Submitted 26 November, 2024; v1 submitted 30 November, 2023;
originally announced December 2023.
-
Production of Protons and Light Nuclei in Au+Au Collisions at $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV with the STAR Detector
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (342 additional authors not shown)
Abstract:
We report the systematic measurement of protons and light nuclei production in Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV by the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The transverse momentum ($p_{T}$) spectra of protons ($p$), deuterons ($d$), tritons ($t$), $^{3}\mathrm{He}$, and $^{4}\mathrm{He}$ are measured from mid-rapidity to target rapidity for different c…
▽ More
We report the systematic measurement of protons and light nuclei production in Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV by the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The transverse momentum ($p_{T}$) spectra of protons ($p$), deuterons ($d$), tritons ($t$), $^{3}\mathrm{He}$, and $^{4}\mathrm{He}$ are measured from mid-rapidity to target rapidity for different collision centralities. We present the rapidity and centrality dependence of particle yields ($dN/dy$), average transverse momentum ($\langle p_{T}\rangle$), yield ratios ($d/p$, $t/p$,$^{3}\mathrm{He}/p$, $^{4}\mathrm{He}/p$), as well as the coalescence parameters ($B_2$, $B_3$). The 4$π$ yields for various particles are determined by utilizing the measured rapidity distributions, $dN/dy$. Furthermore, we present the energy, centrality, and rapidity dependence of the compound yield ratios ($N_{p} \times N_{t} / N_{d}^{2}$) and compare them with various model calculations. The physics implications of those results on the production mechanism of light nuclei and on QCD phase structure are discussed.
△ Less
Submitted 23 October, 2024; v1 submitted 18 November, 2023;
originally announced November 2023.
-
Magnetotransport properties of a twisted bilayer graphene in the presence of external electric and magnetic field
Authors:
Priyanka Sinha,
Ayan Mondal,
Simão Meneses João,
Bheema Lingam Chittari
Abstract:
We extensively investigate the electronic and transport properties of a twisted bilayer graphene when subjected to both an external perpendicular electric field and a magnetic field. Using a basic tight-binding model, we show the flat electronic band properties as well as the density of states (DOS), both without and with the applied electric field. In the presence of an electric field, the degene…
▽ More
We extensively investigate the electronic and transport properties of a twisted bilayer graphene when subjected to both an external perpendicular electric field and a magnetic field. Using a basic tight-binding model, we show the flat electronic band properties as well as the density of states (DOS), both without and with the applied electric field. In the presence of an electric field, the degeneracy at the Dirac points is lifted where the non-monotonic behavior of the energy gap exists, especially for twist angles below 3$^\circ$. We also study the behavior of the Landau levels (LL) spectra for different twist angles within a very low energy range. These LL spectra get modified under the influence of the external electric field. Moreover, we calculate the dc Hall conductivity ($σ_{xy}$) for a very large system using the Kernel Polynomial Method (KPM). Interestingly, $σ_{xy}$ makes a transition from a half-integer to an integer quantum Hall effect, \textit{i.e.} the value of $σ_{xy}$ shifts from $\pm 4(n+1/2) (2e^2/h)$ ($n$ is an integer) to $\pm 2n (2e^2/h)$ around a small twist angle of $θ=2.005^\circ$. At this angle, $σ_{xy}$ acquires a Hall plateau at zero Fermi energy. However, the behavior of $σ_{xy}$ remains unaltered when the system is exposed to the electric field, particularly at the magic angle where the bands in both layers can hybridize and strong interlayer coupling plays a crucial role.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
Measurements of charged-particle multiplicity dependence of higher-order net-proton cumulants in $p$+$p$ collisions at $\sqrt{s} =$ 200 GeV from STAR at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (338 additional authors not shown)
Abstract:
We report on the charged-particle multiplicity dependence of net-proton cumulant ratios up to sixth order from $\sqrt{s}=200$ GeV $p$+$p$ collisions at the Relativistic Heavy Ion Collider (RHIC). The measured ratios $C_{4}/C_{2}$, $C_{5}/C_{1}$, and $C_{6}/C_{2}$ decrease with increased charged-particle multiplicity and rapidity acceptance. Neither the Skellam baselines nor PYTHIA8 calculations ac…
▽ More
We report on the charged-particle multiplicity dependence of net-proton cumulant ratios up to sixth order from $\sqrt{s}=200$ GeV $p$+$p$ collisions at the Relativistic Heavy Ion Collider (RHIC). The measured ratios $C_{4}/C_{2}$, $C_{5}/C_{1}$, and $C_{6}/C_{2}$ decrease with increased charged-particle multiplicity and rapidity acceptance. Neither the Skellam baselines nor PYTHIA8 calculations account for the observed multiplicity dependence. In addition, the ratios $C_{5}/C_{1}$ and $C_{6}/C_{2}$ approach negative values in the highest-multiplicity events, which implies that thermalized QCD matter may be formed in $p$+$p$ collisions.
△ Less
Submitted 4 September, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Estimate of Background Baseline and Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at $\sqrt{s_{\text{NN}}}=200$ GeV at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy (…
▽ More
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy ($v_{2}$), was observed to be close to but systematically larger than the inverse multiplicity ratio. The background baseline for the isobar ratio, $Y = \frac{(Δγ/v_{2})^{\text{Ru}}}{(Δγ/v_{2})^{\text{Zr}}}$, is naively expected to be $\frac{(1/N)^{\text{Ru}}}{(1/N)^{\text{Zr}}}$; however, genuine two- and three-particle correlations are expected to alter it. We estimate the contributions to $Y$ from those correlations, utilizing both the isobar data and HIJING simulations. After including those contributions, we arrive at a final background baseline for $Y$, which is consistent with the isobar data. We extract an upper limit for the CME fraction in the $Δγ$ measurement of approximately $10\%$ at a $95\%$ confidence level on in isobar collisions at $\sqrt{s_{\text{NN}}} = 200$ GeV, with an expected $15\%$ difference in their squared magnetic fields.
△ Less
Submitted 17 July, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Observation of the Antimatter Hypernucleus $^4_{\barΛ}\overline{\hbox{H}}$
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (342 additional authors not shown)
Abstract:
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatt…
▽ More
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatter escapes the rapidly expanding fireball without annihilating, making such collisions an effective experimental tool to create heavy antimatter nuclear objects and study their properties, hoping to shed some light on existing questions on the asymmetry between matter and antimatter. Here we report the first observation of the antimatter hypernucleus \hbox{$^4_{\barΛ}\overline{\hbox{H}}$}, composed of a $\barΛ$ , an antiproton and two antineutrons. The discovery was made through its two-body decay after production in ultrarelativistic heavy-ion collisions by the STAR experiment at the Relativistic Heavy Ion Collider. In total, 15.6 candidate \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} antimatter hypernuclei are obtained with an estimated background count of 6.4. The lifetimes of the antihypernuclei \hbox{$^3_{\barΛ}\overline{\hbox{H}}$} and \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} are measured and compared with the lifetimes of their corresponding hypernuclei, testing the symmetry between matter and antimatter. Various production yield ratios among (anti)hypernuclei and (anti)nuclei are also measured and compared with theoretical model predictions, shedding light on their production mechanisms.
△ Less
Submitted 8 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
CNN-based automatic segmentation of Lumen & Media boundaries in IVUS images using closed polygonal chains
Authors:
Pavel Sinha,
Ioannis Psaromiligkos,
Zeljko Zilic
Abstract:
We propose an automatic segmentation method for lumen and media with irregular contours in IntraVascular ultra-sound (IVUS) images. In contrast to most approaches that broadly label each pixel as either lumen, media, or background, we propose to approximate the lumen and media contours by closed polygonal chains. The chain vertices are placed at fixed angles obtained by dividing the entire 360\deg…
▽ More
We propose an automatic segmentation method for lumen and media with irregular contours in IntraVascular ultra-sound (IVUS) images. In contrast to most approaches that broadly label each pixel as either lumen, media, or background, we propose to approximate the lumen and media contours by closed polygonal chains. The chain vertices are placed at fixed angles obtained by dividing the entire 360\degree~angular space into equally spaced angles, and we predict their radius using an adaptive-subband-decomposition CNN. We consider two loss functions during training. The first is a novel loss function using the Jaccard Measure (JM) to quantify the similarities between the predicted lumen and media segments and the corresponding ground-truth image segments. The second loss function is the traditional Mean Squared Error. The proposed architecture significantly reduces computational costs by replacing the popular auto-encoder structure with a simple CNN as the encoder and the decoder is reduced to simply joining the consecutive predicted points. We evaluated our network on the publicly available IVUS-Challenge-2011 dataset using two performance metrics, namely JM and Hausdorff Distance (HD). The evaluation results show that our proposed network mostly outperforms the state-of-the-art lumen and media segmentation methods.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Deep Representation Learning for Prediction of Temporal Event Sets in the Continuous Time Domain
Authors:
Parag Dutta,
Kawin Mayilvaghanan,
Pratyaksha Sinha,
Ambedkar Dukkipati
Abstract:
Temporal Point Processes (TPP) play an important role in predicting or forecasting events. Although these problems have been studied extensively, predicting multiple simultaneously occurring events can be challenging. For instance, more often than not, a patient gets admitted to a hospital with multiple conditions at a time. Similarly people buy more than one stock and multiple news breaks out at…
▽ More
Temporal Point Processes (TPP) play an important role in predicting or forecasting events. Although these problems have been studied extensively, predicting multiple simultaneously occurring events can be challenging. For instance, more often than not, a patient gets admitted to a hospital with multiple conditions at a time. Similarly people buy more than one stock and multiple news breaks out at the same time. Moreover, these events do not occur at discrete time intervals, and forecasting event sets in the continuous time domain remains an open problem. Naive approaches for extending the existing TPP models for solving this problem lead to dealing with an exponentially large number of events or ignoring set dependencies among events. In this work, we propose a scalable and efficient approach based on TPPs to solve this problem. Our proposed approach incorporates contextual event embeddings, temporal information, and domain features to model the temporal event sets. We demonstrate the effectiveness of our approach through extensive experiments on multiple datasets, showing that our model outperforms existing methods in terms of prediction metrics and computational efficiency. To the best of our knowledge, this is the first work that solves the problem of predicting event set intensities in the continuous time domain by using TPPs.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Results on Elastic Cross Sections in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$…
▽ More
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$ does not fit the data in the aforementioned $t$ range, and we obtain a much better fit using a second-order polynomial for $B(t)$. The $t$ dependence of $B$ is determined using six subintervals of $t$ in the STAR measured $t$ range, and is in good agreement with the phenomenological models. The measured elastic differential cross section $\mathrm{d}σ/\mathrm{dt}$ agrees well with the results obtained at $\sqrt{s} = 546$ GeV for proton--antiproton collisions by the UA4 experiment. We also determine that the integrated elastic cross section within the STAR $t$-range is $σ^\mathrm{fid}_\mathrm{el} = 462.1 \pm 0.9 (\mathrm{stat.}) \pm 1.1 (\mathrm {syst.}) \pm 11.6 (\mathrm {scale})$~$μ\mathrm{b}$.
△ Less
Submitted 6 May, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Longitudinal and transverse spin transfer to $Λ$ and $\overlineΛ$ hyperons in polarized $p$+$p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (357 additional authors not shown)
Abstract:
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and…
▽ More
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and the transverse spin transfer coefficient, $D_{TT}$, to $Λ$ and $\overlineΛ$ in polarized proton-proton collisions at $\sqrt{s}$ = 200 GeV by the STAR experiment at RHIC. The data set includes longitudinally polarized proton-proton collisions with an integrated luminosity of 52 pb$^{-1}$, and transversely polarized proton-proton collisions with a similar integrated luminosity. Both data sets have about twice the statistics of previous results and cover a kinematic range of $|η_{Λ(\overlineΛ)}|$ $<$ 1.2 and transverse momentum $p_{T,{Λ(\overlineΛ)}}$ up to 8 GeV/$c$. We also report the first measurements of the hyperon spin transfer coefficients $D_{LL}$ and $D_{TT}$ as a function of the fractional jet momentum $z$ carried by the hyperon, which can provide more direct constraints on the polarized fragmentation functions.
△ Less
Submitted 7 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Reaction plane correlated triangular flow in Au+Au collisions at $\sqrt{s_{NN}}=3$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (341 additional authors not shown)
Abstract:
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$,…
▽ More
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$, opposite in sign compared to the slope for directed flow. No significant $v_3$ signal is observed for charged pions and kaons. Comparisons with models suggest that a mean field potential is required to describe these results, and that the triangular shape of the participant nucleons is the result of stopping and nuclear geometry.
△ Less
Submitted 19 April, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (…
▽ More
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) in the search for the CME. The isobar ratio ($Y$) of CME-sensitive observable, charge separation scaled by elliptic anisotropy, is close to but systematically larger than the inverse multiplicity ratio, the naive background baseline. This indicates the potential existence of a CME signal and the presence of remaining nonflow background due to two- and three-particle correlations, which are different between the isobars. In this post-blind analysis, we estimate the contributions from those nonflow correlations as a background baseline to $Y$, utilizing the isobar data as well as Heavy Ion Jet Interaction Generator simulations. This baseline is found consistent with the isobar ratio measurement, and an upper limit of 10% at 95% confidence level is extracted for the CME fraction in the charge separation measurement in isobar collisions at $\sqrt{s_{\rm NN}}=200$ GeV.
△ Less
Submitted 17 July, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Probing nuclear structure using elliptic flow of strange and multi-strange hadrons in isobar collisions
Authors:
Priyanshi Sinha
Abstract:
Isobar collisions, $^{96}_{44}$Ru+$^{96}_{44}$Ru and $^{96}_{40}$Zr+$^{96}_{40}$Zr, at $\sqrt{s_{\mathrm {NN}}}$ = 200 GeV have been performed at RHIC in order to study the charge separation along the magnetic field, called the Chiral Magnetic Effect (CME). The difference in nuclear deformation and structure between the two isobar nuclei may result in a difference in the flow magnitudes. Hence, el…
▽ More
Isobar collisions, $^{96}_{44}$Ru+$^{96}_{44}$Ru and $^{96}_{40}$Zr+$^{96}_{40}$Zr, at $\sqrt{s_{\mathrm {NN}}}$ = 200 GeV have been performed at RHIC in order to study the charge separation along the magnetic field, called the Chiral Magnetic Effect (CME). The difference in nuclear deformation and structure between the two isobar nuclei may result in a difference in the flow magnitudes. Hence, elliptic flow measurements for these collisions give direct information about the initial state anisotropies. Strange and multi-strange hadrons have a small hadronic cross-section compared to light hadrons, making them an excellent probe for understanding the initial state anisotropies of the medium produced in these isobar collisions. The collected datasets include approximately two billion events for each of the isobar species and provide a unique opportunity for statistics hungry measurements. In this proceeding, we will report the elliptic flow ($v_{2}$) measurement of $K_{s}^{0}$, $Λ$, $\overlineΛ$, $φ$, $Ξ^{-}$, $\overlineΞ^{+}$, $Ω^{-}$, and $\overlineΩ^{+}$ at mid-rapidity for Ru+Ru and Zr+Zr collisions at $\sqrt{s_{\mathrm {NN}}}$ = 200 GeV. The centrality and transverse momentum ($p_{T}$) dependence of $v_{2}$ of (multi-)strange hadrons will be shown. System size dependence of $v_{2}$ will be shown by comparing the $v_{2}$ results obtained from Cu+Cu, Au+Au, and U+U collisions. The number of constituent quark (NCQ) scaling for these strange hadrons will also be tested. We will also compare the $p_{T}$-integrated $v_{2}$ for these two isobar collisions. Transport model calculations will be compared to data to provide further quantitative constraints on the nuclear structure.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Jet-hadron correlations with respect to the event plane in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions in STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai,
H. Caines
, et al. (340 additional authors not shown)
Abstract:
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A seco…
▽ More
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A second-order event plane is used in the analysis as an experimental estimate of the reaction plane formed by the collision impact parameter and the beam direction. Charged-particle jets with $15 < p_{\rm T, jet} <$ 20 and $20 < p_{\rm T, jet} <$ 40 GeV/$c$ were reconstructed with the anti-$k_{\rm T}$ algorithm with radius parameter setting of (R=0.4) in the 20-50\% centrality bin to maximize the initial-state eccentricity of the interaction region. The reaction plane fit method is implemented to remove the flow-modulated background with better precision than prior methods. Yields and widths of jet-associated charged-hadron distributions are extracted in three angular bins between the jet axis and the event plane. The event-plane (EP) dependence is further quantified by ratios of the associated yields in different EP bins. No dependence on orientation of the jet axis with respect to the event plane is seen within the uncertainties in the kinematic regime studied. This finding is consistent with a similar experimental observation by ALICE in $\sqrt{s_{\mathrm{NN}}}$ = 2.76 TeV Pb+Pb collision data.
△ Less
Submitted 20 March, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Exploring the Electrical Transport Properties and Insulator-Metal Transition in Polycrystalline Pr$_2$MgZrO$_6$: Insights from Conductivity and Impedance Spectroscopy
Authors:
Moumin Rudra,
T. P. Sinha
Abstract:
The ac electrical transport properties of polycrystalline Pr$_2$MgZrO$_6$ (PMZ) have been investigated using conductivity and impedance spectroscopic techniques. The crystal structure of PMZ has been determined to be monoclinic through a combination of X-ray diffraction and Raman spectroscopic studies. Ag mode in the Raman spectra has been identified as the breathing vibration of the ZrO6 octahedr…
▽ More
The ac electrical transport properties of polycrystalline Pr$_2$MgZrO$_6$ (PMZ) have been investigated using conductivity and impedance spectroscopic techniques. The crystal structure of PMZ has been determined to be monoclinic through a combination of X-ray diffraction and Raman spectroscopic studies. Ag mode in the Raman spectra has been identified as the breathing vibration of the ZrO6 octahedra. The ac conductivity spectra of PMZ exhibit distinct characteristics at different temperature ranges. At lower temperatures (less than 420 K), the spectra are fitted using a double power law, indicating the involvement of multiple microstructural features. On the other hand, at higher temperatures (greater than 460 K), the spectra follow Jonscher's law, suggesting a simpler conduction mechanism. Through the analysis of conductivity, permittivity, and impedance, an insulator-metal transition has been observed around 452 K. This transition signifies a significant change in the electrical properties of PMZ and provides valuable insights into its conductive nature.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
Phase transition in oxygen-intercalated pseudocapacitor Pr$_2$MgZrO$_6$ electrode: A combined structural and conductivity analysis
Authors:
Moumin Rudra,
S. Saha,
T. P. Sinha
Abstract:
The phase transition behavior and charge storage mechanism of Pr$_2$MgZrO$_6$ (PMM), an oxygen-intercalated pseudocapacitor, were investigated through crystal structure analysis, Raman spectroscopy, ac conductivity spectroscopy, X ray photoelectron spectroscopy, and electrochemical spectroscopy. The crystal structure analysis and vibration studies revealed a phase transition in PMM, following the…
▽ More
The phase transition behavior and charge storage mechanism of Pr$_2$MgZrO$_6$ (PMM), an oxygen-intercalated pseudocapacitor, were investigated through crystal structure analysis, Raman spectroscopy, ac conductivity spectroscopy, X ray photoelectron spectroscopy, and electrochemical spectroscopy. The crystal structure analysis and vibration studies revealed a phase transition in PMM, following the sequence 14 to 87 to 225. High temperature Raman spectroscopy demonstrated a significant feature of a monoclinic to tetragonal phase transformation in PMM. The ac conductivity spectroscopy exhibited a semiconductor to metal transition in PMM. X-ray photoelectron spectroscopy of the Mn 2p state confirmed the presence of oxygen vacancies in PMM at room temperature. Furthermore, the electrochemical performance of PMM as an electrode was evaluated. The PMM electrode displayed an intercalated pseudocapacitive nature, exhibiting a maximum possible specific capacitance of 257.57 F. The charge storage process of the PMM electrode was thoroughly reviewed and discussed, shedding light on the underlying mechanisms.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
Is Grad-CAM Explainable in Medical Images?
Authors:
Subhashis Suara,
Aayush Jha,
Pratik Sinha,
Arif Ahmed Sekh
Abstract:
Explainable Deep Learning has gained significant attention in the field of artificial intelligence (AI), particularly in domains such as medical imaging, where accurate and interpretable machine learning models are crucial for effective diagnosis and treatment planning. Grad-CAM is a baseline that highlights the most critical regions of an image used in a deep learning model's decision-making proc…
▽ More
Explainable Deep Learning has gained significant attention in the field of artificial intelligence (AI), particularly in domains such as medical imaging, where accurate and interpretable machine learning models are crucial for effective diagnosis and treatment planning. Grad-CAM is a baseline that highlights the most critical regions of an image used in a deep learning model's decision-making process, increasing interpretability and trust in the results. It is applied in many computer vision (CV) tasks such as classification and explanation. This study explores the principles of Explainable Deep Learning and its relevance to medical imaging, discusses various explainability techniques and their limitations, and examines medical imaging applications of Grad-CAM. The findings highlight the potential of Explainable Deep Learning and Grad-CAM in improving the accuracy and interpretability of deep learning models in medical imaging. The code is available in (will be available).
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
A Structurally Regularized CNN Architecture via Adaptive Subband Decomposition
Authors:
Pavel Sinha,
Ioannis Psaromiligkos,
Zeljko Zilic
Abstract:
We propose a generalized convolutional neural network (CNN) architecture that first decomposes the input signal into subbands by an adaptive filter bank structure, and then uses convolutional layers to extract features from each subband independently. Fully connected layers finally combine the extracted features to perform classification. The proposed architecture restrains each of the subband CNN…
▽ More
We propose a generalized convolutional neural network (CNN) architecture that first decomposes the input signal into subbands by an adaptive filter bank structure, and then uses convolutional layers to extract features from each subband independently. Fully connected layers finally combine the extracted features to perform classification. The proposed architecture restrains each of the subband CNNs from learning using the entire input signal spectrum, resulting in structural regularization. Our proposed CNN architecture is fully compatible with the end-to-end learning mechanism of typical CNN architectures and learns the subband decomposition from the input dataset. We show that the proposed CNN architecture has attractive properties, such as robustness to input and weight-and-bias quantization noise, compared to regular full-band CNN architectures. Importantly, the proposed architecture significantly reduces computational costs, while maintaining state-of-the-art classification accuracy.
Experiments on image classification tasks using the MNIST, CIFAR-10/100, Caltech-101, and ImageNet-2012 datasets show that the proposed architecture allows accuracy surpassing state-of-the-art results. On the ImageNet-2012 dataset, we achieved top-5 and top-1 validation set accuracy of 86.91% and 69.73%, respectively. Notably, the proposed architecture offers over 90% reduction in computation cost in the inference path and approximately 75% reduction in back-propagation (per iteration) with just a single-layer subband decomposition. With a 2-layer subband decomposition, the computational gains are even more significant with comparable accuracy results to the single-layer decomposition.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Effect of nuclear structure on particle production in relativistic heavy-ion collisions using the AMPT model
Authors:
P. Sinha,
V. Bairathi,
K. Gopal,
C. Jena,
S. Kabana
Abstract:
We report first study of transverse momentum ($p_\mathrm{T}$) spectra for $π^{\pm}$, $K^{\pm}$, $p$, and $\bar{p}$ in isobar, $^{96}_{44}$Ru+$^{96}_{44}$Ru and $^{96}_{40}$Zr+$^{96}_{40}$Zr, collisions at $\sqrt{s_{\mathrm{NN}}} = 200$ GeV using a multi-phase transport (AMPT) model. Particle yields ($dN/dy$), average transverse momenta ($\langle p_\mathrm{T} \rangle$), and particle ratios are repo…
▽ More
We report first study of transverse momentum ($p_\mathrm{T}$) spectra for $π^{\pm}$, $K^{\pm}$, $p$, and $\bar{p}$ in isobar, $^{96}_{44}$Ru+$^{96}_{44}$Ru and $^{96}_{40}$Zr+$^{96}_{40}$Zr, collisions at $\sqrt{s_{\mathrm{NN}}} = 200$ GeV using a multi-phase transport (AMPT) model. Particle yields ($dN/dy$), average transverse momenta ($\langle p_\mathrm{T} \rangle$), and particle ratios are reported in various collision systems with different parameterizations of the Woods-Saxon (WS) distribution. We observed a maximum difference of 5% in the particle yields in peripheral collisions when we included a quadrupole and octupole deformation and a nuclear size difference between the isobars. The $π^{-}$/$π^{+}$ ratio is smaller in Ru+Ru collisions compared to Zr+Zr collisions indicating an effect of isospin due to difference in number of protons and neutrons between the two nuclei. The $K^{-}$/$K^{+}$ ratio is same in both the systems indicating the dominance of the pair production mechanism in the kaon production. The $\bar{p}/p$ ratio is further smaller in Ru+Ru collisions than Zr+Zr collisions, indicating the effect of baryon stopping in addition to the isospin effect. A system size dependence is observed in $dN/dy$ and $\langle p_\mathrm{T} \rangle$ when we compare the results from isobar collisions with Au+Au and U+U collisions.
△ Less
Submitted 9 July, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.