Search | arXiv e-print repository

doi 10.1103/PhysRevA.110.042423

Quantum channels that destroy negative conditional entropy

Authors: PV Srinidhi, Indranil Chakrabarty, Samyadeb Bhattacharya, Nirman Ganguly

Abstract: Counter-intuitive to classical notions, quantum conditional entropy can be negative, playing a pivotal role in information-processing tasks. This article delves deeply into quantum channels, emphasizing negative conditional entropy breaking channels (NCEB) and introducing negative conditional entropy annihilating channels (NCEA). We characterize these channels from both topological and information… ▽ More Counter-intuitive to classical notions, quantum conditional entropy can be negative, playing a pivotal role in information-processing tasks. This article delves deeply into quantum channels, emphasizing negative conditional entropy breaking channels (NCEB) and introducing negative conditional entropy annihilating channels (NCEA). We characterize these channels from both topological and information-theoretic perspectives, examining their properties when combined serially and NCEB in parallel. Our exploration extends to complimentary channels associated with NCEB, leading to the introduction of information-leaking channels. Utilizing the parameters of the standard depolarizing channel, we provide tangible examples and further characterization. We demonstrate the relationship of NCEB and NCEA with newly introduced channels like coherent information breaking (CIB) and mutual information breaking (MIB), along with standard channels like zero capacity channels. Preservation of quantum resources is an integral constituent of quantum information theory. Recognizing this, we lay prescriptions to detect channels that do not break the negativity of conditional entropy, ensuring the conservation of this quantum resource. △ Less

Submitted 4 November, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

Comments: 14 pages, 5 figures

Journal ref: Phys. Rev. A 110 (2024), 042423

arXiv:2311.14787 [pdf, other]

doi 10.1088/1361-648X/ad5d42

Rashba splitting in polar-nonpolar sandwich heterostructure : A DFT Study

Authors: Sanchari Bhattacharya, Sanjoy Datta

Abstract: In this study, we employ density functional theory (DFT) based first-principles calculations to investigate the spin-orbit effects in the electronic structure of a polar-nonpolar sandwich heterostructure namely LAO$_{2.5}$/STO$_{5.5}$/LAO$_{2.5}$. Our focus on the Ti-3d bands reveals an inverted ordering of the STO-$\rm t_{2g}$ orbital near the n-type interface, consistent with earlier experimenta… ▽ More In this study, we employ density functional theory (DFT) based first-principles calculations to investigate the spin-orbit effects in the electronic structure of a polar-nonpolar sandwich heterostructure namely LAO$_{2.5}$/STO$_{5.5}$/LAO$_{2.5}$. Our focus on the Ti-3d bands reveals an inverted ordering of the STO-$\rm t_{2g}$ orbital near the n-type interface, consistent with earlier experimental work. In contrast, toward the p-type interface, the orbital ordering aligns with the natural ordering of STO orbitals, influenced by crystal field splitting. Interestingly, we have found a strong inter-orbital coupling between $t_{2g}$ and $e_g$ orbital, which has not been reported earlier in $\rm SrTiO_3$ based 2D system. Additionally, our observations highlight that the cubic Rashba splitting in this system surpasses the linear Rashba splitting, contrary to experimental findings. This comprehensive analysis contributes to a refined understanding of the role of orbital mixing in Rashba splitting in the sandwich oxide heterostructures. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Journal ref: J. Phys.: Condens. Matter 36 (2024) 405701 (10pp)

arXiv:2311.12046 [pdf, other]

LATIS: Lambda Abstraction-based Thermal Image Super-resolution

Authors: Gargi Panda, Soumitra Kundu, Saumik Bhattacharya, Aurobinda Routray

Abstract: Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images. Recently, transformer-based methods have achieved significant performance in SISR. However, in the SR task, only a small number of pixels are involved in the transformers self-attention (SA) mechanism due to the computational complexity of the attention mechanism. The lambda abst… ▽ More Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images. Recently, transformer-based methods have achieved significant performance in SISR. However, in the SR task, only a small number of pixels are involved in the transformers self-attention (SA) mechanism due to the computational complexity of the attention mechanism. The lambda abstraction is a promising alternative to SA in modeling long-range interactions while being computationally more efficient. This paper presents lambda abstraction-based thermal image super-resolution (LATIS), a novel lightweight architecture for SISR of thermal images. LATIS sequentially captures local and global information using the local and global feature block (LGFB). In LGFB, we introduce a global feature extraction (GFE) module based on the lambda abstraction mechanism, channel-shuffle and convolution (CSConv) layer to encode local context. Besides, to improve the performance further, we propose a differentiable patch-wise histogram-based loss function. Experimental results demonstrate that our LATIS, with the least model parameters and complexity, achieves better or comparable performance with state-of-the-art methods across multiple datasets. △ Less

Submitted 17 November, 2023; originally announced November 2023.

arXiv:2311.11434 [pdf, ps, other]

Low spin spectroscopy of neutron-rich 43,44,45Cl via β and (β}n decay

Authors: V. Tripathi, S. Bhattacharya, E. Rubino, C. Benetti, J. F. Perello, S. L. Tabor, S. N. Liddick, P. C. Bender, M. P. Carpenter, J. J. Carroll, A. Chester, C. J. Chiara, K. Childers, B. R. Clark, B. P. Crider, J. T. Harke, R. Jain, B. Longfellow, S. Luitel, M. Mogannam, T. H. Ogunbeku, A. L. Richard, S. Saha, N. Shimizu, O. A. Shehu , et al. (5 additional authors not shown)

Abstract: β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by co… ▽ More β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by correlating implants and decays in the pixelated implant detector with further coincidence with γ transitions in the daughter nucleus. The level structure of 43,45 Cl is expanded by the addition of 20 new γ transitions in 43Cl and 8 in 45 Cl with the observation of core excited negative-parity states for the first time. For 45 S decay, a large fraction of the β decay strength goes to delayed neutron emission populating states in 44 Cl which are also presented. Comparison of experimental observations is made to detailed shell-model calculations using the SDPFSDG-MU interaction to highlight the role of the diminished N = 28 neutron shell gap and the near degeneracy of the proton s 1/2 and d 3/2 orbitals on the structure of the neutron-rich Cl isotopes. The current work also provides further support to a ground state spin-parity assignment of 3/2 + in 45 Cl. △ Less

Submitted 19 November, 2023; originally announced November 2023.

arXiv:2311.09713 [pdf, other]

doi 10.1103/PhysRevC.107.064318

Search for the origin of wobbling motion in the $ A \approx 130 $ region: The case of $^{131}$Xe

Authors: S. Chakraborty, S. Bhattacharyya, R. Banik, Soumik Bhattacharya, G. Mukherjee, C. Bhattacharya, S. Biswas, S. Rajbanshi, Shabir Dar, S. Nandi, Sajad Ali, S. Chatterjee, S. Das, S. Das Gupta, S. S. Ghugre, A. Goswami, A. Lemasson, Debasish Mondal, S. Mukhopadhyay, A. Navin, H. Pai, Surajit Pal, Deepak Pandit, R. Raut, Prithwijita Ray , et al. (2 additional authors not shown)

Abstract: In-beam $ γ$-ray spectroscopy of $^{131}$Xe has been carried out to study the structure of the intruder $ νh_{11/2} $ band. Excited states were populated via an $ α$-induced fusion-evaporation reaction at E$ _α = 38 $ MeV. Inspection of $ γγ$-coincidence data resulted in the identification of a new rotational sequence. Based on the systematics of excitation energy, assigned spin-parity, decay patt… ▽ More In-beam $ γ$-ray spectroscopy of $^{131}$Xe has been carried out to study the structure of the intruder $ νh_{11/2} $ band. Excited states were populated via an $ α$-induced fusion-evaporation reaction at E$ _α = 38 $ MeV. Inspection of $ γγ$-coincidence data resulted in the identification of a new rotational sequence. Based on the systematics of excitation energy, assigned spin-parity, decay pattern, and the electromagnetic character of the inter-band $ ΔI = 1 $ $ γ$-transitions, this sequence is proposed as the unfavoured signature partner of the $ νh_{11/2} $ band. The structure of this band is further illuminated in the light of the triaxial particle rotor model (TPRM). The possibility of wobbling excitation in $ N = 77 $ Xe-Ba-Ce isotones has been explored in a systematic manner. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Journal ref: Phys. Rev. C 107, 064318 (2023)

arXiv:2311.08781 [pdf, other]

Measurement of the Hoyle State Radiative Transition Width

Authors: T. K. Rana, Deepak Pandit, S. Manna, S. Kundu, K. Banerjee, A. Sen, R. Pandey, G. Mukherjee, T. K. Ghosh, S. S. Nayak, R. Shil, P. Karmakar, K. Atreya, K. Rani, D. Paul, Rajkumar Santra, A. Sultana, S. Basu, S. Pal, S. Sadhukhan, Debasish Mondal, S. Mukhopadhyay, Srijit Bhattacharya, Surajit Pal, Pankaj Pant , et al. (8 additional authors not shown)

Abstract: The radiative decay of the Hoyle state is the doorway to the production of heavier elements in stellar environment. Here we report, an exclusive measurement of electric quadruple (E$_2$) transitions of the Hoyle state to the ground state of $^{12}$C through the $^{12}$C(p, p$^\prime$$γ$$γ$)$^{12}$C reaction. Triple coincidence measurement yields a value of radiative branching ratio $Γ_{rad}$/$Γ$ =… ▽ More The radiative decay of the Hoyle state is the doorway to the production of heavier elements in stellar environment. Here we report, an exclusive measurement of electric quadruple (E$_2$) transitions of the Hoyle state to the ground state of $^{12}$C through the $^{12}$C(p, p$^\prime$$γ$$γ$)$^{12}$C reaction. Triple coincidence measurement yields a value of radiative branching ratio $Γ_{rad}$/$Γ$ = 4.01 (30) $\times$ 10$^{-4}$. The result has been corroborated by an independent experiment based on the complete kinematical measurement $via.$ $^{12}$C(p, p$^\prime$)$^{12}$C reaction ($Γ_{rad}$/$Γ$ = 4.04 (30) $\times$ 10$^{-4}$). Using our results together with the currently adopted values of $Γ_π$(E$_0$)/$Γ$ and $Γ_π$($E_0$), the radiative width of the Hoyle state is found to be 3.75 (40) $\times$ 10$^{-3}$ eV. We emphasize here that our result is not in agreement with 34 $\%$ increase in the radiative decay width of the Hoyle state measured recently but consistent with the currently adopted value. △ Less

Submitted 26 November, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: 7 pages, 5 figures

Journal ref: Phys. Lett. B 859 (2024) 139083

arXiv:2311.08367 [pdf, other]

Arboricity-Dependent Algorithms for Edge Coloring

Authors: Sayan Bhattacharya, Martín Costa, Nadav Panski, Shay Solomon

Abstract: The problem of edge coloring has been extensively studied over the years. Recently, this problem has received significant attention in the dynamic setting, where we are given a dynamic graph evolving via a sequence of edge insertions and deletions and our objective is to maintain an edge coloring of the graph. Currently, it is not known whether it is possible to maintain a $(Δ+ O(Δ^{1 - μ}))$-ed… ▽ More The problem of edge coloring has been extensively studied over the years. Recently, this problem has received significant attention in the dynamic setting, where we are given a dynamic graph evolving via a sequence of edge insertions and deletions and our objective is to maintain an edge coloring of the graph. Currently, it is not known whether it is possible to maintain a $(Δ+ O(Δ^{1 - μ}))$-edge coloring in $\tilde{O}(1)$ update time, for any constant $μ> 0$, where $Δ$ is the maximum degree of the graph. In this paper, we show how to efficiently maintain a $(Δ+ O(α))$-edge coloring in $\tilde O(1)$ amortized update time, where $α$ is the arboricty of the graph. Thus, we answer this question in the affirmative for graphs of sufficiently small arboricity. △ Less

Submitted 7 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

Comments: Started to circulate in September 2023

arXiv:2311.08027 [pdf, other]

A practical key-recovery attack on LWE-based key-encapsulation mechanism schemes using Rowhammer

Authors: Puja Mondal, Suparna Kundu, Sarani Bhattacharya, Angshuman Karmakar, Ingrid Verbauwhede

Abstract: Physical attacks are serious threats to cryptosystems deployed in the real world. In this work, we propose a microarchitectural end-to-end attack methodology on generic lattice-based post-quantum key encapsulation mechanisms to recover the long-term secret key. Our attack targets a critical component of a Fujisaki-Okamoto transform that is used in the construction of almost all lattice-based key e… ▽ More Physical attacks are serious threats to cryptosystems deployed in the real world. In this work, we propose a microarchitectural end-to-end attack methodology on generic lattice-based post-quantum key encapsulation mechanisms to recover the long-term secret key. Our attack targets a critical component of a Fujisaki-Okamoto transform that is used in the construction of almost all lattice-based key encapsulation mechanisms. We demonstrate our attack model on practical schemes such as Kyber and Saber by using Rowhammer. We show that our attack is highly practical and imposes little preconditions on the attacker to succeed. As an additional contribution, we propose an improved version of the plaintext checking oracle, which is used by almost all physical attack strategies on lattice-based key-encapsulation mechanisms. Our improvement reduces the number of queries to the plaintext checking oracle by as much as $39\%$ for Saber and approximately $23\%$ for Kyber768. This can be of independent interest and can also be used to reduce the complexity of other attacks. △ Less

Submitted 14 November, 2023; originally announced November 2023.

ACM Class: E.3.3

arXiv:2311.07103 [pdf, other]

doi 10.1016/j.nimb.2023.05.053

Particle Identification at VAMOS++ with Machine Learning Techniques

Authors: Y. Cho, Y. H. Kim, S. Choi, J. Park, S. Bae, K. I. Hahn, Y. Son, A. Navin, A. Lemasson, M. Rejmund, D. Ramos, D. Ackermann, A. Utepov, C. Fourgeres, J. C. Thomas, J. Goupil, G. Fremont, G. de France, Y. X. Watanabe, Y. Hirayama, S. Jeong, T. Niwase, H. Miyatake, P. Schury, M. Rosenbusch , et al. (23 additional authors not shown)

Abstract: Multi-nucleon transfer reaction between 136Xe beam and 198Pt target was performed using the VAMOS++ spectrometer at GANIL to study the structure of n-rich nuclei around N=126. Unambiguous charge state identification was obtained by combining two supervised machine learning methods, deep neural network (DNN) and positional correction using a gradient-boosting decision tree (GBDT). The new method re… ▽ More Multi-nucleon transfer reaction between 136Xe beam and 198Pt target was performed using the VAMOS++ spectrometer at GANIL to study the structure of n-rich nuclei around N=126. Unambiguous charge state identification was obtained by combining two supervised machine learning methods, deep neural network (DNN) and positional correction using a gradient-boosting decision tree (GBDT). The new method reduced the complexity of the kinetic energy calibration and outperformed the conventional method, improving the charge state resolution by 8% △ Less

Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

Journal ref: Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms, Volume 541, August 2023, Pages 240-242

arXiv:2311.03315 [pdf, other]

doi 10.1103/PhysRevD.110.L061501

Gravitational memory signal from neutrino self-interactions in supernova

Authors: Soumya Bhattacharya, Debanjan Bose, Indranil Chakraborty, Arpan Hait, Subhendra Mohanty

Abstract: Neutrinos with large self-interactions, arising from exchange of light scalars or vectors with mass $M_φ\simeq 10{\rm MeV}$, can play a useful role in cosmology for structure formation and solving the Hubble tension. It has been proposed that large self-interactions of neutrinos may change the observed properties of supernova like the neutrino luminosity or the duration of the neutrino burst. In t… ▽ More Neutrinos with large self-interactions, arising from exchange of light scalars or vectors with mass $M_φ\simeq 10{\rm MeV}$, can play a useful role in cosmology for structure formation and solving the Hubble tension. It has been proposed that large self-interactions of neutrinos may change the observed properties of supernova like the neutrino luminosity or the duration of the neutrino burst. In this paper, we study the gravitational wave memory signal arising from supernova neutrinos. Our results reveal that memory signal for self-interacting neutrinos are weaker than free-streaming neutrinos in the high frequency range. Implications for detecting and differentiating between such signals for planned space-borne detectors, DECIGO and BBO, are also discussed. △ Less

Submitted 4 September, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

Comments: Revised, rewritten version with some additions and omissions, matches with the published version in PRD letters

arXiv:2311.03267 [pdf, ps, other]

Nibbling at Long Cycles: Dynamic (and Static) Edge Coloring in Optimal Time

Authors: Sayan Bhattacharya, Martín Costa, Nadav Panski, Shay Solomon

Abstract: We consider the problem of maintaining a $(1+ε)Δ$-edge coloring in a dynamic graph $G$ with $n$ nodes and maximum degree at most $Δ$. The state-of-the-art update time is $O_ε(\text{polylog}(n))$, by Duan, He and Zhang [SODA'19] and by Christiansen [STOC'23], and more precisely $O(\log^7 n/ε^2)$, where $Δ= Ω(\log^2 n / ε^2)$. The following natural question arises: What is the best possible update… ▽ More We consider the problem of maintaining a $(1+ε)Δ$-edge coloring in a dynamic graph $G$ with $n$ nodes and maximum degree at most $Δ$. The state-of-the-art update time is $O_ε(\text{polylog}(n))$, by Duan, He and Zhang [SODA'19] and by Christiansen [STOC'23], and more precisely $O(\log^7 n/ε^2)$, where $Δ= Ω(\log^2 n / ε^2)$. The following natural question arises: What is the best possible update time of an algorithm for this task? More specifically, \textbf{ can we bring it all the way down to some constant} (for constant $ε$)? This question coincides with the \emph{static} time barrier for the problem: Even for $(2Δ-1)$-coloring, there is only a naive $O(m \log Δ)$-time algorithm. We answer this fundamental question in the affirmative, by presenting a dynamic $(1+ε)Δ$-edge coloring algorithm with $O(\log^4 (1/ε)/ε^9)$ update time, provided $Δ= Ω_ε(\text{polylog}(n))$. As a corollary, we also get the first linear time (for constant $ε$) \emph{static} algorithm for $(1+ε)Δ$-edge coloring; in particular, we achieve a running time of $O(m \log (1/ε)/ε^2)$. We obtain our results by carefully combining a variant of the \textsc{Nibble} algorithm from Bhattacharya, Grandoni and Wajc [SODA'21] with the subsampling technique of Kulkarni, Liu, Sah, Sawhney and Tarnawski [STOC'22]. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: Accepted at SODA 2024

arXiv:2311.03084 [pdf, other]

A Simple yet Efficient Ensemble Approach for AI-generated Text Detection

Authors: Harika Abburi, Kalyani Roy, Michael Suesserman, Nirmala Pudota, Balaji Veeramani, Edward Bowen, Sanmitra Bhattacharya

Abstract: Recent Large Language Models (LLMs) have demonstrated remarkable capabilities in generating text that closely resembles human writing across wide range of styles and genres. However, such capabilities are prone to potential abuse, such as fake news generation, spam email creation, and misuse in academic assignments. Hence, it is essential to build automated approaches capable of distinguishing bet… ▽ More Recent Large Language Models (LLMs) have demonstrated remarkable capabilities in generating text that closely resembles human writing across wide range of styles and genres. However, such capabilities are prone to potential abuse, such as fake news generation, spam email creation, and misuse in academic assignments. Hence, it is essential to build automated approaches capable of distinguishing between artificially generated text and human-authored text. In this paper, we propose a simple yet efficient solution to this problem by ensembling predictions from multiple constituent LLMs. Compared to previous state-of-the-art approaches, which are perplexity-based or uses ensembles with a number of LLMs, our condensed ensembling approach uses only two constituent LLMs to achieve comparable performance. Experiments conducted on four benchmark datasets for generative text classification show performance improvements in the range of 0.5 to 100\% compared to previous state-of-the-art approaches. We also study the influence that the training data from individual LLMs have on model performance. We found that substituting commercially-restrictive Generative Pre-trained Transformer (GPT) data with data generated from other open language models such as Falcon, Large Language Model Meta AI (LLaMA2), and Mosaic Pretrained Transformers (MPT) is a feasible alternative when developing generative text detectors. Furthermore, to demonstrate zero-shot generalization, we experimented with an English essays dataset, and results suggest that our ensembling approach can handle new data effectively. △ Less

Submitted 7 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

arXiv:2311.02915 [pdf, other]

Electromagnetic extension of Buchdahl bound in $f(R,T)$ gravity

Authors: Soumik Bhattacharya, Ranjan Sharma, Sunil D. Maharaj

Abstract: We develop a static charged stellar model in $f(R,T)$ gravity where the modification is assumed to be linear in $T$ which is the trace of the energy momentum tensor. The exterior spacetime of the charged object is described by the Reissner-Nordström metric. The interior solution is obtained by invoking the Buchdahl-Vaidya-Tikekar ansatz, for the metric potential $g_{rr}$, which has a clear geometr… ▽ More We develop a static charged stellar model in $f(R,T)$ gravity where the modification is assumed to be linear in $T$ which is the trace of the energy momentum tensor. The exterior spacetime of the charged object is described by the Reissner-Nordström metric. The interior solution is obtained by invoking the Buchdahl-Vaidya-Tikekar ansatz, for the metric potential $g_{rr}$, which has a clear geometric interpretation. A detailed physical analysis of the model clearly shows distinct physical features of the resulting stellar configuration under such a modification. We find the maximum compactness bound for such a class of compact stars which is a generalization of the Buchdahl bound for a charged sphere described in $f(R,T)$ gravity. Our result shows physical behaviour that is distinct from general relativity. △ Less

Submitted 6 November, 2023; originally announced November 2023.

arXiv:2311.01858 [pdf, other]

doi 10.1103/PhysRevB.109.024427

Magnetic properties and spin dynamics in a spin-orbit driven Jeff= 1/2 triangular lattice antiferromagnet

Authors: J. Khatua, S. Bhattacharya, A. M. Strydom, A. Zorko, J. S. Lord, A. Ozarowski, E. Kermarrec, P. Khuntia

Abstract: Frustration-induced strong quantum fluctuations accompanied by spin-orbit coupling and crystal electric field can give rise to rich and diverse magnetic phenomena associated with unconventional low-energy excitations in rare-earth based quantum magnets. Herein, we present crystal structure, magnetic susceptibility, specific heat, muon spin relaxation(muSR), and electron spin resonance (ESR) studie… ▽ More Frustration-induced strong quantum fluctuations accompanied by spin-orbit coupling and crystal electric field can give rise to rich and diverse magnetic phenomena associated with unconventional low-energy excitations in rare-earth based quantum magnets. Herein, we present crystal structure, magnetic susceptibility, specific heat, muon spin relaxation(muSR), and electron spin resonance (ESR) studies on the polycrystalline samples of Ba6Yb2Ti4O17 in which Yb3+ ions constitute a perfect triangular lattice in ab-plane without detectable anti-site disorder between atomic sites. The Curie-Weiss fit of low-temperature magnetic susceptibility data suggest the spin-orbit entangled Jeff = 1/2 degrees of freedom of Yb3+ spin with weak antiferromagnetic exchange interactions in the Kramers doublet ground state. The zero-field specific heat data reveal the presence of long-range magnetic order at TN = 77 mK which is suppressed in a magnetic field 1 T. The broad maximum in specific heat is attributed to the Schottky anomaly implying the Zeeman splitting of the Kramers doublet ground state. The ESR measurements suggest the presence of anisotropic exchange interaction between the moments of Yb3+ spins and the well separated Kramers doublet state. muSR experiments reveal a fluctuating state of Yb3+ spins in the temperature range 0.1 K-100 K owing to depopulation of crystal electric field levels, which suggests that the Kramers doublets are well separated consistent with thermodynamic and ESR results. In addition to the intraplane nearest-neighbor superexchange interaction, the interplane exchange interaction and anisotropy are expected to stabilize the long-range ordered state in this triangular lattice antiferromagnet. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Journal ref: Phys. Rev. B 109, 024427 (2024)

arXiv:2311.01534 [pdf, other]

Approximate Multiagent Reinforcement Learning for On-Demand Urban Mobility Problem on a Large Map (extended version)

Authors: Daniel Garces, Sushmita Bhattacharya, Dimitri Bertsekas, Stephanie Gil

Abstract: In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has shown that a rollout algorithm with a stable base policy produces a near-optimal stable policy. In the routing setting, a policy is stable if its execu… ▽ More In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has shown that a rollout algorithm with a stable base policy produces a near-optimal stable policy. In the routing setting, a policy is stable if its execution keeps the number of outstanding requests uniformly bounded over time. Although, rollout-based approaches are well-suited for learning cooperative multiagent policies with considerations for future demand, applying such methods to a large urban environment can be computationally expensive due to the large number of taxis required for stability. In this paper, we aim to address the computational bottleneck of multiagent rollout by proposing an approximate multiagent rollout-based two phase algorithm that reduces computational costs, while still achieving a stable near-optimal policy. Our approach partitions the graph into sectors based on the predicted demand and the maximum number of taxis that can run sequentially given the user's computational resources. The algorithm then applies instantaneous assignment (IA) for re-balancing taxis across sectors and a sector-wide multiagent rollout algorithm that is executed in parallel for each sector. We provide two main theoretical results: 1) characterize the number of taxis $m$ that is sufficient for IA to be stable; 2) derive a necessary condition on $m$ to maintain stability for IA as time goes to infinity. Our numerical results show that our approach achieves stability for an $m$ that satisfies the theoretical conditions. We also empirically demonstrate that our proposed two phase algorithm has equivalent performance to the one-at-a-time rollout over the entire map, but with significantly lower runtimes. △ Less

Submitted 18 February, 2025; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: 12 pages, 5 figures, 1 lemma, and 2 theorems

arXiv:2310.20638 [pdf, other]

Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved Generalization

Authors: Vaibhav Khamankar, Sutanu Bera, Saumik Bhattacharya, Debashis Sen, Prabir Kumar Biswas

Abstract: Histopathological images are essential for medical diagnosis and treatment planning, but interpreting them accurately using machine learning can be challenging due to variations in tissue preparation, staining and imaging protocols. Domain generalization aims to address such limitations by enabling the learning models to generalize to new datasets or populations. Style transfer-based data augmenta… ▽ More Histopathological images are essential for medical diagnosis and treatment planning, but interpreting them accurately using machine learning can be challenging due to variations in tissue preparation, staining and imaging protocols. Domain generalization aims to address such limitations by enabling the learning models to generalize to new datasets or populations. Style transfer-based data augmentation is an emerging technique that can be used to improve the generalizability of machine learning models for histopathological images. However, existing style transfer-based methods can be computationally expensive, and they rely on artistic styles, which can negatively impact model accuracy. In this study, we propose a feature domain style mixing technique that uses adaptive instance normalization to generate style-augmented versions of images. We compare our proposed method with existing style transfer-based data augmentation methods and found that it performs similarly or better, despite requiring less computation and time. Our results demonstrate the potential of feature domain statistics mixing in the generalization of learning models for histopathological image analysis. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: Paper is published in MedAGI 2023 (MICCAI 2023 1st International Workshop on Foundation Models for General Medical AI) Code link: https://github.com/Vaibhav-Khamankar/FuseStyle Paper link: https://nbviewer.org/github/MedAGI/medagi.github.io/blob/main/src/assets/papers/P17.pdf

arXiv:2310.19436 [pdf, other]

doi 10.1007/s10714-024-03284-y

Resummation of local and non-local scalar self energies via the Schwinger-Dyson equation in de Sitter spacetime

Authors: Sourav Bhattacharya, Nitin Joshi, Kinsuk Roy

Abstract: We consider a massless and minimally coupled self interacting quantum scalar field in the inflationary de Sitter spacetime. The scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+βφ^3/3!$ ($λ>0$). Compared to the earlier well studied $β=0$ case, the present potential has a rolling down effect due to the $φ^3$ term, along with the usual bounding effect due to the $φ^4$ term. We begin by const… ▽ More We consider a massless and minimally coupled self interacting quantum scalar field in the inflationary de Sitter spacetime. The scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+βφ^3/3!$ ($λ>0$). Compared to the earlier well studied $β=0$ case, the present potential has a rolling down effect due to the $φ^3$ term, along with the usual bounding effect due to the $φ^4$ term. We begin by constructing the Schwinger-Dyson equation for the scalar Feynman propagator up to two loop, at ${\cal O}(λ)$, ${\cal O}(β^2)$, ${\cal O}(λ^2)$ and ${\cal O}(λβ^2)$. We consider first the local part of the scalar self energy and compute the rest mass squared of the scalar field, dynamically generated via the late time non-perturbative secular logarithms, by resumming the daisy-like graphs. The logarithms associated here are sub-leading, compared to those associated with the non-local, leading terms. We also argue that unlike the quartic case, considering merely the one loop results for the purpose of resummation does not give us any sensible result here. We next construct the non-perturbative two particle irreducible effective action up to three loop and derive from it the Schwinger-Dyson equation once again. This equation is satisfied by the non-perturbative Feynman propagator. By series expanding this propagator, the resummed local part of the self energy is shown to yield the same dynamical mass as that of the above. We next use this equation to resum the effect of the non-local part of the scalar self energy, and show that even though the perturbatively corrected propagator shows secular growth at late times, there exists one resummed solution which is vanishing for large spatial separations, in qualitative agreement with that of the stochastic formalism. △ Less

Submitted 13 August, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

Comments: v2, 31pp, 11 figs; added references, discussion and clarifications; accepted in GRG

Journal ref: Gen. Rel. Grav 56:94 (2024)

arXiv:2310.19395 [pdf, ps, other]

Nonrelativistic spin splittings and altermagnetism in twisted bilayers of centrosymmetric antiferromagnets

Authors: Sajjan Sheoran, Saswata Bhattacharya

Abstract: Magnetism-driven nonrelativistic spin splittings (NRSS) are promising for highly efficient spintronics applications. Although 2D centrosymmetric (in four-dimensional spacetime) antiferromagnets are abundant, they have not received extensive research attention owing to symmetry-forbidden spin polarization and magnetization. Here, we demonstrate a paradigm to harness NRSS by twisting the bilayer of… ▽ More Magnetism-driven nonrelativistic spin splittings (NRSS) are promising for highly efficient spintronics applications. Although 2D centrosymmetric (in four-dimensional spacetime) antiferromagnets are abundant, they have not received extensive research attention owing to symmetry-forbidden spin polarization and magnetization. Here, we demonstrate a paradigm to harness NRSS by twisting the bilayer of centrosymmetric antiferromagnets with commensurate twist angles. We observe $i$-wave altermagnetism and spin-momentum locking by first-principles simulations and symmetry analysis on prototypical MnPSe$_3$ and MnSe antiferromagnets. The strength of NRSS (up to 80 meVÅ) induced by twisting is comparable to SOC-induced linear Rashba-Dresselhaus effects. The results also demonstrate how applying biaxial strain and a vertical electric field tune the NRSS. The findings reveal the untapped potential of centrosymmetric antiferromagnets and thus expand the material's horizons in spintronics. △ Less

Submitted 1 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.18371 [pdf, ps, other]

In-Context Ability Transfer for Question Decomposition in Complex QA

Authors: Venktesh V, Sourangshu Bhattacharya, Avishek Anand

Abstract: Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, e… ▽ More Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, existing prompt-based approaches that are effective for complex QA tasks involve expensive hand annotations from experts in the form of rationales and are not generalizable to newer complex QA scenarios and tasks. We propose, icat (In-Context Ability Transfer) which induces reasoning capabilities in LLMs without any LLM fine-tuning or manual annotation of in-context samples. We transfer the ability to decompose complex questions to simpler questions or generate step-by-step rationales to LLMs, by careful selection from available data sources of related tasks. We also propose an automated uncertainty-aware exemplar selection approach for selecting examples from transfer data sources. Finally, we conduct large-scale experiments on a variety of complex QA tasks involving numerical reasoning, compositional complex QA, and heterogeneous complex QA which require decomposed reasoning. We show that ICAT convincingly outperforms existing prompt-based solutions without involving any model training, showcasing the benefits of re-using existing abilities. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 10 pages

arXiv:2310.17831 [pdf, other]

On monic abelian trace-one cubic polynomials

Authors: Shubhrajit Bhattacharya, Andrew O'Desky

Abstract: We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois… ▽ More We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois group $C_3$ for a fixed integer $a$. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 24 pages

MSC Class: 11C08; 11G50; 14M25

arXiv:2310.17420 [pdf, other]

Fully Dynamic $k$-Clustering in $\tilde O(k)$ Update Time

Authors: Sayan Bhattacharya, Martín Costa, Silvio Lattanzi, Nikos Parotsidis

Abstract: We present a $O(1)$-approximate fully dynamic algorithm for the $k$-median and $k$-means problems on metric spaces with amortized update time $\tilde O(k)$ and worst-case query time $\tilde O(k^2)$. We complement our theoretical analysis with the first in-depth experimental study for the dynamic $k$-median problem on general metrics, focusing on comparing our dynamic algorithm to the current state… ▽ More We present a $O(1)$-approximate fully dynamic algorithm for the $k$-median and $k$-means problems on metric spaces with amortized update time $\tilde O(k)$ and worst-case query time $\tilde O(k^2)$. We complement our theoretical analysis with the first in-depth experimental study for the dynamic $k$-median problem on general metrics, focusing on comparing our dynamic algorithm to the current state-of-the-art by Henzinger and Kale [ESA'20]. Finally, we also provide a lower bound for dynamic $k$-median which shows that any $O(1)$-approximate algorithm with $\tilde O(\text{poly}(k))$ query time must have $\tilde Ω(k)$ amortized update time, even in the incremental setting. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2310.15552 [pdf, other]

Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks

Authors: Sunit Bhattacharya, Ondrej Bojar

Abstract: Recent research suggests that the feed-forward module within Transformers can be viewed as a collection of key-value memories, where the keys learn to capture specific patterns from the input based on the training examples. The values then combine the output from the 'memories' of the keys to generate predictions about the next token. This leads to an incremental process of prediction that gradual… ▽ More Recent research suggests that the feed-forward module within Transformers can be viewed as a collection of key-value memories, where the keys learn to capture specific patterns from the input based on the training examples. The values then combine the output from the 'memories' of the keys to generate predictions about the next token. This leads to an incremental process of prediction that gradually converges towards the final token choice near the output layers. This interesting perspective raises questions about how multilingual models might leverage this mechanism. Specifically, for autoregressive models trained on two or more languages, do all neurons (across layers) respond equally to all languages? No! Our hypothesis centers around the notion that during pretraining, certain model parameters learn strong language-specific features, while others learn more language-agnostic (shared across languages) features. To validate this, we conduct experiments utilizing parallel corpora of two languages that the model was initially pretrained on. Our findings reveal that the layers closest to the network's input or output tend to exhibit more language-specific behaviour compared to the layers in the middle. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.13114 [pdf, other]

Generalized Parton Distributions from Lattice QCD with Asymmetric Momentum Transfer: Axial-vector case

Authors: Shohini Bhattacharya, Krzysztof Cichy, Martha Constantinou, Jack Dodson, Xiang Gao, Andreas Metz, Joshua Miller, Swagato Mukherjee, Peter Petreczky, Fernanda Steffens, Yong Zhao

Abstract: Recently, we made significant advancements in improving the computational efficiency of lattice QCD calculations for Generalized Parton Distributions (GPDs). This progress was achieved by adopting calculations of matrix elements in asymmetric frames, deviating from the computationally-expensive symmetric frame typically used, and allowing freedom in the choice for the distribution of the momentum… ▽ More Recently, we made significant advancements in improving the computational efficiency of lattice QCD calculations for Generalized Parton Distributions (GPDs). This progress was achieved by adopting calculations of matrix elements in asymmetric frames, deviating from the computationally-expensive symmetric frame typically used, and allowing freedom in the choice for the distribution of the momentum transfer between the initial and final states. A crucial aspect of this approach involves the adoption of a Lorentz covariant parameterization for the matrix elements, introducing Lorentz-invariant amplitudes. This approach also allows us to propose an alternative definition of quasi-GPDs, ensuring frame independence and potentially reduce power corrections in matching to light-cone GPDs. In our previous work, we presented lattice QCD results for twist-2 unpolarized GPDs ($H$ and $E$) of quarks obtained from calculations performed in asymmetric frames at zero skewness. Building upon this work, we now introduce a novel Lorentz covariant parameterization for the axial-vector matrix elements. We employ this parameterization to compute the axial-vector GPD $\widetilde{H}$ at zero skewness, using an $N_f=2+1+1$ ensemble of twisted mass fermions with clover improvement. The light-quark masses employed in our calculations correspond to a pion mass of approximately 260 MeV. △ Less

Submitted 29 February, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: 32 pages, 20 figures. Version accepted for publication in Physical Review D

arXiv:2310.06479 [pdf]

doi 10.1049/icp.2023.0029

Photovoltaic grid-forming control strategy investigation using hardware-in-the-loop experiments

Authors: Somesh Bhattacharya, Chrysanthos Charalambous, Anja Banjac, Zoran Miletic, Thomas Strasser, Brian Azzopardi, Christina Papadimitriou, Venizelos Efthymiou, Alexis Polycarpou

Abstract: The frequency stability of a power system is of paramount importance, as a fast frequency swings in the system can lead to oscillatory instability, and thereby blackouts. A grid-connected microgrid, that can operate in the islanded mode can also possess such deteriorating effect due to the higher share of converter-based sources. In this paper, a coordinated frequency control within a distribution… ▽ More The frequency stability of a power system is of paramount importance, as a fast frequency swings in the system can lead to oscillatory instability, and thereby blackouts. A grid-connected microgrid, that can operate in the islanded mode can also possess such deteriorating effect due to the higher share of converter-based sources. In this paper, a coordinated frequency control within a distribution network is discussed, with a higher share of Photovoltaics (PV). The main objective of this paper is to test the grid-forming capabilities of PVs, without the requirement of an energy storage in the network. The tests were carried out with the help of the Typhoon Hardware-in-the-loop (HIL) platform using a real Cypriot network feeder. The real-time results confirm the efficacy of the PV as a grid-forming inverter, provided it has sufficient input (irradiance) to provide for the loads within the system of interest. The grid-forming PV also possesses the capability of reconnection with the utility grid through a synchronizer switch that requires minimal communication, makes the overall control independent of any other power source, subject to certain irradiance and loading conditions. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 13th Mediterranean Conference on Power Generation, Transmission, Distribution and Energy Conversion (MEDPOWER 2022)

arXiv:2310.05172 [pdf, other]

Systematic Evaluation of Randomized Cache Designs against Cache Occupancy

Authors: Anirban Chakraborty, Nimish Mishra, Sayandeep Saha, Sarani Bhattacharya, Debdeep Mukhopadhyay

Abstract: Randomizing the address-to-set mapping and partitioning of the cache has been shown to be an effective mechanism in designing secured caches. Several designs have been proposed on a variety of rationales: (1) randomized design, (2) randomized-and-partitioned design, and (3) psuedo-fully associative design. This work fills in a crucial gap in current literature on randomized caches: currently most… ▽ More Randomizing the address-to-set mapping and partitioning of the cache has been shown to be an effective mechanism in designing secured caches. Several designs have been proposed on a variety of rationales: (1) randomized design, (2) randomized-and-partitioned design, and (3) psuedo-fully associative design. This work fills in a crucial gap in current literature on randomized caches: currently most randomized cache designs defend only contention-based attacks, and leave out considerations of cache occupancy. We perform a systematic evaluation of 5 randomized cache designs- CEASER, CEASER-S, MIRAGE, Scatter-Cache, and Sass-cache against cache occupancy wrt. both performance as well as security. With respect to performance, we first establish that benchmarking strategies used by contemporary designs are unsuitable for a fair evaluation (because of differing cache configurations, choice of benchmarking suites, additional implementation-specific assumptions). We thus propose a uniform benchmarking strategy, which allows us to perform a fair and comparative analysis across all designs under various replacement policies. Likewise, with respect to security against cache occupancy attacks, we evaluate the cache designs against various threat assumptions: (1) covert channels, (2) process fingerprinting, and (3) AES key recovery (to the best of our knowledge, this work is the first to demonstrate full AES key recovery on a randomized cache design using cache occupancy attack). Our results establish the need to also consider cache occupancy side-channel in randomized cache design considerations. △ Less

Submitted 30 January, 2025; v1 submitted 8 October, 2023; originally announced October 2023.

arXiv:2310.04819 [pdf, other]

doi 10.1007/s11128-025-04700-1

Breaking absolute separability with quantum switch

Authors: Sravani Yanamandra, P V Srinidhi, Samyadeb Bhattacharya, Indranil Chakrabarty, Suchetana Goswami

Abstract: Absolute separable (AS) quantum states are those states from which it is impossible to create entanglement, even under global unitary operations. It is known from the resource theory of non-absolute separability that the set of absolute separable states forms a convex and compact set, and global unitaries are free operations. We show that the action of a quantum switch controlled by an ancilla qub… ▽ More Absolute separable (AS) quantum states are those states from which it is impossible to create entanglement, even under global unitary operations. It is known from the resource theory of non-absolute separability that the set of absolute separable states forms a convex and compact set, and global unitaries are free operations. We show that the action of a quantum switch controlled by an ancilla qubit over the global unitaries can break this robustness of AS states and produce ordinary separable states. First, we consider bipartite qubit systems and find the effect of quantum switch starting from the states sitting on the boundary of the set of absolute separable states. As particular examples, we illustrate what happens to modified Werner states and Bell diagonal (BD) states. For the Bell diagonal states, we provide the structure for the set of AS BD states and show how the structure changes under the influence of a switch. Further, we consider numerical generalisation of the global unitary operations and show that it is always possible to take AS states out of the convex set under switching operations. We also generalised our results in higher dimensions. △ Less

Submitted 7 October, 2023; originally announced October 2023.

Comments: Comments are welcome!

Journal ref: Quantum Inf Process 24, 81 (2025)

arXiv:2310.04453 [pdf, other]

COVID-19 South African Vaccine Hesitancy Models Show Boost in Performance Upon Fine-Tuning on M-pox Tweets

Authors: Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

Abstract: Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-p… ▽ More Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-pox dataset before and after fine-tuning. More than 20k M-pox-related tweets from South Africa were hand-labelled as being either positive, negative or neutral. After fine-tuning these COVID-19 models on the M-pox dataset, the F1-scores increased by more than 8% falling just short of 70%, but still outperforming state-of-the-art models and well-known classification algorithms. An LDA-based topic modelling procedure was used to compare the miss-classified M-pox tweets of the original COVID-19 RoBERTa model with its fine-tuned version, and from this analysis, we were able to draw conclusions on how to build more sophisticated models. △ Less

Submitted 4 October, 2023; originally announced October 2023.

arXiv:2310.01903 [pdf, other]

The AstroSat UV Deep Field North: Direct determination of the UV Luminosity Function and its evolution from z~0.8-0.4

Authors: Souradeep Bhattacharya, Kanak Saha, Chayan Mondal

Abstract: We characterize the evolution of the rest-frame 1500 $\unicode{xC5}$ UV luminosity Function (UVLF) from AstroSat/UVIT F154W and N242W imaging in the Great Observatories Origins Survey North (GOODS-N) field. With deep FUV observations, we construct the UVLF for galaxies at z$<0.13$ and subsequently characterise it with a Schechter function fit. The fitted parameters are consistent with previous det… ▽ More We characterize the evolution of the rest-frame 1500 $\unicode{xC5}$ UV luminosity Function (UVLF) from AstroSat/UVIT F154W and N242W imaging in the Great Observatories Origins Survey North (GOODS-N) field. With deep FUV observations, we construct the UVLF for galaxies at z$<0.13$ and subsequently characterise it with a Schechter function fit. The fitted parameters are consistent with previous determinations. With deep NUV observations, we are able to construct the UVLF in seven redshift bins in the range z $\sim$ 0.8 - 0.4, with galaxies identified till $\sim$2 mag fainter than previous surveys, owing to the high angular-resolution of UVIT. The fitted Schechter function parameters are obtained for these UVLFs. At z $\sim$ 0.8 - 0.7, we also utilize Hubble Space Telescope (HST) F275W observations in the GOODS-N field to construct the UVLF in 2 redshift bins, whose fitted Schechter function parameters are then found to be consistent with that determined from UVIT at z $\sim$ 0.75. We thus probe the variation of the fitted UVLF parameters over z $\sim$ 0.8 - 0.4, a span of $\sim$2.7 Gyr in age. We find that the slope of the Schechter function, $α$, is at its steepest at z $\sim$ 0.65, implying highest star-formation at this instant with galaxies being relatively more passive before and after this time. We infer that this may be a short-lived instance of increased cosmic star-formation even though cosmic star-formation may be winding-down over longer timespan at this redshift range. △ Less

Submitted 25 June, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

Comments: Accepted for publication at MNRAS, 9 pages, 7 figures, 1 table

arXiv:2310.00917 [pdf, other]

Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

Authors: Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya

Abstract: The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here… ▽ More The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here, we investigate the problem of domain-adaptive scene text spotting, i.e., training a model on multi-domain source data such that it can directly adapt to target domains rather than being specialized for a specific domain or scenario. Further, we investigate a transformer baseline called Swin-TESTR to focus on solving scene-text spotting for both regular and arbitrary-shaped scene text along with an exhaustive evaluation. The results clearly demonstrate the potential of intermediate representations to achieve significant performance on text spotting benchmarks across multiple domains (e.g. language, synth-to-real, and documents). both in terms of accuracy and efficiency. △ Less

Submitted 1 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: Accepted to the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

arXiv:2309.15425 [pdf, other]

Quantitative Analysis of Social Influence & Digital Piracy Contagion with Differential Equations on Networks

Authors: Dibyajyoti Mallick, Kumar Gaurav, Saumik Bhattacharya, Sayantari Ghosh

Abstract: Though the studies of social contagions are regularly borrowing network models to study the propagation of social influences and opinions to include social heterogeneity. Such studies provide valuable insights regarding these, but the social network structures cannot be well explored in their study. In this research, we methodically study the trends in online piracy with a continuous ODE approach… ▽ More Though the studies of social contagions are regularly borrowing network models to study the propagation of social influences and opinions to include social heterogeneity. Such studies provide valuable insights regarding these, but the social network structures cannot be well explored in their study. In this research, we methodically study the trends in online piracy with a continuous ODE approach and differential equations on graphs, to have a clear comparative view. We first formulate a compartmental model to mathematically study bifurcations and thresholds, and later move on with a network-based analysis to illustrate the proliferation of online piracy dynamic with an epidemiological approach over a social network. We figure out a solution for this online piracy problem by developing awareness among individuals by introducing media campaigns which could be a useful factor for the eradication and control of online piracy. Next, using degree-block approximation, network analysis has been performed to investigate the phenomena from a heterogeneous approach and to derive the threshold condition for the persistence of piracy in the population in a steady state. Based on the behavioral responses of individuals in a society due to the effect of media, we examine the system through the aid of realistic parameter selection to better understand the complexity of the dynamics and propose control strategies. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: 21 pages, 6 figures

arXiv:2309.15011 [pdf, ps, other]

Characterization of LAPPD timing at CERN PS testbeam

Authors: Deb Sankar Bhattacharya, Andrea Bressan, Chandradoy Chatterjee, Silvia Dalla Torre, Mauro Gregori, Alexander Kiselev, Stefano Levorato, Anna Martin, Saverio Minutoli, Mikhail Osipenko, Richa Rai, Marco Ripani, Fulvio Tessarotto, Triloki Triloki

Abstract: Large Area Picosecond PhotoDetectors (LAPPDs) are photosensors based on microchannel plate technology with about 400 cm$^2$ sensitive area. The external readout plane of a capacitively coupled LAPPD can be segmented into pads providing a spatial resolution down to 1 mm scale. The LAPPD signals have about 0.5 ns risetime followed by a slightly longer falltime and their amplitude reaches a few dozen… ▽ More Large Area Picosecond PhotoDetectors (LAPPDs) are photosensors based on microchannel plate technology with about 400 cm$^2$ sensitive area. The external readout plane of a capacitively coupled LAPPD can be segmented into pads providing a spatial resolution down to 1 mm scale. The LAPPD signals have about 0.5 ns risetime followed by a slightly longer falltime and their amplitude reaches a few dozens of mV per single photoelectron. In this article, we report on the measurement of the time resolution of an LAPPD prototype in a test beam exercise at CERN PS. Most of the previous measurements of LAPPD time resolution had been performed with laser sources. In this article we report time resolution measurements obtained through the detection of Cherenkov radiation emitted by high energy hadrons. Our approach has been demonstrated capable of measuring time resolutions as fine as 25-30 ps. The available prototype had performance limitations, which prevented us from applying the optimal high voltage setting. The measured time resolution for single photoelectrons is about 80 ps r.m.s. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 35 pages, 23 figures

arXiv:2309.10806 [pdf, other]

doi 10.1103/PhysRevA.109.062213

Relating CP divisibility of dynamical maps with compatibility of channels

Authors: Arindam Mitra, Debashis Saha, Samyadeb Bhattacharya, A. S. Majumdar

Abstract: The role of CP-indivisibility and incompatibility as valuable resources for various information-theoretic tasks is widely acknowledged. This study delves into the intricate relationship between CP-divisibility and channel compatibility. Our investigation focuses on the behaviour of incompatibility robustness of quantum channels for a pair of generic dynamical maps. We show that the incompatibility… ▽ More The role of CP-indivisibility and incompatibility as valuable resources for various information-theoretic tasks is widely acknowledged. This study delves into the intricate relationship between CP-divisibility and channel compatibility. Our investigation focuses on the behaviour of incompatibility robustness of quantum channels for a pair of generic dynamical maps. We show that the incompatibility robustness of channels is monotonically non-increasing for a pair of generic CP-divisible dynamical maps. Further, our explicit study of the behaviour of incompatibility robustness with time for some specific dynamical maps reveals non-monotonic behaviour in the CP-indivisible regime. Additionally, we propose a measure of CP-indivisibility based on the incompatibility robustness of quantum channels. Our investigation provides valuable insights into the nature of quantum dynamical maps and their relevance in information-theoretic applications. △ Less

Submitted 1 May, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: 12 pages, 7 figures, typos fixed

Journal ref: Phys. Rev. A 109, 062213 (2024)

arXiv:2309.07755 [pdf, other]

Generative AI Text Classification using Ensemble LLM Approaches

Authors: Harika Abburi, Michael Suesserman, Nirmala Pudota, Balaji Veeramani, Edward Bowen, Sanmitra Bhattacharya

Abstract: Large Language Models (LLMs) have shown impressive performance across a variety of Artificial Intelligence (AI) and natural language processing tasks, such as content creation, report generation, etc. However, unregulated malign application of these models can create undesirable consequences such as generation of fake news, plagiarism, etc. As a result, accurate detection of AI-generated language… ▽ More Large Language Models (LLMs) have shown impressive performance across a variety of Artificial Intelligence (AI) and natural language processing tasks, such as content creation, report generation, etc. However, unregulated malign application of these models can create undesirable consequences such as generation of fake news, plagiarism, etc. As a result, accurate detection of AI-generated language can be crucial in responsible usage of LLMs. In this work, we explore 1) whether a certain body of text is AI generated or written by human, and 2) attribution of a specific language model in generating a body of text. Texts in both English and Spanish are considered. The datasets used in this study are provided as part of the Automated Text Identification (AuTexTification) shared task. For each of the research objectives stated above, we propose an ensemble neural model that generates probabilities from different pre-trained LLMs which are used as features to a Traditional Machine Learning (TML) classifier following it. For the first task of distinguishing between AI and human generated text, our model ranked in fifth and thirteenth place (with macro $F1$ scores of 0.733 and 0.649) for English and Spanish texts, respectively. For the second task on model attribution, our model ranked in first place with macro $F1$ scores of 0.625 and 0.653 for English and Spanish texts, respectively. △ Less

Submitted 14 September, 2023; originally announced September 2023.

arXiv:2309.04130 [pdf, other]

Gravitational wave memory for a class of static and spherically symmetric spacetimes

Authors: Soumya Bhattacharya, Shramana Ghosh

Abstract: This article aims at comparing gravitational wave memory effect in a Schwarzschild spacetime with that of other compact objects with static and spherically symmetric spacetime, with the purpose of proposing a procedure for differentiating between various compact object geometries. We do this by considering the relative evolution of two nearby test geodesics with in different backgrounds in the pre… ▽ More This article aims at comparing gravitational wave memory effect in a Schwarzschild spacetime with that of other compact objects with static and spherically symmetric spacetime, with the purpose of proposing a procedure for differentiating between various compact object geometries. We do this by considering the relative evolution of two nearby test geodesics with in different backgrounds in the presence and absence of a gravitational wave pulse and comparing them. Memory effect due to a gravitational wave would ensure that there is a permanent effect on each spacetime and the corresponding geodesic evolution, being metric dependent, would display distinct results in each case. For a complete picture, we have considered both displacement and velocity memory effect in each geometry. △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: 21 pages, 14 figures

arXiv:2309.02746 [pdf, other]

Determination of dynamical ages of open clusters through the A$^+$ parameter -- II

Authors: Khushboo K. Rao, Kaushar Vaidya, Manan Agarwal, Shanmugha Balan, Souradeep Bhattacharya

Abstract: Blue straggler stars (BSS), one of the most massive members of star clusters, have been used for over a decade to investigate mass segregation and estimate the dynamical ages of globular clusters (GCs) and open clusters (OCs). This work is an extension of our previous study, in which we investigated a correlation between theoretically estimated dynamical ages and the observed $A^+_{\mathrm{rh}}$ v… ▽ More Blue straggler stars (BSS), one of the most massive members of star clusters, have been used for over a decade to investigate mass segregation and estimate the dynamical ages of globular clusters (GCs) and open clusters (OCs). This work is an extension of our previous study, in which we investigated a correlation between theoretically estimated dynamical ages and the observed $A^+_{\mathrm{rh}}$ values, which represent the sedimentation level of BSS with respect to the reference population. Here, we use the ML-MOC algorithm on \textit{Gaia} EDR3 data to extend this analysis to 23 OCs. Using cluster properties and identified members, we estimate their dynamical and physical parameters. In order to estimate the $A^+_{\mathrm{rh}}$ values, we use the main sequence and main sequence turnoff stars as the reference population. OCs are observed to exhibit a wide range of degrees of dynamical evolution, ranging from dynamically young to late stages of intermediate dynamical age. Hence, we classify OCs into three distinct dynamical stages based on their relationship to $A^+_{\mathrm{rh}}$ and $N_{\text{relax}}$. NGC 2682 and King 2 are discovered to be the most evolved OCs, like Familly III GCs, while Berkeley 18 is the least evolved OC. Melotte 66 and Berkeley 31 are peculiar OCs because none of their dynamical and physical parameters correlate with their BSS segregation levels. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: Accepted for publication at MNRAS

arXiv:2309.02074 [pdf, ps, other]

Approximate recoverability and the quantum data processing inequality

Authors: Saptak Bhattacharya

Abstract: In this paper, we discuss the quantum data processing inequality and its refinements that are physically meaningful in the context of approximate recoverability. An important conjecture regarding this due to Seshadreesan et. al. in J. Phys. A: Math. Theor. 48 (2015) is disproved. We prove some inequalities capturing universal approximate recoverability with the Petz recovery map for the sandwiched… ▽ More In this paper, we discuss the quantum data processing inequality and its refinements that are physically meaningful in the context of approximate recoverability. An important conjecture regarding this due to Seshadreesan et. al. in J. Phys. A: Math. Theor. 48 (2015) is disproved. We prove some inequalities capturing universal approximate recoverability with the Petz recovery map for the sandwiched quasi and Rényi relative entropies for the parameter $t=2$. We also obtain convexity theorems on some parametrized versions of the relative entropy and fidelity, which can be of independent interest. △ Less

Submitted 16 April, 2025; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 25 pages, 0 figures

MSC Class: 94A17 15A45

arXiv:2309.01707 [pdf, other]

On the $α$/Fe bimodality of the M31 disks

Authors: Chiaki Kobayashi, Souradeep Bhattacharya, Magda Arnaboldi, Ortwin Gerhard

Abstract: An outstanding question is whether the $α$/Fe bimodality exists in disk galaxies other than in the Milky Way. Here we present a bimodality using our state-of-the-art galactic chemical evolution models that can explain various observations in the Andromeda Galaxy (M31) disks, namely, elemental abundances both of planetary nebulae, and of red-giant branch stars recently observed with the James Webb… ▽ More An outstanding question is whether the $α$/Fe bimodality exists in disk galaxies other than in the Milky Way. Here we present a bimodality using our state-of-the-art galactic chemical evolution models that can explain various observations in the Andromeda Galaxy (M31) disks, namely, elemental abundances both of planetary nebulae, and of red-giant branch stars recently observed with the James Webb Space Telescope. We find that in M31 a high-$α$ thicker-disk population out to 30 kpc formed by more intense initial star burst than in the Milky Way. We also find a young low-$α$ thin disk within 14 kpc, which is formed by a secondary star formation M31 underwent about 2-4.5 Gyr ago, probably triggered by a wet merger. In the outer disk, however, the planetary nebula observations indicate a slightly higher-$α$ young ($\sim$2.5 Gyr) population at a given metallicity, possibly formed by secondary star formation from almost pristine gas. Therefore, an $α$/Fe bimodality is seen in the inner disk ($<$14 kpc), while only a slight $α$/Fe offset of the young population is seen in the outer disk ($>$18 kpc). The appearance of the $α$/Fe bimodality depends on the merging history at various galactocentric radii, and wide-field multi-object spectroscopy is required for unveiling the history of M31. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: 7 pages, 3 figures, Accepted for publication in The Astrophysical Journal Letters

arXiv:2309.00745 [pdf, other]

Wall-attached convection under strong inclined magnetic fields

Authors: Shashwat Bhattacharya, Thomas Boeck, Dmitry Krasnov, Jörg Schumacher

Abstract: We employ a linear stability analysis and direct numerical simulations to study the characteristics of wall-modes in thermal convection in a rectangular box under strong and inclined magnetic fields. The walls of the convection cell are electrically insulated. The stability analysis assumes periodicity in the spanwise direction perpendicular to the plane of the homogeneous magnetic field. Our stud… ▽ More We employ a linear stability analysis and direct numerical simulations to study the characteristics of wall-modes in thermal convection in a rectangular box under strong and inclined magnetic fields. The walls of the convection cell are electrically insulated. The stability analysis assumes periodicity in the spanwise direction perpendicular to the plane of the homogeneous magnetic field. Our study shows that for a fixed vertical magnetic field, the imposition of horizontal magnetic fields results in an increase of the critical Rayleigh number along with a decrease in the wavelength of the wall modes. The wall modes become tilted along the direction of the resulting magnetic fields and therefore extend further into the bulk as the horizontal magnetic field is increased. Once the modes localized on the opposite walls interact, the critical Rayleigh number decreases again and eventually drops below the value for onset with a purely vertical field. We find that for sufficiently strong horizontal magnetic fields, the steady wall modes occupy the entire bulk and therefore convection is no longer restricted to the sidewalls. The above results are confirmed by direct numerical simulations of the nonlinear evolution of magnetoconvection. △ Less

Submitted 1 September, 2023; originally announced September 2023.

Comments: 25 pages, 18 figures

MSC Class: 76W05

arXiv:2309.00326 [pdf, ps, other]

doi 10.1140/epjc/s10052-024-12480-8

Formulation of Galilean relativistic Born-Infeld theory

Authors: Rabin Banerjee, Soumya Bhattacharya, Bibhas Ranjan Majhi

Abstract: In this paper, we formulate, for the first time, in a systematic manner, Galilean relativistic Born-Infeld action in detail. Exploiting maps connecting Lorentz relativistic and Galilean relativistic vectors, we construct the two limits (electric and magnetic) of Galilean relativistic Born-Infeld action from usual relativistic Born-Infeld theory. An action formalism is thereby derived. From this ac… ▽ More In this paper, we formulate, for the first time, in a systematic manner, Galilean relativistic Born-Infeld action in detail. Exploiting maps connecting Lorentz relativistic and Galilean relativistic vectors, we construct the two limits (electric and magnetic) of Galilean relativistic Born-Infeld action from usual relativistic Born-Infeld theory. An action formalism is thereby derived. From this action, equations of motion are obtained either in the potential or field formulation. Galilean version of duality transformations involving the electric and magnetic fields are defined. They map the electric limit relations to the magnetic ones and vice-versa, exactly as happens for Galilean relativistic Maxwell theory. We also explicitly show the Galilean boost and gauge invariances of the theory in both limits. △ Less

Submitted 9 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: 13 pages, 4 tables, some comments and a table added, matches with the published version

Journal ref: Eur. Phys. J. C 84, 141 (2024)

arXiv:2308.15377 [pdf, other]

Unraveling anomalies in Deeply Virtual Compton Scattering

Authors: Shohini Bhattacharya, Yoshitaka Hatta, Werner Vogelsang

Abstract: We calculate the one-loop quark box diagrams relevant to polarized and unpolarized Deeply Virtual Compton Scattering by introducing an off-forward momentum $l^μ$ as an infrared regulator. This regularization approach allows us to reveal the poles associated with the chiral anomaly in the polarized scenario, as well as the trace anomaly in the unpolarized case. We provide an interpretation of our f… ▽ More We calculate the one-loop quark box diagrams relevant to polarized and unpolarized Deeply Virtual Compton Scattering by introducing an off-forward momentum $l^μ$ as an infrared regulator. This regularization approach allows us to reveal the poles associated with the chiral anomaly in the polarized scenario, as well as the trace anomaly in the unpolarized case. We provide an interpretation of our findings in the context of pertinent Generalized Parton Distributions (GPDs). Furthermore, we discuss the implications of these poles on the QCD factorization pertaining to Compton amplitudes. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: 7 pages, 1 figure; contribution to DIS2023: XXX International Workshop on Deep-Inelastic Scattering and Related Subjects, Michigan State University, USA, 27-31 March 2023

arXiv:2308.14988 [pdf, other]

Inferences on Mixing Probabilities and Ranking in Mixed-Membership Models

Authors: Sohom Bhattacharya, Jianqing Fan, Jikai Hou

Abstract: Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector… ▽ More Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector $\boldsymbolπ_ i = (\boldsymbolπ_i(1), \boldsymbolπ_i(2),\ldots, \boldsymbolπ_i(K))$, where $\boldsymbolπ_i(k)$ denotes the weight that node $i$ puts in community $k$. We derive novel finite-sample expansion for the $\boldsymbolπ_i(k)$s which allows us to obtain asymptotic distributions and confidence interval of the membership mixing probabilities and other related population quantities. This fills an important gap on uncertainty quantification on the membership profile. We further develop a ranking scheme of the vertices based on the membership mixing probabilities on certain communities and perform relevant statistical inferences. A multiplier bootstrap method is proposed for ranking inference of individual member's profile with respect to a given community. The validity of our theoretical results is further demonstrated by via numerical experiments in both real and synthetic data examples. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2308.12722 [pdf, other]

Accelerated Neural Network Training through Dimensionality Reduction for High-Throughput Screening of Topological Materials

Authors: Ruman Moulik, Ankita Phutela, Sajjan Sheoran, Saswata Bhattacharya

Abstract: Machine Learning facilitates building a large variety of models, starting from elementary linear regression models to very complex neural networks. Neural networks are currently limited by the size of data provided and the huge computational cost of training a model. This is especially problematic when dealing with a large set of features without much prior knowledge of how good or bad each indivi… ▽ More Machine Learning facilitates building a large variety of models, starting from elementary linear regression models to very complex neural networks. Neural networks are currently limited by the size of data provided and the huge computational cost of training a model. This is especially problematic when dealing with a large set of features without much prior knowledge of how good or bad each individual feature is. We try tackling the problem using dimensionality reduction algorithms to construct more meaningful features. We also compare the accuracy and training times of raw data and data transformed after dimensionality reduction to deduce a sufficient number of dimensions without sacrificing accuracy. The indicated estimation is done using a lighter decision tree-based algorithm, AdaBoost, as it trains faster than neural networks. We have chosen the data from an online database of topological materials, Materiae. Our final goal is to construct a model to predict the topological properties of new materials from elementary properties. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.12473 [pdf, other]

High scale validity of two Higgs doublet scenarios with a real scalar singlet dark matter

Authors: Subhaditya Bhattacharya, Atri Dey, Jayita Lahiri, Biswarup Mukhopadhyaya

Abstract: We study the high-scale validity of two kinds of two Higgs doublet models (2HDM), namely, Type-II and Type-X, but with a scalar SU(2) singlet dark matter (DM) candidate in addition in each case. The additional quartic couplings involving the DM particle in the scalar potential in both the scenarios bring in additional constraints from the requirement of perturbative unitarity and vacuum stability.… ▽ More We study the high-scale validity of two kinds of two Higgs doublet models (2HDM), namely, Type-II and Type-X, but with a scalar SU(2) singlet dark matter (DM) candidate in addition in each case. The additional quartic couplings involving the DM particle in the scalar potential in both the scenarios bring in additional constraints from the requirement of perturbative unitarity and vacuum stability. DM relic density and direct search constraints play a crucial role in this analysis as the perturbative unitarity of the DM-Higgs portal couplings primarily decide the high scale validity of the model. We find that, within the parameter regions thus restricted, the Type-II scenario must have a cut-off at around $10^6$ GeV, while the Type-X scenario admits of validity upto the Planck scale. However, only those regions which are valid upto about $10^8$ GeV in Type-X 2HDM is amenable to detection at the High-luminosity LHC (upto 3000 $fb^{-1}$), while most of the parameter space of the Type-II scenario mentioned above is likely to be detectable. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 32 pages, 6 figures, 4 tables

arXiv:2308.11854 [pdf, other]

doi 10.5120/ijca2023923042

Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0

Authors: Anmol Chaure, Ashok Kumar Behera, Sudip Bhattacharya

Abstract: Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the pe… ▽ More Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the performance of machine learning emulators designed for climate data. Recent studies have reported that despite being considered fundamental, regression models offer several advantages pertaining to climate emulations. In particular, by leveraging the kernel trick, regression models can capture complex relationships and improve their predictive capabilities. This study focuses on evaluating non-linear regression models using the aforementioned dataset. Specifically, we compare the emulation capabilities of three non-linear regression models. Among them, Gaussian Process Regressor demonstrates the best-in-class performance against standard evaluation metrics used for climate field emulation studies. However, Gaussian Process Regression suffers from being computational resource hungry in terms of space and time complexity. Alternatively, Support Vector and Kernel Ridge models also deliver competitive results and but there are certain trade-offs to be addressed. Additionally, we are actively investigating the performance of composite kernels and techniques such as variational inference to further enhance the performance of the regression models and effectively model complex non-linear patterns, including phenomena like precipitation. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Journal ref: International Journal of Computer Applications 185(29):31-39, August 2023

arXiv:2308.11384 [pdf, other]

Non-perturbative $\langle φ\rangle$, $\langle φ^2 \rangle$ and the dynamically generated scalar mass with Yukawa interaction in the inflationary de Sitter spacetime

Authors: Sourav Bhattacharya, Moutushi Dutta Choudhury

Abstract: We consider a massless minimally coupled self interacting quantum scalar field coupled to fermion via the Yukawa interaction, in the inflationary de Sitter background. The fermion is also taken to be massless and the scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+ βφ^3/3!$ ($λ>0$). The chief physical motivation behind this choice of $V(φ)$ corresponds to, apart from its boundedness from… ▽ More We consider a massless minimally coupled self interacting quantum scalar field coupled to fermion via the Yukawa interaction, in the inflationary de Sitter background. The fermion is also taken to be massless and the scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+ βφ^3/3!$ ($λ>0$). The chief physical motivation behind this choice of $V(φ)$ corresponds to, apart from its boundedness from below property, the fact that shape wise $V(φ)$ has qualitative similarity with standard inflationary classical slow roll potentials. Also, its vacuum expectation value can be negative, suggesting some screening of the inflationary cosmological constant. We choose that $\langle φ\rangle\sim 0$ at early times with respect to the Bunch-Davies vacuum, so that perturbation theory is valid initially. We consider the equations satisfied by $\langle φ(t) \rangle$ and $\langle φ^2(t) \rangle$, constructed from the coarse grained equation of motion for the slowly rolling $φ$. We then compute the vacuum diagrammes of various relevant operators using the in-in formalism up to three loop, in terms of the leading powers of the secular logarithms. For a closed fermion loop, we have restricted ourselves here to only the local contribution. These large temporal logarithms are then resummed by constructing suitable non-perturbative equations to compute $\langle φ\rangle$ and $\langle φ^2 \rangle$. $\langle φ\rangle$ turns out to be at least approximately an order of magnitude less compared to the minimum of the classical potential, $-3β/λ$, owing to the strong quantum fluctuations. For $\langle φ^2 \rangle$, we have computed the dynamically generated scalar mass at late times, by taking the appropriate purely local contributions. Variations of these quantities with respect to different couplings have also been presented. △ Less

Submitted 2 January, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

Comments: v2; 36pp, 11 figs.; added references, discussions and clarifications; improved presentation; accepted in JCAP

arXiv:2308.09104 [pdf, other]

Spike-and-slab shrinkage priors for structurally sparse Bayesian neural networks

Authors: Sanket Jantre, Shrijita Bhattacharya, Tapabrata Maiti

Abstract: Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low… ▽ More Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low latency inference, higher data throughput, and reduced energy consumption. In this paper, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks. To this end, we propose structurally sparse Bayesian neural networks which systematically prune excessive nodes with (i) Spike-and-Slab Group Lasso (SS-GL), and (ii) Spike-and-Slab Group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layer-wise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared to the baseline models in prediction accuracy, model compression, and inference latency. △ Less

Submitted 21 August, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

arXiv:2308.08917 [pdf, other]

Unfolding for Joint Channel Estimation and Symbol Detection in MIMO Communication Systems

Authors: Swati Bhattacharya, K. V. S. Hari, Yonina C. Eldar

Abstract: This paper proposes a Joint Channel Estimation and Symbol Detection (JED) scheme for Multiple-Input Multiple-Output (MIMO) wireless communication systems. Our proposed method for JED using Alternating Direction Method of Multipliers (JED-ADMM) and its model-based neural network version JED using Unfolded ADMM (JED-U-ADMM) markedly improve the symbol detection performance over JED using Alternating… ▽ More This paper proposes a Joint Channel Estimation and Symbol Detection (JED) scheme for Multiple-Input Multiple-Output (MIMO) wireless communication systems. Our proposed method for JED using Alternating Direction Method of Multipliers (JED-ADMM) and its model-based neural network version JED using Unfolded ADMM (JED-U-ADMM) markedly improve the symbol detection performance over JED using Alternating Minimization (JED-AM) for a range of MIMO antenna configurations. Both proposed algorithms exploit the non-smooth constraint, that occurs as a result of the Quadrature Amplitude Modulation (QAM) data symbols, to effectively improve the performance using the ADMM iterations. The proposed unfolded network JED-U-ADMM consists of a few trainable parameters and requires a small training set. We show the efficacy of the proposed methods for both uncorrelated and correlated MIMO channels. For certain configurations, the gain in SNR for a desired BER of $10^{-2}$ for the proposed JED-ADMM and JED-U-ADMM is upto $4$ dB and is also accompanied by a significant reduction in computational complexity of upto $75\%$, depending on the MIMO configuration, as compared to the complexity of JED-AM. △ Less

Submitted 21 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: 14 pages, 19 figures, submitted to IEEE Transactions on Signal Processing

arXiv:2308.04178 [pdf, ps, other]

Assistive Chatbots for healthcare: a succinct review

Authors: Basabdatta Sen Bhattacharya, Vibhav Sinai Pissurlenkar

Abstract: Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing depen… ▽ More Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing dependence on human-human interaction and saving man-hours. Our review indicates that there are a handful of (commercial) Chatbots that are being used for patient support, while there are others (non-commercial) that are in the clinical trial phases. However, there is a lack of trust on this technology regarding patient safety and data protection, as well as a lack of wider awareness on its benefits among the healthcare workers and professionals. Also, patients have expressed dissatisfaction with Natural Language Processing (NLP) skills of the Chatbots in comparison to humans. Notwithstanding the recent introduction of ChatGPT that has raised the bar for the NLP technology, this Chatbot cannot be trusted with patient safety and medical ethics without thorough and rigorous checks to serve in the `narrow' domain of assistive healthcare. Our review suggests that to enable deployment and integration of AI-enabled Chatbots in public health services, the need of the hour is: to build technology that is simple and safe to use; to build confidence on the technology among: (a) the medical community by focussed training and development; (b) the patients and wider community through outreach. △ Less

Submitted 8 August, 2023; originally announced August 2023.

arXiv:2308.03908 [pdf, other]

ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition

Authors: Soumyabrata Chaudhuri, Saumik Bhattacharya

Abstract: Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has ofte… ▽ More Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has often been used for this task, either independently or in conjunction with the visual information (RGB modality) present in videos. However, the combination of pose, visual information, and text attributes has not been explored yet, though text and pose attributes independently have been proven to be effective in numerous computer vision tasks. In this paper, we present the first pose augmented Vision-language model (VLM) for VAR. Notably, our scheme achieves an accuracy of 92.81% and 73.02% on two popular human video action recognition benchmark datasets, UCF-101 and HMDB-51, respectively, even without any video data pre-training, and an accuracy of 96.11% and 75.75% after kinetics pre-training. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 7 pages, 3 figures, 2 Tables

arXiv:2308.02905 [pdf, other]

FASTER: A Font-Agnostic Scene Text Editing and Rendering Framework

Authors: Alloy Das, Sanket Biswas, Prasun Roy, Subhankar Ghosh, Umapada Pal, Michael Blumenstein, Josep Lladós, Saumik Bhattacharya

Abstract: Scene Text Editing (STE) is a challenging research problem, that primarily aims towards modifying existing texts in an image while preserving the background and the font style of the original text. Despite its utility in numerous real-world applications, existing style-transfer-based approaches have shown sub-par editing performance due to (1) complex image backgrounds, (2) diverse font attributes… ▽ More Scene Text Editing (STE) is a challenging research problem, that primarily aims towards modifying existing texts in an image while preserving the background and the font style of the original text. Despite its utility in numerous real-world applications, existing style-transfer-based approaches have shown sub-par editing performance due to (1) complex image backgrounds, (2) diverse font attributes, and (3) varying word lengths within the text. To address such limitations, in this paper, we propose a novel font-agnostic scene text editing and rendering framework, named FASTER, for simultaneously generating text in arbitrary styles and locations while preserving a natural and realistic appearance and structure. A combined fusion of target mask generation and style transfer units, with a cascaded self-attention mechanism has been proposed to focus on multi-level text region edits to handle varying word lengths. Extensive evaluation on a real-world database with further subjective human evaluation study indicates the superiority of FASTER in both scene text editing and rendering tasks, in terms of model performance and efficiency. Our code will be released upon acceptance. △ Less

Submitted 5 November, 2024; v1 submitted 5 August, 2023; originally announced August 2023.

Comments: Accepted in WACV 2025

Showing 201–250 of 1,252 results for author: Bhattacharya, S