-
Quantum channels that destroy negative conditional entropy
Authors:
PV Srinidhi,
Indranil Chakrabarty,
Samyadeb Bhattacharya,
Nirman Ganguly
Abstract:
Counter-intuitive to classical notions, quantum conditional entropy can be negative, playing a pivotal role in information-processing tasks. This article delves deeply into quantum channels, emphasizing negative conditional entropy breaking channels (NCEB) and introducing negative conditional entropy annihilating channels (NCEA). We characterize these channels from both topological and information…
▽ More
Counter-intuitive to classical notions, quantum conditional entropy can be negative, playing a pivotal role in information-processing tasks. This article delves deeply into quantum channels, emphasizing negative conditional entropy breaking channels (NCEB) and introducing negative conditional entropy annihilating channels (NCEA). We characterize these channels from both topological and information-theoretic perspectives, examining their properties when combined serially and NCEB in parallel. Our exploration extends to complimentary channels associated with NCEB, leading to the introduction of information-leaking channels. Utilizing the parameters of the standard depolarizing channel, we provide tangible examples and further characterization. We demonstrate the relationship of NCEB and NCEA with newly introduced channels like coherent information breaking (CIB) and mutual information breaking (MIB), along with standard channels like zero capacity channels. Preservation of quantum resources is an integral constituent of quantum information theory. Recognizing this, we lay prescriptions to detect channels that do not break the negativity of conditional entropy, ensuring the conservation of this quantum resource.
△ Less
Submitted 4 November, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Rashba splitting in polar-nonpolar sandwich heterostructure : A DFT Study
Authors:
Sanchari Bhattacharya,
Sanjoy Datta
Abstract:
In this study, we employ density functional theory (DFT) based first-principles calculations to investigate the spin-orbit effects in the electronic structure of a polar-nonpolar sandwich heterostructure namely LAO$_{2.5}$/STO$_{5.5}$/LAO$_{2.5}$. Our focus on the Ti-3d bands reveals an inverted ordering of the STO-$\rm t_{2g}$ orbital near the n-type interface, consistent with earlier experimenta…
▽ More
In this study, we employ density functional theory (DFT) based first-principles calculations to investigate the spin-orbit effects in the electronic structure of a polar-nonpolar sandwich heterostructure namely LAO$_{2.5}$/STO$_{5.5}$/LAO$_{2.5}$. Our focus on the Ti-3d bands reveals an inverted ordering of the STO-$\rm t_{2g}$ orbital near the n-type interface, consistent with earlier experimental work. In contrast, toward the p-type interface, the orbital ordering aligns with the natural ordering of STO orbitals, influenced by crystal field splitting. Interestingly, we have found a strong inter-orbital coupling between $t_{2g}$ and $e_g$ orbital, which has not been reported earlier in $\rm SrTiO_3$ based 2D system. Additionally, our observations highlight that the cubic Rashba splitting in this system surpasses the linear Rashba splitting, contrary to experimental findings. This comprehensive analysis contributes to a refined understanding of the role of orbital mixing in Rashba splitting in the sandwich oxide heterostructures.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
LATIS: Lambda Abstraction-based Thermal Image Super-resolution
Authors:
Gargi Panda,
Soumitra Kundu,
Saumik Bhattacharya,
Aurobinda Routray
Abstract:
Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images. Recently, transformer-based methods have achieved significant performance in SISR. However, in the SR task, only a small number of pixels are involved in the transformers self-attention (SA) mechanism due to the computational complexity of the attention mechanism. The lambda abst…
▽ More
Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images. Recently, transformer-based methods have achieved significant performance in SISR. However, in the SR task, only a small number of pixels are involved in the transformers self-attention (SA) mechanism due to the computational complexity of the attention mechanism. The lambda abstraction is a promising alternative to SA in modeling long-range interactions while being computationally more efficient. This paper presents lambda abstraction-based thermal image super-resolution (LATIS), a novel lightweight architecture for SISR of thermal images. LATIS sequentially captures local and global information using the local and global feature block (LGFB). In LGFB, we introduce a global feature extraction (GFE) module based on the lambda abstraction mechanism, channel-shuffle and convolution (CSConv) layer to encode local context. Besides, to improve the performance further, we propose a differentiable patch-wise histogram-based loss function. Experimental results demonstrate that our LATIS, with the least model parameters and complexity, achieves better or comparable performance with state-of-the-art methods across multiple datasets.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
Low spin spectroscopy of neutron-rich 43,44,45Cl via β and (β}n decay
Authors:
V. Tripathi,
S. Bhattacharya,
E. Rubino,
C. Benetti,
J. F. Perello,
S. L. Tabor,
S. N. Liddick,
P. C. Bender,
M. P. Carpenter,
J. J. Carroll,
A. Chester,
C. J. Chiara,
K. Childers,
B. R. Clark,
B. P. Crider,
J. T. Harke,
R. Jain,
B. Longfellow,
S. Luitel,
M. Mogannam,
T. H. Ogunbeku,
A. L. Richard,
S. Saha,
N. Shimizu,
O. A. Shehu
, et al. (5 additional authors not shown)
Abstract:
β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by co…
▽ More
β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by correlating implants and decays in the pixelated implant detector with further coincidence with γ transitions in the daughter nucleus. The level structure of 43,45 Cl is expanded by the addition of 20 new γ transitions in 43Cl and 8 in 45 Cl with the observation of core excited negative-parity states for the first time. For 45 S decay, a large fraction of the β decay strength goes to delayed neutron emission populating states in 44 Cl which are also presented. Comparison of experimental observations is made to detailed shell-model calculations using the SDPFSDG-MU interaction to highlight the role of the diminished N = 28 neutron shell gap and the near degeneracy of the proton s 1/2 and d 3/2 orbitals on the structure of the neutron-rich Cl isotopes. The current work also provides further support to a ground state spin-parity assignment of 3/2 + in 45 Cl.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Search for the origin of wobbling motion in the $ A \approx 130 $ region: The case of $^{131}$Xe
Authors:
S. Chakraborty,
S. Bhattacharyya,
R. Banik,
Soumik Bhattacharya,
G. Mukherjee,
C. Bhattacharya,
S. Biswas,
S. Rajbanshi,
Shabir Dar,
S. Nandi,
Sajad Ali,
S. Chatterjee,
S. Das,
S. Das Gupta,
S. S. Ghugre,
A. Goswami,
A. Lemasson,
Debasish Mondal,
S. Mukhopadhyay,
A. Navin,
H. Pai,
Surajit Pal,
Deepak Pandit,
R. Raut,
Prithwijita Ray
, et al. (2 additional authors not shown)
Abstract:
In-beam $ γ$-ray spectroscopy of $^{131}$Xe has been carried out to study the structure of the intruder $ νh_{11/2} $ band. Excited states were populated via an $ α$-induced fusion-evaporation reaction at E$ _α = 38 $ MeV. Inspection of $ γγ$-coincidence data resulted in the identification of a new rotational sequence. Based on the systematics of excitation energy, assigned spin-parity, decay patt…
▽ More
In-beam $ γ$-ray spectroscopy of $^{131}$Xe has been carried out to study the structure of the intruder $ νh_{11/2} $ band. Excited states were populated via an $ α$-induced fusion-evaporation reaction at E$ _α = 38 $ MeV. Inspection of $ γγ$-coincidence data resulted in the identification of a new rotational sequence. Based on the systematics of excitation energy, assigned spin-parity, decay pattern, and the electromagnetic character of the inter-band $ ΔI = 1 $ $ γ$-transitions, this sequence is proposed as the unfavoured signature partner of the $ νh_{11/2} $ band. The structure of this band is further illuminated in the light of the triaxial particle rotor model (TPRM). The possibility of wobbling excitation in $ N = 77 $ Xe-Ba-Ce isotones has been explored in a systematic manner.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Measurement of the Hoyle State Radiative Transition Width
Authors:
T. K. Rana,
Deepak Pandit,
S. Manna,
S. Kundu,
K. Banerjee,
A. Sen,
R. Pandey,
G. Mukherjee,
T. K. Ghosh,
S. S. Nayak,
R. Shil,
P. Karmakar,
K. Atreya,
K. Rani,
D. Paul,
Rajkumar Santra,
A. Sultana,
S. Basu,
S. Pal,
S. Sadhukhan,
Debasish Mondal,
S. Mukhopadhyay,
Srijit Bhattacharya,
Surajit Pal,
Pankaj Pant
, et al. (8 additional authors not shown)
Abstract:
The radiative decay of the Hoyle state is the doorway to the production of heavier elements in stellar environment. Here we report, an exclusive measurement of electric quadruple (E$_2$) transitions of the Hoyle state to the ground state of $^{12}$C through the $^{12}$C(p, p$^\prime$$γ$$γ$)$^{12}$C reaction. Triple coincidence measurement yields a value of radiative branching ratio $Γ_{rad}$/$Γ$ =…
▽ More
The radiative decay of the Hoyle state is the doorway to the production of heavier elements in stellar environment. Here we report, an exclusive measurement of electric quadruple (E$_2$) transitions of the Hoyle state to the ground state of $^{12}$C through the $^{12}$C(p, p$^\prime$$γ$$γ$)$^{12}$C reaction. Triple coincidence measurement yields a value of radiative branching ratio $Γ_{rad}$/$Γ$ = 4.01 (30) $\times$ 10$^{-4}$. The result has been corroborated by an independent experiment based on the complete kinematical measurement $via.$ $^{12}$C(p, p$^\prime$)$^{12}$C reaction ($Γ_{rad}$/$Γ$ = 4.04 (30) $\times$ 10$^{-4}$). Using our results together with the currently adopted values of $Γ_π$(E$_0$)/$Γ$ and $Γ_π$($E_0$), the radiative width of the Hoyle state is found to be 3.75 (40) $\times$ 10$^{-3}$ eV. We emphasize here that our result is not in agreement with 34 $\%$ increase in the radiative decay width of the Hoyle state measured recently but consistent with the currently adopted value.
△ Less
Submitted 26 November, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Arboricity-Dependent Algorithms for Edge Coloring
Authors:
Sayan Bhattacharya,
Martín Costa,
Nadav Panski,
Shay Solomon
Abstract:
The problem of edge coloring has been extensively studied over the years. Recently, this problem has received significant attention in the dynamic setting, where we are given a dynamic graph evolving via a sequence of edge insertions and deletions and our objective is to maintain an edge coloring of the graph.
Currently, it is not known whether it is possible to maintain a $(Δ+ O(Δ^{1 - μ}))$-ed…
▽ More
The problem of edge coloring has been extensively studied over the years. Recently, this problem has received significant attention in the dynamic setting, where we are given a dynamic graph evolving via a sequence of edge insertions and deletions and our objective is to maintain an edge coloring of the graph.
Currently, it is not known whether it is possible to maintain a $(Δ+ O(Δ^{1 - μ}))$-edge coloring in $\tilde{O}(1)$ update time, for any constant $μ> 0$, where $Δ$ is the maximum degree of the graph. In this paper, we show how to efficiently maintain a $(Δ+ O(α))$-edge coloring in $\tilde O(1)$ amortized update time, where $α$ is the arboricty of the graph. Thus, we answer this question in the affirmative for graphs of sufficiently small arboricity.
△ Less
Submitted 7 February, 2024; v1 submitted 14 November, 2023;
originally announced November 2023.
-
A practical key-recovery attack on LWE-based key-encapsulation mechanism schemes using Rowhammer
Authors:
Puja Mondal,
Suparna Kundu,
Sarani Bhattacharya,
Angshuman Karmakar,
Ingrid Verbauwhede
Abstract:
Physical attacks are serious threats to cryptosystems deployed in the real world. In this work, we propose a microarchitectural end-to-end attack methodology on generic lattice-based post-quantum key encapsulation mechanisms to recover the long-term secret key. Our attack targets a critical component of a Fujisaki-Okamoto transform that is used in the construction of almost all lattice-based key e…
▽ More
Physical attacks are serious threats to cryptosystems deployed in the real world. In this work, we propose a microarchitectural end-to-end attack methodology on generic lattice-based post-quantum key encapsulation mechanisms to recover the long-term secret key. Our attack targets a critical component of a Fujisaki-Okamoto transform that is used in the construction of almost all lattice-based key encapsulation mechanisms. We demonstrate our attack model on practical schemes such as Kyber and Saber by using Rowhammer. We show that our attack is highly practical and imposes little preconditions on the attacker to succeed. As an additional contribution, we propose an improved version of the plaintext checking oracle, which is used by almost all physical attack strategies on lattice-based key-encapsulation mechanisms. Our improvement reduces the number of queries to the plaintext checking oracle by as much as $39\%$ for Saber and approximately $23\%$ for Kyber768. This can be of independent interest and can also be used to reduce the complexity of other attacks.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Particle Identification at VAMOS++ with Machine Learning Techniques
Authors:
Y. Cho,
Y. H. Kim,
S. Choi,
J. Park,
S. Bae,
K. I. Hahn,
Y. Son,
A. Navin,
A. Lemasson,
M. Rejmund,
D. Ramos,
D. Ackermann,
A. Utepov,
C. Fourgeres,
J. C. Thomas,
J. Goupil,
G. Fremont,
G. de France,
Y. X. Watanabe,
Y. Hirayama,
S. Jeong,
T. Niwase,
H. Miyatake,
P. Schury,
M. Rosenbusch
, et al. (23 additional authors not shown)
Abstract:
Multi-nucleon transfer reaction between 136Xe beam and 198Pt target was performed using the VAMOS++ spectrometer at GANIL to study the structure of n-rich nuclei around N=126. Unambiguous charge state identification was obtained by combining two supervised machine learning methods, deep neural network (DNN) and positional correction using a gradient-boosting decision tree (GBDT). The new method re…
▽ More
Multi-nucleon transfer reaction between 136Xe beam and 198Pt target was performed using the VAMOS++ spectrometer at GANIL to study the structure of n-rich nuclei around N=126. Unambiguous charge state identification was obtained by combining two supervised machine learning methods, deep neural network (DNN) and positional correction using a gradient-boosting decision tree (GBDT). The new method reduced the complexity of the kinetic energy calibration and outperformed the conventional method, improving the charge state resolution by 8%
△ Less
Submitted 14 November, 2023; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Gravitational memory signal from neutrino self-interactions in supernova
Authors:
Soumya Bhattacharya,
Debanjan Bose,
Indranil Chakraborty,
Arpan Hait,
Subhendra Mohanty
Abstract:
Neutrinos with large self-interactions, arising from exchange of light scalars or vectors with mass $M_φ\simeq 10{\rm MeV}$, can play a useful role in cosmology for structure formation and solving the Hubble tension. It has been proposed that large self-interactions of neutrinos may change the observed properties of supernova like the neutrino luminosity or the duration of the neutrino burst. In t…
▽ More
Neutrinos with large self-interactions, arising from exchange of light scalars or vectors with mass $M_φ\simeq 10{\rm MeV}$, can play a useful role in cosmology for structure formation and solving the Hubble tension. It has been proposed that large self-interactions of neutrinos may change the observed properties of supernova like the neutrino luminosity or the duration of the neutrino burst. In this paper, we study the gravitational wave memory signal arising from supernova neutrinos. Our results reveal that memory signal for self-interacting neutrinos are weaker than free-streaming neutrinos in the high frequency range. Implications for detecting and differentiating between such signals for planned space-borne detectors, DECIGO and BBO, are also discussed.
△ Less
Submitted 4 September, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Nibbling at Long Cycles: Dynamic (and Static) Edge Coloring in Optimal Time
Authors:
Sayan Bhattacharya,
Martín Costa,
Nadav Panski,
Shay Solomon
Abstract:
We consider the problem of maintaining a $(1+ε)Δ$-edge coloring in a dynamic graph $G$ with $n$ nodes and maximum degree at most $Δ$. The state-of-the-art update time is $O_ε(\text{polylog}(n))$, by Duan, He and Zhang [SODA'19] and by Christiansen [STOC'23], and more precisely $O(\log^7 n/ε^2)$, where $Δ= Ω(\log^2 n / ε^2)$.
The following natural question arises: What is the best possible update…
▽ More
We consider the problem of maintaining a $(1+ε)Δ$-edge coloring in a dynamic graph $G$ with $n$ nodes and maximum degree at most $Δ$. The state-of-the-art update time is $O_ε(\text{polylog}(n))$, by Duan, He and Zhang [SODA'19] and by Christiansen [STOC'23], and more precisely $O(\log^7 n/ε^2)$, where $Δ= Ω(\log^2 n / ε^2)$.
The following natural question arises: What is the best possible update time of an algorithm for this task? More specifically, \textbf{ can we bring it all the way down to some constant} (for constant $ε$)? This question coincides with the \emph{static} time barrier for the problem: Even for $(2Δ-1)$-coloring, there is only a naive $O(m \log Δ)$-time algorithm.
We answer this fundamental question in the affirmative, by presenting a dynamic $(1+ε)Δ$-edge coloring algorithm with $O(\log^4 (1/ε)/ε^9)$ update time, provided $Δ= Ω_ε(\text{polylog}(n))$. As a corollary, we also get the first linear time (for constant $ε$) \emph{static} algorithm for $(1+ε)Δ$-edge coloring; in particular, we achieve a running time of $O(m \log (1/ε)/ε^2)$.
We obtain our results by carefully combining a variant of the \textsc{Nibble} algorithm from Bhattacharya, Grandoni and Wajc [SODA'21] with the subsampling technique of Kulkarni, Liu, Sah, Sawhney and Tarnawski [STOC'22].
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
Authors:
Harika Abburi,
Kalyani Roy,
Michael Suesserman,
Nirmala Pudota,
Balaji Veeramani,
Edward Bowen,
Sanmitra Bhattacharya
Abstract:
Recent Large Language Models (LLMs) have demonstrated remarkable capabilities in generating text that closely resembles human writing across wide range of styles and genres. However, such capabilities are prone to potential abuse, such as fake news generation, spam email creation, and misuse in academic assignments. Hence, it is essential to build automated approaches capable of distinguishing bet…
▽ More
Recent Large Language Models (LLMs) have demonstrated remarkable capabilities in generating text that closely resembles human writing across wide range of styles and genres. However, such capabilities are prone to potential abuse, such as fake news generation, spam email creation, and misuse in academic assignments. Hence, it is essential to build automated approaches capable of distinguishing between artificially generated text and human-authored text. In this paper, we propose a simple yet efficient solution to this problem by ensembling predictions from multiple constituent LLMs. Compared to previous state-of-the-art approaches, which are perplexity-based or uses ensembles with a number of LLMs, our condensed ensembling approach uses only two constituent LLMs to achieve comparable performance. Experiments conducted on four benchmark datasets for generative text classification show performance improvements in the range of 0.5 to 100\% compared to previous state-of-the-art approaches. We also study the influence that the training data from individual LLMs have on model performance. We found that substituting commercially-restrictive Generative Pre-trained Transformer (GPT) data with data generated from other open language models such as Falcon, Large Language Model Meta AI (LLaMA2), and Mosaic Pretrained Transformers (MPT) is a feasible alternative when developing generative text detectors. Furthermore, to demonstrate zero-shot generalization, we experimented with an English essays dataset, and results suggest that our ensembling approach can handle new data effectively.
△ Less
Submitted 7 November, 2023; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Electromagnetic extension of Buchdahl bound in $f(R,T)$ gravity
Authors:
Soumik Bhattacharya,
Ranjan Sharma,
Sunil D. Maharaj
Abstract:
We develop a static charged stellar model in $f(R,T)$ gravity where the modification is assumed to be linear in $T$ which is the trace of the energy momentum tensor. The exterior spacetime of the charged object is described by the Reissner-Nordström metric. The interior solution is obtained by invoking the Buchdahl-Vaidya-Tikekar ansatz, for the metric potential $g_{rr}$, which has a clear geometr…
▽ More
We develop a static charged stellar model in $f(R,T)$ gravity where the modification is assumed to be linear in $T$ which is the trace of the energy momentum tensor. The exterior spacetime of the charged object is described by the Reissner-Nordström metric. The interior solution is obtained by invoking the Buchdahl-Vaidya-Tikekar ansatz, for the metric potential $g_{rr}$, which has a clear geometric interpretation. A detailed physical analysis of the model clearly shows distinct physical features of the resulting stellar configuration under such a modification. We find the maximum compactness bound for such a class of compact stars which is a generalization of the Buchdahl bound for a charged sphere described in $f(R,T)$ gravity. Our result shows physical behaviour that is distinct from general relativity.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Magnetic properties and spin dynamics in a spin-orbit driven Jeff= 1/2 triangular lattice antiferromagnet
Authors:
J. Khatua,
S. Bhattacharya,
A. M. Strydom,
A. Zorko,
J. S. Lord,
A. Ozarowski,
E. Kermarrec,
P. Khuntia
Abstract:
Frustration-induced strong quantum fluctuations accompanied by spin-orbit coupling and crystal electric field can give rise to rich and diverse magnetic phenomena associated with unconventional low-energy excitations in rare-earth based quantum magnets. Herein, we present crystal structure, magnetic susceptibility, specific heat, muon spin relaxation(muSR), and electron spin resonance (ESR) studie…
▽ More
Frustration-induced strong quantum fluctuations accompanied by spin-orbit coupling and crystal electric field can give rise to rich and diverse magnetic phenomena associated with unconventional low-energy excitations in rare-earth based quantum magnets. Herein, we present crystal structure, magnetic susceptibility, specific heat, muon spin relaxation(muSR), and electron spin resonance (ESR) studies on the polycrystalline samples of Ba6Yb2Ti4O17 in which Yb3+ ions constitute a perfect triangular lattice in ab-plane without detectable anti-site disorder between atomic sites. The Curie-Weiss fit of low-temperature magnetic susceptibility data suggest the spin-orbit entangled Jeff = 1/2 degrees of freedom of Yb3+ spin with weak antiferromagnetic exchange interactions in the Kramers doublet ground state. The zero-field specific heat data reveal the presence of long-range magnetic order at TN = 77 mK which is suppressed in a magnetic field 1 T. The broad maximum in specific heat is attributed to the Schottky anomaly implying the Zeeman splitting of the Kramers doublet ground state. The ESR measurements suggest the presence of anisotropic exchange interaction between the moments of Yb3+ spins and the well separated Kramers doublet state. muSR experiments reveal a fluctuating state of Yb3+ spins in the temperature range 0.1 K-100 K owing to depopulation of crystal electric field levels, which suggests that the Kramers doublets are well separated consistent with thermodynamic and ESR results. In addition to the intraplane nearest-neighbor superexchange interaction, the interplane exchange interaction and anisotropy are expected to stabilize the long-range ordered state in this triangular lattice antiferromagnet.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Approximate Multiagent Reinforcement Learning for On-Demand Urban Mobility Problem on a Large Map (extended version)
Authors:
Daniel Garces,
Sushmita Bhattacharya,
Dimitri Bertsekas,
Stephanie Gil
Abstract:
In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has shown that a rollout algorithm with a stable base policy produces a near-optimal stable policy. In the routing setting, a policy is stable if its execu…
▽ More
In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has shown that a rollout algorithm with a stable base policy produces a near-optimal stable policy. In the routing setting, a policy is stable if its execution keeps the number of outstanding requests uniformly bounded over time. Although, rollout-based approaches are well-suited for learning cooperative multiagent policies with considerations for future demand, applying such methods to a large urban environment can be computationally expensive due to the large number of taxis required for stability. In this paper, we aim to address the computational bottleneck of multiagent rollout by proposing an approximate multiagent rollout-based two phase algorithm that reduces computational costs, while still achieving a stable near-optimal policy. Our approach partitions the graph into sectors based on the predicted demand and the maximum number of taxis that can run sequentially given the user's computational resources. The algorithm then applies instantaneous assignment (IA) for re-balancing taxis across sectors and a sector-wide multiagent rollout algorithm that is executed in parallel for each sector. We provide two main theoretical results: 1) characterize the number of taxis $m$ that is sufficient for IA to be stable; 2) derive a necessary condition on $m$ to maintain stability for IA as time goes to infinity. Our numerical results show that our approach achieves stability for an $m$ that satisfies the theoretical conditions. We also empirically demonstrate that our proposed two phase algorithm has equivalent performance to the one-at-a-time rollout over the entire map, but with significantly lower runtimes.
△ Less
Submitted 18 February, 2025; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved Generalization
Authors:
Vaibhav Khamankar,
Sutanu Bera,
Saumik Bhattacharya,
Debashis Sen,
Prabir Kumar Biswas
Abstract:
Histopathological images are essential for medical diagnosis and treatment planning, but interpreting them accurately using machine learning can be challenging due to variations in tissue preparation, staining and imaging protocols. Domain generalization aims to address such limitations by enabling the learning models to generalize to new datasets or populations. Style transfer-based data augmenta…
▽ More
Histopathological images are essential for medical diagnosis and treatment planning, but interpreting them accurately using machine learning can be challenging due to variations in tissue preparation, staining and imaging protocols. Domain generalization aims to address such limitations by enabling the learning models to generalize to new datasets or populations. Style transfer-based data augmentation is an emerging technique that can be used to improve the generalizability of machine learning models for histopathological images. However, existing style transfer-based methods can be computationally expensive, and they rely on artistic styles, which can negatively impact model accuracy. In this study, we propose a feature domain style mixing technique that uses adaptive instance normalization to generate style-augmented versions of images. We compare our proposed method with existing style transfer-based data augmentation methods and found that it performs similarly or better, despite requiring less computation and time. Our results demonstrate the potential of feature domain statistics mixing in the generalization of learning models for histopathological image analysis.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Resummation of local and non-local scalar self energies via the Schwinger-Dyson equation in de Sitter spacetime
Authors:
Sourav Bhattacharya,
Nitin Joshi,
Kinsuk Roy
Abstract:
We consider a massless and minimally coupled self interacting quantum scalar field in the inflationary de Sitter spacetime. The scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+βφ^3/3!$ ($λ>0$). Compared to the earlier well studied $β=0$ case, the present potential has a rolling down effect due to the $φ^3$ term, along with the usual bounding effect due to the $φ^4$ term. We begin by const…
▽ More
We consider a massless and minimally coupled self interacting quantum scalar field in the inflationary de Sitter spacetime. The scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+βφ^3/3!$ ($λ>0$). Compared to the earlier well studied $β=0$ case, the present potential has a rolling down effect due to the $φ^3$ term, along with the usual bounding effect due to the $φ^4$ term. We begin by constructing the Schwinger-Dyson equation for the scalar Feynman propagator up to two loop, at ${\cal O}(λ)$, ${\cal O}(β^2)$, ${\cal O}(λ^2)$ and ${\cal O}(λβ^2)$. We consider first the local part of the scalar self energy and compute the rest mass squared of the scalar field, dynamically generated via the late time non-perturbative secular logarithms, by resumming the daisy-like graphs. The logarithms associated here are sub-leading, compared to those associated with the non-local, leading terms. We also argue that unlike the quartic case, considering merely the one loop results for the purpose of resummation does not give us any sensible result here. We next construct the non-perturbative two particle irreducible effective action up to three loop and derive from it the Schwinger-Dyson equation once again. This equation is satisfied by the non-perturbative Feynman propagator. By series expanding this propagator, the resummed local part of the self energy is shown to yield the same dynamical mass as that of the above. We next use this equation to resum the effect of the non-local part of the scalar self energy, and show that even though the perturbatively corrected propagator shows secular growth at late times, there exists one resummed solution which is vanishing for large spatial separations, in qualitative agreement with that of the stochastic formalism.
△ Less
Submitted 13 August, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Nonrelativistic spin splittings and altermagnetism in twisted bilayers of centrosymmetric antiferromagnets
Authors:
Sajjan Sheoran,
Saswata Bhattacharya
Abstract:
Magnetism-driven nonrelativistic spin splittings (NRSS) are promising for highly efficient spintronics applications. Although 2D centrosymmetric (in four-dimensional spacetime) antiferromagnets are abundant, they have not received extensive research attention owing to symmetry-forbidden spin polarization and magnetization. Here, we demonstrate a paradigm to harness NRSS by twisting the bilayer of…
▽ More
Magnetism-driven nonrelativistic spin splittings (NRSS) are promising for highly efficient spintronics applications. Although 2D centrosymmetric (in four-dimensional spacetime) antiferromagnets are abundant, they have not received extensive research attention owing to symmetry-forbidden spin polarization and magnetization. Here, we demonstrate a paradigm to harness NRSS by twisting the bilayer of centrosymmetric antiferromagnets with commensurate twist angles. We observe $i$-wave altermagnetism and spin-momentum locking by first-principles simulations and symmetry analysis on prototypical MnPSe$_3$ and MnSe antiferromagnets. The strength of NRSS (up to 80 meVÅ) induced by twisting is comparable to SOC-induced linear Rashba-Dresselhaus effects. The results also demonstrate how applying biaxial strain and a vertical electric field tune the NRSS. The findings reveal the untapped potential of centrosymmetric antiferromagnets and thus expand the material's horizons in spintronics.
△ Less
Submitted 1 April, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
In-Context Ability Transfer for Question Decomposition in Complex QA
Authors:
Venktesh V,
Sourangshu Bhattacharya,
Avishek Anand
Abstract:
Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, e…
▽ More
Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, existing prompt-based approaches that are effective for complex QA tasks involve expensive hand annotations from experts in the form of rationales and are not generalizable to newer complex QA scenarios and tasks. We propose, icat (In-Context Ability Transfer) which induces reasoning capabilities in LLMs without any LLM fine-tuning or manual annotation of in-context samples. We transfer the ability to decompose complex questions to simpler questions or generate step-by-step rationales to LLMs, by careful selection from available data sources of related tasks. We also propose an automated uncertainty-aware exemplar selection approach for selecting examples from transfer data sources. Finally, we conduct large-scale experiments on a variety of complex QA tasks involving numerical reasoning, compositional complex QA, and heterogeneous complex QA which require decomposed reasoning. We show that ICAT convincingly outperforms existing prompt-based solutions without involving any model training, showcasing the benefits of re-using existing abilities.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
On monic abelian trace-one cubic polynomials
Authors:
Shubhrajit Bhattacharya,
Andrew O'Desky
Abstract:
We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois…
▽ More
We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois group $C_3$ for a fixed integer $a$.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Fully Dynamic $k$-Clustering in $\tilde O(k)$ Update Time
Authors:
Sayan Bhattacharya,
Martín Costa,
Silvio Lattanzi,
Nikos Parotsidis
Abstract:
We present a $O(1)$-approximate fully dynamic algorithm for the $k$-median and $k$-means problems on metric spaces with amortized update time $\tilde O(k)$ and worst-case query time $\tilde O(k^2)$. We complement our theoretical analysis with the first in-depth experimental study for the dynamic $k$-median problem on general metrics, focusing on comparing our dynamic algorithm to the current state…
▽ More
We present a $O(1)$-approximate fully dynamic algorithm for the $k$-median and $k$-means problems on metric spaces with amortized update time $\tilde O(k)$ and worst-case query time $\tilde O(k^2)$. We complement our theoretical analysis with the first in-depth experimental study for the dynamic $k$-median problem on general metrics, focusing on comparing our dynamic algorithm to the current state-of-the-art by Henzinger and Kale [ESA'20]. Finally, we also provide a lower bound for dynamic $k$-median which shows that any $O(1)$-approximate algorithm with $\tilde O(\text{poly}(k))$ query time must have $\tilde Ω(k)$ amortized update time, even in the incremental setting.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks
Authors:
Sunit Bhattacharya,
Ondrej Bojar
Abstract:
Recent research suggests that the feed-forward module within Transformers can be viewed as a collection of key-value memories, where the keys learn to capture specific patterns from the input based on the training examples. The values then combine the output from the 'memories' of the keys to generate predictions about the next token. This leads to an incremental process of prediction that gradual…
▽ More
Recent research suggests that the feed-forward module within Transformers can be viewed as a collection of key-value memories, where the keys learn to capture specific patterns from the input based on the training examples. The values then combine the output from the 'memories' of the keys to generate predictions about the next token. This leads to an incremental process of prediction that gradually converges towards the final token choice near the output layers. This interesting perspective raises questions about how multilingual models might leverage this mechanism. Specifically, for autoregressive models trained on two or more languages, do all neurons (across layers) respond equally to all languages? No! Our hypothesis centers around the notion that during pretraining, certain model parameters learn strong language-specific features, while others learn more language-agnostic (shared across languages) features. To validate this, we conduct experiments utilizing parallel corpora of two languages that the model was initially pretrained on. Our findings reveal that the layers closest to the network's input or output tend to exhibit more language-specific behaviour compared to the layers in the middle.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Generalized Parton Distributions from Lattice QCD with Asymmetric Momentum Transfer: Axial-vector case
Authors:
Shohini Bhattacharya,
Krzysztof Cichy,
Martha Constantinou,
Jack Dodson,
Xiang Gao,
Andreas Metz,
Joshua Miller,
Swagato Mukherjee,
Peter Petreczky,
Fernanda Steffens,
Yong Zhao
Abstract:
Recently, we made significant advancements in improving the computational efficiency of lattice QCD calculations for Generalized Parton Distributions (GPDs). This progress was achieved by adopting calculations of matrix elements in asymmetric frames, deviating from the computationally-expensive symmetric frame typically used, and allowing freedom in the choice for the distribution of the momentum…
▽ More
Recently, we made significant advancements in improving the computational efficiency of lattice QCD calculations for Generalized Parton Distributions (GPDs). This progress was achieved by adopting calculations of matrix elements in asymmetric frames, deviating from the computationally-expensive symmetric frame typically used, and allowing freedom in the choice for the distribution of the momentum transfer between the initial and final states. A crucial aspect of this approach involves the adoption of a Lorentz covariant parameterization for the matrix elements, introducing Lorentz-invariant amplitudes. This approach also allows us to propose an alternative definition of quasi-GPDs, ensuring frame independence and potentially reduce power corrections in matching to light-cone GPDs. In our previous work, we presented lattice QCD results for twist-2 unpolarized GPDs ($H$ and $E$) of quarks obtained from calculations performed in asymmetric frames at zero skewness. Building upon this work, we now introduce a novel Lorentz covariant parameterization for the axial-vector matrix elements. We employ this parameterization to compute the axial-vector GPD $\widetilde{H}$ at zero skewness, using an $N_f=2+1+1$ ensemble of twisted mass fermions with clover improvement. The light-quark masses employed in our calculations correspond to a pion mass of approximately 260 MeV.
△ Less
Submitted 29 February, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Photovoltaic grid-forming control strategy investigation using hardware-in-the-loop experiments
Authors:
Somesh Bhattacharya,
Chrysanthos Charalambous,
Anja Banjac,
Zoran Miletic,
Thomas Strasser,
Brian Azzopardi,
Christina Papadimitriou,
Venizelos Efthymiou,
Alexis Polycarpou
Abstract:
The frequency stability of a power system is of paramount importance, as a fast frequency swings in the system can lead to oscillatory instability, and thereby blackouts. A grid-connected microgrid, that can operate in the islanded mode can also possess such deteriorating effect due to the higher share of converter-based sources. In this paper, a coordinated frequency control within a distribution…
▽ More
The frequency stability of a power system is of paramount importance, as a fast frequency swings in the system can lead to oscillatory instability, and thereby blackouts. A grid-connected microgrid, that can operate in the islanded mode can also possess such deteriorating effect due to the higher share of converter-based sources. In this paper, a coordinated frequency control within a distribution network is discussed, with a higher share of Photovoltaics (PV). The main objective of this paper is to test the grid-forming capabilities of PVs, without the requirement of an energy storage in the network. The tests were carried out with the help of the Typhoon Hardware-in-the-loop (HIL) platform using a real Cypriot network feeder. The real-time results confirm the efficacy of the PV as a grid-forming inverter, provided it has sufficient input (irradiance) to provide for the loads within the system of interest. The grid-forming PV also possesses the capability of reconnection with the utility grid through a synchronizer switch that requires minimal communication, makes the overall control independent of any other power source, subject to certain irradiance and loading conditions.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Systematic Evaluation of Randomized Cache Designs against Cache Occupancy
Authors:
Anirban Chakraborty,
Nimish Mishra,
Sayandeep Saha,
Sarani Bhattacharya,
Debdeep Mukhopadhyay
Abstract:
Randomizing the address-to-set mapping and partitioning of the cache has been shown to be an effective mechanism in designing secured caches. Several designs have been proposed on a variety of rationales: (1) randomized design, (2) randomized-and-partitioned design, and (3) psuedo-fully associative design. This work fills in a crucial gap in current literature on randomized caches: currently most…
▽ More
Randomizing the address-to-set mapping and partitioning of the cache has been shown to be an effective mechanism in designing secured caches. Several designs have been proposed on a variety of rationales: (1) randomized design, (2) randomized-and-partitioned design, and (3) psuedo-fully associative design. This work fills in a crucial gap in current literature on randomized caches: currently most randomized cache designs defend only contention-based attacks, and leave out considerations of cache occupancy. We perform a systematic evaluation of 5 randomized cache designs- CEASER, CEASER-S, MIRAGE, Scatter-Cache, and Sass-cache against cache occupancy wrt. both performance as well as security.
With respect to performance, we first establish that benchmarking strategies used by contemporary designs are unsuitable for a fair evaluation (because of differing cache configurations, choice of benchmarking suites, additional implementation-specific assumptions). We thus propose a uniform benchmarking strategy, which allows us to perform a fair and comparative analysis across all designs under various replacement policies. Likewise, with respect to security against cache occupancy attacks, we evaluate the cache designs against various threat assumptions: (1) covert channels, (2) process fingerprinting, and (3) AES key recovery (to the best of our knowledge, this work is the first to demonstrate full AES key recovery on a randomized cache design using cache occupancy attack). Our results establish the need to also consider cache occupancy side-channel in randomized cache design considerations.
△ Less
Submitted 30 January, 2025; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Breaking absolute separability with quantum switch
Authors:
Sravani Yanamandra,
P V Srinidhi,
Samyadeb Bhattacharya,
Indranil Chakrabarty,
Suchetana Goswami
Abstract:
Absolute separable (AS) quantum states are those states from which it is impossible to create entanglement, even under global unitary operations. It is known from the resource theory of non-absolute separability that the set of absolute separable states forms a convex and compact set, and global unitaries are free operations. We show that the action of a quantum switch controlled by an ancilla qub…
▽ More
Absolute separable (AS) quantum states are those states from which it is impossible to create entanglement, even under global unitary operations. It is known from the resource theory of non-absolute separability that the set of absolute separable states forms a convex and compact set, and global unitaries are free operations. We show that the action of a quantum switch controlled by an ancilla qubit over the global unitaries can break this robustness of AS states and produce ordinary separable states. First, we consider bipartite qubit systems and find the effect of quantum switch starting from the states sitting on the boundary of the set of absolute separable states. As particular examples, we illustrate what happens to modified Werner states and Bell diagonal (BD) states. For the Bell diagonal states, we provide the structure for the set of AS BD states and show how the structure changes under the influence of a switch. Further, we consider numerical generalisation of the global unitary operations and show that it is always possible to take AS states out of the convex set under switching operations. We also generalised our results in higher dimensions.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
COVID-19 South African Vaccine Hesitancy Models Show Boost in Performance Upon Fine-Tuning on M-pox Tweets
Authors:
Nicholas Perikli,
Srimoy Bhattacharya,
Blessing Ogbuokiri,
Zahra Movahedi Nia,
Benjamin Lieberman,
Nidhi Tripathi,
Salah-Eddine Dahbi,
Finn Stevenson,
Nicola Bragazzi,
Jude Kong,
Bruce Mellado
Abstract:
Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-p…
▽ More
Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-pox dataset before and after fine-tuning. More than 20k M-pox-related tweets from South Africa were hand-labelled as being either positive, negative or neutral. After fine-tuning these COVID-19 models on the M-pox dataset, the F1-scores increased by more than 8% falling just short of 70%, but still outperforming state-of-the-art models and well-known classification algorithms. An LDA-based topic modelling procedure was used to compare the miss-classified M-pox tweets of the original COVID-19 RoBERTa model with its fine-tuned version, and from this analysis, we were able to draw conclusions on how to build more sophisticated models.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
The AstroSat UV Deep Field North: Direct determination of the UV Luminosity Function and its evolution from z~0.8-0.4
Authors:
Souradeep Bhattacharya,
Kanak Saha,
Chayan Mondal
Abstract:
We characterize the evolution of the rest-frame 1500 $\unicode{xC5}$ UV luminosity Function (UVLF) from AstroSat/UVIT F154W and N242W imaging in the Great Observatories Origins Survey North (GOODS-N) field. With deep FUV observations, we construct the UVLF for galaxies at z$<0.13$ and subsequently characterise it with a Schechter function fit. The fitted parameters are consistent with previous det…
▽ More
We characterize the evolution of the rest-frame 1500 $\unicode{xC5}$ UV luminosity Function (UVLF) from AstroSat/UVIT F154W and N242W imaging in the Great Observatories Origins Survey North (GOODS-N) field. With deep FUV observations, we construct the UVLF for galaxies at z$<0.13$ and subsequently characterise it with a Schechter function fit. The fitted parameters are consistent with previous determinations. With deep NUV observations, we are able to construct the UVLF in seven redshift bins in the range z $\sim$ 0.8 - 0.4, with galaxies identified till $\sim$2 mag fainter than previous surveys, owing to the high angular-resolution of UVIT. The fitted Schechter function parameters are obtained for these UVLFs. At z $\sim$ 0.8 - 0.7, we also utilize Hubble Space Telescope (HST) F275W observations in the GOODS-N field to construct the UVLF in 2 redshift bins, whose fitted Schechter function parameters are then found to be consistent with that determined from UVIT at z $\sim$ 0.75. We thus probe the variation of the fitted UVLF parameters over z $\sim$ 0.8 - 0.4, a span of $\sim$2.7 Gyr in age. We find that the slope of the Schechter function, $α$, is at its steepest at z $\sim$ 0.65, implying highest star-formation at this instant with galaxies being relatively more passive before and after this time. We infer that this may be a short-lived instance of increased cosmic star-formation even though cosmic star-formation may be winding-down over longer timespan at this redshift range.
△ Less
Submitted 25 June, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Authors:
Alloy Das,
Sanket Biswas,
Ayan Banerjee,
Josep Lladós,
Umapada Pal,
Saumik Bhattacharya
Abstract:
The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here…
▽ More
The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here, we investigate the problem of domain-adaptive scene text spotting, i.e., training a model on multi-domain source data such that it can directly adapt to target domains rather than being specialized for a specific domain or scenario. Further, we investigate a transformer baseline called Swin-TESTR to focus on solving scene-text spotting for both regular and arbitrary-shaped scene text along with an exhaustive evaluation. The results clearly demonstrate the potential of intermediate representations to achieve significant performance on text spotting benchmarks across multiple domains (e.g. language, synth-to-real, and documents). both in terms of accuracy and efficiency.
△ Less
Submitted 1 November, 2023; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Quantitative Analysis of Social Influence & Digital Piracy Contagion with Differential Equations on Networks
Authors:
Dibyajyoti Mallick,
Kumar Gaurav,
Saumik Bhattacharya,
Sayantari Ghosh
Abstract:
Though the studies of social contagions are regularly borrowing network models to study the propagation of social influences and opinions to include social heterogeneity. Such studies provide valuable insights regarding these, but the social network structures cannot be well explored in their study. In this research, we methodically study the trends in online piracy with a continuous ODE approach…
▽ More
Though the studies of social contagions are regularly borrowing network models to study the propagation of social influences and opinions to include social heterogeneity. Such studies provide valuable insights regarding these, but the social network structures cannot be well explored in their study. In this research, we methodically study the trends in online piracy with a continuous ODE approach and differential equations on graphs, to have a clear comparative view. We first formulate a compartmental model to mathematically study bifurcations and thresholds, and later move on with a network-based analysis to illustrate the proliferation of online piracy dynamic with an epidemiological approach over a social network. We figure out a solution for this online piracy problem by developing awareness among individuals by introducing media campaigns which could be a useful factor for the eradication and control of online piracy. Next, using degree-block approximation, network analysis has been performed to investigate the phenomena from a heterogeneous approach and to derive the threshold condition for the persistence of piracy in the population in a steady state. Based on the behavioral responses of individuals in a society due to the effect of media, we examine the system through the aid of realistic parameter selection to better understand the complexity of the dynamics and propose control strategies.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Characterization of LAPPD timing at CERN PS testbeam
Authors:
Deb Sankar Bhattacharya,
Andrea Bressan,
Chandradoy Chatterjee,
Silvia Dalla Torre,
Mauro Gregori,
Alexander Kiselev,
Stefano Levorato,
Anna Martin,
Saverio Minutoli,
Mikhail Osipenko,
Richa Rai,
Marco Ripani,
Fulvio Tessarotto,
Triloki Triloki
Abstract:
Large Area Picosecond PhotoDetectors (LAPPDs) are photosensors based on microchannel plate technology with about 400 cm$^2$ sensitive area. The external readout plane of a capacitively coupled LAPPD can be segmented into pads providing a spatial resolution down to 1 mm scale. The LAPPD signals have about 0.5 ns risetime followed by a slightly longer falltime and their amplitude reaches a few dozen…
▽ More
Large Area Picosecond PhotoDetectors (LAPPDs) are photosensors based on microchannel plate technology with about 400 cm$^2$ sensitive area. The external readout plane of a capacitively coupled LAPPD can be segmented into pads providing a spatial resolution down to 1 mm scale. The LAPPD signals have about 0.5 ns risetime followed by a slightly longer falltime and their amplitude reaches a few dozens of mV per single photoelectron. In this article, we report on the measurement of the time resolution of an LAPPD prototype in a test beam exercise at CERN PS. Most of the previous measurements of LAPPD time resolution had been performed with laser sources. In this article we report time resolution measurements obtained through the detection of Cherenkov radiation emitted by high energy hadrons. Our approach has been demonstrated capable of measuring time resolutions as fine as 25-30 ps. The available prototype had performance limitations, which prevented us from applying the optimal high voltage setting. The measured time resolution for single photoelectrons is about 80 ps r.m.s.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Relating CP divisibility of dynamical maps with compatibility of channels
Authors:
Arindam Mitra,
Debashis Saha,
Samyadeb Bhattacharya,
A. S. Majumdar
Abstract:
The role of CP-indivisibility and incompatibility as valuable resources for various information-theoretic tasks is widely acknowledged. This study delves into the intricate relationship between CP-divisibility and channel compatibility. Our investigation focuses on the behaviour of incompatibility robustness of quantum channels for a pair of generic dynamical maps. We show that the incompatibility…
▽ More
The role of CP-indivisibility and incompatibility as valuable resources for various information-theoretic tasks is widely acknowledged. This study delves into the intricate relationship between CP-divisibility and channel compatibility. Our investigation focuses on the behaviour of incompatibility robustness of quantum channels for a pair of generic dynamical maps. We show that the incompatibility robustness of channels is monotonically non-increasing for a pair of generic CP-divisible dynamical maps. Further, our explicit study of the behaviour of incompatibility robustness with time for some specific dynamical maps reveals non-monotonic behaviour in the CP-indivisible regime. Additionally, we propose a measure of CP-indivisibility based on the incompatibility robustness of quantum channels. Our investigation provides valuable insights into the nature of quantum dynamical maps and their relevance in information-theoretic applications.
△ Less
Submitted 1 May, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Generative AI Text Classification using Ensemble LLM Approaches
Authors:
Harika Abburi,
Michael Suesserman,
Nirmala Pudota,
Balaji Veeramani,
Edward Bowen,
Sanmitra Bhattacharya
Abstract:
Large Language Models (LLMs) have shown impressive performance across a variety of Artificial Intelligence (AI) and natural language processing tasks, such as content creation, report generation, etc. However, unregulated malign application of these models can create undesirable consequences such as generation of fake news, plagiarism, etc. As a result, accurate detection of AI-generated language…
▽ More
Large Language Models (LLMs) have shown impressive performance across a variety of Artificial Intelligence (AI) and natural language processing tasks, such as content creation, report generation, etc. However, unregulated malign application of these models can create undesirable consequences such as generation of fake news, plagiarism, etc. As a result, accurate detection of AI-generated language can be crucial in responsible usage of LLMs. In this work, we explore 1) whether a certain body of text is AI generated or written by human, and 2) attribution of a specific language model in generating a body of text. Texts in both English and Spanish are considered. The datasets used in this study are provided as part of the Automated Text Identification (AuTexTification) shared task. For each of the research objectives stated above, we propose an ensemble neural model that generates probabilities from different pre-trained LLMs which are used as features to a Traditional Machine Learning (TML) classifier following it. For the first task of distinguishing between AI and human generated text, our model ranked in fifth and thirteenth place (with macro $F1$ scores of 0.733 and 0.649) for English and Spanish texts, respectively. For the second task on model attribution, our model ranked in first place with macro $F1$ scores of 0.625 and 0.653 for English and Spanish texts, respectively.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Gravitational wave memory for a class of static and spherically symmetric spacetimes
Authors:
Soumya Bhattacharya,
Shramana Ghosh
Abstract:
This article aims at comparing gravitational wave memory effect in a Schwarzschild spacetime with that of other compact objects with static and spherically symmetric spacetime, with the purpose of proposing a procedure for differentiating between various compact object geometries. We do this by considering the relative evolution of two nearby test geodesics with in different backgrounds in the pre…
▽ More
This article aims at comparing gravitational wave memory effect in a Schwarzschild spacetime with that of other compact objects with static and spherically symmetric spacetime, with the purpose of proposing a procedure for differentiating between various compact object geometries. We do this by considering the relative evolution of two nearby test geodesics with in different backgrounds in the presence and absence of a gravitational wave pulse and comparing them. Memory effect due to a gravitational wave would ensure that there is a permanent effect on each spacetime and the corresponding geodesic evolution, being metric dependent, would display distinct results in each case. For a complete picture, we have considered both displacement and velocity memory effect in each geometry.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Determination of dynamical ages of open clusters through the A$^+$ parameter -- II
Authors:
Khushboo K. Rao,
Kaushar Vaidya,
Manan Agarwal,
Shanmugha Balan,
Souradeep Bhattacharya
Abstract:
Blue straggler stars (BSS), one of the most massive members of star clusters, have been used for over a decade to investigate mass segregation and estimate the dynamical ages of globular clusters (GCs) and open clusters (OCs). This work is an extension of our previous study, in which we investigated a correlation between theoretically estimated dynamical ages and the observed $A^+_{\mathrm{rh}}$ v…
▽ More
Blue straggler stars (BSS), one of the most massive members of star clusters, have been used for over a decade to investigate mass segregation and estimate the dynamical ages of globular clusters (GCs) and open clusters (OCs). This work is an extension of our previous study, in which we investigated a correlation between theoretically estimated dynamical ages and the observed $A^+_{\mathrm{rh}}$ values, which represent the sedimentation level of BSS with respect to the reference population. Here, we use the ML-MOC algorithm on \textit{Gaia} EDR3 data to extend this analysis to 23 OCs. Using cluster properties and identified members, we estimate their dynamical and physical parameters. In order to estimate the $A^+_{\mathrm{rh}}$ values, we use the main sequence and main sequence turnoff stars as the reference population. OCs are observed to exhibit a wide range of degrees of dynamical evolution, ranging from dynamically young to late stages of intermediate dynamical age. Hence, we classify OCs into three distinct dynamical stages based on their relationship to $A^+_{\mathrm{rh}}$ and $N_{\text{relax}}$. NGC 2682 and King 2 are discovered to be the most evolved OCs, like Familly III GCs, while Berkeley 18 is the least evolved OC. Melotte 66 and Berkeley 31 are peculiar OCs because none of their dynamical and physical parameters correlate with their BSS segregation levels.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Approximate recoverability and the quantum data processing inequality
Authors:
Saptak Bhattacharya
Abstract:
In this paper, we discuss the quantum data processing inequality and its refinements that are physically meaningful in the context of approximate recoverability. An important conjecture regarding this due to Seshadreesan et. al. in J. Phys. A: Math. Theor. 48 (2015) is disproved. We prove some inequalities capturing universal approximate recoverability with the Petz recovery map for the sandwiched…
▽ More
In this paper, we discuss the quantum data processing inequality and its refinements that are physically meaningful in the context of approximate recoverability. An important conjecture regarding this due to Seshadreesan et. al. in J. Phys. A: Math. Theor. 48 (2015) is disproved. We prove some inequalities capturing universal approximate recoverability with the Petz recovery map for the sandwiched quasi and Rényi relative entropies for the parameter $t=2$. We also obtain convexity theorems on some parametrized versions of the relative entropy and fidelity, which can be of independent interest.
△ Less
Submitted 16 April, 2025; v1 submitted 5 September, 2023;
originally announced September 2023.
-
On the $α$/Fe bimodality of the M31 disks
Authors:
Chiaki Kobayashi,
Souradeep Bhattacharya,
Magda Arnaboldi,
Ortwin Gerhard
Abstract:
An outstanding question is whether the $α$/Fe bimodality exists in disk galaxies other than in the Milky Way. Here we present a bimodality using our state-of-the-art galactic chemical evolution models that can explain various observations in the Andromeda Galaxy (M31) disks, namely, elemental abundances both of planetary nebulae, and of red-giant branch stars recently observed with the James Webb…
▽ More
An outstanding question is whether the $α$/Fe bimodality exists in disk galaxies other than in the Milky Way. Here we present a bimodality using our state-of-the-art galactic chemical evolution models that can explain various observations in the Andromeda Galaxy (M31) disks, namely, elemental abundances both of planetary nebulae, and of red-giant branch stars recently observed with the James Webb Space Telescope. We find that in M31 a high-$α$ thicker-disk population out to 30 kpc formed by more intense initial star burst than in the Milky Way. We also find a young low-$α$ thin disk within 14 kpc, which is formed by a secondary star formation M31 underwent about 2-4.5 Gyr ago, probably triggered by a wet merger. In the outer disk, however, the planetary nebula observations indicate a slightly higher-$α$ young ($\sim$2.5 Gyr) population at a given metallicity, possibly formed by secondary star formation from almost pristine gas. Therefore, an $α$/Fe bimodality is seen in the inner disk ($<$14 kpc), while only a slight $α$/Fe offset of the young population is seen in the outer disk ($>$18 kpc). The appearance of the $α$/Fe bimodality depends on the merging history at various galactocentric radii, and wide-field multi-object spectroscopy is required for unveiling the history of M31.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Wall-attached convection under strong inclined magnetic fields
Authors:
Shashwat Bhattacharya,
Thomas Boeck,
Dmitry Krasnov,
Jörg Schumacher
Abstract:
We employ a linear stability analysis and direct numerical simulations to study the characteristics of wall-modes in thermal convection in a rectangular box under strong and inclined magnetic fields. The walls of the convection cell are electrically insulated. The stability analysis assumes periodicity in the spanwise direction perpendicular to the plane of the homogeneous magnetic field. Our stud…
▽ More
We employ a linear stability analysis and direct numerical simulations to study the characteristics of wall-modes in thermal convection in a rectangular box under strong and inclined magnetic fields. The walls of the convection cell are electrically insulated. The stability analysis assumes periodicity in the spanwise direction perpendicular to the plane of the homogeneous magnetic field. Our study shows that for a fixed vertical magnetic field, the imposition of horizontal magnetic fields results in an increase of the critical Rayleigh number along with a decrease in the wavelength of the wall modes. The wall modes become tilted along the direction of the resulting magnetic fields and therefore extend further into the bulk as the horizontal magnetic field is increased. Once the modes localized on the opposite walls interact, the critical Rayleigh number decreases again and eventually drops below the value for onset with a purely vertical field. We find that for sufficiently strong horizontal magnetic fields, the steady wall modes occupy the entire bulk and therefore convection is no longer restricted to the sidewalls. The above results are confirmed by direct numerical simulations of the nonlinear evolution of magnetoconvection.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Formulation of Galilean relativistic Born-Infeld theory
Authors:
Rabin Banerjee,
Soumya Bhattacharya,
Bibhas Ranjan Majhi
Abstract:
In this paper, we formulate, for the first time, in a systematic manner, Galilean relativistic Born-Infeld action in detail. Exploiting maps connecting Lorentz relativistic and Galilean relativistic vectors, we construct the two limits (electric and magnetic) of Galilean relativistic Born-Infeld action from usual relativistic Born-Infeld theory. An action formalism is thereby derived. From this ac…
▽ More
In this paper, we formulate, for the first time, in a systematic manner, Galilean relativistic Born-Infeld action in detail. Exploiting maps connecting Lorentz relativistic and Galilean relativistic vectors, we construct the two limits (electric and magnetic) of Galilean relativistic Born-Infeld action from usual relativistic Born-Infeld theory. An action formalism is thereby derived. From this action, equations of motion are obtained either in the potential or field formulation. Galilean version of duality transformations involving the electric and magnetic fields are defined. They map the electric limit relations to the magnetic ones and vice-versa, exactly as happens for Galilean relativistic Maxwell theory. We also explicitly show the Galilean boost and gauge invariances of the theory in both limits.
△ Less
Submitted 9 February, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Unraveling anomalies in Deeply Virtual Compton Scattering
Authors:
Shohini Bhattacharya,
Yoshitaka Hatta,
Werner Vogelsang
Abstract:
We calculate the one-loop quark box diagrams relevant to polarized and unpolarized Deeply Virtual Compton Scattering by introducing an off-forward momentum $l^μ$ as an infrared regulator. This regularization approach allows us to reveal the poles associated with the chiral anomaly in the polarized scenario, as well as the trace anomaly in the unpolarized case. We provide an interpretation of our f…
▽ More
We calculate the one-loop quark box diagrams relevant to polarized and unpolarized Deeply Virtual Compton Scattering by introducing an off-forward momentum $l^μ$ as an infrared regulator. This regularization approach allows us to reveal the poles associated with the chiral anomaly in the polarized scenario, as well as the trace anomaly in the unpolarized case. We provide an interpretation of our findings in the context of pertinent Generalized Parton Distributions (GPDs). Furthermore, we discuss the implications of these poles on the QCD factorization pertaining to Compton amplitudes.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Inferences on Mixing Probabilities and Ranking in Mixed-Membership Models
Authors:
Sohom Bhattacharya,
Jianqing Fan,
Jikai Hou
Abstract:
Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector…
▽ More
Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector $\boldsymbolπ_ i = (\boldsymbolπ_i(1), \boldsymbolπ_i(2),\ldots, \boldsymbolπ_i(K))$, where $\boldsymbolπ_i(k)$ denotes the weight that node $i$ puts in community $k$. We derive novel finite-sample expansion for the $\boldsymbolπ_i(k)$s which allows us to obtain asymptotic distributions and confidence interval of the membership mixing probabilities and other related population quantities. This fills an important gap on uncertainty quantification on the membership profile. We further develop a ranking scheme of the vertices based on the membership mixing probabilities on certain communities and perform relevant statistical inferences. A multiplier bootstrap method is proposed for ranking inference of individual member's profile with respect to a given community. The validity of our theoretical results is further demonstrated by via numerical experiments in both real and synthetic data examples.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Accelerated Neural Network Training through Dimensionality Reduction for High-Throughput Screening of Topological Materials
Authors:
Ruman Moulik,
Ankita Phutela,
Sajjan Sheoran,
Saswata Bhattacharya
Abstract:
Machine Learning facilitates building a large variety of models, starting from elementary linear regression models to very complex neural networks. Neural networks are currently limited by the size of data provided and the huge computational cost of training a model. This is especially problematic when dealing with a large set of features without much prior knowledge of how good or bad each indivi…
▽ More
Machine Learning facilitates building a large variety of models, starting from elementary linear regression models to very complex neural networks. Neural networks are currently limited by the size of data provided and the huge computational cost of training a model. This is especially problematic when dealing with a large set of features without much prior knowledge of how good or bad each individual feature is. We try tackling the problem using dimensionality reduction algorithms to construct more meaningful features. We also compare the accuracy and training times of raw data and data transformed after dimensionality reduction to deduce a sufficient number of dimensions without sacrificing accuracy. The indicated estimation is done using a lighter decision tree-based algorithm, AdaBoost, as it trains faster than neural networks. We have chosen the data from an online database of topological materials, Materiae. Our final goal is to construct a model to predict the topological properties of new materials from elementary properties.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
High scale validity of two Higgs doublet scenarios with a real scalar singlet dark matter
Authors:
Subhaditya Bhattacharya,
Atri Dey,
Jayita Lahiri,
Biswarup Mukhopadhyaya
Abstract:
We study the high-scale validity of two kinds of two Higgs doublet models (2HDM), namely, Type-II and Type-X, but with a scalar SU(2) singlet dark matter (DM) candidate in addition in each case. The additional quartic couplings involving the DM particle in the scalar potential in both the scenarios bring in additional constraints from the requirement of perturbative unitarity and vacuum stability.…
▽ More
We study the high-scale validity of two kinds of two Higgs doublet models (2HDM), namely, Type-II and Type-X, but with a scalar SU(2) singlet dark matter (DM) candidate in addition in each case. The additional quartic couplings involving the DM particle in the scalar potential in both the scenarios bring in additional constraints from the requirement of perturbative unitarity and vacuum stability. DM relic density and direct search constraints play a crucial role in this analysis as the perturbative unitarity of the DM-Higgs portal couplings primarily decide the high scale validity of the model. We find that, within the parameter regions thus restricted, the Type-II scenario must have a cut-off at around $10^6$ GeV, while the Type-X scenario admits of validity upto the Planck scale. However, only those regions which are valid upto about $10^8$ GeV in Type-X 2HDM is amenable to detection at the High-luminosity LHC (upto 3000 $fb^{-1}$), while most of the parameter space of the Type-II scenario mentioned above is likely to be detectable.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0
Authors:
Anmol Chaure,
Ashok Kumar Behera,
Sudip Bhattacharya
Abstract:
Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the pe…
▽ More
Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the performance of machine learning emulators designed for climate data. Recent studies have reported that despite being considered fundamental, regression models offer several advantages pertaining to climate emulations. In particular, by leveraging the kernel trick, regression models can capture complex relationships and improve their predictive capabilities. This study focuses on evaluating non-linear regression models using the aforementioned dataset. Specifically, we compare the emulation capabilities of three non-linear regression models. Among them, Gaussian Process Regressor demonstrates the best-in-class performance against standard evaluation metrics used for climate field emulation studies. However, Gaussian Process Regression suffers from being computational resource hungry in terms of space and time complexity. Alternatively, Support Vector and Kernel Ridge models also deliver competitive results and but there are certain trade-offs to be addressed. Additionally, we are actively investigating the performance of composite kernels and techniques such as variational inference to further enhance the performance of the regression models and effectively model complex non-linear patterns, including phenomena like precipitation.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Non-perturbative $\langle φ\rangle$, $\langle φ^2 \rangle$ and the dynamically generated scalar mass with Yukawa interaction in the inflationary de Sitter spacetime
Authors:
Sourav Bhattacharya,
Moutushi Dutta Choudhury
Abstract:
We consider a massless minimally coupled self interacting quantum scalar field coupled to fermion via the Yukawa interaction, in the inflationary de Sitter background. The fermion is also taken to be massless and the scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+ βφ^3/3!$ ($λ>0$). The chief physical motivation behind this choice of $V(φ)$ corresponds to, apart from its boundedness from…
▽ More
We consider a massless minimally coupled self interacting quantum scalar field coupled to fermion via the Yukawa interaction, in the inflationary de Sitter background. The fermion is also taken to be massless and the scalar potential is taken to be a hybrid, $V(φ)= λφ^4/4!+ βφ^3/3!$ ($λ>0$). The chief physical motivation behind this choice of $V(φ)$ corresponds to, apart from its boundedness from below property, the fact that shape wise $V(φ)$ has qualitative similarity with standard inflationary classical slow roll potentials. Also, its vacuum expectation value can be negative, suggesting some screening of the inflationary cosmological constant. We choose that $\langle φ\rangle\sim 0$ at early times with respect to the Bunch-Davies vacuum, so that perturbation theory is valid initially. We consider the equations satisfied by $\langle φ(t) \rangle$ and $\langle φ^2(t) \rangle$, constructed from the coarse grained equation of motion for the slowly rolling $φ$. We then compute the vacuum diagrammes of various relevant operators using the in-in formalism up to three loop, in terms of the leading powers of the secular logarithms. For a closed fermion loop, we have restricted ourselves here to only the local contribution. These large temporal logarithms are then resummed by constructing suitable non-perturbative equations to compute $\langle φ\rangle$ and $\langle φ^2 \rangle$. $\langle φ\rangle$ turns out to be at least approximately an order of magnitude less compared to the minimum of the classical potential, $-3β/λ$, owing to the strong quantum fluctuations. For $\langle φ^2 \rangle$, we have computed the dynamically generated scalar mass at late times, by taking the appropriate purely local contributions. Variations of these quantities with respect to different couplings have also been presented.
△ Less
Submitted 2 January, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.
-
Spike-and-slab shrinkage priors for structurally sparse Bayesian neural networks
Authors:
Sanket Jantre,
Shrijita Bhattacharya,
Tapabrata Maiti
Abstract:
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low…
▽ More
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low latency inference, higher data throughput, and reduced energy consumption. In this paper, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks. To this end, we propose structurally sparse Bayesian neural networks which systematically prune excessive nodes with (i) Spike-and-Slab Group Lasso (SS-GL), and (ii) Spike-and-Slab Group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layer-wise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared to the baseline models in prediction accuracy, model compression, and inference latency.
△ Less
Submitted 21 August, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Unfolding for Joint Channel Estimation and Symbol Detection in MIMO Communication Systems
Authors:
Swati Bhattacharya,
K. V. S. Hari,
Yonina C. Eldar
Abstract:
This paper proposes a Joint Channel Estimation and Symbol Detection (JED) scheme for Multiple-Input Multiple-Output (MIMO) wireless communication systems. Our proposed method for JED using Alternating Direction Method of Multipliers (JED-ADMM) and its model-based neural network version JED using Unfolded ADMM (JED-U-ADMM) markedly improve the symbol detection performance over JED using Alternating…
▽ More
This paper proposes a Joint Channel Estimation and Symbol Detection (JED) scheme for Multiple-Input Multiple-Output (MIMO) wireless communication systems. Our proposed method for JED using Alternating Direction Method of Multipliers (JED-ADMM) and its model-based neural network version JED using Unfolded ADMM (JED-U-ADMM) markedly improve the symbol detection performance over JED using Alternating Minimization (JED-AM) for a range of MIMO antenna configurations. Both proposed algorithms exploit the non-smooth constraint, that occurs as a result of the Quadrature Amplitude Modulation (QAM) data symbols, to effectively improve the performance using the ADMM iterations. The proposed unfolded network JED-U-ADMM consists of a few trainable parameters and requires a small training set. We show the efficacy of the proposed methods for both uncorrelated and correlated MIMO channels. For certain configurations, the gain in SNR for a desired BER of $10^{-2}$ for the proposed JED-ADMM and JED-U-ADMM is upto $4$ dB and is also accompanied by a significant reduction in computational complexity of upto $75\%$, depending on the MIMO configuration, as compared to the complexity of JED-AM.
△ Less
Submitted 21 August, 2023; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Assistive Chatbots for healthcare: a succinct review
Authors:
Basabdatta Sen Bhattacharya,
Vibhav Sinai Pissurlenkar
Abstract:
Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing depen…
▽ More
Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing dependence on human-human interaction and saving man-hours. Our review indicates that there are a handful of (commercial) Chatbots that are being used for patient support, while there are others (non-commercial) that are in the clinical trial phases. However, there is a lack of trust on this technology regarding patient safety and data protection, as well as a lack of wider awareness on its benefits among the healthcare workers and professionals. Also, patients have expressed dissatisfaction with Natural Language Processing (NLP) skills of the Chatbots in comparison to humans. Notwithstanding the recent introduction of ChatGPT that has raised the bar for the NLP technology, this Chatbot cannot be trusted with patient safety and medical ethics without thorough and rigorous checks to serve in the `narrow' domain of assistive healthcare. Our review suggests that to enable deployment and integration of AI-enabled Chatbots in public health services, the need of the hour is: to build technology that is simple and safe to use; to build confidence on the technology among: (a) the medical community by focussed training and development; (b) the patients and wider community through outreach.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition
Authors:
Soumyabrata Chaudhuri,
Saumik Bhattacharya
Abstract:
Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has ofte…
▽ More
Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has often been used for this task, either independently or in conjunction with the visual information (RGB modality) present in videos. However, the combination of pose, visual information, and text attributes has not been explored yet, though text and pose attributes independently have been proven to be effective in numerous computer vision tasks. In this paper, we present the first pose augmented Vision-language model (VLM) for VAR. Notably, our scheme achieves an accuracy of 92.81% and 73.02% on two popular human video action recognition benchmark datasets, UCF-101 and HMDB-51, respectively, even without any video data pre-training, and an accuracy of 96.11% and 75.75% after kinetics pre-training.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
FASTER: A Font-Agnostic Scene Text Editing and Rendering Framework
Authors:
Alloy Das,
Sanket Biswas,
Prasun Roy,
Subhankar Ghosh,
Umapada Pal,
Michael Blumenstein,
Josep Lladós,
Saumik Bhattacharya
Abstract:
Scene Text Editing (STE) is a challenging research problem, that primarily aims towards modifying existing texts in an image while preserving the background and the font style of the original text. Despite its utility in numerous real-world applications, existing style-transfer-based approaches have shown sub-par editing performance due to (1) complex image backgrounds, (2) diverse font attributes…
▽ More
Scene Text Editing (STE) is a challenging research problem, that primarily aims towards modifying existing texts in an image while preserving the background and the font style of the original text. Despite its utility in numerous real-world applications, existing style-transfer-based approaches have shown sub-par editing performance due to (1) complex image backgrounds, (2) diverse font attributes, and (3) varying word lengths within the text. To address such limitations, in this paper, we propose a novel font-agnostic scene text editing and rendering framework, named FASTER, for simultaneously generating text in arbitrary styles and locations while preserving a natural and realistic appearance and structure. A combined fusion of target mask generation and style transfer units, with a cascaded self-attention mechanism has been proposed to focus on multi-level text region edits to handle varying word lengths. Extensive evaluation on a real-world database with further subjective human evaluation study indicates the superiority of FASTER in both scene text editing and rendering tasks, in terms of model performance and efficiency. Our code will be released upon acceptance.
△ Less
Submitted 5 November, 2024; v1 submitted 5 August, 2023;
originally announced August 2023.