-
Imaging the Photochemistry of Cyclobutanone using Ultrafast Electron Diffraction: Experimental Results
Authors:
A. E. Green,
Y. Liu,
F. Allum,
M. Graßl,
P. Lenzen,
M. N. R. Ashfold,
S. Bhattacharyya,
X. Cheng,
M. Centurion,
S. W. Crane,
R. G. Forbes,
N. A. Goff,
L. Huang,
B. Kaufman,
M. F. Kling,
P. L. Kramer,
H. V. S. Lam,
K. A. Larsen,
R. Lemons,
M. -F. Lin,
A. J. Orr-Ewing,
D. Rolles,
A. Rudenko,
S. K. Saha,
J. Searles
, et al. (5 additional authors not shown)
Abstract:
We investigated the ultrafast structural dynamics of cyclobutanone following photoexcitation at $λ=200$ nm using gas-phase megaelectronvolt ultrafast electron diffraction. Our investigation complements the simulation studies of the same process within this special issue. It provides information about both electronic state population and structural dynamics through well-separable inelastic and elas…
▽ More
We investigated the ultrafast structural dynamics of cyclobutanone following photoexcitation at $λ=200$ nm using gas-phase megaelectronvolt ultrafast electron diffraction. Our investigation complements the simulation studies of the same process within this special issue. It provides information about both electronic state population and structural dynamics through well-separable inelastic and elastic electron scattering signatures. We observe the depopulation of the photoexcited S$_2$ state of cyclobutanone with n3s Rydberg character through its inelastic electron scattering signature with a time constant of $(0.29 \pm 0.2)$ ps towards the S$_1$ state. The S$_1$ state population undergoes ring-opening via a Norrish Type-I reaction, likely while passing through a conical intersection with S$_0$. The corresponding structural changes can be tracked by elastic electron scattering signatures. These changes appear with a delay of $(0.14 \pm 0.05)$ ps with respect the initial photoexcitation, which is less than the S$_2$ depopulation time constant. This behavior provides evidence for the ballistic nature of the ring-opening once the S$_1$ state is reached. The resulting biradical species react further within $(1.2 \pm 0.2)$ ps via two rival fragmentation channels yielding ketene and ethylene, or propene and carbon monoxide. Our study showcases both the value of gas-phase ultrafast diffraction studies as an experimental benchmark for nonadiabatic dynamics simulation methods and the limits in the interpretation of such experimental data without comparison to such simulations.
△ Less
Submitted 14 April, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
Enhanced Gilbert Damping via Cubic Spin-Orbit Coupling at 2DHG/Ferromagnetic Insulator Interface
Authors:
Sushmita Saha,
Alestin Mawrie
Abstract:
We investigate the enhancement of Gilbert damping at 2DHG/ferromagnetic insulator (FI) interfaces, where spin pumping from the FI layer injects spins into the 2DHG, and cubic Rashba spin-orbit coupling (RSOC) significantly boosts spin relaxation and spin-pumping efficiency compared to 2DEG systems. The dominant contribution to spin damping arises from interband transitions which does exhibits cond…
▽ More
We investigate the enhancement of Gilbert damping at 2DHG/ferromagnetic insulator (FI) interfaces, where spin pumping from the FI layer injects spins into the 2DHG, and cubic Rashba spin-orbit coupling (RSOC) significantly boosts spin relaxation and spin-pumping efficiency compared to 2DEG systems. The dominant contribution to spin damping arises from interband transitions which does exhibits conductivity-like behavior as the temperature, \( T \to 0 \). Our results reveal that damping remains stronger than in 2DEG due to the persistent influence of cubic RSOC. The interplay between RSOC and magnon absorption broadens the spectral response, with the damping peak shifting more notably at higher temperatures. Stronger RSOC expands the magnon interaction phase space, thus widening the damping spectrum. A key observation emerges with the Fermi level (\(E_f\)): a finite \(E_f\) sustains spin imbalance and enhances damping, whereas \(E_f = 0\) suppresses it, unlike in 2DEG. The electric field tunability of RSOC enables real-time control over spin relaxation and angular momentum transfer, offering a pathway toward voltage-controlled spintronic devices. These findings highlight the superior potential of 2DHG for tailoring spin dynamics via electric and thermal effects.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Meta-Cultural Competence: Climbing the Right Hill of Cultural Awareness
Authors:
Sougata Saha,
Saurabh Kumar Pandey,
Monojit Choudhury
Abstract:
Numerous recent studies have shown that Large Language Models (LLMs) are biased towards a Western and Anglo-centric worldview, which compromises their usefulness in non-Western cultural settings. However, "culture" is a complex, multifaceted topic, and its awareness, representation, and modeling in LLMs and LLM-based applications can be defined and measured in numerous ways. In this position paper…
▽ More
Numerous recent studies have shown that Large Language Models (LLMs) are biased towards a Western and Anglo-centric worldview, which compromises their usefulness in non-Western cultural settings. However, "culture" is a complex, multifaceted topic, and its awareness, representation, and modeling in LLMs and LLM-based applications can be defined and measured in numerous ways. In this position paper, we ask what does it mean for an LLM to possess "cultural awareness", and through a thought experiment, which is an extension of the Octopus test proposed by Bender and Koller (2020), we argue that it is not cultural awareness or knowledge, rather meta-cultural competence, which is required of an LLM and LLM-based AI system that will make it useful across various, including completely unseen, cultures. We lay out the principles of meta-cultural competence AI systems, and discuss ways to measure and model those.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Reading between the Lines: Can LLMs Identify Cross-Cultural Communication Gaps?
Authors:
Sougata Saha,
Saurabh Kumar Pandey,
Harshit Gupta,
Monojit Choudhury
Abstract:
In a rapidly globalizing and digital world, content such as book and product reviews created by people from diverse cultures are read and consumed by others from different corners of the world. In this paper, we investigate the extent and patterns of gaps in understandability of book reviews due to the presence of culturally-specific items and elements that might be alien to users from another cul…
▽ More
In a rapidly globalizing and digital world, content such as book and product reviews created by people from diverse cultures are read and consumed by others from different corners of the world. In this paper, we investigate the extent and patterns of gaps in understandability of book reviews due to the presence of culturally-specific items and elements that might be alien to users from another culture. Our user-study on 57 book reviews from Goodreads reveal that 83\% of the reviews had at least one culture-specific difficult-to-understand element. We also evaluate the efficacy of GPT-4o in identifying such items, given the cultural background of the reader; the results are mixed, implying a significant scope for improvement. Our datasets are available here: https://github.com/sougata-ub/reading_between_lines
△ Less
Submitted 20 February, 2025; v1 submitted 8 February, 2025;
originally announced February 2025.
-
The Multilingual Mind : A Survey of Multilingual Reasoning in Language Models
Authors:
Akash Ghosh,
Debayan Datta,
Sriparna Saha,
Chirag Agarwal
Abstract:
While reasoning and multilingual capabilities in Language Models (LMs) have achieved remarkable progress in recent years, their integration into a unified paradigm, multilingual reasoning, is at a nascent stage. Multilingual reasoning requires language models to handle logical reasoning across languages while addressing misalignment, biases, and challenges in low-resource settings. This survey pro…
▽ More
While reasoning and multilingual capabilities in Language Models (LMs) have achieved remarkable progress in recent years, their integration into a unified paradigm, multilingual reasoning, is at a nascent stage. Multilingual reasoning requires language models to handle logical reasoning across languages while addressing misalignment, biases, and challenges in low-resource settings. This survey provides the first in-depth review of multilingual reasoning in LMs. In this survey, we provide a systematic overview of existing methods that leverage LMs for multilingual reasoning, specifically outlining the challenges, motivations, and foundational aspects of applying language models to reason across diverse languages. We provide an overview of the standard data resources used for training multilingual reasoning in LMs and the evaluation benchmarks employed to assess their multilingual capabilities. Next, we analyze various state-of-the-art methods and their performance on these benchmarks. Finally, we explore future research opportunities to improve multilingual reasoning in LMs, focusing on enhancing their ability to handle diverse languages and complex reasoning tasks.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Revealing isotropic abundant low-energy excitations in UTe$_2$ through complex microwave surface impedance
Authors:
Arthur Carlton-Jones,
Alonso Suarez,
Yun-Suk Eo,
Ian M. Hayes,
Shanta R. Saha,
Johnpierre Paglione,
Nicholas P. Butch,
Steven M. Anlage
Abstract:
The complex surface impedance is a well-established tool to study the super- and normal-fluid responses of superconductors. Fundamental properties of the superconductor, such as the pairing mechanism, Fermi surface, and topological properties, also influence the surface impedance. We explore the microwave surface impedance of spin-triplet UTe$_2$ single crystals as a function of temperature using…
▽ More
The complex surface impedance is a well-established tool to study the super- and normal-fluid responses of superconductors. Fundamental properties of the superconductor, such as the pairing mechanism, Fermi surface, and topological properties, also influence the surface impedance. We explore the microwave surface impedance of spin-triplet UTe$_2$ single crystals as a function of temperature using resonant cavity perturbation measurements employing a novel multi-modal analysis to gain insight into these properties. We determine a composite surface impedance of the crystal for each mode using resonance data combined with the independently measured normal state dc resistivity tensor. The normal state surface impedance reveals the weighting of current flow directions in the crystal of each resonant mode. For UTe$_2$, we find an isotropic $Δλ(T) \sim T^α$ power-law temperature dependence for the magnetic penetration depth for $T\le T_c/3$ with $α< 2$, which is inconsistent with a single pair of point nodes on the Fermi surface under weak scattering. We also find a similar power-law temperature dependence for the low-temperature surface resistance $R_s(T) \sim T^{α_R}$ with $α_R < 2$. We observe a strong anisotropy of the residual microwave loss across these modes, with some modes showing loss below the universal line-nodal value, to those showing substantially more. We compare to predictions for topological Weyl superconductivity in the context of the observed isotropic power-laws, and anisotropy of the residual loss.
△ Less
Submitted 4 June, 2025; v1 submitted 11 February, 2025;
originally announced February 2025.
-
Hydrodynamic stresses in a multi-species suspension of active Janus colloids
Authors:
Gennaro Tucci,
Giulia Pisegna,
Ramin Golestanian,
Suropriya Saha
Abstract:
A realistic description of active particles should include interactions with the medium, commonly a momentum-conserving simple fluid, in which they are suspended. In this work, we consider a multi-species suspension of self-diffusiophoretic Janus colloids interacting via chemical and hydrodynamic fields. Through a systematic coarse-graining of the microscopic dynamics, we calculate the multi-compo…
▽ More
A realistic description of active particles should include interactions with the medium, commonly a momentum-conserving simple fluid, in which they are suspended. In this work, we consider a multi-species suspension of self-diffusiophoretic Janus colloids interacting via chemical and hydrodynamic fields. Through a systematic coarse-graining of the microscopic dynamics, we calculate the multi-component contribution to the hydrodynamic stress tensor of the incompressible Stokesian fluid in which the particles are immersed. For a single species, we find that the strength of the stress produced by the gradients of the number density field is determined by the particles' self-propulsion and chemotactic alignment, and can be tuned to be either contractile or extensile. For a multi-species system, we unveil how different forms of activity modify the stress tensor, and how non-reciprocity in hydrodynamic interactions emerges in an active binary mixture.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Generating crossmodal gene expression from cancer histopathology improves multimodal AI predictions
Authors:
Samiran Dey,
Christopher R. S. Banerji,
Partha Basuchowdhuri,
Sanjoy K. Saha,
Deepak Parashar,
Tapabrata Chakraborti
Abstract:
Emerging research has highlighted that artificial intelligence based multimodal fusion of digital pathology and transcriptomic features can improve cancer diagnosis (grading/subtyping) and prognosis (survival risk) prediction. However, such direct fusion for joint decision is impractical in real clinical settings, where histopathology is still the gold standard for diagnosis and transcriptomic tes…
▽ More
Emerging research has highlighted that artificial intelligence based multimodal fusion of digital pathology and transcriptomic features can improve cancer diagnosis (grading/subtyping) and prognosis (survival risk) prediction. However, such direct fusion for joint decision is impractical in real clinical settings, where histopathology is still the gold standard for diagnosis and transcriptomic tests are rarely requested, at least in the public healthcare system. With our novel diffusion based crossmodal generative AI model PathGen, we show that genomic expressions synthesized from digital histopathology jointly predicts cancer grading and patient survival risk with high accuracy (state-of-the-art performance), certainty (through conformal coverage guarantee) and interpretability (through distributed attention maps). PathGen code is available for open use by the research community through GitHub at https://github.com/Samiran-Dey/PathGen.
△ Less
Submitted 11 February, 2025; v1 submitted 1 February, 2025;
originally announced February 2025.
-
Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge
Authors:
Swarnadeep Saha,
Xian Li,
Marjan Ghazvininejad,
Jason Weston,
Tianlu Wang
Abstract:
LLM-as-a-Judge models generate chain-of-thought (CoT) sequences intended to capture the step-bystep reasoning process that underlies the final evaluation of a response. However, due to the lack of human annotated CoTs for evaluation, the required components and structure of effective reasoning traces remain understudied. Consequently, previous approaches often (1) constrain reasoning traces to han…
▽ More
LLM-as-a-Judge models generate chain-of-thought (CoT) sequences intended to capture the step-bystep reasoning process that underlies the final evaluation of a response. However, due to the lack of human annotated CoTs for evaluation, the required components and structure of effective reasoning traces remain understudied. Consequently, previous approaches often (1) constrain reasoning traces to hand-designed components, such as a list of criteria, reference answers, or verification questions and (2) structure them such that planning is intertwined with the reasoning for evaluation. In this work, we propose EvalPlanner, a preference optimization algorithm for Thinking-LLM-as-a-Judge that first generates an unconstrained evaluation plan, followed by its execution, and then the final judgment. In a self-training loop, EvalPlanner iteratively optimizes over synthetically constructed evaluation plans and executions, leading to better final verdicts. Our method achieves a new state-of-the-art performance for generative reward models on RewardBench (with a score of 93.9), despite being trained on fewer amount of, and synthetically generated, preference pairs. Additional experiments on other benchmarks like RM-Bench, JudgeBench, and FollowBenchEval further highlight the utility of both planning and reasoning for building robust LLM-as-a-Judge reasoning models.
△ Less
Submitted 29 January, 2025;
originally announced January 2025.
-
AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies
Authors:
Surojit Saha,
Ross Whitaker
Abstract:
Automated interpretation of seismic images using deep learning methods is challenging because of the limited availability of training data. Few-shot learning is a suitable learning paradigm in such scenarios due to its ability to adapt to a new task with limited supervision (small training budget). Existing few-shot semantic segmentation (FSSS) methods fix the number of target classes. Therefore,…
▽ More
Automated interpretation of seismic images using deep learning methods is challenging because of the limited availability of training data. Few-shot learning is a suitable learning paradigm in such scenarios due to its ability to adapt to a new task with limited supervision (small training budget). Existing few-shot semantic segmentation (FSSS) methods fix the number of target classes. Therefore, they do not support joint training on multiple datasets varying in the number of classes. In the context of the interpretation of seismic facies, fixing the number of target classes inhibits the generalization capability of a model trained on one facies dataset to another, which is likely to have a different number of facies. To address this shortcoming, we propose a few-shot semantic segmentation method for interpreting seismic facies that can adapt to the varying number of facies across the dataset, dubbed the AdaSemSeg. In general, the backbone network of FSSS methods is initialized with the statistics learned from the ImageNet dataset for better performance. The lack of such a huge annotated dataset for seismic images motivates using a self-supervised algorithm on seismic datasets to initialize the backbone network. We have trained the AdaSemSeg on three public seismic facies datasets with different numbers of facies and evaluated the proposed method on multiple metrics. The performance of the AdaSemSeg on unseen datasets (not used in training) is better than the prototype-based few-shot method and baselines.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
Pair Wavefunction Symmetry in UTe2 from Zero-Energy Surface State Visualization
Authors:
Qiangqiang Gu,
Shuqiu Wang,
Joseph P. Carroll,
Kuanysh Zhussupbekov,
Christopher Broyles,
Sheng Ran,
Nicholas P. Butch,
Shanta Saha,
Johnpierre Paglione,
Xiaolong Liu,
J. C. Séamus Davis,
Dung-Hai Lee
Abstract:
Although nodal spin-triplet topological superconductivity appears probable in UTe2, its superconductive order-parameter $Δ_k$ remains unestablished. In theory, a distinctive identifier would be the existence of a superconductive topological surface band (TSB), which could facilitate zero-energy Andreev tunneling to an s-wave superconductor, and also distinguish a chiral from non-chiral $Δ_k$ via e…
▽ More
Although nodal spin-triplet topological superconductivity appears probable in UTe2, its superconductive order-parameter $Δ_k$ remains unestablished. In theory, a distinctive identifier would be the existence of a superconductive topological surface band (TSB), which could facilitate zero-energy Andreev tunneling to an s-wave superconductor, and also distinguish a chiral from non-chiral $Δ_k$ via enhanced s-wave proximity. Here we employ s-wave superconductive scan-tips and detect intense zero-energy Andreev conductance at the UTe2 (0-11) termination surface. Imaging reveals sub-gap quasiparticle scattering interference signatures with a-axis orientation. The observed zero-energy Andreev peak splitting with enhanced s-wave proximity, signifies that $Δ_k$ of UTe2 is a non-chiral state: B1u, B2u or B3u. However, if the quasiparticle scattering along the a-axis is internodal, then a non-chiral B3u state is the most consistent for UTe2.
△ Less
Submitted 28 January, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
Disentanglement Analysis in Deep Latent Variable Models Matching Aggregate Posterior Distributions
Authors:
Surojit Saha,
Sarang Joshi,
Ross Whitaker
Abstract:
Deep latent variable models (DLVMs) are designed to learn meaningful representations in an unsupervised manner, such that the hidden explanatory factors are interpretable by independent latent variables (aka disentanglement). The variational autoencoder (VAE) is a popular DLVM widely studied in disentanglement analysis due to the modeling of the posterior distribution using a factorized Gaussian d…
▽ More
Deep latent variable models (DLVMs) are designed to learn meaningful representations in an unsupervised manner, such that the hidden explanatory factors are interpretable by independent latent variables (aka disentanglement). The variational autoencoder (VAE) is a popular DLVM widely studied in disentanglement analysis due to the modeling of the posterior distribution using a factorized Gaussian distribution that encourages the alignment of the latent factors with the latent axes. Several metrics have been proposed recently, assuming that the latent variables explaining the variation in data are aligned with the latent axes (cardinal directions). However, there are other DLVMs, such as the AAE and WAE-MMD (matching the aggregate posterior to the prior), where the latent variables might not be aligned with the latent axes. In this work, we propose a statistical method to evaluate disentanglement for any DLVMs in general. The proposed technique discovers the latent vectors representing the generative factors of a dataset that can be different from the cardinal latent axes. We empirically demonstrate the advantage of the method on two datasets.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
A Note on the value distribution of some differential-difference monomials generated by a transcendental entire function of hyper-order less than one
Authors:
Soumon Roy,
Sudip Saha,
Ritam Sinha
Abstract:
Let $\mathfrak{f}$ be a transcendental entire function with hyper-order less than one. The aim of this note is to study the value distribution of the differential-difference monomials $α\mathfrak{f}(z)^{q_0}(\mathfrak{f}(z+c))^{q_1}$, where $c$ is a non-zero complex number and $q_0\geq2,$ $q_1\geq 1$ are non-negative integers, and $ α(z)$ $(\not\equiv 0,\infty)$ be a small function of…
▽ More
Let $\mathfrak{f}$ be a transcendental entire function with hyper-order less than one. The aim of this note is to study the value distribution of the differential-difference monomials $α\mathfrak{f}(z)^{q_0}(\mathfrak{f}(z+c))^{q_1}$, where $c$ is a non-zero complex number and $q_0\geq2,$ $q_1\geq 1$ are non-negative integers, and $ α(z)$ $(\not\equiv 0,\infty)$ be a small function of $\mathfrak{f}$.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
ARD-VAE: A Statistical Formulation to Find the Relevant Latent Dimensions of Variational Autoencoders
Authors:
Surojit Saha,
Sarang Joshi,
Ross Whitaker
Abstract:
The variational autoencoder (VAE) is a popular, deep, latent-variable model (DLVM) due to its simple yet effective formulation for modeling the data distribution. Moreover, optimizing the VAE objective function is more manageable than other DLVMs. The bottleneck dimension of the VAE is a crucial design choice, and it has strong ramifications for the model's performance, such as finding the hidden…
▽ More
The variational autoencoder (VAE) is a popular, deep, latent-variable model (DLVM) due to its simple yet effective formulation for modeling the data distribution. Moreover, optimizing the VAE objective function is more manageable than other DLVMs. The bottleneck dimension of the VAE is a crucial design choice, and it has strong ramifications for the model's performance, such as finding the hidden explanatory factors of a dataset using the representations learned by the VAE. However, the size of the latent dimension of the VAE is often treated as a hyperparameter estimated empirically through trial and error. To this end, we propose a statistical formulation to discover the relevant latent factors required for modeling a dataset. In this work, we use a hierarchical prior in the latent space that estimates the variance of the latent axes using the encoded data, which identifies the relevant latent dimensions. For this, we replace the fixed prior in the VAE objective function with a hierarchical prior, keeping the remainder of the formulation unchanged. We call the proposed method the automatic relevancy detection in the variational autoencoder (ARD-VAE). We demonstrate the efficacy of the ARD-VAE on multiple benchmark datasets in finding the relevant latent dimensions and their effect on different evaluation metrics, such as FID score and disentanglement analysis.
△ Less
Submitted 26 January, 2025; v1 submitted 18 January, 2025;
originally announced January 2025.
-
Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
Authors:
Sofia Jamil,
Bollampalli Areen Reddy,
Raghvendra Kumar,
Sriparna Saha,
K J Joseph,
Koustava Goswami
Abstract:
The task of text-to-image generation has encountered significant challenges when applied to literary works, especially poetry. Poems are a distinct form of literature, with meanings that frequently transcend beyond the literal words. To address this shortcoming, we propose a PoemToPixel framework designed to generate images that visually represent the inherent meanings of poems. Our approach incor…
▽ More
The task of text-to-image generation has encountered significant challenges when applied to literary works, especially poetry. Poems are a distinct form of literature, with meanings that frequently transcend beyond the literal words. To address this shortcoming, we propose a PoemToPixel framework designed to generate images that visually represent the inherent meanings of poems. Our approach incorporates the concept of prompt tuning in our image generation framework to ensure that the resulting images closely align with the poetic content. In addition, we propose the PoeKey algorithm, which extracts three key elements in the form of emotions, visual elements, and themes from poems to form instructions which are subsequently provided to a diffusion model for generating corresponding images. Furthermore, to expand the diversity of the poetry dataset across different genres and ages, we introduce MiniPo, a novel multimodal dataset comprising 1001 children's poems and images. Leveraging this dataset alongside PoemSum, we conducted both quantitative and qualitative evaluations of image generation using our PoemToPixel framework. This paper demonstrates the effectiveness of our approach and offers a fresh perspective on generating images from literary sources.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
NGTS-EB-7, an eccentric, long-period, low-mass eclipsing binary
Authors:
Toby Rodel,
Christopher. A. Watson,
Solène Ulmer-Moll,
Samuel Gill,
Pierre F. L. Maxted,
Sarah L. Casewell,
Rafael Brahm,
Thomas G Wilson,
Jean C. Costes,
Yoshi Nike Emilia Eschen,
Lauren Doyle,
Alix V. Freckelton,
Douglas R. Alves,
Ioannis Apergis,
Daniel Bayliss,
Francois Bouchy,
Matthew R. Burleigh,
Xavier Dumusque,
Jan Eberhardt,
Jorge Fernández Fernández,
Edward Gillen,
Michael R. Goad,
Faith Hawthorn,
Ravit Helled,
Thomas Henning
, et al. (13 additional authors not shown)
Abstract:
Despite being the most common types of stars in the Galaxy, the physical properties of late M dwarfs are often poorly constrained. A trend of radius inflation compared to evolutionary models has been observed for earlier type M dwarfs in eclipsing binaries, possibly caused by magnetic activity. It is currently unclear whether this trend also extends to later type M dwarfs below the convective boun…
▽ More
Despite being the most common types of stars in the Galaxy, the physical properties of late M dwarfs are often poorly constrained. A trend of radius inflation compared to evolutionary models has been observed for earlier type M dwarfs in eclipsing binaries, possibly caused by magnetic activity. It is currently unclear whether this trend also extends to later type M dwarfs below the convective boundary. This makes the discovery of lower-mass, fully convective, M dwarfs in eclipsing binaries valuable for testing evolutionary models especially in longer-period binaries where tidal interaction between the primary and secondary is negligible. With this context, we present the discovery of the NGTS-EB-7 AB system, an eclipsing binary containing a late M dwarf secondary and an evolved G-type primary star. The secondary star has a radius of $0.125 \pm 0.006 R_\odot$ , a mass of $0.096 \pm 0.004 M_\odot$ and follows a highly eccentric $(e=0.71436 \pm 0.00085)$ orbit every $193.35875 \pm 0.00034$ days. This makes NGTS-EB-7 AB the third longest-period eclipsing binary system with a secondary smaller than $200 M_J$ with the mass and radius constrained to better than $5 \%$. In addition, NGTS-EB-7 is situated near the centre of the proposed LOPS2 southern field of the upcoming PLATO mission, allowing for detection of the secondary eclipse and measurement of the companion`s temperature. With its long-period and well-constrained physical properties - NGTS-EB-7 B will make a valuable addition to the sample of M dwarfs in eclipsing binaries and help in determining accurate empirical mass/radius relations for later M dwarf stars.
△ Less
Submitted 10 January, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
A Class of Non-Contracting Branch Groups with Non-Torsion Rigid Kernels
Authors:
Sagar Saha,
K. V. Krishna
Abstract:
In this work, we provide the first example of an infinite family of branch groups in the class of non-contracting self-similar groups. We show that these groups are very strongly fractal, not regular branch, and of exponential growth. Further, we prove that these groups do not have the congruence subgroup property by explicitly calculating the structure of their rigid kernels. This class of groups…
▽ More
In this work, we provide the first example of an infinite family of branch groups in the class of non-contracting self-similar groups. We show that these groups are very strongly fractal, not regular branch, and of exponential growth. Further, we prove that these groups do not have the congruence subgroup property by explicitly calculating the structure of their rigid kernels. This class of groups is also the first example of branch groups with non-torsion rigid kernels. As a consequence of these results, we also determine the Hausdorff dimension of these groups.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
A Priori Log-Concavity Estimates for Dirichlet Eigenfunctions
Authors:
Gabriel Khan,
Soumyajit Saha,
Malik Tuerkoen
Abstract:
In this paper, we establish a priori log-concavity estimates for the first Dirichlet eigenfunction of convex domains of a Riemannian manifold. Specifically, we focus on cases where the principal eigenfunction $u$ is assumed to be log-concave and our primary goal is to obtain quantitative estimates for the Hessian of $\log u$.
In this paper, we establish a priori log-concavity estimates for the first Dirichlet eigenfunction of convex domains of a Riemannian manifold. Specifically, we focus on cases where the principal eigenfunction $u$ is assumed to be log-concave and our primary goal is to obtain quantitative estimates for the Hessian of $\log u$.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Women, Infamous, and Exotic Beings: What Honorific Usages in Wikipedia Reveal about the Socio-Cultural Norms
Authors:
Sourabrata Mukherjee,
Soumya Teotia,
Sougata Saha,
Monojit Choudhury
Abstract:
Honorifics serve as powerful linguistic markers that reflect social hierarchies and cultural values. This paper presents a large-scale, cross-linguistic exploration of usage of honorific pronouns in Bengali and Hindi Wikipedia articles, shedding light on how socio-cultural factors shape language. Using LLM (GPT-4o), we annotated 10, 000 articles of real and fictional beings in each language for se…
▽ More
Honorifics serve as powerful linguistic markers that reflect social hierarchies and cultural values. This paper presents a large-scale, cross-linguistic exploration of usage of honorific pronouns in Bengali and Hindi Wikipedia articles, shedding light on how socio-cultural factors shape language. Using LLM (GPT-4o), we annotated 10, 000 articles of real and fictional beings in each language for several sociodemographic features such as gender, age, fame, and exoticness, and the use of honorifics. We find that across all feature combinations, use of honorifics is consistently more common in Bengali than Hindi. For both languages, the use non-honorific pronouns is more commonly observed for infamous, juvenile, and exotic beings. Notably, we observe a gender bias in use of honorifics in Hindi, with men being more commonly referred to with honorifics than women.
△ Less
Submitted 6 March, 2025; v1 submitted 6 January, 2025;
originally announced January 2025.
-
Large deviations of density in the non-equilibrium steady state of boundary-driven diffusive systems
Authors:
Soumyabrata Saha,
Tridib Sadhu
Abstract:
A diffusive system coupled to unequal boundary reservoirs eventually reaches a non-equilibrium steady state. While the full-counting-statistics of current in these non-equilibrium states are well understood for generic systems, results for steady-state density fluctuations remain limited to a few integrable models. By employing an exact solution of the Macroscopic Fluctuation Theory, we characteri…
▽ More
A diffusive system coupled to unequal boundary reservoirs eventually reaches a non-equilibrium steady state. While the full-counting-statistics of current in these non-equilibrium states are well understood for generic systems, results for steady-state density fluctuations remain limited to a few integrable models. By employing an exact solution of the Macroscopic Fluctuation Theory, we characterize steady-state density fluctuations in terms of large deviations for a class of systems. Additionally, we quantitatively describe the development of these rare fluctuations. For generic diffusive systems in arbitrary dimensions, we use a perturbation around the equilibrium state to solve for the large deviation functional and the corresponding path to fluctuations, up to non-trivial orders where the non-locality of fluctuations unveils.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Diabetic Retinopathy Detection Using CNN with Residual Block with DCGAN
Authors:
Debjany Ghosh Aronno,
Sumaiya Saeha
Abstract:
Diabetic Retinopathy (DR) is a major cause of blindness worldwide, caused by damage to the blood vessels in the retina due to diabetes. Early detection and classification of DR are crucial for timely intervention and preventing vision loss. This work proposes an automated system for DR detection using Convolutional Neural Networks (CNNs) with a residual block architecture, which enhances feature e…
▽ More
Diabetic Retinopathy (DR) is a major cause of blindness worldwide, caused by damage to the blood vessels in the retina due to diabetes. Early detection and classification of DR are crucial for timely intervention and preventing vision loss. This work proposes an automated system for DR detection using Convolutional Neural Networks (CNNs) with a residual block architecture, which enhances feature extraction and model performance. To further improve the model's robustness, we incorporate advanced data augmentation techniques, specifically leveraging a Deep Convolutional Generative Adversarial Network (DCGAN) for generating diverse retinal images. This approach increases the variability of training data, making the model more generalizable and capable of handling real-world variations in retinal images. The system is designed to classify retinal images into five distinct categories, from No DR to Proliferative DR, providing an efficient and scalable solution for early diagnosis and monitoring of DR progression. The proposed model aims to support healthcare professionals in large-scale DR screening, especially in resource-constrained settings.
△ Less
Submitted 4 January, 2025;
originally announced January 2025.
-
Innate behavioural mechanisms and defensive traits in ecological models of predator-prey types
Authors:
Sangeeta Saha,
Swadesh Pal,
Roderick Melnik
Abstract:
There are various examples of phenotypic plasticity in ecosystems that serve as the basis for a wide range of inducible defences against predation. These strategies include camouflage, burrowing, mimicry, evasive actions, and even counterattacks that enhance survival under fluctuating predatory threats. Additionally, the ability to exhibit plastic responses often influences ecological balances, sh…
▽ More
There are various examples of phenotypic plasticity in ecosystems that serve as the basis for a wide range of inducible defences against predation. These strategies include camouflage, burrowing, mimicry, evasive actions, and even counterattacks that enhance survival under fluctuating predatory threats. Additionally, the ability to exhibit plastic responses often influences ecological balances, shaping predator-prey coexistence over time. This study introduces a predator-prey model where prey species show inducible defences, providing new insights into the role of adaptive strategies in these complex interactions. The stabilizing impact of the defensive mechanism is one of several intriguing outcomes produced by the dynamics. Moreover, the predator population rises when the interference rate increases to a moderate value even in the presence of lower prey defence but decreases monotonically for stronger defence levels. Furthermore, we identify a bistable domain when the handling rate is used as a control parameter, emphasizing the critical role of initial population sizes in determining system outcomes. By considering the species diffusion in a bounded region, the study is expanded into a spatio-temporal model. The numerical simulation reveals that the Turing domain decreases as the level of protection increases. The study is subsequently extended to incorporate taxis, known as the directed movement of species toward or away from another species. Our investigation identifies the conditions under which pattern formation emerges, driven by the interplay of inducible defences, taxis as well as species diffusion. Numerical simulations demonstrate that including taxis within the spatio-temporal model exerts a stabilizing influence, thereby diminishing the potential for pattern formation in the system.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1794 additional authors not shown)
Abstract:
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana…
▽ More
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent analysis methods considering the single-harmonic and the dual-harmonic emission models. We find no evidence of a CW signal in O4a data for both models and set upper limits on the signal amplitude and on the ellipticity, which quantifies the asymmetry in the neutron star mass distribution. For the single-harmonic emission model, 29 targets have the upper limit on the amplitude below the theoretical spin-down limit. The lowest upper limit on the amplitude is $6.4\!\times\!10^{-27}$ for the young energetic pulsar J0537-6910, while the lowest constraint on the ellipticity is $8.8\!\times\!10^{-9}$ for the bright nearby millisecond pulsar J0437-4715. Additionally, for a subset of 16 targets we performed a narrowband search that is more robust regarding the emission model, with no evidence of a signal. We also found no evidence of non-standard polarizations as predicted by the Brans-Dicke theory.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Non-reciprocal mixtures in suspension: the role of hydrodynamic interactions
Authors:
Giulia Pisegna,
Navdeep Rana,
Ramin Golestanian,
Suropriya Saha
Abstract:
The collective chasing dynamics of non-reciprocally coupled densities leads to stable travelling waves which can be mapped to a model for emergent flocking. In this work, we couple the non-reciprocal Cahn-Hilliard model (NRCH) to a fluid to minimally describe scalar active mixtures in a suspension, with the aim to explore the stability of the waves, i.e. the emergent flock in the presence of self-…
▽ More
The collective chasing dynamics of non-reciprocally coupled densities leads to stable travelling waves which can be mapped to a model for emergent flocking. In this work, we couple the non-reciprocal Cahn-Hilliard model (NRCH) to a fluid to minimally describe scalar active mixtures in a suspension, with the aim to explore the stability of the waves, i.e. the emergent flock in the presence of self-generated fluid flows. We show that the emergent polarity is linearly unstable to perturbations for a specific sign of the active stress recalling instabilities of orientational order in a fluid. Using numerical simulations, we find however that non-reciprocity stabilizes the waves against the linear instability in a large region of the phase space.
△ Less
Submitted 12 February, 2025; v1 submitted 2 January, 2025;
originally announced January 2025.
-
Defect-mediated electron-phonon coupling in halide double perovskite
Authors:
Aprajita Joshi,
Sajid Saikia,
Shalini Badola,
Angshuman Nag,
Surajit Saha
Abstract:
Optically active defects often play a crucial role in governing the light emission as well as the electronic properties of materials. Moreover, defect-mediated states in the mid-gap region can trap electrons, thus opening a path for the recombination of electrons and holes in lower energy states that may require phonons in the process. Considering this, we have probed electron-phonon interaction i…
▽ More
Optically active defects often play a crucial role in governing the light emission as well as the electronic properties of materials. Moreover, defect-mediated states in the mid-gap region can trap electrons, thus opening a path for the recombination of electrons and holes in lower energy states that may require phonons in the process. Considering this, we have probed electron-phonon interaction in halide perovskite systems with the introduction of defects and investigated the thermal effect on this interaction. Here, we report Raman spectroscopy study of the thermal evolution of electron-phonon coupling, which is tunable with the crystal growth conditions, in the halide perovskite systems Cs2AgInCl6 and Cs2NaInCl6. The signature of electron-phonon coupling is observed as a Fano anomaly in the lowest frequency phonon mode (51 cm-1) which evolves with temperature. In addition, we observe a broad band in the photoluminescence (PL) measurements for the defect-mediated systems, which is otherwise absent in defect-free halide perovskite. The simultaneous observation of the Fano anomaly in the Raman spectrum and the emergence of the PL band suggests the defect-mediated mid-gap states and the consequent existence of electron-phonon coupling in the double perovskite.
△ Less
Submitted 31 December, 2024;
originally announced January 2025.
-
Dynamic magnetic response in ABA type trilayered systems and compensation phenomenon
Authors:
Enakshi Guru,
Sonali Saha,
Sankhasubhra Nag
Abstract:
Dynamic magnetic response in a trilayered structure with non-equivalent layers (ABA type) has been studied with Monte Carlo simulation using Metropolis algorithm. In each layer, ferromagnetic (FM) nearest neighbour Ising interactions are present along with antiferromagnetic (AFM) nearest neighbour coupling across different layers. The system is studied under a harmonically oscillating external mag…
▽ More
Dynamic magnetic response in a trilayered structure with non-equivalent layers (ABA type) has been studied with Monte Carlo simulation using Metropolis algorithm. In each layer, ferromagnetic (FM) nearest neighbour Ising interactions are present along with antiferromagnetic (AFM) nearest neighbour coupling across different layers. The system is studied under a harmonically oscillating external magnetic field. It is revealed that along with dynamic phase transition (DPT), compensation phenomenon emerges in this system under dynamic scenario too. This feature in dynamic case is unique for such trilayered systems only, in contrast to the bulk system reported earlier. The temporal behaviour of the magnetisation of each individual layer shows that different magnetic response of the non-equivalent layers results into such dynamic compensation phenomenon. The difference in response also results into warping of the dynamic hysteresis loops, under various external parameter values, such as amplitude of the oscillating field and temperature.
△ Less
Submitted 30 December, 2024;
originally announced December 2024.
-
KVC-onGoing: Keystroke Verification Challenge
Authors:
Giuseppe Stragapede,
Ruben Vera-Rodriguez,
Ruben Tolosana,
Aythami Morales,
Ivan DeAndres-Tame,
Naser Damer,
Julian Fierrez,
Javier Ortega-Garcia,
Alejandro Acien,
Nahuel Gonzalez,
Andrei Shadrikov,
Dmitrii Gordin,
Leon Schmitt,
Daniel Wimmer,
Christoph Großmann,
Joerdis Krieger,
Florian Heinz,
Ron Krestel,
Christoffer Mayer,
Simon Haberl,
Helena Gschrey,
Yosuke Yamagishi,
Sanjay Saha,
Sanka Rasnayaka,
Sandareka Wickramanayake
, et al. (5 additional authors not shown)
Abstract:
This article presents the Keystroke Verification Challenge - onGoing (KVC-onGoing), on which researchers can easily benchmark their systems in a common platform using large-scale public databases, the Aalto University Keystroke databases, and a standard experimental protocol. The keystroke data consist of tweet-long sequences of variable transcript text from over 185,000 subjects, acquired through…
▽ More
This article presents the Keystroke Verification Challenge - onGoing (KVC-onGoing), on which researchers can easily benchmark their systems in a common platform using large-scale public databases, the Aalto University Keystroke databases, and a standard experimental protocol. The keystroke data consist of tweet-long sequences of variable transcript text from over 185,000 subjects, acquired through desktop and mobile keyboards simulating real-life conditions. The results on the evaluation set of KVC-onGoing have proved the high discriminative power of keystroke dynamics, reaching values as low as 3.33% of Equal Error Rate (EER) and 11.96% of False Non-Match Rate (FNMR) @1% False Match Rate (FMR) in the desktop scenario, and 3.61% of EER and 17.44% of FNMR @1% at FMR in the mobile scenario, significantly improving previous state-of-the-art results. Concerning demographic fairness, the analyzed scores reflect the subjects' age and gender to various extents, not negligible in a few cases. The framework runs on CodaLab.
△ Less
Submitted 29 December, 2024;
originally announced December 2024.
-
Non-reciprocal interactions preserve the universality class of Potts model
Authors:
Soumya K. Saha,
P. K. Mohanty
Abstract:
We study the $q$-state Potts model on a square lattice with directed nearest-neighbor spin-spin interactions that are inherently non-reciprocal. Both equilibrium and non-equilibrium dynamics are investigated. Analytically, we demonstrate that non-reciprocal interactions do not alter the critical exponents of the model under equilibrium dynamics. In contrast, numerical simulations with selfish non-…
▽ More
We study the $q$-state Potts model on a square lattice with directed nearest-neighbor spin-spin interactions that are inherently non-reciprocal. Both equilibrium and non-equilibrium dynamics are investigated. Analytically, we demonstrate that non-reciprocal interactions do not alter the critical exponents of the model under equilibrium dynamics. In contrast, numerical simulations with selfish non-equilibrium dynamics reveal distinctive behavior. For $q=2$ (non-reciprocal non-equilibrium Ising model), the critical exponents remain consistent with those of the equilibrium Ising universality class. However, for $q=3$ and $q=4$, the critical exponents vary continuously. Remarkably, a super-universal scaling function -- Binder cumulant as a function of $ξ_2/ξ_0$, where $ξ_2$ is the second moment correlation length and $ξ_0$ its maximum value -- remains identical to that of the equilibrium $q=3,4$ Potts models. These findings indicate that non-reciprocal Potts models belong to the superuniversality class of their respective equilibrium counterparts.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Real-time classification of EEG signals using Machine Learning deployment
Authors:
Swati Chowdhuri,
Satadip Saha,
Samadrita Karmakar,
Ankur Chanda
Abstract:
The prevailing educational methods predominantly rely on traditional classroom instruction or online delivery, often limiting the teachers' ability to engage effectively with all the students simultaneously. A more intrinsic method of evaluating student attentiveness during lectures can enable the educators to tailor the course materials and their teaching styles in order to better meet the studen…
▽ More
The prevailing educational methods predominantly rely on traditional classroom instruction or online delivery, often limiting the teachers' ability to engage effectively with all the students simultaneously. A more intrinsic method of evaluating student attentiveness during lectures can enable the educators to tailor the course materials and their teaching styles in order to better meet the students' needs. The aim of this paper is to enhance teaching quality in real time, thereby fostering a higher student engagement in the classroom activities. By monitoring the students' electroencephalography (EEG) signals and employing machine learning algorithms, this study proposes a comprehensive solution for addressing this challenge. Machine learning has emerged as a powerful tool for simplifying the analysis of complex variables, enabling the effective assessment of the students' concentration levels based on specific parameters. However, the real-time impact of machine learning models necessitates a careful consideration as their deployment is concerned. This study proposes a machine learning-based approach for predicting the level of students' comprehension with regard to a certain topic. A browser interface was introduced that accesses the values of the system's parameters to determine a student's level of concentration on a chosen topic. The deployment of the proposed system made it necessary to address the real-time challenges faced by the students, consider the system's cost, and establish trust in its efficacy. This paper presents the efforts made for approaching this pertinent issue through the implementation of innovative technologies and provides a framework for addressing key considerations for future research directions.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Unveiling the Chiral States in Multi-Weyl Semimetals through Magneto-Optical Spectroscopy
Authors:
Sushmita Saha,
Deepannita Das,
Alestin Mawrie
Abstract:
This study investigates the transport parameters in multi-Weyl semimetals, focusing on their magneto-optical properties and the role of chiral states. The tilting parameter is identified as a key factor in higher-order Weyl nodes, significantly influencing the magneto-optical response. We obtain a generic Landau-level expression for multi-Weyl semimetals, establishing a robust framework for analyz…
▽ More
This study investigates the transport parameters in multi-Weyl semimetals, focusing on their magneto-optical properties and the role of chiral states. The tilting parameter is identified as a key factor in higher-order Weyl nodes, significantly influencing the magneto-optical response. We obtain a generic Landau-level expression for multi-Weyl semimetals, establishing a robust framework for analyzing their quantum transport properties. A comprehensive expression for the conductivity tensor components is presented, uncovering distinctive low-frequency peaks and other features shaped by the tilting parameter. Our findings reveal that the signatures of chiral states in the conductivity tensors become increasingly pronounced with the Weyl node order. Particularly, the tilting parameter is shown to impact Faraday rotation, at energies near the tilted Dirac cone energies. These results provide critical insights into the magneto-optical behavior of multi-Weyl semimetals and their potential for exploring topological phenomena.
△ Less
Submitted 26 December, 2024;
originally announced December 2024.
-
Power Law Behavior of Center-Like Decaying Oscillation : Exponent through Perturbation Theory and Optimization
Authors:
Sandip Saha
Abstract:
In dynamical systems theory, there is a lack of a straightforward rule to distinguish exact center solutions from decaying center-like solutions, as both require the damping force function to be zero [1, 2]. By adopting a multi-scale perturbative method, we have demonstrated a general rule for the decaying center-like power law behavior, characterized by an exponent of 1/3 . The investigation bega…
▽ More
In dynamical systems theory, there is a lack of a straightforward rule to distinguish exact center solutions from decaying center-like solutions, as both require the damping force function to be zero [1, 2]. By adopting a multi-scale perturbative method, we have demonstrated a general rule for the decaying center-like power law behavior, characterized by an exponent of 1/3 . The investigation began with a physical question about the higher-order nonlinearity in a damping force function, which exhibits birhythmic and trirhythmic behavior under a transition to a decaying center-type solution. Using numerical optimization algorithms, we identified the power law exponent for decaying center-type behavior across various rhythmic conditions. For all scenarios, we consistently observed a decaying power law with an exponent of 1/3 .Our study aims to elucidate their dynamical differences, contributing to theoretical insights and practical applications where distinguishing between different types of center-like behaviour is crucial. This key result would be beneficial for studying the multi-rhythmic nature of biological and engineering systems.
△ Less
Submitted 5 April, 2025; v1 submitted 21 December, 2024;
originally announced December 2024.
-
Deep Learning Based Recalibration of SDSS and DESI BAO Alleviates Hubble and Clustering Tensions
Authors:
Rahul Shah,
Purba Mukherjee,
Soumadeep Saha,
Utpal Garain,
Supratik Pal
Abstract:
Conventional calibration of Baryon Acoustic Oscillations (BAO) data relies on estimation of the sound horizon at drag epoch $r_d$ from early universe observations by assuming a cosmological model. We present a recalibration of two independent BAO datasets, SDSS and DESI, by employing deep learning techniques for model-independent estimation of $r_d$, and explore the impacts on $Λ$CDM cosmological…
▽ More
Conventional calibration of Baryon Acoustic Oscillations (BAO) data relies on estimation of the sound horizon at drag epoch $r_d$ from early universe observations by assuming a cosmological model. We present a recalibration of two independent BAO datasets, SDSS and DESI, by employing deep learning techniques for model-independent estimation of $r_d$, and explore the impacts on $Λ$CDM cosmological parameters. Significant reductions in both Hubble ($H_0$) and clustering ($S_8$) tensions are observed for both the recalibrated datasets. Moderate shifts in some other parameters hint towards further exploration of such data-driven approaches.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Authors:
Mariam Hassan,
Sebastian Stapf,
Ahmad Rahimi,
Pedro M B Rezende,
Yasaman Haghighi,
David Brüggemann,
Isinsu Katircioglu,
Lin Zhang,
Xiaoran Chen,
Suman Saha,
Marco Cannici,
Elie Aljalbout,
Botao Ye,
Xi Wang,
Aram Davtyan,
Mathieu Salzmann,
Davide Scaramuzza,
Marc Pollefeys,
Paolo Favaro,
Alexandre Alahi
Abstract:
We present GEM, a Generalizable Ego-vision Multimodal world model that predicts future frames using a reference frame, sparse features, human poses, and ego-trajectories. Hence, our model has precise control over object dynamics, ego-agent motion and human poses. GEM generates paired RGB and depth outputs for richer spatial understanding. We introduce autoregressive noise schedules to enable stabl…
▽ More
We present GEM, a Generalizable Ego-vision Multimodal world model that predicts future frames using a reference frame, sparse features, human poses, and ego-trajectories. Hence, our model has precise control over object dynamics, ego-agent motion and human poses. GEM generates paired RGB and depth outputs for richer spatial understanding. We introduce autoregressive noise schedules to enable stable long-horizon generations. Our dataset is comprised of 4000+ hours of multimodal data across domains like autonomous driving, egocentric human activities, and drone flights. Pseudo-labels are used to get depth maps, ego-trajectories, and human poses. We use a comprehensive evaluation framework, including a new Control of Object Manipulation (COM) metric, to assess controllability. Experiments show GEM excels at generating diverse, controllable scenarios and temporal consistency over long generations. Code, models, and datasets are fully open-sourced.
△ Less
Submitted 15 December, 2024;
originally announced December 2024.
-
Optimizing CDN Architectures: Multi-Metric Algorithmic Breakthroughs for Edge and Distributed Performance
Authors:
Md Nurul Absur,
Sourya Saha,
Sifat Nawrin Nova,
Kazi Fahim Ahmad Nasif,
Md Rahat Ul Nasib
Abstract:
A Content Delivery Network (CDN) is a powerful system of distributed caching servers that aims to accelerate content delivery, like high-definition video, IoT applications, and ultra-low-latency services, efficiently and with fast velocity. This has become of paramount importance in the post-pandemic era. Challenges arise when exponential content volume growth and scalability across different geog…
▽ More
A Content Delivery Network (CDN) is a powerful system of distributed caching servers that aims to accelerate content delivery, like high-definition video, IoT applications, and ultra-low-latency services, efficiently and with fast velocity. This has become of paramount importance in the post-pandemic era. Challenges arise when exponential content volume growth and scalability across different geographic locations are required. This paper investigates data-driven evaluations of CDN algorithms in dynamic server selection for latency reduction, bandwidth throttling for efficient resource management, real-time Round Trip Time analysis for adaptive routing, and programmatic network delay simulation to emulate various conditions. Key performance metrics, such as round-trip time (RTT) and CPU usage, are carefully analyzed to evaluate scalability and algorithmic efficiency through two experimental setups: a constrained edge-like local system and a scalable FABRIC testbed. The statistical validation of RTT trends, alongside CPU utilization, is presented in the results. The optimization process reveals significant trade-offs between scalability and resource consumption, providing actionable insights for effectively deploying and enhancing CDN algorithms in edge and distributed computing environments.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Strong Coupling Theory of Superconductivity and Ferroelectric Quantum Criticality in metallic SrTiO$_3$
Authors:
Sudip Kumar Saha,
Maria N. Gastiasoro,
Jonathan Ruhman,
Avraham Klein
Abstract:
Superconductivity in doped SrTiO$_3$ has remained an enduring mystery for over 50 years. The material's status as a ``quantum" ferroelectric metal, characterized by a soft polar mode, suggests that quantum criticality could play a pivotal role in the emergence of its superconducting state. We show that the system is amenable to a strong coupling (Eliashberg) pairing analysis, with the dominant cou…
▽ More
Superconductivity in doped SrTiO$_3$ has remained an enduring mystery for over 50 years. The material's status as a ``quantum" ferroelectric metal, characterized by a soft polar mode, suggests that quantum criticality could play a pivotal role in the emergence of its superconducting state. We show that the system is amenable to a strong coupling (Eliashberg) pairing analysis, with the dominant coupling to the soft mode being a ``dynamical'' Rashba coupling. We compute the expected $T_c$ for the entire phase diagram, all the way to the quantum critical point and beyond. We demonstrate that the linear coupling is sufficient to obtain a rough approximation of the experimentally measured phase diagram, but that nonlinear coupling terms are crucial in reproducing the finer features in the ordered phase. The primary role of nonlinear terms at the peak of the superconducting dome is to enhance the effective linear coupling induced by the broken order, shifting the dome's maximum into the ordered phase. Our theory quantitatively reproduces the three-dimensional experimental phase diagram in the space of carrier density, distance from the quantum critical point and temperature, and allows us to estimate microscopic parameters from the experimental data.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
Specification-Driven Code Translation Powered by Large Language Models: How Far Are We?
Authors:
Soumit Kanti Saha,
Fazle Rabbi,
Song Wang,
Jinqiu Yang
Abstract:
Large Language Models (LLMs) are increasingly being applied across various domains, including code-related tasks such as code translation. Previous studies have explored using LLMs for translating code between different programming languages. Since LLMs are more effective with natural language, using natural language as an intermediate representation in code translation tasks presents a promising…
▽ More
Large Language Models (LLMs) are increasingly being applied across various domains, including code-related tasks such as code translation. Previous studies have explored using LLMs for translating code between different programming languages. Since LLMs are more effective with natural language, using natural language as an intermediate representation in code translation tasks presents a promising approach. In this work, we investigate using NL-specification as an intermediate representation for code translation. We evaluate our method using three datasets, five popular programming languages, and 29 language pair permutations. Our results show that using NL-specification alone does not lead to performance improvements. However, when combined with source code, it provides a slight improvement over the baseline in certain language pairs. Besides analyzing the performance of code translation, we also investigate the quality of the translated code and provide insights into the issues present in the translated code.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning
Authors:
Pranab Sahoo,
Ashutosh Tripathi,
Sriparna Saha,
Samrat Mondal
Abstract:
Federated Learning (FL) marks a transformative approach to distributed model training by combining locally optimized models from various clients into a unified global model. While FL preserves data privacy by eliminating centralized storage, it encounters significant challenges such as performance degradation, slower convergence, and reduced robustness of the global model due to the heterogeneity…
▽ More
Federated Learning (FL) marks a transformative approach to distributed model training by combining locally optimized models from various clients into a unified global model. While FL preserves data privacy by eliminating centralized storage, it encounters significant challenges such as performance degradation, slower convergence, and reduced robustness of the global model due to the heterogeneity in client data distributions. Among the various forms of data heterogeneity, label skew emerges as a particularly formidable and prevalent issue, especially in domains such as image classification. To address these challenges, we begin with comprehensive experiments to pinpoint the underlying issues in the FL training process. Based on our findings, we then introduce an innovative dual-strategy approach designed to effectively resolve these issues. First, we introduce an adaptive loss function for client-side training, meticulously crafted to preserve previously acquired knowledge while maintaining an optimal equilibrium between local optimization and global model coherence. Secondly, we develop a dynamic aggregation strategy for aggregating client models at the server. This approach adapts to each client's unique learning patterns, effectively addressing the challenges of diverse data across the network. Our comprehensive evaluation, conducted across three diverse real-world datasets, coupled with theoretical convergence guarantees, demonstrates the superior efficacy of our method compared to several established state-of-the-art approaches.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
A Granger-Causal Perspective on Gradient Descent with Application to Pruning
Authors:
Aditya Shah,
Aditya Challa,
Sravan Danda,
Archana Mathur,
Snehanshu Saha
Abstract:
Stochastic Gradient Descent (SGD) is the main approach to optimizing neural networks. Several generalization properties of deep networks, such as convergence to a flatter minima, are believed to arise from SGD. This article explores the causality aspect of gradient descent. Specifically, we show that the gradient descent procedure has an implicit granger-causal relationship between the reduction i…
▽ More
Stochastic Gradient Descent (SGD) is the main approach to optimizing neural networks. Several generalization properties of deep networks, such as convergence to a flatter minima, are believed to arise from SGD. This article explores the causality aspect of gradient descent. Specifically, we show that the gradient descent procedure has an implicit granger-causal relationship between the reduction in loss and a change in parameters. By suitable modifications, we make this causal relationship explicit. A causal approach to gradient descent has many significant applications which allow greater control. In this article, we illustrate the significance of the causal approach using the application of Pruning. The causal approach to pruning has several interesting properties - (i) We observe a phase shift as the percentage of pruned parameters increase. Such phase shift is indicative of an optimal pruning strategy. (ii) After pruning, we see that minima becomes "flatter", explaining the increase in accuracy after pruning weights.
△ Less
Submitted 4 December, 2024;
originally announced December 2024.
-
Magnetic field tuned superconducting and normal phase magnetism in CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$
Authors:
A. Howell,
M. Songvilay,
J. A. Rodriguez-Rivera,
Ch. Niedermayer,
Z. Husges,
P. Manuel,
S. Saha,
C. Eckberg,
J. Paglione,
C. Stock
Abstract:
By tuning superconductivity with an applied magnetic field, we use neutrons to compare the magnetic ordered phases in superconducting and normal states of CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$. At zero field, CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$ displays both superconductivity ($T_{c}$=1.3 K) and spatially long-ranged commensurate $\uparrow\downarrow\uparrow\downarrow$ antiferromagnetism ($T_{N}$=2.5 K,…
▽ More
By tuning superconductivity with an applied magnetic field, we use neutrons to compare the magnetic ordered phases in superconducting and normal states of CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$. At zero field, CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$ displays both superconductivity ($T_{c}$=1.3 K) and spatially long-ranged commensurate $\uparrow\downarrow\uparrow\downarrow$ antiferromagnetism ($T_{N}$=2.5 K, $\vec{Q}_{0}=({1\over 2}, {1\over 2}, {1\over 2})$). Neutron spectroscopy fails to measure propagating magnetic excitations with only temporally overdamped fluctuations observable. On applying a magnetic field we find anisotropic behavior in the static magnetism. When the field is along the crystallographic $c$-axis, no change in the static magnetic response is observable. However when the field is oriented within the $a-b$ plane, an increase in $T_{N}$ and change in the critical response are measured. At low temperatures in the superconducting phase, the elastic magnetic intensity increases linearly ($\propto |H|$) with small $a-b$ oriented fields. However, this trend is interrupted at intermediate fields where commensurate block $\uparrow\uparrow\downarrow\downarrow$ magnetism with propagation vector $\vec{Q}=({1\over 2}, {1\over 2}, {1\over 4})$ forms. For large applied fields in the [1 $\overline{1}$ 0] direction which completely suppresses superconductivity, weakly incommensurate magnetic order along $L$ is observed to replace the commensurate response present in the superconducting and vortex phases. We suggest field-induced incommensurate static magnetism, present in the normal state of superconducting and antiferromagnetic CeCo$_{0.5}$Rh$_{0.5}$In$_{5}$ for $a-b$ plane oriented magnetic fields. We speculate that these field dependent properties are tied to the field induced anisotropy associated with the local Ce$^{3+}$ crystal field environment of the tetragonal `115' structure.
△ Less
Submitted 27 November, 2024;
originally announced November 2024.
-
Gluon contribution to the angular momentum distribution of a dressed quark state
Authors:
Asmita Mukherjee,
Sudeep Saha,
Ravi Singh
Abstract:
We compute the contribution of the gluonic component of the energy-momentum tensor (EMT) to the angular momentum density in various decompositions. We use the light-front Hamiltonian technique, and a two-component formalism in light-front gauge, where the constrained degrees of freedom are eliminated. Instead of a nucleon, we consider a simple composite spin-$1/2$ state, namely a quark dressed wit…
▽ More
We compute the contribution of the gluonic component of the energy-momentum tensor (EMT) to the angular momentum density in various decompositions. We use the light-front Hamiltonian technique, and a two-component formalism in light-front gauge, where the constrained degrees of freedom are eliminated. Instead of a nucleon, we consider a simple composite spin-$1/2$ state, namely a quark dressed with a gluon. We present two dimensional light-front distributions in transverse impact parameter space, and compare the different angular momentum decompositions at the density level. Incorporating also the contribution coming from the quark part of the EMT, we verify the spin sum rule for such a state.
△ Less
Submitted 27 November, 2024;
originally announced November 2024.
-
Machine Learning and Multi-source Remote Sensing in Forest Carbon Stock Estimation: A Review
Authors:
Autumn Nguyen,
Sulagna Saha
Abstract:
Quantifying forest carbon is crucial for informing decisions and policies that will protect the planet. Machine learning (ML) and remote sensing (RS) techniques have been used to do this task more effectively, yet there lacks a systematic review on the most recent ML methods and RS combinations, especially with the consideration of forest characteristics. This study systematically analyzed 25 pape…
▽ More
Quantifying forest carbon is crucial for informing decisions and policies that will protect the planet. Machine learning (ML) and remote sensing (RS) techniques have been used to do this task more effectively, yet there lacks a systematic review on the most recent ML methods and RS combinations, especially with the consideration of forest characteristics. This study systematically analyzed 25 papers meeting strict inclusion criteria from over 80 related studies, identifying 28 ML methods and key combinations of RS data. Random Forest had the most frequent appearance (88\% of studies), while Extreme Gradient Boosting showed superior performance in 75\% of the studies in which it was compared with other methods. Sentinel-1 emerged as the most utilized remote sensing source, with multi-sensor approaches (e.g., Sentinel-1, Sentinel-2, and LiDAR) proving especially effective. Our findings provide grounds for recommending best practices in integrating machine learning and remote sensing for accurate and scalable forest carbon stock estimation.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
Authors:
Abhinav Joshi,
Shaswati Saha,
Divyaksh Shukla,
Sriram Vema,
Harsh Jhamtani,
Manas Gaur,
Ashutosh Modi
Abstract:
Large Language Models (LLMs) have shown to be a great success in a wide range of applications ranging from regular NLP-based use cases to AI agents. LLMs have been trained on a vast corpus of texts from various sources; despite the best efforts during the data pre-processing stage while training the LLMs, they may pick some undesirable information such as personally identifiable information (PII).…
▽ More
Large Language Models (LLMs) have shown to be a great success in a wide range of applications ranging from regular NLP-based use cases to AI agents. LLMs have been trained on a vast corpus of texts from various sources; despite the best efforts during the data pre-processing stage while training the LLMs, they may pick some undesirable information such as personally identifiable information (PII). Consequently, in recent times research in the area of Machine Unlearning (MUL) has become active, the main idea is to force LLMs to forget (unlearn) certain information (e.g., PII) without suffering from performance loss on regular tasks. In this work, we examine the robustness of the existing MUL techniques for their ability to enable leakage-proof forgetting in LLMs. In particular, we examine the effect of data transformation on forgetting, i.e., is an unlearned LLM able to recall forgotten information if there is a change in the format of the input? Our findings on the TOFU dataset highlight the necessity of using diverse data formats to quantify unlearning in LLMs more reliably.
△ Less
Submitted 23 November, 2024;
originally announced November 2024.
-
The role of inducible defence in ecological models: Effects of nonlocal intraspecific competitions
Authors:
Sangeeta Saha,
Swadesh Pal,
Roderick Melnik
Abstract:
Phenotypic plasticity is a key factor in driving the evolution of species in the predator-prey interaction. The natural environment is replete with phenotypic plasticity, which is the source of inducible defences against predators, including concealment, cave-dwelling, mimicry, evasion, and revenge. In this work, a predator-prey model is proposed where the prey species shows inducible defence agai…
▽ More
Phenotypic plasticity is a key factor in driving the evolution of species in the predator-prey interaction. The natural environment is replete with phenotypic plasticity, which is the source of inducible defences against predators, including concealment, cave-dwelling, mimicry, evasion, and revenge. In this work, a predator-prey model is proposed where the prey species shows inducible defence against their predators. The dynamics produce a wide range of non-trivial and impactful results, including the stabilizing effect of the defence mechanism. The model is also analyzed in the presence of spatio-temporal diffusion in a bounded domain. It is found in the numerical simulation that the Turing domain shrinks with the increase of defence level. The work is extended further by introducing a nonlocal term in the intra-specific competition of the prey species. The Turing instability condition has been studied for the local model around the coexisting steady state, followed by the Turing and non-Turing patterns in the presence of the nonlocal interaction term. The work reveals how an increase in inducible defence reduces the Turing domain in the local interaction model but expands it when the range of nonlocal interactions is extended, suggesting a higher likelihood of species colonization.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
NGTS-33b: A Young Super-Jupiter Hosted by a Fast Rotating Massive Hot Star
Authors:
Douglas R. Alves,
James S. Jenkins,
Jose I. Vines,
Matthew P. Battley,
Monika Lendl,
François Bouchy,
Louise D. Nielsen,
Samuel Gill,
Maximiliano Moyano,
D. R. Anderson,
Matthew R. Burleigh,
Sarah L. Casewell,
Michael R. Goad,
Faith Hawthorn,
Alicia Kendall,
James McCormac,
Ares Osborn,
Alexis M. S. Smith,
Stephane Udry,
Peter J. Wheatley,
Suman Saha,
Lena Parc,
Arianna Nigioni,
Ioannis Apergis,
Gavin Ramsay
Abstract:
In the last few decades planet search surveys have been focusing on solar type stars, and only recently the high-mass regimes. This is mostly due to challenges arising from the lack of instrumental precision, and more importantly, the inherent active nature of fast rotating massive stars. Here we report NGTS-33b (TOI-6442b), a super-Jupiter planet with mass, radius and orbital period of 3.6 $\pm$…
▽ More
In the last few decades planet search surveys have been focusing on solar type stars, and only recently the high-mass regimes. This is mostly due to challenges arising from the lack of instrumental precision, and more importantly, the inherent active nature of fast rotating massive stars. Here we report NGTS-33b (TOI-6442b), a super-Jupiter planet with mass, radius and orbital period of 3.6 $\pm$ 0.3 M$_{\rm jup}$, 1.64 $\pm$ 0.07 R$_{\rm jup}$ and $2.827972 \pm 0.000001$ days, respectively. The host is a fast rotating ($0.6654 \pm 0.0006$ day) and hot (T$_{\rm eff}$ = 7437 $\pm$ 72 K) A9V type star, with a mass and radius of 1.60 $\pm$ 0.11 M$_{\odot}$ and 1.47 $\pm$ 0.06 R$_{\odot}$, respectively. Planet structure and Gyrochronology models shows that NGTS-33 is also very young with age limits of 10-50 Myr. In addition, membership analysis points towards the star being part of the Vela OB2 association, which has an age of $\sim$ 20-35 Myr, thus providing further evidences about the young nature of NGTS-33. Its low bulk density of 0.19$\pm$0.03 g cm$^{-3}$ is 13$\%$ smaller than expected when compared to transiting hot Jupiters with similar masses. Such cannot be solely explained by its age, where an up to 15$\%$ inflated atmosphere is expected from planet structure models. Finally, we found that its emission spectroscopy metric is similar to JWST community targets, making the planet an interesting target for atmospheric follow-up. Therefore, NGTS-33b's discovery will not only add to the scarce population of young, massive and hot Jupiters, but will also help place further strong constraints on current formation and evolution models for such planetary systems.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
Study of baryon-strangeness and charge-strangeness correlations in Pb$-$Pb collisions at $\sqrt{s_\mathrm{NN}}$ = 5.02 TeV with ALICE
Authors:
Swati Saha
Abstract:
In the quest to unravel the mysteries of the strong force and the underlying properties of the quark-gluon plasma, the ALICE collaboration at CERN has carried out a comprehensive study focusing on the correlations between net-conserved quantities such as net-baryon, net-charge and net-strangeness. These correlations play a crucial role in the study of QCD phase structure as they are closely relate…
▽ More
In the quest to unravel the mysteries of the strong force and the underlying properties of the quark-gluon plasma, the ALICE collaboration at CERN has carried out a comprehensive study focusing on the correlations between net-conserved quantities such as net-baryon, net-charge and net-strangeness. These correlations play a crucial role in the study of QCD phase structure as they are closely related to the ratios of thermodynamic susceptibilities in lattice QCD calculations. This work mainly focuses on the correlations between net-kaon and net-proton, and net-kaon and net-charge in Pb$-$Pb collisions at $\sqrt{s_\mathrm{NN}} = 5.02$ TeV using data recorded during LHC Run 2. The net-proton and net-kaon serve as proxies for net-baryon and net-strangeness, respectively, with measurements analyzed as a function of collision centrality. Theoretical predictions from the Thermal-FIST model are compared with experimental results, providing insights into the effects of resonance decays and charge conservation laws on the correlations.
△ Less
Submitted 11 November, 2024;
originally announced November 2024.
-
Machine learning for prediction of dose-volume histograms of organs-at-risk in prostate cancer from simple structure volume parameters
Authors:
Saheli Saha,
Debasmita Banerjee,
Rishi Ram,
Gowtham Reddy,
Debashree Guha,
Arnab Sarkar,
Bapi Dutta,
Moses ArunSingh S,
Suman Chakraborty,
Indranil Mallick
Abstract:
Dose prediction is an area of ongoing research that facilitates radiotherapy planning. Most commercial models utilise imaging data and intense computing resources. This study aimed to predict the dose-volume of rectum and bladder from volumes of target, at-risk structure organs and their overlap regions using machine learning. Dose-volume information of 94 patients with prostate cancer planned for…
▽ More
Dose prediction is an area of ongoing research that facilitates radiotherapy planning. Most commercial models utilise imaging data and intense computing resources. This study aimed to predict the dose-volume of rectum and bladder from volumes of target, at-risk structure organs and their overlap regions using machine learning. Dose-volume information of 94 patients with prostate cancer planned for 6000cGy in 20 fractions was exported from the treatment planning system as text files and mined to create a training dataset. Several statistical modelling, machine learning methods, and a new fuzzy rule-based prediction (FRBP) model were explored and validated on an independent dataset of 39 patients. The median absolute error was 2.0%-3.7% for bladder and 1.7-2.4% for rectum in the 4000-6420cGy range. For 5300cGy, 5600cGy and 6000cGy, the median difference was less than 2.5% for rectum and 3.8% for bladder. The FRBP model produced errors of 1.2%, 1.3%, 0.9% and 1.6%, 1.2%, 0.1% for the rectum and bladder respectively at these dose levels. These findings indicate feasibility of obtaining accurate predictions of the clinically important dose-volume parameters for rectum and bladder using just the volumes of these structures.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Beyond Grid Data: Exploring Graph Neural Networks for Earth Observation
Authors:
Shan Zhao,
Zhaiyu Chen,
Zhitong Xiong,
Yilei Shi,
Sudipan Saha,
Xiao Xiang Zhu
Abstract:
Earth Observation (EO) data analysis has been significantly revolutionized by deep learning (DL), with applications typically limited to grid-like data structures. Graph Neural Networks (GNNs) emerge as an important innovation, propelling DL into the non-Euclidean domain. Naturally, GNNs can effectively tackle the challenges posed by diverse modalities, multiple sensors, and the heterogeneous natu…
▽ More
Earth Observation (EO) data analysis has been significantly revolutionized by deep learning (DL), with applications typically limited to grid-like data structures. Graph Neural Networks (GNNs) emerge as an important innovation, propelling DL into the non-Euclidean domain. Naturally, GNNs can effectively tackle the challenges posed by diverse modalities, multiple sensors, and the heterogeneous nature of EO data. To introduce GNNs in the related domains, our review begins by offering fundamental knowledge on GNNs. Then, we summarize the generic problems in EO, to which GNNs can offer potential solutions. Following this, we explore a broad spectrum of GNNs' applications to scientific problems in Earth systems, covering areas such as weather and climate analysis, disaster management, air quality monitoring, agriculture, land cover classification, hydrological process modeling, and urban modeling. The rationale behind adopting GNNs in these fields is explained, alongside methodologies for organizing graphs and designing favorable architectures for various tasks. Furthermore, we highlight methodological challenges of implementing GNNs in these domains and possible solutions that could guide future research. While acknowledging that GNNs are not a universal solution, we conclude the paper by comparing them with other popular architectures like transformers and analyzing their potential synergies.
△ Less
Submitted 6 November, 2024; v1 submitted 5 November, 2024;
originally announced November 2024.
-
Search for a Hidden Sector Scalar from Kaon Decay in the Di-Muon Final State at ICARUS
Authors:
ICARUS Collaboration,
F. Abd Alrahman,
P. Abratenko,
N. Abrego-Martinez,
A. Aduszkiewicz,
F. Akbar,
L. Aliaga Soplin,
R. Alvarez Garrote,
M. Artero Pons,
J. Asaadi,
W. F. Badgett,
B. Baibussinov,
B. Behera,
V. Bellini,
R. Benocci,
J. Berger,
S. Berkman,
S. Bertolucci,
M. Betancourt,
M. Bonesini,
T. Boone,
B. Bottino,
A. Braggiotti,
D. Brailsford,
S. J. Brice
, et al. (170 additional authors not shown)
Abstract:
We present a search for long-lived particles (LLPs) produced from kaon decay that decay to two muons inside the ICARUS neutrino detector. This channel would be a signal of hidden sector models that can address outstanding issues in particle physics such as the strong CP problem and the microphysical origin of dark matter. The search is performed with data collected in the Neutrinos at the Main Inj…
▽ More
We present a search for long-lived particles (LLPs) produced from kaon decay that decay to two muons inside the ICARUS neutrino detector. This channel would be a signal of hidden sector models that can address outstanding issues in particle physics such as the strong CP problem and the microphysical origin of dark matter. The search is performed with data collected in the Neutrinos at the Main Injector (NuMI) beam at Fermilab corresponding to $2.41\times 10^{20}$ protons-on-target. No new physics signal is observed, and we set world-leading limits on heavy QCD axions, as well as for the Higgs portal scalar among dedicated searches. Limits are also presented in a model-independent way applicable to any new physics model predicting the process $K\to π+S(\toμμ)$, for a long-lived particle S. This result is the first search for new physics performed with the ICARUS detector at Fermilab. It paves the way for the future program of long-lived particle searches at ICARUS.
△ Less
Submitted 10 June, 2025; v1 submitted 4 November, 2024;
originally announced November 2024.
-
Proton induced reaction on $^{108}$Cd for astrophysical p-process studies
Authors:
Sukhendu Saha,
Dipali Basak,
Tanmoy Bar,
Lalit Kumar Sahoo,
Jagannath Datta,
Sandipan Dasgupta,
Norikazu Kinoshita,
Chinmay Basu
Abstract:
The proton capture cross-section of the least abundant proton-rich stable isotope of cadmium, $^{108}$Cd (abundance 0.89\%), has been measured near the Gamow window corresponding to a temperature range of 3-4 GK. The measurement of the $^{108}$Cd(p,$γ$)$^{109}$In reaction was carried out using the activation technique. The cross-section at the lowest energy point of 3T$_9$, E$_p$$^{lab}$= 2.28 MeV…
▽ More
The proton capture cross-section of the least abundant proton-rich stable isotope of cadmium, $^{108}$Cd (abundance 0.89\%), has been measured near the Gamow window corresponding to a temperature range of 3-4 GK. The measurement of the $^{108}$Cd(p,$γ$)$^{109}$In reaction was carried out using the activation technique. The cross-section at the lowest energy point of 3T$_9$, E$_p$$^{lab}$= 2.28 MeV, has been reported for the first time. The astrophysical S-factor was measured in the energy range relevant to the astrophysical p-process, between E$_p$$^{cm}$= 2.29 and 6.79 MeV. The experimental results have been compared with theoretical predictions of Hauser-Feshbach statistical model calculations using TALYS-1.96. A calculated proton-optical potential was implemented to achieve better fitting, with different combinations of available nuclear level densities (NLDs) and $γ$-ray strength functions in TALYS-1.96. The calculations provided satisfactory agreement with the experimental results. The reaction rate was calculated using the calculated potential in TALYS-1.96 and compared with the values provided in the REACLIB database.
△ Less
Submitted 2 November, 2024;
originally announced November 2024.
-
Face Anonymization Made Simple
Authors:
Han-Wei Kung,
Tuomas Varanka,
Sanjay Saha,
Terence Sim,
Nicu Sebe
Abstract:
Current face anonymization techniques often depend on identity loss calculated by face recognition models, which can be inaccurate and unreliable. Additionally, many methods require supplementary data such as facial landmarks and masks to guide the synthesis process. In contrast, our approach uses diffusion models with only a reconstruction loss, eliminating the need for facial landmarks or masks…
▽ More
Current face anonymization techniques often depend on identity loss calculated by face recognition models, which can be inaccurate and unreliable. Additionally, many methods require supplementary data such as facial landmarks and masks to guide the synthesis process. In contrast, our approach uses diffusion models with only a reconstruction loss, eliminating the need for facial landmarks or masks while still producing images with intricate, fine-grained details. We validated our results on two public benchmarks through both quantitative and qualitative evaluations. Our model achieves state-of-the-art performance in three key areas: identity anonymization, facial attribute preservation, and image quality. Beyond its primary function of anonymization, our model can also perform face swapping tasks by incorporating an additional facial image as input, demonstrating its versatility and potential for diverse applications. Our code and models are available at https://github.com/hanweikung/face_anon_simple .
△ Less
Submitted 1 November, 2024;
originally announced November 2024.