-
Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies
Authors:
Xiwen Li,
Rehman Mohammed,
Tristalee Mangin,
Surojit Saha,
Ross T Whitaker,
Kerry E. Kelly,
Tolga Tasdizen
Abstract:
Idling vehicle detection (IVD) can be helpful in monitoring and reducing unnecessary idling and can be integrated into real-time systems to address the resulting pollution and harmful products. The previous approach [13], a non-end-to-end model, requires extra user clicks to specify a part of the input, making system deployment more error-prone or even not feasible. In contrast, we introduce an en…
▽ More
Idling vehicle detection (IVD) can be helpful in monitoring and reducing unnecessary idling and can be integrated into real-time systems to address the resulting pollution and harmful products. The previous approach [13], a non-end-to-end model, requires extra user clicks to specify a part of the input, making system deployment more error-prone or even not feasible. In contrast, we introduce an end-to-end joint audio-visual IVD task designed to detect vehicles visually under three states: moving, idling and engine off. Unlike feature co-occurrence task such as audio-visual vehicle tracking, our IVD task addresses complementary features, where labels cannot be determined by a single modality alone. To this end, we propose AVIVD-Net, a novel network that integrates audio and visual features through a bidirectional attention mechanism. AVIVD-Net streamlines the input process by learning a joint feature space, reducing the deployment complexity of previous methods. Additionally, we introduce the AVIVD dataset, which is seven times larger than previous datasets, offering significantly more annotated samples to study the IVD problem. Our model achieves performance comparable to prior approaches, making it suitable for automated deployment. Furthermore, by evaluating AVIVDNet on the feature co-occurrence public dataset MAVD [23], we demonstrate its potential for extension to self-driving vehicle video-camera setups.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
Hidden Conformal Symmetry of the Discrete Series Scalars in dS$_2$
Authors:
Kara Farnsworth,
Kurt Hinterbichler,
Samanta Saha
Abstract:
In $D$ dimensional de Sitter space, a scalar field has an infinite tower of special tachyonic mass values at which enhanced shift symmetries appear. After modding out by these shift symmetries, these fields correspond to the unitary irreducible representations of the de Sitter group known as the discrete series. We show that in $D=2$ these theories have global conformal symmetry. In all but the ma…
▽ More
In $D$ dimensional de Sitter space, a scalar field has an infinite tower of special tachyonic mass values at which enhanced shift symmetries appear. After modding out by these shift symmetries, these fields correspond to the unitary irreducible representations of the de Sitter group known as the discrete series. We show that in $D=2$ these theories have global conformal symmetry. In all but the massless case, these theories have no stress tensor and the conformal symmetry does not act in the usual way on the scalar field. We find the conformal symmetry by explicitly computing the correlators of the shift invariant local operators and showing that they take conformally invariant forms. We also demonstrate how these fields are self dual in $D=2$, and dual to the shift invariant massive vector fields, which are therefore also conformally invariant.
△ Less
Submitted 24 October, 2024;
originally announced October 2024.
-
Shape evolution in even-mass $^{98-104}$Zr isotopes via lifetime measurements using the $γγ$-coincidence technique
Authors:
G. Pasqualato,
S. Ansari,
J. S. Heines,
V. Modamio,
A. Görgen,
W. Korten,
J. Ljungvall,
E. Clément,
J. Dudouet,
A. Lemasson,
T. R. Rodríguez,
J. M. Allmond,
T. Arici,
K. S. Beckmann,
A. M. Bruce,
D. Doherty,
A. Esmaylzadeh,
E. R. Gamba,
L. Gerhard,
J. Gerl,
G. Georgiev,
D. P. Ivanova,
J. Jolie,
Y. -H. Kim,
L. Knafla
, et al. (60 additional authors not shown)
Abstract:
The Zirconium (Z = 40) isotopic chain has attracted interest for more than four decades. The abrupt lowering of the energy of the first $2^+$ state and the increase in the transition strength B(E2; $2_1^\rightarrow 0_1^+$ going from $^{98}$Zr to $^{100}$Zr has been the first example of "quantum phase transition" in nuclear shapes, which has few equivalents in the nuclear chart. Although a multitud…
▽ More
The Zirconium (Z = 40) isotopic chain has attracted interest for more than four decades. The abrupt lowering of the energy of the first $2^+$ state and the increase in the transition strength B(E2; $2_1^\rightarrow 0_1^+$ going from $^{98}$Zr to $^{100}$Zr has been the first example of "quantum phase transition" in nuclear shapes, which has few equivalents in the nuclear chart. Although a multitude of experiments have been performed to measure nuclear properties related to nuclear shapes and collectivity in the region, none of the measured lifetimes were obtained using the Recoil Distance Doppler Shift method in the $γγ$-coincidence mode where a gate on the direct feeding transition of the state of interest allows a strict control of systematical errors. This work reports the results of lifetime measurements for the first yrast excited states in $^{98-104}$Zr carried out to extract reduced transition probabilities. The new lifetime values in $γγ$-coincidence and $γ$-single mode are compared with the results of former experiments. Recent predictions of the Interacting Boson Model with Configuration Mixing, the Symmetry Conserving Configuration Mixing model based on the Hartree-Fock-Bogoliubov approach and the Monte Carlo Shell Model are presented and compared with the experimental data.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Search for gravitational waves emitted from SN 2023ixf
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1758 additional authors not shown)
Abstract:
We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been…
▽ More
We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been identified in data when at least two gravitational-wave observatories were operating, which covered $\sim 14\%$ of this five-day window. We report the search detection efficiency for various possible gravitational-wave emission models. Considering the distance to M101 (6.7 Mpc), we derive constraints on the gravitational-wave emission mechanism of core-collapse supernovae across a broad frequency spectrum, ranging from 50 Hz to 2 kHz where we assume the gravitational-wave emission occurred when coincident data are available in the on-source window. Considering an ellipsoid model for a rotating proto-neutron star, our search is sensitive to gravitational-wave energy $1 \times 10^{-4} M_{\odot} c^2$ and luminosity $2.6 \times 10^{-4} M_{\odot} c^2/s$ for a source emitting at 82 Hz. These constraints are around an order of magnitude more stringent than those obtained so far with gravitational-wave data. The constraint on the ellipticity of the proto-neutron star that is formed is as low as 1.08, at frequencies above 1200 Hz, surpassing past results.
△ Less
Submitted 11 March, 2025; v1 submitted 21 October, 2024;
originally announced October 2024.
-
Modeling the Human Visual System: Comparative Insights from Response-Optimized and Task-Optimized Vision Models, Language Models, and different Readout Mechanisms
Authors:
Shreya Saha,
Ishaan Chadha,
Meenakshi khosla
Abstract:
Over the past decade, predictive modeling of neural responses in the primate visual system has advanced significantly, largely driven by various DNN approaches. These include models optimized directly for visual recognition, cross-modal alignment through contrastive objectives, neural response prediction from scratch, and large language model embeddings.Likewise, different readout mechanisms, rang…
▽ More
Over the past decade, predictive modeling of neural responses in the primate visual system has advanced significantly, largely driven by various DNN approaches. These include models optimized directly for visual recognition, cross-modal alignment through contrastive objectives, neural response prediction from scratch, and large language model embeddings.Likewise, different readout mechanisms, ranging from fully linear to spatial-feature factorized methods have been explored for mapping network activations to neural responses. Despite the diversity of these approaches, it remains unclear which method performs best across different visual regions. In this study, we systematically compare these approaches for modeling the human visual system and investigate alternative strategies to improve response predictions. Our findings reveal that for early to mid-level visual areas, response-optimized models with visual inputs offer superior prediction accuracy, while for higher visual regions, embeddings from LLMs based on detailed contextual descriptions of images and task-optimized models pretrained on large vision datasets provide the best fit. Through comparative analysis of these modeling approaches, we identified three distinct regions in the visual cortex: one sensitive primarily to perceptual features of the input that are not captured by linguistic descriptions, another attuned to fine-grained visual details representing semantic information, and a third responsive to abstract, global meanings aligned with linguistic content. We also highlight the critical role of readout mechanisms, proposing a novel scheme that modulates receptive fields and feature maps based on semantic content, resulting in an accuracy boost of 3-23% over existing SOTAs for all models and brain regions. Together, these findings offer key insights into building more precise models of the visual system.
△ Less
Submitted 13 December, 2024; v1 submitted 17 October, 2024;
originally announced October 2024.
-
Markov Random Fields with Proximity Constraints for Spatial Data
Authors:
Sudipto Saha,
Jonathan R. Bradley
Abstract:
The conditional autoregressive (CAR) model, simultaneous autoregressive (SAR) model, and its variants have become the predominant strategies for modeling regional or areal-referenced spatial data. The overwhelming wide-use of the CAR/SAR model motivates the need for new classes of models for areal-referenced data. Thus, we develop a novel class of Markov random fields based on truncating the full-…
▽ More
The conditional autoregressive (CAR) model, simultaneous autoregressive (SAR) model, and its variants have become the predominant strategies for modeling regional or areal-referenced spatial data. The overwhelming wide-use of the CAR/SAR model motivates the need for new classes of models for areal-referenced data. Thus, we develop a novel class of Markov random fields based on truncating the full-conditional distribution. We define this truncation in two ways leading to versions of what we call the truncated autoregressive (TAR) model. First, we truncate the full conditional distribution so that a response at one location is close to the average of its neighbors. This strategy establishes relationships between TAR and CAR. Second, we truncate on the joint distribution of the data process in a similar way. This specification leads to connection between TAR and SAR model. Our Bayesian implementation does not use Markov chain Monte Carlo (MCMC) for Bayesian computation, and generates samples directly from the posterior distribution. Moreover, TAR does not have a range parameter that arises in the CAR/SAR models, which can be difficult to learn. We present the results of the proposed truncated autoregressive model on several simulated datasets and on a dataset of average property prices.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1758 additional authors not shown)
Abstract:
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by…
▽ More
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs.
△ Less
Submitted 21 May, 2025; v1 submitted 11 October, 2024;
originally announced October 2024.
-
Towards Model Discovery Using Domain Decomposition and PINNs
Authors:
Tirtho S. Saha,
Alexander Heinlein,
Cordula Reisch
Abstract:
We enhance machine learning algorithms for learning model parameters in complex systems represented by ordinary differential equations (ODEs) with domain decomposition methods. The study evaluates the performance of two approaches, namely (vanilla) Physics-Informed Neural Networks (PINNs) and Finite Basis Physics-Informed Neural Networks (FBPINNs), in learning the dynamics of test models with a qu…
▽ More
We enhance machine learning algorithms for learning model parameters in complex systems represented by ordinary differential equations (ODEs) with domain decomposition methods. The study evaluates the performance of two approaches, namely (vanilla) Physics-Informed Neural Networks (PINNs) and Finite Basis Physics-Informed Neural Networks (FBPINNs), in learning the dynamics of test models with a quasi-stationary longtime behavior. We test the approaches for data sets in different dynamical regions and with varying noise level. As results, we find a better performance for the FBPINN approach compared to the vanilla PINN approach, even in cases with data from only a quasi-stationary time domain with few dynamics.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
The track-length extension fitting algorithm for energy measurement of interacting particles in liquid argon TPCs and its performance with ProtoDUNE-SP data
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
N. S. Alex,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
C. Andreopoulos
, et al. (1348 additional authors not shown)
Abstract:
This paper introduces a novel track-length extension fitting algorithm for measuring the kinetic energies of inelastically interacting particles in liquid argon time projection chambers (LArTPCs). The algorithm finds the most probable offset in track length for a track-like object by comparing the measured ionization density as a function of position with a theoretical prediction of the energy los…
▽ More
This paper introduces a novel track-length extension fitting algorithm for measuring the kinetic energies of inelastically interacting particles in liquid argon time projection chambers (LArTPCs). The algorithm finds the most probable offset in track length for a track-like object by comparing the measured ionization density as a function of position with a theoretical prediction of the energy loss as a function of the energy, including models of electron recombination and detector response. The algorithm can be used to measure the energies of particles that interact before they stop, such as charged pions that are absorbed by argon nuclei. The algorithm's energy measurement resolutions and fractional biases are presented as functions of particle kinetic energy and number of track hits using samples of stopping secondary charged pions in data collected by the ProtoDUNE-SP detector, and also in a detailed simulation. Additional studies describe the impact of the dE/dx model on energy measurement performance. The method described in this paper to characterize the energy measurement performance can be repeated in any LArTPC experiment using stopping secondary charged pions.
△ Less
Submitted 26 December, 2024; v1 submitted 26 September, 2024;
originally announced September 2024.
-
For a flat Universe, $C_P/C_V=-q$ : another coincidence in Cosmology?
Authors:
Somnath Saha,
Subhajit Saha,
Nilanjana Mahata
Abstract:
This paper deals with gravitational thermodynamics on the dynamical apparent horizon of an FLRW universe with dissipation. The dissipation is assumed to arise due to adiabatic gravitational particle creation. For the thermodynamic study, we consider the Bekenstein-Hawking formalism and also assume a nonzero curvature $κ$ for a general study. In particular, we study the unified first law, the gener…
▽ More
This paper deals with gravitational thermodynamics on the dynamical apparent horizon of an FLRW universe with dissipation. The dissipation is assumed to arise due to adiabatic gravitational particle creation. For the thermodynamic study, we consider the Bekenstein-Hawking formalism and also assume a nonzero curvature $κ$ for a general study. In particular, we study the unified first law, the generalized second law, and thermodynamic stability in our model. The specific heat capacities are taken into account for the study of thermodynamic stability. Our study reveals a nice result! The ratio of the specific heat capacity at constant pressure and that at constant volume in a flat FLRW universe with dissipation is nothing but the negative of the deceleration parameter. In classical thermodynamics, this ratio is known as the isentropic expansion factor or (for ideal gases) the adiabatic index. A more interesting fact that has come to light is that this relation is independent of the cosmological model used. So, this is actually a generic result in Big Bang Cosmology. We discuss the implications of this result on the evolution of the Universe. Finally, we determine the constraints on the effective equation of state and the particle creation rate which guarantees thermodynamic stability in our model.
△ Less
Submitted 21 February, 2025; v1 submitted 23 September, 2024;
originally announced September 2024.
-
TCG CREST System Description for the Second DISPLACE Challenge
Authors:
Nikhil Raghav,
Subhajit Saha,
Md Sahidullah,
Swagatam Das
Abstract:
In this report, we describe the speaker diarization (SD) and language diarization (LD) systems developed by our team for the Second DISPLACE Challenge, 2024. Our contributions were dedicated to Track 1 for SD and Track 2 for LD in multilingual and multi-speaker scenarios. We investigated different speech enhancement techniques, voice activity detection (VAD) techniques, unsupervised domain categor…
▽ More
In this report, we describe the speaker diarization (SD) and language diarization (LD) systems developed by our team for the Second DISPLACE Challenge, 2024. Our contributions were dedicated to Track 1 for SD and Track 2 for LD in multilingual and multi-speaker scenarios. We investigated different speech enhancement techniques, voice activity detection (VAD) techniques, unsupervised domain categorization, and neural embedding extraction architectures. We also exploited the fusion of various embedding extraction models. We implemented our system with the open-source SpeechBrain toolkit. Our final submissions use spectral clustering for both the speaker and language diarization. We achieve about $7\%$ relative improvement over the challenge baseline in Track 1. We did not obtain improvement over the challenge baseline in Track 2.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Intelligent Routing Algorithm over SDN: Reusable Reinforcement Learning Approach
Authors:
Wang Wumian,
Sajal Saha,
Anwar Haque,
Greg Sidebottom
Abstract:
Traffic routing is vital for the proper functioning of the Internet. As users and network traffic increase, researchers try to develop adaptive and intelligent routing algorithms that can fulfill various QoS requirements. Reinforcement Learning (RL) based routing algorithms have shown better performance than traditional approaches. We developed a QoS-aware, reusable RL routing algorithm, RLSR-Rout…
▽ More
Traffic routing is vital for the proper functioning of the Internet. As users and network traffic increase, researchers try to develop adaptive and intelligent routing algorithms that can fulfill various QoS requirements. Reinforcement Learning (RL) based routing algorithms have shown better performance than traditional approaches. We developed a QoS-aware, reusable RL routing algorithm, RLSR-Routing over SDN. During the learning process, our algorithm ensures loop-free path exploration. While finding the path for one traffic demand (a source destination pair with certain amount of traffic), RLSR-Routing learns the overall network QoS status, which can be used to speed up algorithm convergence when finding the path for other traffic demands. By adapting Segment Routing, our algorithm can achieve flow-based, source packet routing, and reduce communications required between SDN controller and network plane. Our algorithm shows better performance in terms of load balancing than the traditional approaches. It also has faster convergence than the non-reusable RL approach when finding paths for multiple traffic demands.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Region Mixup
Authors:
Saptarshi Saha,
Utpal Garain
Abstract:
This paper introduces a simple extension of mixup (Zhang et al., 2018) data augmentation to enhance generalization in visual recognition tasks. Unlike the vanilla mixup method, which blends entire images, our approach focuses on combining regions from multiple images.
This paper introduces a simple extension of mixup (Zhang et al., 2018) data augmentation to enhance generalization in visual recognition tasks. Unlike the vanilla mixup method, which blends entire images, our approach focuses on combining regions from multiple images.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Overcoming Data Limitations in Internet Traffic Forecasting: LSTM Models with Transfer Learning and Wavelet Augmentation
Authors:
Sajal Saha,
Anwar Haque,
Greg Sidebottom
Abstract:
Effective internet traffic prediction in smaller ISP networks is challenged by limited data availability. This paper explores this issue using transfer learning and data augmentation techniques with two LSTM-based models, LSTMSeq2Seq and LSTMSeq2SeqAtn, initially trained on a comprehensive dataset provided by Juniper Networks and subsequently applied to smaller datasets. The datasets represent rea…
▽ More
Effective internet traffic prediction in smaller ISP networks is challenged by limited data availability. This paper explores this issue using transfer learning and data augmentation techniques with two LSTM-based models, LSTMSeq2Seq and LSTMSeq2SeqAtn, initially trained on a comprehensive dataset provided by Juniper Networks and subsequently applied to smaller datasets. The datasets represent real internet traffic telemetry, offering insights into diverse traffic patterns across different network domains. Our study revealed that while both models performed well in single-step predictions, multi-step forecasts were challenging, particularly in terms of long-term accuracy. In smaller datasets, LSTMSeq2Seq generally outperformed LSTMSeq2SeqAtn, indicating that higher model complexity does not necessarily translate to better performance. The models' effectiveness varied across different network domains, reflecting the influence of distinct traffic characteristics. To address data scarcity, Discrete Wavelet Transform was used for data augmentation, leading to significant improvements in model performance, especially in shorter-term forecasts. Our analysis showed that data augmentation is crucial in scenarios with limited data. Additionally, the study included an analysis of the models' variability and consistency, with attention mechanisms in LSTMSeq2SeqAtn providing better short-term forecasting consistency but greater variability in longer forecasts. The results highlight the benefits and limitations of different modeling approaches in traffic prediction. Overall, this research underscores the importance of transfer learning and data augmentation in enhancing the accuracy of traffic prediction models, particularly in smaller ISP networks with limited data availability.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
ConvLSTMTransNet: A Hybrid Deep Learning Approach for Internet Traffic Telemetry
Authors:
Sajal Saha,
Saikat Das,
Glaucio H. S. Carvalho
Abstract:
In this paper, we present a novel hybrid deep learning model, named ConvLSTMTransNet, designed for time series prediction, with a specific application to internet traffic telemetry. This model integrates the strengths of Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM) networks, and Transformer encoders to capture complex spatial-temporal relationships inherent in time series da…
▽ More
In this paper, we present a novel hybrid deep learning model, named ConvLSTMTransNet, designed for time series prediction, with a specific application to internet traffic telemetry. This model integrates the strengths of Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM) networks, and Transformer encoders to capture complex spatial-temporal relationships inherent in time series data. The ConvLSTMTransNet model was evaluated against three baseline models: RNN, LSTM, and Gated Recurrent Unit (GRU), using real internet traffic data sampled from high-speed ports on a provider edge router. Performance metrics such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Weighted Absolute Percentage Error (WAPE) were used to assess each model's accuracy. Our findings demonstrate that ConvLSTMTransNet significantly outperforms the baseline models by approximately 10% in terms of prediction accuracy. ConvLSTMTransNet surpasses traditional models due to its innovative architectural features, which enhance its ability to capture temporal dependencies and extract spatial features from internet traffic data. Overall, these findings underscore the importance of employing advanced architectures tailored to the complexities of internet traffic data for achieving more precise predictions.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
An Adaptive End-to-End IoT Security Framework Using Explainable AI and LLMs
Authors:
Sudipto Baral,
Sajal Saha,
Anwar Haque
Abstract:
The exponential growth of the Internet of Things (IoT) has significantly increased the complexity and volume of cybersecurity threats, necessitating the development of advanced, scalable, and interpretable security frameworks. This paper presents an innovative, comprehensive framework for real-time IoT attack detection and response that leverages Machine Learning (ML), Explainable AI (XAI), and La…
▽ More
The exponential growth of the Internet of Things (IoT) has significantly increased the complexity and volume of cybersecurity threats, necessitating the development of advanced, scalable, and interpretable security frameworks. This paper presents an innovative, comprehensive framework for real-time IoT attack detection and response that leverages Machine Learning (ML), Explainable AI (XAI), and Large Language Models (LLM). By integrating XAI techniques such as SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) with a model-independent architecture, we ensure our framework's adaptability across various ML algorithms. Additionally, the incorporation of LLMs enhances the interpretability and accessibility of detection decisions, providing system administrators with actionable, human-understandable explanations of detected threats. Our end-to-end framework not only facilitates a seamless transition from model development to deployment but also represents a real-world application capability that is often lacking in existing research. Based on our experiments with the CIC-IOT-2023 dataset \cite{neto2023ciciot2023}, Gemini and OPENAI LLMS demonstrate unique strengths in attack mitigation: Gemini offers precise, focused strategies, while OPENAI provides extensive, in-depth security measures. Incorporating SHAP and LIME algorithms within XAI provides comprehensive insights into attack detection, emphasizing opportunities for model improvement through detailed feature analysis, fine-tuning, and the adaptation of misclassifications to enhance accuracy.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
Authors:
Justin Chih-Yao Chen,
Archiki Prasad,
Swarnadeep Saha,
Elias Stengel-Eskin,
Mohit Bansal
Abstract:
Large Language Models' (LLM) reasoning can be improved using test-time aggregation strategies, i.e., generating multiple samples and voting among generated samples. While these improve performance, they often reach a saturation point. Refinement offers an alternative by using LLM-generated feedback to improve solution quality. However, refinement introduces 3 key challenges: (1) Excessive refineme…
▽ More
Large Language Models' (LLM) reasoning can be improved using test-time aggregation strategies, i.e., generating multiple samples and voting among generated samples. While these improve performance, they often reach a saturation point. Refinement offers an alternative by using LLM-generated feedback to improve solution quality. However, refinement introduces 3 key challenges: (1) Excessive refinement: Uniformly refining all instances can over-correct and reduce the overall performance. (2) Inability to localize and address errors: LLMs have a limited ability to self-correct and struggle to identify and correct their own mistakes. (3) Insufficient refinement: Deciding how many iterations of refinement are needed is non-trivial, and stopping too soon could leave errors unaddressed. To tackle these issues, we propose MAgICoRe, which avoids excessive refinement by categorizing problem difficulty as easy or hard, solving easy problems with coarse-grained aggregation and hard ones with fine-grained and iterative multi-agent refinement. To improve error localization, we incorporate external step-wise reward model (RM) scores. Moreover, to ensure effective refinement, we employ a multi-agent loop with three agents: Solver, Reviewer (which generates targeted feedback based on step-wise RM scores), and the Refiner (which incorporates feedback). To ensure sufficient refinement, we re-evaluate updated solutions, iteratively initiating further rounds of refinement. We evaluate MAgICoRe on Llama-3-8B and GPT-3.5 and show its effectiveness across 5 math datasets. Even one iteration of MAgICoRe beats Self-Consistency by 3.4%, Best-of-k by 3.2%, and Self-Refine by 4.0% while using less than half the samples. Unlike iterative refinement with baselines, MAgICoRe continues to improve with more iterations. Finally, our ablations highlight the importance of MAgICoRe's RMs and multi-agent communication.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Investigating baryon-strangeness and charge-strangeness correlations in Pb$-$Pb collisions at $\sqrt{s_\mathrm{NN}}$ = 5.02 TeV with ALICE
Authors:
Swati Saha
Abstract:
To explore the quantum chromodynamics (QCD) phase transitions and the properties of quark$-$gluon plasma, the ALICE collaboration at CERN has conducted an extensive analysis of the correlations among net-conserved quantities, namely net-baryon, net-charge, and net-strangeness. These correlations are essential for understanding the QCD phase structure, as they are directly connected to ratios of th…
▽ More
To explore the quantum chromodynamics (QCD) phase transitions and the properties of quark$-$gluon plasma, the ALICE collaboration at CERN has conducted an extensive analysis of the correlations among net-conserved quantities, namely net-baryon, net-charge, and net-strangeness. These correlations are essential for understanding the QCD phase structure, as they are directly connected to ratios of thermodynamic susceptibilities calculated in lattice QCD. This analysis focuses on the correlations between net-kaon and net-proton, as well as net-kaon and net-charge, in Pb$-$Pb collisions at $\sqrt{s_\mathrm{NN}} = 5.02$ TeV, where net-proton and net-kaon serve as effective proxies for net-baryon and net-strangeness, respectively. A comparison with theoretical predictions from the Thermal-FIST model sheds light on the role of resonance decays and the effects of charge conservation laws in shaping these correlations. Furthermore, the measurements show sensitivity to the correlation volume in which these conservation laws are applied, underscoring the importance of modeling the underlying dynamics to fully understand the experimental results on fluctuations and correlations in heavy-ion collisions.
△ Less
Submitted 18 September, 2024; v1 submitted 17 September, 2024;
originally announced September 2024.
-
On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model
Authors:
Surajit Saha,
Parongama Sen
Abstract:
A system of interacting walkers is considered in a two-dimensional hypothetical space, where the dynamics of each walker are governed by the opinion states of the agents of a fully connected three-state opinion dynamics model. Such walks, studied in different models of statistical physics, are usually considered in one-dimensional virtual spaces. Here, the mapping is done in such a way that the wa…
▽ More
A system of interacting walkers is considered in a two-dimensional hypothetical space, where the dynamics of each walker are governed by the opinion states of the agents of a fully connected three-state opinion dynamics model. Such walks, studied in different models of statistical physics, are usually considered in one-dimensional virtual spaces. Here, the mapping is done in such a way that the walk is directed along the Y-axis while it can move either way along the X-axis. The walk shows that there are three distinct regions as the noise parameter, responsible for driving a continuous phase transition in the model, is varied. In absence of any noise, the scaling properties and the form of the distribution along either axis do not follow any conventional form. For any finite noise below the critical point the bivariate distribution of the displacements is found to be a modified biased Gaussian function while above it, only the marginal distribution along one direction is Gaussian. The marginal probability distributions can be extracted and the scaling forms of different quantities, showing power law behaviour, are obtained. The directed nature of the walk is reflected in the marginal distributions as well as in the exponents.
△ Less
Submitted 28 April, 2025; v1 submitted 16 September, 2024;
originally announced September 2024.
-
Exploring and Visualizing COVID-19 Trends in India: Vulnerabilities and Mitigation Strategies
Authors:
Swayamjit Saha,
Kuntal Ghosh,
Garga Chatterjee,
J. Edward Swan II
Abstract:
Visualizing data plays a pivotal role in portraying important scientific information. Hence, visualization techniques aid in displaying relevant graphical interpretations from the varied structures of data, which is found otherwise. In this paper, we explore the COVID-19 pandemic influence trends in the subcontinent of India in the context of how far the infection rate spiked in the year 2020 and…
▽ More
Visualizing data plays a pivotal role in portraying important scientific information. Hence, visualization techniques aid in displaying relevant graphical interpretations from the varied structures of data, which is found otherwise. In this paper, we explore the COVID-19 pandemic influence trends in the subcontinent of India in the context of how far the infection rate spiked in the year 2020 and how the public health division of the country India has helped to curb the spread of the novel virus by installing vaccination centers across the diaspora of the country. The paper contributes to the empirical study of understanding the impact caused by the novel virus to the country by doing extensive explanatory data analysis of the data collected from the official government portal. Our work contributes to the understanding that data visualization is prime in understanding public health problems and beyond and taking necessary measures to curb the existing pandemic.
△ Less
Submitted 25 August, 2024;
originally announced September 2024.
-
Cosmic topology. Part Ic. Limits on lens spaces from circle searches
Authors:
Samanta Saha,
Craig J. Copi,
Glenn D. Starkman,
Stefano Anselmi,
Javier Carrón Duque,
Mikel Martin Barandiaran,
Yashar Akrami,
Fernando Cornet-Gomez,
Andrew H. Jaffe,
Arthur Kosowsky,
Deyan P. Mihaylov,
Thiago S. Pereira,
Amirhossein Samandar,
Andrius Tamosiunas
Abstract:
Cosmic microwave background (CMB) temperature and polarization observations indicate that in the best-fit $Λ$ Cold Dark Matter model of the Universe, the local geometry is consistent with at most a small amount of positive or negative curvature, i.e., $\vertΩ_K\vert\ll1$. However, whether the geometry is flat ($E^3$), positively curved ($S^3$) or negatively curved ($H^3$), there are many possible…
▽ More
Cosmic microwave background (CMB) temperature and polarization observations indicate that in the best-fit $Λ$ Cold Dark Matter model of the Universe, the local geometry is consistent with at most a small amount of positive or negative curvature, i.e., $\vertΩ_K\vert\ll1$. However, whether the geometry is flat ($E^3$), positively curved ($S^3$) or negatively curved ($H^3$), there are many possible topologies. Among the topologies of $S^3$ geometry, the lens spaces $L(p,q)$, where $p$ and $q$ ($p>1$ and $0<q<p$) are positive integers, are quotients of the covering space of $S^3$ (the three-sphere) by ${\mathbb{Z}}_p$, the cyclic group of order $p$. We use the absence of any pair of circles on the CMB sky with matching patterns of temperature fluctuations to establish constraints on $p$ and $q$ as a function of the curvature scale that are considerably stronger than those previously asserted for most values of $p$ and $q$. The smaller the value of $\vertΩ_K\vert$, i.e., the larger the curvature radius, the larger the maximum allowed value of $p$. For example, if $\vertΩ_K\vert\simeq 0.05$ then $p\leq 9 $, while if $\vertΩ_K\vert\simeq 0.02$, $p$ can be as high as 24. Future work will extend these constraints to a wider set of $S^{3}$ topologies.
△ Less
Submitted 27 March, 2025; v1 submitted 3 September, 2024;
originally announced September 2024.
-
Signatures of topology in generic transport measurements for Rarita-Schwinger-Weyl semimetals
Authors:
Ipsita Mandal,
Shreya Saha,
Rahul Ghosh
Abstract:
We investigate how the signatures of the topological properties of the bandstructures for nodal-point semimetals are embedded in the response coefficients, arising in two distinct experimental set-ups, by taking the Rarita-Schwinger-Weyl (RSW) semimetal as an example. The first scenario involves the computation of third-rank tensors representing second-order response coefficients, relating the cha…
▽ More
We investigate how the signatures of the topological properties of the bandstructures for nodal-point semimetals are embedded in the response coefficients, arising in two distinct experimental set-ups, by taking the Rarita-Schwinger-Weyl (RSW) semimetal as an example. The first scenario involves the computation of third-rank tensors representing second-order response coefficients, relating the charge/thermal current densities to the combined effects of the gradient of the chemical potential and an external electric field/temperature gradient. On the premises that internode scatterings can be ignored, the relaxation-time approximation leads to a quantized value for the nonvanishing components of each of these nonlinear response tensors, characterizing a single untilted RSW node. Furthermore, the final expressions turn out to be insensitive to the specific values of the chemical potential and the temperature. The second scenario involves computing the magnetoelectric conductivity under the action of collinear electric ($\mathbf E$) and magnetic ($\mathbf B$) fields, representing a planar Hall set-up. In particular, our focus is in bringing out the dependence of the linear-in-$|\mathbf B|$ parts of the conductivity tensor on the intrinsic topological properties of the bandstructure, which are nonvanishing only in the presence of a nonzero tilt in the energy spectrum.
△ Less
Submitted 27 December, 2024; v1 submitted 30 August, 2024;
originally announced August 2024.
-
Emergent scalar-chirality \& colossal transverse-magnetoresponse in strongly correlated nodal-line half-metal
Authors:
Jyotirmoy Sau,
Sourav Chakraborty,
Sourabh Saha,
Kalpataru Pradhan,
Anamitra Mukherjee,
Manoranjan Kumar
Abstract:
Understanding the interplay of strong correlation and temperature in nodal-line semimetals can offer novel ways to control spin currents. Here we consider the 3d-5d double-perovskite Ba$_{2}$CoWO$_{6}$, which features mirror-symmetry-protected nodal-lines, strong Co-site interactions, and spin-orbit coupling (SOC) at W sites. Our first principles and exact diagonalization results reveal a half-met…
▽ More
Understanding the interplay of strong correlation and temperature in nodal-line semimetals can offer novel ways to control spin currents. Here we consider the 3d-5d double-perovskite Ba$_{2}$CoWO$_{6}$, which features mirror-symmetry-protected nodal-lines, strong Co-site interactions, and spin-orbit coupling (SOC) at W sites. Our first principles and exact diagonalization results reveal a half-metallic ground state with high-spin Co and topologically non-trivial bands. We demonstrate that SOC gaps out nodal points, causes band-inversion and generates anomalous Hall response. A semi-classical Monte Carlo finite-temperature simulation of five-orbital Hubbard model uncovers an emergent Co-spin scalar chirality and colossal positive transverse-magnetoresponse. We predict the temperature and magnetic field scales for the tunability of scalar-chirality and magnetoresponse.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Nonzero-sum Discrete-time Stochastic Games with Risk-sensitive Ergodic Cost Criterion
Authors:
Bivakar Bose,
Chandan Pal,
Somnath Pradhan,
Subhamay Saha
Abstract:
In this paper we study infinite horizon nonzero-sum stochastic games for controlled discrete-time Markov chains on a Polish state space with risk-sensitive ergodic cost criterion. Under suitable assumptions we show that the associated ergodic optimality equations admit unique solutions. Finally, the existence of Nash-equilibrium in randomized stationary strategies is established by showing that an…
▽ More
In this paper we study infinite horizon nonzero-sum stochastic games for controlled discrete-time Markov chains on a Polish state space with risk-sensitive ergodic cost criterion. Under suitable assumptions we show that the associated ergodic optimality equations admit unique solutions. Finally, the existence of Nash-equilibrium in randomized stationary strategies is established by showing that an appropriate set-valued map has a fixed point.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
DUNE Phase II: Scientific Opportunities, Detector Concepts, Technological Solutions
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti
, et al. (1347 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the European Strategy for Particle Physics. While the construction of the DUNE Phase I is well underway, this White Paper focuses on DUNE Phase II planning. DUNE Phase-II consists of a third and fourth far detector (FD) module, an upgraded near detector complex, and an enhanced 2.1 MW beam. The fourth FD module is conceived as a "Module of Opportunity", aimed at expanding the physics opportunities, in addition to supporting the core DUNE science program, with more advanced technologies. This document highlights the increased science opportunities offered by the DUNE Phase II near and far detectors, including long-baseline neutrino oscillation physics, neutrino astrophysics, and physics beyond the standard model. It describes the DUNE Phase II near and far detector technologies and detector design concepts that are currently under consideration. A summary of key R&D goals and prototyping phases needed to realize the Phase II detector technical designs is also provided. DUNE's Phase II detectors, along with the increased beam power, will complete the full scope of DUNE, enabling a multi-decadal program of groundbreaking science with neutrinos.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition -- And Ways to Overcome Them
Authors:
Harish Haresamudram,
Apoorva Beedu,
Mashfiqui Rabbi,
Sankalita Saha,
Irfan Essa,
Thomas Ploetz
Abstract:
Cross-modal contrastive pre-training between natural language and other modalities, e.g., vision and audio, has demonstrated astonishing performance and effectiveness across a diverse variety of tasks and domains. In this paper, we investigate whether such natural language supervision can be used for wearable sensor based Human Activity Recognition (HAR), and discover that-surprisingly-it performs…
▽ More
Cross-modal contrastive pre-training between natural language and other modalities, e.g., vision and audio, has demonstrated astonishing performance and effectiveness across a diverse variety of tasks and domains. In this paper, we investigate whether such natural language supervision can be used for wearable sensor based Human Activity Recognition (HAR), and discover that-surprisingly-it performs substantially worse than standard end-to-end training and self-supervision. We identify the primary causes for this as: sensor heterogeneity and the lack of rich, diverse text descriptions of activities. To mitigate their impact, we also develop strategies and assess their effectiveness through an extensive experimental evaluation. These strategies lead to significant increases in activity recognition, bringing performance closer to supervised and self-supervised training, while also enabling the recognition of unseen activities and cross modal retrieval of videos. Overall, our work paves the way for better sensor-language learning, ultimately leading to the development of foundational models for HAR using wearables.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
MIMA 2.0 -- Compact and Portable Multifunctional IoT integrated Menstrual Aid
Authors:
Kumar J. Jyothish,
Shreya Shivangi,
Amish Bibhu,
Subhankar Mishra,
Sulagna Saha
Abstract:
The shredding intrauterine lining or the endometrium is known as Menstruation. It occurs every month and causes several issues like Menstrual Cramps and aches in the abdominal region, stains, menstrual malodor, rashes in intimate areas, and many more. In our research, almost all of the products available in the market do not cater to these problems single-handedly. There are few remedies available…
▽ More
The shredding intrauterine lining or the endometrium is known as Menstruation. It occurs every month and causes several issues like Menstrual Cramps and aches in the abdominal region, stains, menstrual malodor, rashes in intimate areas, and many more. In our research, almost all of the products available in the market do not cater to these problems single-handedly. There are few remedies available to cater to the cramps, among which heat therapy is the most commonly used. Our methodology, involved surveys regarding problems and the solutions to these problems that are deemed optimal. This inclusive approach helped us infer about the gaps in available menstrual aids which has become our guide towards developing MIMA (Multifunctional IoT Integrated Menstrual Aid). In this paper, we have featured an IOT incorporated multifunctional smart intimate wear that aims to provide for the multiple necessities of women during menstruation like leakproof, antibacterial, anti-odor, rash-free experience along with an integrated Bluetooth-controlled intimate heat-pad for relieving abdominal cramps. The entire process of product development has been done in phases according to feedback from target users in each stage. This paper is an extension to our paper [1] which serves as the proof of concept for our approach. The development has led us towards MIMA 2.0 featuring a completely concealed and integrated design that includes a safe Bluetooth-controlled heating system for the intimate area. The product has received incredibly positive feedback from survey participants.
△ Less
Submitted 3 August, 2024;
originally announced August 2024.
-
Radiation Resilience of $β$-Ga$_2$O$_3$ Schottky Barrier Diodes Under High Dose Gamma Radiation
Authors:
Saleh Ahmed Khan,
Sudipto Saha,
Uttam Singisetti,
A F M Anhar Uddin Bhuiyan
Abstract:
A systematic investigation of the electrical characteristics of \b{eta}-Ga2O3 Schottky barrier diodes (SBDs) has been conducted under high-dose 60Co gamma radiation, with total cumulative doses reaching up to 5 Mrad (Si). Initial exposure of the diodes to 1 Mrad resulted in a significant decrease in on-current and an increase in on-resistance compared to the pre-radiation condition, likely due to…
▽ More
A systematic investigation of the electrical characteristics of \b{eta}-Ga2O3 Schottky barrier diodes (SBDs) has been conducted under high-dose 60Co gamma radiation, with total cumulative doses reaching up to 5 Mrad (Si). Initial exposure of the diodes to 1 Mrad resulted in a significant decrease in on-current and an increase in on-resistance compared to the pre-radiation condition, likely due to the generation of radiation-induced deep-level acceptor traps. However, upon exposure to higher gamma radiation doses of 3 and 5 Mrad, partial recovery of the device performance occurred, attributed to a radiation annealing effect. The capacitance-voltage (C-V) characterization revealed that the net carrier concentration in the $β$-Ga$_2$O$_3$ drift layer reduced from $\sim$3.19 $\times$ 10$^{16}$ cm$^{-3}$ to $\sim$3.05 $\times$ 10$^{16}$ cm$^{-3}$ after 5 Mrad (Si) irradiation. Temperature-dependent I-V characteristics showed that irradiation leads to a reduction in both forward and reverse current across all investigated temperatures ranging from 25 to 250$^\circ$C, accompanied by slight increases in on-resistance, ideality factors, and Schottky barrier heights. The reverse breakdown characteristics of the $β$-Ga$_2$O$_3$ SBDs showed a slight increase of the breakdown voltage after radiation. Overall, $β$-Ga$_2$O$_3$ Schottky diode exhibits high resilience to gamma irradiation, with performance degradation mitigated by radiation-induced self-recovery, highlighting its potential for radiation-hardened electronic applications in extreme environments.
△ Less
Submitted 11 January, 2025; v1 submitted 20 August, 2024;
originally announced August 2024.
-
Probing right-handed neutrinos via tri-lepton signals at the HL-LHC
Authors:
Manimala Mitra,
Subham Saha,
Michael Spannowsky,
Michihisa Takeuchi
Abstract:
Neutrino oscillation experiments have provided direct evidence for the existence of neutrino masses. The seesaw mechanism explains the smallness of these masses through the introduction of heavy right-handed neutrino (RHN) states. The RHN states can aslo generate Dirac neutrino masses at tree or loop level. These heavy states can exist at the electroweak scale, approximately in the…
▽ More
Neutrino oscillation experiments have provided direct evidence for the existence of neutrino masses. The seesaw mechanism explains the smallness of these masses through the introduction of heavy right-handed neutrino (RHN) states. The RHN states can aslo generate Dirac neutrino masses at tree or loop level. These heavy states can exist at the electroweak scale, approximately in the $\mathcal{O}(\mathrm{GeV})$ range, and can be investigated through current and future collider experiments. This scenario, where other new physics interactions occur at scales much higher than the RHN scale, can be described using an effective field theory (EFT) framework known as $N_R$-EFT. This study focuses on constraining the Wilson coefficients of $N_R$-EFT operators, which primarily contribute to tri-lepton production and missing energy signals at the LHC. We examine both the scenarios where the RHN mass $M_N$ is less than and greater than the $W$ boson mass $M_W$, and provide predictions for the High-Luminosity run of the LHC (HL-LHC).
△ Less
Submitted 17 September, 2024; v1 submitted 16 August, 2024;
originally announced August 2024.
-
Segment Using Just One Example
Authors:
Pratik Vora,
Sudipan Saha
Abstract:
Semantic segmentation is an important topic in computer vision with many relevant application in Earth observation. While supervised methods exist, the constraints of limited annotated data has encouraged development of unsupervised approaches. However, existing unsupervised methods resemble clustering and cannot be directly mapped to explicit target classes. In this paper, we deal with single sho…
▽ More
Semantic segmentation is an important topic in computer vision with many relevant application in Earth observation. While supervised methods exist, the constraints of limited annotated data has encouraged development of unsupervised approaches. However, existing unsupervised methods resemble clustering and cannot be directly mapped to explicit target classes. In this paper, we deal with single shot semantic segmentation, where one example for the target class is provided, which is used to segment the target class from query/test images. Our approach exploits recently popular Segment Anything (SAM), a promptable foundation model. We specifically design several techniques to automatically generate prompts from the only example/key image in such a way that the segmentation is successfully achieved on a stitch or concatenation of the example/key and query/test images. Proposed technique does not involve any training phase and just requires one example image to grasp the concept. Furthermore, no text-based prompt is required for the proposed method. We evaluated the proposed techniques on building and car classes.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
Specialized Change Detection using Segment Anything
Authors:
Tahir Ahmad,
Sudipan Saha
Abstract:
Change detection (CD) is a fundamental task in Earth observation. While most change detection methods detect all changes, there is a growing need for specialized methods targeting specific changes relevant to particular applications while discarding the other changes. For instance, urban management might prioritize detecting the disappearance of buildings due to natural disasters or other reasons.…
▽ More
Change detection (CD) is a fundamental task in Earth observation. While most change detection methods detect all changes, there is a growing need for specialized methods targeting specific changes relevant to particular applications while discarding the other changes. For instance, urban management might prioritize detecting the disappearance of buildings due to natural disasters or other reasons. Furthermore, while most supervised change detection methods require large-scale training datasets, in many applications only one or two training examples might be available instead of large datasets. Addressing such needs, we propose a focused CD approach using the Segment Anything Model (SAM), a versatile vision foundation model. Our method leverages a binary mask of the object of interest in pre-change images to detect their disappearance in post-change images. By using SAM's robust segmentation capabilities, we create prompts from the pre-change mask, use those prompts to segment the post-change image, and identify missing objects. This unsupervised approach demonstrated for building disappearance detection, is adaptable to various domains requiring specialized CD. Our contributions include defining a novel CD problem, proposing a method using SAM, and demonstrating its effectiveness. The proposed method also has benefits related to privacy preservation.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Enhanced stability and chaotic condensates in multi-species non-reciprocal mixtures
Authors:
Laya Parkavousi,
Navdeep Rana,
Ramin Golestanian,
Suropriya Saha
Abstract:
Random non-reciprocal interactions between a large number of conserved densities are shown to enhance the stability of the system towards pattern formation. The enhanced stability is an exact result when the number of species approaches infinity and is confirmed numerically by simulations of the multi-species non-reciprocal Cahn-Hilliard model. Furthermore, the diversity in dynamical patterns incr…
▽ More
Random non-reciprocal interactions between a large number of conserved densities are shown to enhance the stability of the system towards pattern formation. The enhanced stability is an exact result when the number of species approaches infinity and is confirmed numerically by simulations of the multi-species non-reciprocal Cahn-Hilliard model. Furthermore, the diversity in dynamical patterns increases with increasing number of components and novel steady states such as pulsating or spatiotemporally chaotic condensates are observed. Our results may help to unravel the mechanisms by which living systems self-organise via metabolism.
△ Less
Submitted 13 August, 2024; v1 submitted 12 August, 2024;
originally announced August 2024.
-
Cluster-Segregate-Perturb (CSP): A Model-agnostic Explainability Pipeline for Spatiotemporal Land Surface Forecasting Models
Authors:
Tushar Verma,
Sudipan Saha
Abstract:
Satellite images have become increasingly valuable for modelling regional climate change effects. Earth surface forecasting represents one such task that integrates satellite images with meteorological data to capture the joint evolution of regional climate change effects. However, understanding the complex relationship between specific meteorological variables and land surface evolution poses a s…
▽ More
Satellite images have become increasingly valuable for modelling regional climate change effects. Earth surface forecasting represents one such task that integrates satellite images with meteorological data to capture the joint evolution of regional climate change effects. However, understanding the complex relationship between specific meteorological variables and land surface evolution poses a significant challenge. In light of this challenge, our paper introduces a pipeline that integrates principles from both perturbation-based explainability techniques like LIME and global marginal explainability techniques like PDP, besides addressing the constraints of using such techniques when applying them to high-dimensional spatiotemporal deep models. The proposed pipeline simplifies the undertaking of diverse investigative analyses, such as marginal sensitivity analysis, marginal correlation analysis, lag analysis, etc., on complex land surface forecasting models In this study we utilised Convolutional Long Short-Term Memory (ConvLSTM) as the surface forecasting model and did analyses on the Normalized Difference Vegetation Index (NDVI) of the surface forecasts, since meteorological variables like temperature, pressure, and precipitation significantly influence it. The study area encompasses various regions in Europe. Our analyses show that precipitation exhibits the highest sensitivity in the study area, followed by temperature and pressure. Pressure has little to no direct effect on NDVI. Additionally, interesting nonlinear correlations between meteorological variables and NDVI have been uncovered.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Benchmark Computations of Nearly Degenerate Singlet and Triplet states of N-heterocyclic Chromophores : II. Density-based Methods
Authors:
Shamik Chanda,
Subhasish Saha,
Sangita Sen
Abstract:
In this paper we demonstrate the performance of several density-based methods in predicting the inversion of S$_1$ and T$_1$ states of a few N-heterocyclic fused ring molecules (popularly known as INVEST molecules) with an eye to identify a well performing but cheap preliminary screening method. Both conventional LR-TDDFT and $Δ$SCF methods (namely MOM, SGM, ROKS) are considered for excited state…
▽ More
In this paper we demonstrate the performance of several density-based methods in predicting the inversion of S$_1$ and T$_1$ states of a few N-heterocyclic fused ring molecules (popularly known as INVEST molecules) with an eye to identify a well performing but cheap preliminary screening method. Both conventional LR-TDDFT and $Δ$SCF methods (namely MOM, SGM, ROKS) are considered for excited state computations using exchange-correlation (XC) functionals from different rungs of the Jacob's ladder. A well-justified systematism is observed in the performance of the functionals when compared against FICMRCISD and/or EOM-CCSD, with the most important feature being the capture of spin-polarization in presence of correlation. A set of functionals with the least mean absolute error (MAE) is proposed for both the approaches, LR-TDDFT and $Δ$SCF, which can be cheaper alternatives for computations on synthesizable larger derivatives of the templates studied here. We have based our findings on extensive studies of three cyclazine-based molecular templates, with additional studies on a set of six related templates. Previous benchmark studies for subsets of the functionals were conducted against the DLPNO-STEOM-CCSD, which resulted in an inadequate evaluation due to deficiencies in the benchmark theory. The role of exact-exchange, spin-contamination and spin-polarization in the context of DFT comes to the forefront in our studies and supports the numerical evaluation of XC functionals for these applications. Suitable connections are drawn to two and three state exciton models which identify the minimal physics governing the interactions in these molecules.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Implementation of cosmological bounce inflation with Nojiri-Odintsov generalized holographic dark fluid
Authors:
Sanghati Saha,
Surajit Chattopadhyay
Abstract:
The current work reports a study on bounce cosmology with a highly generalized holographic dark fluid inspired by S. Nojiri and S. D. Odintsov, 2017, European Physical Journal C, 77, pp.1-8. The holographic dark fluid that is mostly used for late-time acceleration has been implemented to reconstruct towards realisation of cosmological bounce. We first used the most generalized Nojiri-Odintsov(NO)…
▽ More
The current work reports a study on bounce cosmology with a highly generalized holographic dark fluid inspired by S. Nojiri and S. D. Odintsov, 2017, European Physical Journal C, 77, pp.1-8. The holographic dark fluid that is mostly used for late-time acceleration has been implemented to reconstruct towards realisation of cosmological bounce. We first used the most generalized Nojiri-Odintsov(NO) cutoff to implement the holographic dark fluid. Accordingly, we have reconstructed this dark fluid via some solutions of scale factors. With those solutions, we have explored the evolution of different cosmological parameters. We have examined the effects of each reconstructed parameter in the context of the realization of the cosmic bounce. Next, we use the analytical inferences of the scalar spectral index, tensor-to-scalar ratio, and slow-roll characteristics of the model to study a bounce inflationary scenario. Since inflation is usually associated with the existence of scalar fields, we looked at a possible relationship between NO generalized holographic dark energy and scalar field models. Plotting the evolution of the potential that results from the scalar fields against time. Finally, we investigated the GSL of thermodynamics in the pre-and post-bounce scenarios.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
TOI-2490b- The most eccentric brown dwarf transiting in the brown dwarf desert
Authors:
Beth A. Henderson,
Sarah L. Casewell,
Andrés Jordán,
Rafael Brahm,
Thomas Henning,
Samuel Gill,
L. C. Mayorga,
Carl Ziegler,
Keivan G. Stassun,
Michael R. Goad,
Jack Acton,
Douglas R. Alves,
David R. Anderson,
Ioannis Apergis,
David J. Armstrong,
Daniel Bayliss,
Matthew R. Burleigh,
Diana Dragomir,
Edward Gillen,
Maximilian N. Günther,
Christina Hedges,
Katharine M. Hesse,
Melissa J. Hobson,
James S. Jenkins,
Jon M. Jenkins
, et al. (18 additional authors not shown)
Abstract:
We report the discovery of the most eccentric transiting brown dwarf in the brown dwarf desert, TOI02490b. The brown dwarf desert is the lack of brown dwarfs around main sequence stars within $\sim3$~AU and is thought to be caused by differences in formation mechanisms between a star and planet. To date, only $\sim40$ transiting brown dwarfs have been confirmed. \systemt is a $73.6\pm2.4$ \mjupnos…
▽ More
We report the discovery of the most eccentric transiting brown dwarf in the brown dwarf desert, TOI02490b. The brown dwarf desert is the lack of brown dwarfs around main sequence stars within $\sim3$~AU and is thought to be caused by differences in formation mechanisms between a star and planet. To date, only $\sim40$ transiting brown dwarfs have been confirmed. \systemt is a $73.6\pm2.4$ \mjupnospace, $1.00\pm0.02$ \rjup brown dwarf orbiting a $1.004_{-0.022}^{+0.031}$ \msunnospace, $1.105_{-0.012}^{+0.012}$ \rsun sun-like star on a 60.33~d orbit with an eccentricity of $0.77989\pm0.00049$. The discovery was detected within \tess sectors 5 (30 minute cadence) and 32 (2 minute and 20 second cadence). It was then confirmed with 31 radial velocity measurements with \feros by the WINE collaboration and photometric observations with the Next Generation Transit Survey. Stellar modelling of the host star estimates an age of $\sim8$~Gyr, which is supported by estimations from kinematics likely placing the object within the thin disc. However, this is not consistent with model brown dwarf isochrones for the system age suggesting an inflated radius. Only one other transiting brown dwarf with an eccentricity higher than 0.6 is currently known in the brown dwarf desert. Demographic studies of brown dwarfs have suggested such high eccentricity is indicative of stellar formation mechanisms.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
The Wood-Saxon proton optical potential for p-nuclei
Authors:
Sukhendu Saha,
Dipali Basak,
Chinmay Basu
Abstract:
A phenomenological mass-energy dependent proton optical model potential has been computed for p-nuclei. The parameters of the Wood-Saxon optical potential are found to be a good fit for proton elastic scattering data involving p-nuclei and elements with mass numbers near p-nuclei (within the range of 74 < A < 148) at energies around the Coulomb barrier of the system. The elastic scattering data we…
▽ More
A phenomenological mass-energy dependent proton optical model potential has been computed for p-nuclei. The parameters of the Wood-Saxon optical potential are found to be a good fit for proton elastic scattering data involving p-nuclei and elements with mass numbers near p-nuclei (within the range of 74 < A < 148) at energies around the Coulomb barrier of the system. The elastic scattering data were meticulously fitted using the SFRESCO code, allowing for the calculation of the real and imaginary parts of the Wood Saxon optical potential. To validate the model, experimental proton capture cross-sections for 106Cd and 113In near the Coulomb barrier were compared with results obtained using the TALYS-1.96 code, showing better agreement than the available global proton optical model potential.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Exploring magnetic and topological complexity in MgMn$_6$Sn$_6$: from frustrated ground states to nontrivial Hall conductivity
Authors:
Jyotirmoy Sau,
Hrishit Banerjee,
Sourabh Saha,
Nitesh Kumar,
Manoranjan Kumar
Abstract:
We explore the intriguing topological itinerant magnet MgMn$_6$Sn$_6$, characterized by bilayer kagome Mn layers encasing a hexagonal Sn layer. Using \textit{ab initio} Density functional theory and Dynamical mean-field theory calculations, we uncover the complex electronic properties and many-body configuration of its magnetic ground state. Mn d-orbital electrons form a frustrated many-body groun…
▽ More
We explore the intriguing topological itinerant magnet MgMn$_6$Sn$_6$, characterized by bilayer kagome Mn layers encasing a hexagonal Sn layer. Using \textit{ab initio} Density functional theory and Dynamical mean-field theory calculations, we uncover the complex electronic properties and many-body configuration of its magnetic ground state. Mn d-orbital electrons form a frustrated many-body ground state with significant quantum fluctuations, resulting in competing antiferromagnetic and ferromagnetic spin exchanges. Our band dispersion calculations reveal a mirror symmetry-protected nodal line in the \textit{k}$_z$ = 0 plane. When spin-orbit coupling (SOC) is introduced, the gap is formed along the nodal line lifted due to broken time-reversal symmetry with magnetic ordering, leading to substantial intrinsic Berry curvature. We identify Dirac fermions, van Hove singularities, and flat band near the Fermi energy (\textit{E}$_F$), with SOC introducing a finite gap at key points. The unique proximity of the flat band to \textit{E}$_F$ suggests potential instabilities. Spin-orbit coupling opens a 20 meV gap at the quadratic touching point between the Dirac and flat band, bestowing a nonzero Z$_2$ invariant. This leads to a significant spin Hall conductivity. Despite the presence of large incoherent scattering due to electronic interactions, band crossings and flat band features persist at finite temperatures. MgMn$_6$Sn$_6$ exhibits intriguing topological and magnetic properties, with promising applications in spintronics.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
First Measurement of the Total Inelastic Cross-Section of Positively-Charged Kaons on Argon at Energies Between 5.0 and 7.5 GeV
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti
, et al. (1341 additional authors not shown)
Abstract:
ProtoDUNE Single-Phase (ProtoDUNE-SP) is a 770-ton liquid argon time projection chamber that operated in a hadron test beam at the CERN Neutrino Platform in 2018. We present a measurement of the total inelastic cross section of charged kaons on argon as a function of kaon energy using 6 and 7 GeV/$c$ beam momentum settings. The flux-weighted average of the extracted inelastic cross section at each…
▽ More
ProtoDUNE Single-Phase (ProtoDUNE-SP) is a 770-ton liquid argon time projection chamber that operated in a hadron test beam at the CERN Neutrino Platform in 2018. We present a measurement of the total inelastic cross section of charged kaons on argon as a function of kaon energy using 6 and 7 GeV/$c$ beam momentum settings. The flux-weighted average of the extracted inelastic cross section at each beam momentum setting was measured to be 380$\pm$26 mbarns for the 6 GeV/$c$ setting and 379$\pm$35 mbarns for the 7 GeV/$c$ setting.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
A Space-Time Knife-Edge In Epsilon-Near-Zero Films for Ultrafast Pulse Characterization
Authors:
Adam Ball,
Ray Secondo,
Dhruv Fomra,
Jingwei Wu,
Samprity Saha,
Amit Agrawal,
Henri Lezec,
Nathaniel Kinsey
Abstract:
Epsilon-near-zero (ENZ) materials have shown strong refractive nonlinearities that can be fast in an absolute sense. While continuing to advance fundamental science, such as time varying interactions, the community is still searching for an application that can effectively make use of the strong index modulation offered. Here we combine the effect of strong space-time index modulation in ENZ mater…
▽ More
Epsilon-near-zero (ENZ) materials have shown strong refractive nonlinearities that can be fast in an absolute sense. While continuing to advance fundamental science, such as time varying interactions, the community is still searching for an application that can effectively make use of the strong index modulation offered. Here we combine the effect of strong space-time index modulation in ENZ materials with the beam deflection technique to introduce a new approach to optical pulse characterization that we term a space-time knife edge. We show that in this approach, we are able to extract temporal and spatial information of a Gaussian beam with only two time resolved measurements. The approach achieves this without phase-matching requirements (<1 micron thick film) and can achieve a high signal to noise ratio by combining the system with lock-in detection, facilitating the measurement of weak refractive index changes (delta_n ~ 10^-5) for low intensity beams. Thus, the space-time knife edge can offer a new avenue for ultrafast light measurement and demonstrates a use cases of ENZ materials. In support of this, we outline temporal dynamics for refractive index changes in non-colinear experiments opening avenues for better theoretical understanding of both the spatial and temporal dynamics of emerging ENZ films.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Precise Transit Photometry Using TESS II: Revisiting 28 Additional Transiting Systems With Updated Physical Properties
Authors:
Suman Saha
Abstract:
Precise physical properties of the known transiting exoplanets are essential for their precise atmospheric characterization using modern and upcoming instruments. Leveraging the large volume of high SNR photometric follow-up data from TESS, highly precise physical properties can be estimated for these systems, especially for those discovered using ground-based instruments prior to the TESS mission…
▽ More
Precise physical properties of the known transiting exoplanets are essential for their precise atmospheric characterization using modern and upcoming instruments. Leveraging the large volume of high SNR photometric follow-up data from TESS, highly precise physical properties can be estimated for these systems, especially for those discovered using ground-based instruments prior to the TESS mission. In this work, I have used the publicly available TESS follow-up data for 28 transiting systems with 10 $<$ V$_{mag}$ $<$ 10.5, with an aim to update their known physical properties. The observed lightcurves have been analysed by implementing a state-of-the-art critical noise treatment algorithm to effectively reduce both time-correlated and un-correlated noise components, using sophisticated techniques like wavelet denoising and Gaussian-process regression. Compared with the previous studies, the estimated transit parameters are found to be more precise for most of the targets, including a few cases where a larger space-based instrument like Spitzer, Kepler or CHEOPS has been used in the previous study. The large volume of transit observations used for each target has also resulted in a more accurate estimation of the physical properties, as this overcomes any error in parameter estimations from bias present in a smaller volume of data. Thus, comparing with the literature values, statistically significant improvements in the known physical properties of several targeted systems have been reported from this work. The large volume of transit timing information from the analyses was also used to search for Transit Timing Variation trends in these targets, which has resulted in no significant detection.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Quasi-classical Trajectory Calculations on a Two-state Potential Energy Surface Including Nonadiabatic Coupling Terms as Friction for D+ + H2 Collisions
Authors:
Soumya Mukherjee,
Swagato Saha,
Sandip Ghosh,
Satrajit Adhikari,
Narayanasami Sathyamurthy,
Michael Baer
Abstract:
Akin to the traditional quasi-classical trajectory method for investigating the dynamics on a single adiabatic potential energy surface for an elementary chemical reaction, we carry out the dynamics on a 2-state ab initio potential energy surface including nonadiabatic coupling terms as friction terms for D+ + H2 collisions. It is shown that the resulting dynamics correctly accounts for nonreactiv…
▽ More
Akin to the traditional quasi-classical trajectory method for investigating the dynamics on a single adiabatic potential energy surface for an elementary chemical reaction, we carry out the dynamics on a 2-state ab initio potential energy surface including nonadiabatic coupling terms as friction terms for D+ + H2 collisions. It is shown that the resulting dynamics correctly accounts for nonreactive charge transfer, reactive non charge transfer and reactive charge transfer processes. In addition, it leads to the formation of triatomic DH2+ species as well.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Two eyes, Two views, and finally, One summary! Towards Multi-modal Multi-tasking Knowledge-Infused Medical Dialogue Summarization
Authors:
Anisha Saha,
Abhisek Tiwari,
Sai Ruthvik,
Sriparna Saha
Abstract:
We often summarize a multi-party conversation in two stages: chunking with homogeneous units and summarizing the chunks. Thus, we hypothesize that there exists a correlation between homogeneous speaker chunking and overall summarization tasks. In this work, we investigate the effectiveness of a multi-faceted approach that simultaneously produces summaries of medical concerns, doctor impressions, a…
▽ More
We often summarize a multi-party conversation in two stages: chunking with homogeneous units and summarizing the chunks. Thus, we hypothesize that there exists a correlation between homogeneous speaker chunking and overall summarization tasks. In this work, we investigate the effectiveness of a multi-faceted approach that simultaneously produces summaries of medical concerns, doctor impressions, and an overall view. We introduce a multi-modal, multi-tasking, knowledge-infused medical dialogue summary generation (MMK-Summation) model, which is incorporated with adapter-based fine-tuning through a gated mechanism for multi-modal information integration. The model, MMK-Summation, takes dialogues as input, extracts pertinent external knowledge based on the context, integrates the knowledge and visual cues from the dialogues into the textual content, and ultimately generates concise summaries encompassing medical concerns, doctor impressions, and a comprehensive overview. The introduced model surpasses multiple baselines and traditional summarization models across all evaluation metrics (including human evaluation), which firmly demonstrates the efficacy of the knowledge-guided multi-tasking, multimodal medical conversation summarization. The code is available at https://github.com/NLP-RL/MMK-Summation.
△ Less
Submitted 21 July, 2024;
originally announced July 2024.
-
Ferrimagnetic hexagonal Mn$_2$CuGe Heusler alloy with a low-temperature spin-glass state
Authors:
Abhinav Kumar Khorwal,
Sonu Vishvakarma,
Sujoy Saha,
Debashish Patra,
Akriti Singh,
Surajit Saha,
V. Srinivas,
Ajit K. Patra
Abstract:
An extensive experimental investigation on the structural, static magnetic, and non-equilibrium dynamical properties of polycrystalline Mn$_2$CuGe Heusler alloy using powder X-ray diffraction, DC magnetization, magnetic relaxation, magnetic memory effect, and specific heat measurements is presented. Structural studies reveal that the alloy crystallizes in a mixed hexagonal crystal structure (space…
▽ More
An extensive experimental investigation on the structural, static magnetic, and non-equilibrium dynamical properties of polycrystalline Mn$_2$CuGe Heusler alloy using powder X-ray diffraction, DC magnetization, magnetic relaxation, magnetic memory effect, and specific heat measurements is presented. Structural studies reveal that the alloy crystallizes in a mixed hexagonal crystal structure (space groups P3c1 (no. 158) and P6$_3$/mmc (no. 194)) with lattice parameters a = b = 7.18(4) $\mathring{A}$ and c = 13.12(4) $\mathring{A}$ for the majority phase. The DC magnetization analysis reveals a paramagnetic to ferrimagnetic phase transition around T$_C$ $\approx$ 682 K with a compensation of magnetization at $\approx$ 250 K, and a spin-glass transition around T$_P$ $\approx$ 25.6 K. The Néel theory of ferrimagnets supports the ferrimagnetic nature of the studied alloy and the estimated T$_C$ ($\approx$ 687 K) from this theory is consistent with that obtained from the DC magnetization data. A detailed study of non-equilibrium spin dynamics via magnetic relaxation and memory effect experiments shows the evolution of the system through a number of intermediate states and striking magnetic memory effect. Furthermore, heat capacity measurements suggest a large electronic contribution to the specific heat capacity suggesting strong spin fluctuations, due to competing magnetic interactions. All the observations render a spin-glass behavior in Mn$_2$CuGe, attributed to the magnetic frustration possibly arising out of the competing ferromagnetic and antiferromagnetic interactions.
△ Less
Submitted 20 July, 2024;
originally announced July 2024.
-
System-1.x: Learning to Balance Fast and Slow Planning with Language Models
Authors:
Swarnadeep Saha,
Archiki Prasad,
Justin Chih-Yao Chen,
Peter Hase,
Elias Stengel-Eskin,
Mohit Bansal
Abstract:
Language models can be used to solve long-horizon planning problems in two distinct modes: a fast 'System-1' mode, directly generating plans without any explicit search or backtracking, and a slow 'System-2' mode, planning step-by-step by explicitly searching over possible actions. While System-2 is typically more effective, it is also more computationally expensive, making it infeasible for long…
▽ More
Language models can be used to solve long-horizon planning problems in two distinct modes: a fast 'System-1' mode, directly generating plans without any explicit search or backtracking, and a slow 'System-2' mode, planning step-by-step by explicitly searching over possible actions. While System-2 is typically more effective, it is also more computationally expensive, making it infeasible for long plans or large action spaces. Moreover, isolated System-1 or 2 ignores the user's end goals, failing to provide ways to control the model's behavior. To this end, we propose the System-1.x Planner, a controllable planning framework with LLMs that is capable of generating hybrid plans and balancing between the two planning modes based on the difficulty of the problem at hand. System-1.x consists of (i) a controller, (ii) a System-1 Planner, and (iii) a System-2 Planner. Based on a user-specified hybridization factor (x) governing the mixture between System-1 and 2, the controller decomposes a problem into sub-goals, and classifies them as easy or hard to be solved by either System-1 or 2, respectively. We fine-tune all three components on top of a single base LLM, requiring only search traces as supervision. Experiments with two diverse planning tasks -- Maze Navigation and Blocksworld -- show that our System-1.x Planner outperforms a System-1 Planner, a System-2 Planner trained to approximate A* search, and also a symbolic planner (A*). We demonstrate the following key properties of our planner: (1) controllability: increasing the hybridization factor (e.g., System-1.75 vs 1.5) performs more search, improving performance, (2) flexibility: by building a neuro-symbolic variant with a neural System-1 and a symbolic System-2, we can use existing symbolic methods, and (3) generalizability: by being able to learn from different search algorithms, our method is robust to the choice of search algorithm.
△ Less
Submitted 14 April, 2025; v1 submitted 19 July, 2024;
originally announced July 2024.
-
Angular momentum distribution for a quark dressed with a gluon: different decompositions
Authors:
Ravi Singh,
Sudeep Saha,
Asmita Mukherjee,
Nilmani Mathur
Abstract:
We present a recent calculation of the quark and gluon contributions to the angular momentum of a composite spin -$1/2$ state in QCD. The state we consider is a quark dressed with a gluon, and we use the two-component framework in light-front Hamiltonian QCD. We compare the results from different decompositions available in the literature. We also present the angular momentum distributions.
We present a recent calculation of the quark and gluon contributions to the angular momentum of a composite spin -$1/2$ state in QCD. The state we consider is a quark dressed with a gluon, and we use the two-component framework in light-front Hamiltonian QCD. We compare the results from different decompositions available in the literature. We also present the angular momentum distributions.
△ Less
Submitted 19 July, 2024; v1 submitted 18 July, 2024;
originally announced July 2024.
-
Angular dependent measurement of electron-ion recombination in liquid argon for ionization calorimetry in the ICARUS liquid argon time projection chamber
Authors:
ICARUS collaboration,
P. Abratenko,
N. Abrego-Martinez,
A. Aduszkiewic,
F. Akbar,
L. Aliaga Soplin,
M. Artero Pons,
J. Asaadi,
W. F. Badgett,
B. Baibussinov,
B. Behera,
V. Bellini,
R. Benocci,
J. Berger,
S. Berkman,
S. Bertolucci,
M. Betancourt,
M. Bonesini,
T. Boone,
B. Bottino,
A. Braggiotti,
D. Brailsford,
S. J. Brice,
V. Brio,
C. Brizzolari
, et al. (156 additional authors not shown)
Abstract:
This paper reports on a measurement of electron-ion recombination in liquid argon in the ICARUS liquid argon time projection chamber (LArTPC). A clear dependence of recombination on the angle of the ionizing particle track relative to the drift electric field is observed. An ellipsoid modified box (EMB) model of recombination describes the data across all measured angles. These measurements are us…
▽ More
This paper reports on a measurement of electron-ion recombination in liquid argon in the ICARUS liquid argon time projection chamber (LArTPC). A clear dependence of recombination on the angle of the ionizing particle track relative to the drift electric field is observed. An ellipsoid modified box (EMB) model of recombination describes the data across all measured angles. These measurements are used for the calorimetric energy scale calibration of the ICARUS TPC, which is also presented. The impact of the EMB model is studied on calorimetric particle identification, as well as muon and proton energy measurements. Accounting for the angular dependence in EMB recombination improves the accuracy and precision of these measurements.
△ Less
Submitted 9 August, 2024; v1 submitted 17 July, 2024;
originally announced July 2024.
-
Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run
Authors:
Gayathri Raman,
Samuele Ronchini,
James Delaunay,
Aaron Tohuvavohu,
Jamie A. Kennea,
Tyler Parsotan,
Elena Ambrosi,
Maria Grazia Bernardini,
Sergio Campana,
Giancarlo Cusumano,
Antonino D'Ai,
Paolo D'Avanzo,
Valerio D'Elia,
Massimiliano De Pasquale,
Simone Dichiara,
Phil Evans,
Dieter Hartmann,
Paul Kuin,
Andrea Melandri,
Paul O'Brien,
Julian P. Osborne,
Kim Page,
David M. Palmer,
Boris Sbarufatti,
Gianpiero Tagliaferri
, et al. (1797 additional authors not shown)
Abstract:
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav…
▽ More
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers.
△ Less
Submitted 27 March, 2025; v1 submitted 13 July, 2024;
originally announced July 2024.
-
Calibration and simulation of ionization signal and electronics noise in the ICARUS liquid argon time projection chamber
Authors:
ICARUS collaboration,
P. Abratenko,
N. Abrego-Martinez,
A. Aduszkiewic,
F. Akbar,
L. Aliaga Soplin,
M. Artero Pons,
J. Asaadi,
W. F. Badgett,
B. Baibussinov,
B. Behera,
V. Bellini,
R. Benocci,
J. Berger,
S. Berkman,
S. Bertolucci,
M. Betancourt,
M. Bonesini,
T. Boone,
B. Bottino,
A. Braggiotti,
D. Brailsford,
S. J. Brice,
V. Brio,
C. Brizzolari
, et al. (156 additional authors not shown)
Abstract:
The ICARUS liquid argon time projection chamber (LArTPC) neutrino detector has been taking physics data since 2022 as part of the Short-Baseline Neutrino (SBN) Program. This paper details the equalization of the response to charge in the ICARUS time projection chamber (TPC), as well as data-driven tuning of the simulation of ionization charge signals and electronics noise. The equalization procedu…
▽ More
The ICARUS liquid argon time projection chamber (LArTPC) neutrino detector has been taking physics data since 2022 as part of the Short-Baseline Neutrino (SBN) Program. This paper details the equalization of the response to charge in the ICARUS time projection chamber (TPC), as well as data-driven tuning of the simulation of ionization charge signals and electronics noise. The equalization procedure removes non-uniformities in the ICARUS TPC response to charge in space and time. This work leverages the copious number of cosmic ray muons available to ICARUS at the surface. The ionization signal shape simulation applies a novel procedure that tunes the simulation to match what is measured in data. The end result of the equalization procedure and simulation tuning allows for a comparison of charge measurements in ICARUS between Monte Carlo simulation and data, showing good performance with minimal residual bias between the two.
△ Less
Submitted 5 August, 2024; v1 submitted 16 July, 2024;
originally announced July 2024.
-
FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging
Authors:
Pranab Sahoo,
Ashutosh Tripathi,
Sriparna Saha,
Samrat Mondal
Abstract:
Despite recent advancements in federated learning (FL) for medical image diagnosis, addressing data heterogeneity among clients remains a significant challenge for practical implementation. A primary hurdle in FL arises from the non-IID nature of data samples across clients, which typically results in a decline in the performance of the aggregated global model. In this study, we introduce FedMRL,…
▽ More
Despite recent advancements in federated learning (FL) for medical image diagnosis, addressing data heterogeneity among clients remains a significant challenge for practical implementation. A primary hurdle in FL arises from the non-IID nature of data samples across clients, which typically results in a decline in the performance of the aggregated global model. In this study, we introduce FedMRL, a novel federated multi-agent deep reinforcement learning framework designed to address data heterogeneity. FedMRL incorporates a novel loss function to facilitate fairness among clients, preventing bias in the final global model. Additionally, it employs a multi-agent reinforcement learning (MARL) approach to calculate the proximal term $(μ)$ for the personalized local objective function, ensuring convergence to the global optimum. Furthermore, FedMRL integrates an adaptive weight adjustment method using a Self-organizing map (SOM) on the server side to counteract distribution shifts among clients' local data distributions. We assess our approach using two publicly available real-world medical datasets, and the results demonstrate that FedMRL significantly outperforms state-of-the-art techniques, showing its efficacy in addressing data heterogeneity in federated learning. The code can be found here~{\url{https://github.com/Pranabiitp/FedMRL}}.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.