-
Students' Reliance on AI in Higher Education: Identifying Contributing Factors
Authors:
Griffin Pitts,
Neha Rani,
Weedguet Mildort,
Eva-Marie Cook
Abstract:
The increasing availability and use of artificial intelligence (AI) tools in educational settings has raised concerns about students' overreliance on these technologies. Overreliance occurs when individuals accept incorrect AI-generated recommendations, often without critical evaluation, leading to flawed problem solutions and undermining learning outcomes. This study investigates potential factor…
▽ More
The increasing availability and use of artificial intelligence (AI) tools in educational settings has raised concerns about students' overreliance on these technologies. Overreliance occurs when individuals accept incorrect AI-generated recommendations, often without critical evaluation, leading to flawed problem solutions and undermining learning outcomes. This study investigates potential factors contributing to patterns of AI reliance among undergraduate students, examining not only overreliance but also appropriate reliance (correctly accepting helpful and rejecting harmful recommendations) and underreliance (incorrectly rejecting helpful recommendations). Our approach combined pre- and post-surveys with a controlled experimental task where participants solved programming problems with an AI assistant that provided both accurate and deliberately incorrect suggestions, allowing direct observation of students' reliance patterns when faced with varying AI reliability. We find that appropriate reliance is significantly related to students' programming self-efficacy, programming literacy, and need for cognition, while showing negative correlations with post-task trust and satisfaction. Overreliance showed significant correlations with post-task trust and satisfaction with the AI assistant. Underreliance was negatively correlated with programming literacy, programming self-efficacy, and need for cognition. Overall, the findings provide insights for developing targeted interventions that promote appropriate reliance on AI tools, with implications for the integration of AI in curriculum and educational technologies.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
The Amazon Nova Family of Models: Technical Report and Model Card
Authors:
Amazon AGI,
Aaron Langford,
Aayush Shah,
Abhanshu Gupta,
Abhimanyu Bhatter,
Abhinav Goyal,
Abhinav Mathur,
Abhinav Mohanty,
Abhishek Kumar,
Abhishek Sethi,
Abi Komma,
Abner Pena,
Achin Jain,
Adam Kunysz,
Adam Opyrchal,
Adarsh Singh,
Aditya Rawal,
Adok Achar Budihal Prasad,
Adrià de Gispert,
Agnika Kumar,
Aishwarya Aryamane,
Ajay Nair,
Akilan M,
Akshaya Iyengar,
Akshaya Vishnu Kudlu Shanbhogue
, et al. (761 additional authors not shown)
Abstract:
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents…
▽ More
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation.
△ Less
Submitted 17 March, 2025;
originally announced June 2025.
-
Patchy Helium and Hydrogen Reionization from the Kinetic Sunyaev-Zel'dovich Effect and Galaxies
Authors:
Neha Anil Kumar,
Mesut Çalışkan,
Selim C. Hotinli,
Marc Kamionkowski,
Simone Ferraro,
Kendrick Smith
Abstract:
Upcoming cosmic microwave background (CMB) experiments will measure temperature fluctuations on small angular scales with unprecedented precision, enabling improved measurements of the kinetic Sunyaev-Zel'dovich (kSZ) effect. This secondary anisotropy has emerged as a valuable probe of the distribution of ionized electrons in the post-recombination Universe. Although the sensitivity of the kSZ eff…
▽ More
Upcoming cosmic microwave background (CMB) experiments will measure temperature fluctuations on small angular scales with unprecedented precision, enabling improved measurements of the kinetic Sunyaev-Zel'dovich (kSZ) effect. This secondary anisotropy has emerged as a valuable probe of the distribution of ionized electrons in the post-recombination Universe. Although the sensitivity of the kSZ effect has recently been utilized to study the high-redshift epoch of hydrogen (H) reionization, its redshift-integrated nature -- combined with anticipated improvements in measurement precision -- suggests that accounting for the later epoch of helium (He) reionization will become increasingly important in the near future. Joint characterization of the epochs will allow for a more coherent understanding of early-star and -quasar formation, as these sources drive the ionization of H and He in the intergalactic medium. In this paper, we extend the kSZ higher-order statistic introduced by Smith \& Ferraro (2017) to forecast the ability of upcoming CMB surveys to probe the morphology of both H and He reionization. Moreover, given that upcoming large-scale structure surveys will trace density fluctuations at redshifts overlapping with the epoch of He reionization, we propose a novel cross-correlation between the kSZ higher-order statistic and galaxy survey measurements. Using a joint information-matrix analysis of H and He reionization, we show that next-generation CMB and galaxy surveys will have sufficient statistical power to characterize the patchy morphology of H reionization and set constraints on the redshift evolution of its He counterpart.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Magistral
Authors:
Mistral-AI,
:,
Abhinav Rastogi,
Albert Q. Jiang,
Andy Lo,
Gabrielle Berrada,
Guillaume Lample,
Jason Rute,
Joep Barmentlo,
Karmesh Yadav,
Kartik Khandelwal,
Khyathi Raghavi Chandu,
Léonard Blier,
Lucile Saulnier,
Matthieu Dinot,
Maxime Darrin,
Neha Gupta,
Roman Soletskyi,
Sagar Vaze,
Teven Le Scao,
Yihan Wang,
Adam Yang,
Alexander H. Liu,
Alexandre Sablayrolles,
Amélie Héliou
, et al. (76 additional authors not shown)
Abstract:
We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a s…
▽ More
We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a simple method to force the reasoning language of the model, and show that RL on text data alone maintains most of the initial checkpoint's capabilities. We find that RL on text maintains or improves multimodal understanding, instruction following and function calling. We present Magistral Medium, trained for reasoning on top of Mistral Medium 3 with RL alone, and we open-source Magistral Small (Apache 2.0) which further includes cold-start data from Magistral Medium.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
First positronium imaging using $^{44}$Sc with the J-PET scanner: a case study on the NEMA-Image Quality phantom
Authors:
Manish Das,
Sushil Sharma,
Aleksander Bilewicz,
Jarosław Choiński,
Neha Chug,
Catalina Curceanu,
Eryk Czerwiński,
Jakub Hajduga,
Sharareh Jalali,
Krzysztof Kacprzak,
Tevfik Kaplanoglu,
Łukasz Kapłon,
Kamila Kasperska,
Aleksander Khreptak,
Grzegorz Korcyl,
Tomasz Kozik,
Karol Kubat,
Deepak Kumar,
Anoop Kunimmal Venadan,
Edward Lisowski,
Filip Lisowski,
Justyna Medrala-Sowa,
Simbarashe Moyo,
Wiktor Mryka,
Szymon Niedźwiecki
, et al. (19 additional authors not shown)
Abstract:
Positronium Lifetime Imaging (PLI), an emerging extension of conventional positron emission tomography (PET) imaging, offers a novel window for probing the submolecular properties of biological tissues by imaging the mean lifetime of the positronium atom. Currently, the method is under rapid development in terms of reconstruction and detection systems. Recently, the first in vivo PLI of the human…
▽ More
Positronium Lifetime Imaging (PLI), an emerging extension of conventional positron emission tomography (PET) imaging, offers a novel window for probing the submolecular properties of biological tissues by imaging the mean lifetime of the positronium atom. Currently, the method is under rapid development in terms of reconstruction and detection systems. Recently, the first in vivo PLI of the human brain was performed using the J-PET scanner utilizing the $^{68}$Ga isotope. However, this isotope has limitations due to its comparatively low prompt gamma yields, which is crucial for positronium lifetime measurement. Among alternative radionuclides, $^{44}$Sc stands out as a promising isotope for PLI, characterized by a clinically suitable half-life (4.04 hours) emitting 1157 keV prompt gamma in 100% cases after the emission of the positron. This study reports the first experimental demonstration of PLI with $^{44}$Sc, carried out on a NEMA-Image Quality (IQ) phantom using the Modular J-PET tomograph-the first plastic scintillators-based PET scanner.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
Authors:
Zhilong Wang,
Neha Nagaraja,
Lan Zhang,
Hayretdin Bahsi,
Pawan Patil,
Peng Liu
Abstract:
LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called…
▽ More
LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called Polymorphic Prompt Assembling (PPA), which protects against prompt injection with near-zero overhead. The approach is based on the insight that prompt injection requires guessing and breaking the structure of the system prompt. By dynamically varying the structure of system prompts, PPA prevents attackers from predicting the prompt structure, thereby enhancing security without compromising performance. We conducted experiments to evaluate the effectiveness of PPA against existing attacks and compared it with other defense methods.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Interpreting the chromatic polynomial coefficients via hyperplane arrangements
Authors:
Neha Goregaokar
Abstract:
A recent result of Lofano and Paolini expresses the characteristic polynomial of a real hyperplane arrangement in terms of a projection statistic on the regions of the arrangement. We use this result to give an alternative proof for Greene and Zaslavsky's interpretation for the coefficients of the chromatic polynomial of a graph. We also show that this projection statistic has a nice combinatorial…
▽ More
A recent result of Lofano and Paolini expresses the characteristic polynomial of a real hyperplane arrangement in terms of a projection statistic on the regions of the arrangement. We use this result to give an alternative proof for Greene and Zaslavsky's interpretation for the coefficients of the chromatic polynomial of a graph. We also show that this projection statistic has a nice combinatorial interpretation in the case of the braid arrangement, which generalizes to graphical arrangements of natural unit interval graphs. We use this generalization to give a new proof of the formula for the chromatic polynomial of a natural unit interval graph.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Estimating the binary neutron star merger rate density evolution with Einstein Telescope
Authors:
Neha Singh,
Tomasz Bulik,
Aleksandra Olejak
Abstract:
The Einstein Telescope (ET) is a proposed third-generation, wide-band gravitational wave (GW) detector which will have an improved detection sensitivity in low frequencies, leading to a longer observation time in the detection band and higher detection rate for binary neutron stars (BNSs). Despite the fact that ET will have a higher detection rate, a large fraction of BNSs will remain undetectable…
▽ More
The Einstein Telescope (ET) is a proposed third-generation, wide-band gravitational wave (GW) detector which will have an improved detection sensitivity in low frequencies, leading to a longer observation time in the detection band and higher detection rate for binary neutron stars (BNSs). Despite the fact that ET will have a higher detection rate, a large fraction of BNSs will remain undetectable. We present a scheme to estimate accurate detection efficiency and to reconstruct the true merger rate density of the population of the BNSs, as a function of redshift. We show that with ET as a single instrumnet, for a population of BNSs with $R_{mer} \sim 100 (300)$ $\rm Gpc^{-3} yr^{-1}$ at $z\sim 0(2)$, we can reconstruct the merger rate density uptil $z \sim 2$ , with a relative error of $12\%$ at ($z \sim 2$), despite the loss in detection of the bulk of the BNS population.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents
Authors:
Mrinal Rawat,
Ambuje Gupta,
Rushil Goomer,
Alessandro Di Bari,
Neha Gupta,
Roberto Pieraccini
Abstract:
The ReAct (Reasoning + Action) capability in large language models (LLMs) has become the foundation of modern agentic systems. Recent LLMs, such as DeepSeek-R1 and OpenAI o1/o3, exemplify this by emphasizing reasoning through the generation of ample intermediate tokens, which help build a strong premise before producing the final output tokens. In this paper, we introduce Pre-Act, a novel approach…
▽ More
The ReAct (Reasoning + Action) capability in large language models (LLMs) has become the foundation of modern agentic systems. Recent LLMs, such as DeepSeek-R1 and OpenAI o1/o3, exemplify this by emphasizing reasoning through the generation of ample intermediate tokens, which help build a strong premise before producing the final output tokens. In this paper, we introduce Pre-Act, a novel approach that enhances the agent's performance by creating a multi-step execution plan along with the detailed reasoning for the given user input. This plan incrementally incorporates previous steps and tool outputs, refining itself after each step execution until the final response is obtained. Our approach is applicable to both conversational and non-conversational agents. To measure the performance of task-oriented agents comprehensively, we propose a two-level evaluation framework: (1) turn level and (2) end-to-end. Our turn-level evaluation, averaged across five models, shows that our approach, Pre-Act, outperforms ReAct by 70% in Action Recall on the Almita dataset. While this approach is effective for larger models, smaller models crucial for practical applications, where latency and cost are key constraints, often struggle with complex reasoning tasks required for agentic systems. To address this limitation, we fine-tune relatively small models such as Llama 3.1 (8B & 70B) using the proposed Pre-Act approach. Our experiments show that the fine-tuned 70B model outperforms GPT-4, achieving a 69.5% improvement in action accuracy (turn-level) and a 28% improvement in goal completion rate (end-to-end) on the Almita (out-of-domain) dataset.
△ Less
Submitted 18 May, 2025; v1 submitted 15 May, 2025;
originally announced May 2025.
-
Photoproduction and detection of $ρ'\rightarrowπ^+π^-π^+π^-$ decays in ultra-peripheral collisions and at an electron-ion collider
Authors:
Neha Devi,
Minjung Kim,
Spencer R. Klein,
Janet Seger
Abstract:
Vector meson photoproduction is an important probe of nuclear structure. Light vector mesons are most sensitive to low$-x$ structure, as long as they are not too light for perturbative QCD calculations. The $ρ'$ is of interest as an intermediate mass state (between the $ρ$ and $J/ψ$) that is easier to detect than the $φ$.
Using HERA data on proton targets, we make projections for lead/gold targe…
▽ More
Vector meson photoproduction is an important probe of nuclear structure. Light vector mesons are most sensitive to low$-x$ structure, as long as they are not too light for perturbative QCD calculations. The $ρ'$ is of interest as an intermediate mass state (between the $ρ$ and $J/ψ$) that is easier to detect than the $φ$.
Using HERA data on proton targets, we make projections for lead/gold targets in UPCs at the Large Hadron Collider and RHIC, and for $ep$ and $eA$ collisions at a future Electron-Ion Collider (EIC). These projections for ion targets depend on the largely-unknown $ρ'\rightarrowπ^+π^-π^+π^-$ branching ratio, and use existing data to constrain that branching ratio. Current data points to a relatively low branching ratio, less than 50\%. The HERA $ep$ and ALICE UPC $e$Pb data exhibit very similar $4π$ mass spectra, indicating that, if the system is composed of two resonances, the products of their photon couplings with their four-pion branching ratios are similar.
The predicted rates are high for both UPCs and the EIC. The $ρ'\rightarrowπ^+π^-π^+π^-$ decay can be observed at the EIC with high efficiency. In $ep$ collisions at the highest energy, the forward B0 detector is needed to observe this channel down to the lowest achievable Bjorken$-x$ values.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning
Authors:
Neha Prakriya,
Zijian Ding,
Yizhou Sun,
Jason Cong
Abstract:
FPGAs are increasingly adopted in datacenter environments for their reconfigurability and energy efficiency. High-Level Synthesis (HLS) tools have eased FPGA programming by raising the abstraction level from RTL to untimed C/C++, yet attaining high performance still demands expert knowledge and iterative manual insertion of optimization pragmas to modify the microarchitecture. To address this chal…
▽ More
FPGAs are increasingly adopted in datacenter environments for their reconfigurability and energy efficiency. High-Level Synthesis (HLS) tools have eased FPGA programming by raising the abstraction level from RTL to untimed C/C++, yet attaining high performance still demands expert knowledge and iterative manual insertion of optimization pragmas to modify the microarchitecture. To address this challenge, we propose LIFT, a large language model (LLM)-based coding assistant for HLS that automatically generates performance-critical pragmas given a C/C++ design. We fine-tune the LLM by tightly integrating and supervising the training process with a graph neural network (GNN), combining the sequential modeling capabilities of LLMs with the structural and semantic understanding of GNNs necessary for reasoning over code and its control/data dependencies. On average, LIFT produces designs that improve performance by 3.52x and 2.16x than prior state-of the art AutoDSE and HARP respectively, and 66x than GPT-4o.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Towards Long Context Hallucination Detection
Authors:
Siyi Liu,
Kishaloy Halder,
Zheng Qi,
Wei Xiao,
Nikolaos Pappas,
Phu Mon Htut,
Neha Anna John,
Yassine Benajiba,
Dan Roth
Abstract:
Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take a…
▽ More
Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take an initial step toward solving this problem by constructing a dataset specifically designed for long-context hallucination detection. Furthermore, we propose a novel architecture that enables pre-trained encoder models, such as BERT, to process long contexts and effectively detect contextual hallucinations through a decomposition and aggregation mechanism. Our experimental results show that the proposed architecture significantly outperforms previous models of similar size as well as LLM-based models across various metrics, while providing substantially faster inference.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
Noncentral moderate deviations for time-changed multivariate Lévy processes with linear combinations of inverse stable subordinators
Authors:
Neha Gupta,
Claudio Macci
Abstract:
The term noncentral moderate deviations is used in the literature to mean a class of large deviation principles that, in some sense, fills the gap between the convergence in probability to a constant (governed by a reference large deviation principle) and a weak convergence to a non-Gaussian (and non-degenerating) distribution. Some noncentral moderate deviation results in the literature concern t…
▽ More
The term noncentral moderate deviations is used in the literature to mean a class of large deviation principles that, in some sense, fills the gap between the convergence in probability to a constant (governed by a reference large deviation principle) and a weak convergence to a non-Gaussian (and non-degenerating) distribution. Some noncentral moderate deviation results in the literature concern time-changed univariate Lévy processes, where the time-changes are given by inverse stable subordinators. In this paper we present analogue results for multivariate Lévy processes; in particular the random time-changes are suitable linear combinations of independent inverse stable subordinators.
△ Less
Submitted 25 April, 2025;
originally announced April 2025.
-
S2Vec: Self-Supervised Geospatial Embeddings
Authors:
Shushman Choudhury,
Elad Aharoni,
Chandrakumari Suvarna,
Iveel Tsogsuren,
Abdul Rahman Kreidieh,
Chun-Ta Lu,
Neha Arora
Abstract:
Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec uses the S2 Geometry library to partition large areas into discrete S2 cells, rasterizes built environment feature vectors within cells as images, and applies…
▽ More
Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec uses the S2 Geometry library to partition large areas into discrete S2 cells, rasterizes built environment feature vectors within cells as images, and applies masked autoencoding on these rasterized images to encode the feature vectors. This approach yields task-agnostic embeddings that capture local feature characteristics and broader spatial relationships. We evaluate S2Vec on three large-scale socioeconomic prediction tasks, showing its competitive performance against state-of-the-art image-based embeddings. We also explore the benefits of combining S2Vec embeddings with image-based embeddings downstream, showing that such multimodal fusion can often improve performance. Our results highlight how S2Vec can learn effective general-purpose geospatial representations and how it can complement other data modalities in geospatial artificial intelligence.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
DataS^3: Dataset Subset Selection for Specialization
Authors:
Neha Hulkund,
Alaa Maalouf,
Levi Cai,
Daniel Yang,
Tsun-Hsuan Wang,
Abigail O'Neil,
Timm Haucke,
Sandeep Mukherjee,
Vikram Ramaswamy,
Judy Hansen Shen,
Gabriel Tseng,
Mike Walmsley,
Daniela Rus,
Ken Goldberg,
Hannah Kerner,
Irene Chen,
Yogesh Girdhar,
Sara Beery
Abstract:
In many real-world machine learning (ML) applications (e.g. detecting broken bones in x-ray images, detecting species in camera traps), in practice models need to perform well on specific deployments (e.g. a specific hospital, a specific national park) rather than the domain broadly. However, deployments often have imbalanced, unique data distributions. Discrepancy between the training distributio…
▽ More
In many real-world machine learning (ML) applications (e.g. detecting broken bones in x-ray images, detecting species in camera traps), in practice models need to perform well on specific deployments (e.g. a specific hospital, a specific national park) rather than the domain broadly. However, deployments often have imbalanced, unique data distributions. Discrepancy between the training distribution and the deployment distribution can lead to suboptimal performance, highlighting the need to select deployment-specialized subsets from the available training data. We formalize dataset subset selection for specialization (DS3): given a training set drawn from a general distribution and a (potentially unlabeled) query set drawn from the desired deployment-specific distribution, the goal is to select a subset of the training data that optimizes deployment performance.
We introduce DataS^3; the first dataset and benchmark designed specifically for the DS3 problem. DataS^3 encompasses diverse real-world application domains, each with a set of distinct deployments to specialize in. We conduct a comprehensive study evaluating algorithms from various families--including coresets, data filtering, and data curation--on DataS^3, and find that general-distribution methods consistently fail on deployment-specific tasks. Additionally, we demonstrate the existence of manually curated (deployment-specific) expert subsets that outperform training on all available data with accuracy gains up to 51.3 percent. Our benchmark highlights the critical role of tailored dataset curation in enhancing performance and training efficiency on deployment-specific distributions, which we posit will only become more important as global, public datasets become available across domains and ML models are deployed in the real world.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Parental Imprints On Birth Weight: A Data-Driven Model For Neonatal Prediction In Low Resource Prenatal Care
Authors:
Rajeshwari Mistri,
Harsh Joshi,
Nachiket Kapure,
Parul Kumari,
Manasi Mali,
Seema Purohit,
Neha Sharma,
Mrityunjoy Panday,
Chittaranjan S. Yajnik
Abstract:
Accurate fetal birth weight prediction is a cornerstone of prenatal care, yet traditional methods often rely on imaging technologies that remain inaccessible in resource-limited settings. This study presents a novel machine learning-based framework that circumvents these conventional dependencies, using a diverse set of physiological, environmental, and parental factors to refine birth weight esti…
▽ More
Accurate fetal birth weight prediction is a cornerstone of prenatal care, yet traditional methods often rely on imaging technologies that remain inaccessible in resource-limited settings. This study presents a novel machine learning-based framework that circumvents these conventional dependencies, using a diverse set of physiological, environmental, and parental factors to refine birth weight estimation. A multi-stage feature selection pipeline filters the dataset into an optimized subset, demonstrating previously underexplored yet clinically relevant predictors of fetal growth. By integrating advanced regression architectures and ensemble learning strategies, the model captures non-linear relationships often overlooked by traditional approaches, offering a predictive solution that is both interpretable and scalable. Beyond predictive accuracy, this study addresses a question: whether birth weight can be reliably estimated without conventional diagnostic tools. The findings challenge entrenched methodologies by introducing an alternative pathway that enhances accessibility without compromising clinical utility. While limitations exist, the study lays the foundation for a new era in prenatal analytics, one where data-driven inference competes with, and potentially redefines, established medical assessments. By bridging computational intelligence with obstetric science, this research establishes a framework for equitable, technology-driven advancements in maternal-fetal healthcare.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Input to the ESPPU 2026 update: Searching for millicharged particles with the FORMOSA experiment at the CERN LHC
Authors:
Matthew Citron,
Frank Golf,
Kranti Gunthoti,
Andrew Haas,
Christopher S. Hill,
Dariush Imani,
Samantha Kelly,
Ming Liu,
Steven Lowette,
Albert De Roeck,
Sai Neha Santpur,
Ryan Schmitz,
Jacob Steenis,
David Stuart,
Yu-Dai Tsai,
Juan Salvador Tafoya Vargas,
Tiepolo Wybouw,
Jaehyeok Yoo
Abstract:
In this contribution, we evaluate the sensitivity for particles with charges much smaller than the electron charge with a dedicated scintillator-based detector in the far forward region at the CERN LHC, FORMOSA. This contribution will outline the scientific case for this detector, its design and potential locations, and the sensitivity that can be achieved. The ongoing efforts to prove the feasibi…
▽ More
In this contribution, we evaluate the sensitivity for particles with charges much smaller than the electron charge with a dedicated scintillator-based detector in the far forward region at the CERN LHC, FORMOSA. This contribution will outline the scientific case for this detector, its design and potential locations, and the sensitivity that can be achieved. The ongoing efforts to prove the feasibility of the detector with the FORMOSA demonstrator will be discussed. Finally, possible upgrades to the detector through the use of high-performance scintillator will be discussed.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
Normal state and superconducting state properties of high entropy Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf )
Authors:
Nikita Sharma,
J. Link,
Kuldeep Kargeti,
Neha Sharma,
I. Heinmaa,
S. K. Panda,
R. Stern,
Tirthankar Chakraborty,
Tanmoy Chakrabarty,
Sourav Marik
Abstract:
High entropy alloy superconductors represent a unique blend of advanced material systems and quantum physics, offering significant potential for advancing superconducting technologies. In this study, we report a detailed theoretical and experimental investigation of high entropy alloy superconductors Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf). Our study unveils that both the materials crystallize in…
▽ More
High entropy alloy superconductors represent a unique blend of advanced material systems and quantum physics, offering significant potential for advancing superconducting technologies. In this study, we report a detailed theoretical and experimental investigation of high entropy alloy superconductors Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf). Our study unveils that both the materials crystallize in a body-centered cubic structure (space group: I m -3 m) and exhibit bulk superconductivity with a superconducting onset temperature of (Tonset C ) of 5 K for X = Hf and 6.19 K for X = Zr sample. Our detailed analysis, including magnetization, resistivity, heat capacity measurements, and density functional theory (DFT) calculations indicates moderately coupled isotropic s-wave superconductivity in these materials. Our DFT results find significant spectral weight at the Fermi energy and phonon spectra is free of imaginary modes, confirming the dynamical stability and metallic nature of these alloys. Remarkably, we have observed a high upper critical field (HC2(0)) surpassing the Pauli paramagnetic limit for the X = Hf sample and explained it on the basis of the increased spin-orbit coupling in the structure. Ta0.2Nb0.2V0.2Ti0.2Zr0.2, on the other hand, shows a conventional HC2 behaviour. With the dynamical stability of these alloys, excellent normal state metallic nature, high micro-hardness, and high upper critical field, these samples emerge as potential candidates for future applications in superconducting devices.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi
Authors:
Monojit Choudhury,
Shivam Chauhan,
Rocktim Jyoti Das,
Dhruv Sahnan,
Xudong Han,
Haonan Li,
Aaryamonvikram Singh,
Alok Anil Jadhav,
Utkarsh Agarwal,
Mukund Choudhary,
Debopriyo Banerjee,
Fajri Koto,
Junaid Bhat,
Awantika Shukla,
Samujjwal Ghosh,
Samta Kamboj,
Onkar Pandit,
Lalit Pradhan,
Rahul Pal,
Sunil Sahu,
Soundar Doraiswamy,
Parvez Mullah,
Ali El Filali,
Neha Sengupta,
Gokul Ramakrishnan
, et al. (5 additional authors not shown)
Abstract:
Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorp…
▽ More
Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorporates continuous pre-training with expanded transformer blocks, leveraging the Llama Pro methodology. A key challenge was the limited availability of high-quality Hindi text data; we addressed this through rigorous data curation, augmentation, and strategic bilingual training, balancing Hindi and English corpora to optimize cross-linguistic knowledge transfer. With 10 billion parameters, Nanda stands among the top-performing open-source Hindi and multilingual models of similar scale, demonstrating significant advantages over many existing models. We provide an in-depth discussion of training strategies, fine-tuning techniques, safety alignment, and evaluation metrics, demonstrating how these approaches enabled Nanda to achieve state-of-the-art results. By open-sourcing Nanda, we aim to advance research in Hindi LLMs and support a wide range of real-world applications across academia, industry, and public services.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
Triple differential cross-section for Laser-assisted (e,2e) process on $H_2O$ molecule by Plane and Twisted electrons Impact
Authors:
Neha,
Rakesh Choubisa
Abstract:
Twisted electron-impact single ionization ((e,2e) process) of water molecule is theoretically studied in the presence of an external laser field. Calculations have been performed in the framework of the first Born approximation in coplanar asymmetric geometry. The wave functions for the fast (incident/scattered) electron and the ejected electron are described by the Volkov and Coulomb Volkov wave…
▽ More
Twisted electron-impact single ionization ((e,2e) process) of water molecule is theoretically studied in the presence of an external laser field. Calculations have been performed in the framework of the first Born approximation in coplanar asymmetric geometry. The wave functions for the fast (incident/scattered) electron and the ejected electron are described by the Volkov and Coulomb Volkov wave functions respectively which take into account the interaction of the laser field with incident/scattered and ejected electrons respectively. We calculate the triple differential cross section (TDCS) corresponding to the laser-assisted (e,2e) process for different orientations of the linearly polarised laser field. We describe the molecular state of $H_{2}O$ by the linear combination of atomic orbitals (self-consistent field LCAO method). The angular profile of the TDCS for plane wave electrons is strongly modified by the laser field compared to that of without laser field. However, for the twisted electrons with laser field the angular distribution is slightly changed from those of without laser field. We study the angular distribution of TDCS for laser field polarization parallel to the incident momentum ($\varepsilon_0$ $\parallel$ $k_i$), parallel to momentum transfer ($\varepsilon_0$ $\parallel$ $Δ$) and perpendicular to the momentum transfer ($\varepsilon_0$ $\perp$ $Δ$) and we observe that the $\varepsilon_0$ $\parallel$ $Δ$ has the highest magnitude of TDCS. For the orientations $\varepsilon_0$ $\parallel$ $k_i$ and $\varepsilon_0$ $\perp$ $Δ$, the laser-assisted plane wave studies shows oscillatory nature of TDCS but for the orientation $\varepsilon_0$ $\parallel$ $Δ$ we observe only recoil peak for $p-like$ character orbitals and dual peak, a recoil and binary peaks for the $s-like$ character orbital.
△ Less
Submitted 26 May, 2025; v1 submitted 4 April, 2025;
originally announced April 2025.
-
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
Authors:
Divya Velayudhan,
Abdelfatah Ahmed,
Mohamad Alansari,
Neha Gour,
Abderaouf Behouch,
Taimur Hassan,
Syed Talal Wasim,
Nabil Maalej,
Muzammal Naseer,
Juergen Gall,
Mohammed Bennamoun,
Ernesto Damiani,
Naoufel Werghi
Abstract:
Advancements in Computer-Aided Screening (CAS) systems are essential for improving the detection of security threats in X-ray baggage scans. However, current datasets are limited in representing real-world, sophisticated threats and concealment tactics, and existing approaches are constrained by a closed-set paradigm with predefined labels. To address these challenges, we introduce STCray, the fir…
▽ More
Advancements in Computer-Aided Screening (CAS) systems are essential for improving the detection of security threats in X-ray baggage scans. However, current datasets are limited in representing real-world, sophisticated threats and concealment tactics, and existing approaches are constrained by a closed-set paradigm with predefined labels. To address these challenges, we introduce STCray, the first multimodal X-ray baggage security dataset, comprising 46,642 image-caption paired scans across 21 threat categories, generated using an X-ray scanner for airport security. STCray is meticulously developed with our specialized protocol that ensures domain-aware, coherent captions, that lead to the multi-modal instruction following data in X-ray baggage security. This allows us to train a domain-aware visual AI assistant named STING-BEE that supports a range of vision-language tasks, including scene comprehension, referring threat localization, visual grounding, and visual question answering (VQA), establishing novel baselines for multi-modal learning in X-ray baggage security. Further, STING-BEE shows state-of-the-art generalization in cross-domain settings. Code, data, and models are available at https://divs1159.github.io/STING-BEE/.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Nanocrystal tuned ammonia gas sensing technique via impedance spectroscopy
Authors:
Neha Sharma,
Debanjan Bhattacharjee,
Sunita Kumari,
Sandip Paul Choudhury
Abstract:
Ammonia is a harmful chemical hazard known for its widespread industrial use. Exposure to ammonia can cause environmental damage, human health hazards, and huge economic losses. Therefore, ammonia gas sensors are essential for detecting ammonia leaks to avoid serious accidental injury and death. In this study, we synthesize a nanostructured (WO3) n-type metal oxide semiconductor doped with a rare…
▽ More
Ammonia is a harmful chemical hazard known for its widespread industrial use. Exposure to ammonia can cause environmental damage, human health hazards, and huge economic losses. Therefore, ammonia gas sensors are essential for detecting ammonia leaks to avoid serious accidental injury and death. In this study, we synthesize a nanostructured (WO3) n-type metal oxide semiconductor doped with a rare earth element-transition metal (Ce-Cu) via hydrothermal method for ammonia gas sensing application. Structural analysis was performed using XRD and FESEM. Further we investigate the optical properties via UV-visible spectroscopy, FTIR, and PL. We found that doping of (Ce-Cu) led to significant improvement in thermal stability for ammonia detection and selectivity performance compared to that pure one, across a wide frequency range. We believe that these studies will pave the way for exploring the use of Ce-Cu to improve the gas sensing properties of semiconductor-based gas sensors.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Understanding and Improving Information Preservation in Prompt Compression for LLMs
Authors:
Weronika Łajewska,
Momchil Hardalov,
Laura Aina,
Neha Anna John,
Hang Su,
Lluís Màrquez
Abstract:
Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced t…
▽ More
Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced to optimize the trade-off between reducing input length and retaining performance. We propose a holistic evaluation framework that allows for in-depth analysis of prompt compression methods. We focus on three key aspects, besides compression ratio: (i) downstream task performance, (ii) grounding in the input context, and (iii) information preservation. Through this framework, we investigate state-of-the-art soft and hard compression methods, showing that they struggle to preserve key details from the original prompt, limiting their performance on complex tasks. We demonstrate that modifying soft prompting methods to control better the granularity of the compressed information can significantly improve their effectiveness -- up to +23\% in downstream task performance, more than +8 BERTScore points in grounding, and 2.7x more entities preserved in compression.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Range Avoidance in Boolean Circuits via Turan-type Bounds
Authors:
Neha Kuntewar,
Jayalal Sarma
Abstract:
Given a circuit $C : \{0,1\}^n \to \{0,1\}^m$ from a circuit class $F$, with $m > n$, finding a $y \in \{0,1\}^m$ such that $\forall x \in \{0,1\}^n$, $C(x) \ne y$, is the range avoidance problem (denoted by $F$-$avoid$). Deterministic polynomial time algorithms (even with access to $NP$ oracles) solving this problem is known to imply explicit constructions of various pseudorandom objects like har…
▽ More
Given a circuit $C : \{0,1\}^n \to \{0,1\}^m$ from a circuit class $F$, with $m > n$, finding a $y \in \{0,1\}^m$ such that $\forall x \in \{0,1\}^n$, $C(x) \ne y$, is the range avoidance problem (denoted by $F$-$avoid$). Deterministic polynomial time algorithms (even with access to $NP$ oracles) solving this problem is known to imply explicit constructions of various pseudorandom objects like hard Boolean functions, linear codes, PRGs etc. Deterministic polynomial time algorithms are known for $NC^0_2$-$avoid$ when $m > n$, and for $NC^0_3$-$avoid$ when $m \ge \frac{n^2}{\log n}$, where $NC^0_k$ is the class of circuits with bounded fan-in which have constant depth and the output depends on at most $k$ of the input bits. On the other hand, it is also known that $NC^0_3$-$avoid$ when $m = n+O\left(n^{2/3}\right)$ is at least as hard as explicit construction of rigid matrices.
In this paper, we propose a new approach to solving range avoidance problem via hypergraphs. We formulate the problem in terms of Turan-type problems in hypergraphs of the following kind - for a fixed $k$-uniform hypergraph $H'$, what is the maximum number of edges that can exist in a $k$-uniform hypergraph $H$ which does not have a sub-hypergraph isomorphic to $H'$? We use our approach to show (using known Turan-type bounds) that there is a constant $c$ such that $mon$-$NC^0_3$-$avoid$ can be solved in deterministic polynomial time when $m > cn^2$. To improve the stretch constraint to linear, we show a new Turan-type theorem for a hypergraph structure (which we call the the loose $chi$-cycles) and use it to show that $mon$-$NC^0_3$-$avoid$ can be solved in deterministic polynomial time when $m > n$, thus improving the known bounds of $NC^0_3$-avoid for the case of monotone circuits.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models
Authors:
Erminia Calabrese,
J. Colin Hill,
Hidde T. Jense,
Adrien La Posta,
Irene Abril-Cabezas,
Graeme E. Addison,
Peter A. R. Ade,
Simone Aiola,
Tommy Alford,
David Alonso,
Mandana Amiri,
Rui An,
Zachary Atkins,
Jason E. Austermann,
Eleonora Barbavara,
Nicola Barbieri,
Nicholas Battaglia,
Elia Stefano Battistelli,
James A. Beall,
Rachel Bean,
Ali Beheshti,
Benjamin Beringue,
Tanay Bhandarkar,
Emily Biermann,
Boris Bolliet
, et al. (147 additional authors not shown)
Abstract:
We use new cosmic microwave background (CMB) primary temperature and polarization anisotropy measurements from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) to test foundational assumptions of the standard cosmological model and set constraints on extensions to it. We derive constraints from the ACT DR6 power spectra alone, as well as in combination with legacy data from Planck. To br…
▽ More
We use new cosmic microwave background (CMB) primary temperature and polarization anisotropy measurements from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) to test foundational assumptions of the standard cosmological model and set constraints on extensions to it. We derive constraints from the ACT DR6 power spectra alone, as well as in combination with legacy data from Planck. To break geometric degeneracies, we include ACT and Planck CMB lensing data and baryon acoustic oscillation data from DESI Year-1, and further add supernovae measurements from Pantheon+ for models that affect the late-time expansion history. We verify the near-scale-invariance (running of the spectral index $d n_s/d\ln k = 0.0062 \pm 0.0052$) and adiabaticity of the primordial perturbations. Neutrino properties are consistent with Standard Model predictions: we find no evidence for new light, relativistic species that are free-streaming ($N_{\rm eff} = 2.86 \pm 0.13$, which combined with external BBN data becomes $N_{\rm eff} = 2.89 \pm 0.11$), for non-zero neutrino masses ($\sum m_ν< 0.082$ eV at 95% CL), or for neutrino self-interactions. We also find no evidence for self-interacting dark radiation ($N_{\rm idr} < 0.134$), early-universe variation of fundamental constants, early dark energy, primordial magnetic fields, or modified recombination. Our data are consistent with standard BBN, the FIRAS-inferred CMB temperature, a dark matter component that is collisionless and with only a small fraction allowed as axion-like particles, a cosmological constant, and the late-time growth rate predicted by general relativity. We find no statistically significant preference for a departure from the baseline $Λ$CDM model. In general, models introduced to increase the Hubble constant or to decrease the amplitude of density fluctuations inferred from the primary CMB are not favored by our data.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
The Atacama Cosmology Telescope: DR6 Power Spectra, Likelihoods and $Λ$CDM Parameters
Authors:
Thibaut Louis,
Adrien La Posta,
Zachary Atkins,
Hidde T. Jense,
Irene Abril-Cabezas,
Graeme E. Addison,
Peter A. R. Ade,
Simone Aiola,
Tommy Alford,
David Alonso,
Mandana Amiri,
Rui An,
Jason E. Austermann,
Eleonora Barbavara,
Nicholas Battaglia,
Elia Stefano Battistelli,
James A. Beall,
Rachel Bean,
Ali Beheshti,
Benjamin Beringue,
Tanay Bhandarkar,
Emily Biermann,
Boris Bolliet,
J Richard Bond,
Erminia Calabrese
, et al. (143 additional authors not shown)
Abstract:
We present power spectra of the cosmic microwave background (CMB) anisotropy in temperature and polarization, measured from the Data Release 6 maps made from Atacama Cosmology Telescope (ACT) data. These cover 19,000 deg$^2$ of sky in bands centered at 98, 150 and 220 GHz, with white noise levels three times lower than Planck in polarization. We find that the ACT angular power spectra estimated ov…
▽ More
We present power spectra of the cosmic microwave background (CMB) anisotropy in temperature and polarization, measured from the Data Release 6 maps made from Atacama Cosmology Telescope (ACT) data. These cover 19,000 deg$^2$ of sky in bands centered at 98, 150 and 220 GHz, with white noise levels three times lower than Planck in polarization. We find that the ACT angular power spectra estimated over 10,000 deg$^2$, and measured to arcminute scales in TT, TE and EE, are well fit by the sum of CMB and foregrounds, where the CMB spectra are described by the $Λ$CDM model. Combining ACT with larger-scale Planck data, the joint P-ACT dataset provides tight limits on the ingredients, expansion rate, and initial conditions of the universe. We find similar constraining power, and consistent results, from either the Planck power spectra or from ACT combined with WMAP data, as well as from either temperature or polarization in the joint P-ACT dataset. When combined with CMB lensing from ACT and Planck, and baryon acoustic oscillation data from the Dark Energy Spectroscopic Instrument (DESI Y1), we measure a baryon density of $Ω_b h^2=0.0226\pm0.0001$, a cold dark matter density of $Ω_c h^2=0.118\pm0.001$, a Hubble constant of $H_0=68.22\pm0.36$ km/s/Mpc, a spectral index of $n_s=0.974\pm0.003$, and an amplitude of density fluctuations of $σ_8=0.813\pm0.005$. We find no evidence for excess lensing in the power spectrum, and no departure from spatial flatness. The contribution from Sunyaev-Zel'dovich (SZ) anisotropy is detected at high significance; we find evidence for a tilt with suppressed small-scale power compared to our baseline SZ template spectrum, consistent with hydrodynamical simulations with feedback.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
The Atacama Cosmology Telescope: DR6 Maps
Authors:
Sigurd Naess,
Yilun Guan,
Adriaan J. Duivenvoorden,
Matthew Hasselfield,
Yuhan Wang,
Irene Abril-Cabezas,
Graeme E. Addison,
Peter A. R. Ade,
Simone Aiola,
Tommy Alford,
David Alonso,
Mandana Amiri,
Rui An,
Zachary Atkins,
Jason E. Austermann,
Eleonora Barbavara,
Nicholas Battaglia,
Elia Stefano Battistelli,
James A. Beall,
Rachel Bean,
Ali Beheshti,
Benjamin Beringue,
Tanay Bhandarkar,
Emily Biermann,
Boris Bolliet
, et al. (141 additional authors not shown)
Abstract:
We present Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) maps of the Cosmic Microwave Background temperature and polarization anisotropy at arcminute resolution over three frequency bands centered on 98, 150 and 220 GHz. The maps are based on data collected with the AdvancedACT camera over the period 2017--2022 and cover 19,000 square degrees with a median combined depth of 10 uK arcmin.…
▽ More
We present Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) maps of the Cosmic Microwave Background temperature and polarization anisotropy at arcminute resolution over three frequency bands centered on 98, 150 and 220 GHz. The maps are based on data collected with the AdvancedACT camera over the period 2017--2022 and cover 19,000 square degrees with a median combined depth of 10 uK arcmin. We describe the instrument, mapmaking and map properties and illustrate them with a number of figures and tables. The ACT DR6 maps and derived products are available on LAMBDA at https://lambda.gsfc.nasa.gov/product/act/actadv_prod_table.html. We also provide an interactive web atlas at https://phy-act1.princeton.edu/public/snaess/actpol/dr6/atlas and HiPS data sets in Aladin (e.g. https://alasky.cds.unistra.fr/ACT/DR4DR6/color_CMB).
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Multi-Parameter Analysis of Li-ion Battery Degradation: Integrating Optical Fiber Sensing with Differential State of Health Metrics
Authors:
Idris Temitope Bello,
Hassan Raza,
Madithedu Muneeswara,
Neha Tewari,
Yin Nee Cheung,
Tobi Alabi Michael,
Ridwan Taiwo,
Fiske Lin
Abstract:
The reliability and safety of Lithium-ion batteries (LiBs) are of great concern in the energy storage industry. Nevertheless, the real-time monitoring of their degradation remains challenging due to limited quantitative metrics available during cycling. This study addresses this limitation by employing a novel approach that combines external optical fiber sensing with advanced data analysis techni…
▽ More
The reliability and safety of Lithium-ion batteries (LiBs) are of great concern in the energy storage industry. Nevertheless, the real-time monitoring of their degradation remains challenging due to limited quantitative metrics available during cycling. This study addresses this limitation by employing a novel approach that combines external optical fiber sensing with advanced data analysis techniques to comprehensively assess battery health. We engineered a non-invasive optical sensing platform using tandem pairs of polymeric and silica-based fiber Bragg grating (FBG) sensors affixed to the external surface of commercial Li-ion button cells, enabling simultaneous, real-time monitoring of device-level volume changes and thermal events over 600 cycles. Our analysis incorporated differential techniques to estimate the battery's state of health (SOH) based on capacity, strain, and temperature variations with respect to voltage. Additionally, we implemented and compared three deep learning models - Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Artificial Neural Network (ANN) - to predict battery SOH over cycles. We were able to capture both continuous and spontaneous degradation events and provide unique insights into battery behavior across its lifecycle through differential analysis and new SOH metrics demonstrating high correlation with conventional measures. This multi-parameter approach, combining advanced sensing techniques with innovative data analysis and deep learning methods, contributes significantly to battery diagnostics, potentially improving reliability assessment, enhancing safety standards, and accelerating the development of more sustainable energy storage solutions.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Illuminating Youth: Decades of Mid-Infrared Variability and Color Evolution of Young Stellar Objects
Authors:
Neha S.,
Saurabh
Abstract:
The variability of Young Stellar Objects (YSOs) is a crucial tool for understanding the mechanisms driving flux changes. In this study, we present an infrared variability analysis of a large sample of over 20,000 candidate YSOs, using data from the ALLWISE and NEOWISE surveys, which span around a decade with a 6-month cadence. We applied Lomb-Scargle Periodogram (LSP) analysis and linear fitting t…
▽ More
The variability of Young Stellar Objects (YSOs) is a crucial tool for understanding the mechanisms driving flux changes. In this study, we present an infrared variability analysis of a large sample of over 20,000 candidate YSOs, using data from the ALLWISE and NEOWISE surveys, which span around a decade with a 6-month cadence. We applied Lomb-Scargle Periodogram (LSP) analysis and linear fitting to the light curves, classifying them into distinct categories: {\it Secular} ({\it Linear}, {\it Curved}, and {\it Periodic}) and {\it Stochastic} ({\it Burst}, {\it Drop}, and {\it Irregular}). Our findings show that 5,467 (26.2$\pm$0.3\%) of the sources exhibit variability, with most (19.7$\pm$0.3\%) showing {\it Irregular} variations, followed by {\it Curved} and {\it Periodic} variations. In addition, 235 sources of {\it Bursts} and 122 {\it Drop} sources were identified. Variability is more pronounced in Class I sources with a higher fraction of variables (36.3$\pm$0.6\%) compared to Class II (22.1$\pm$0.4\%) and Class III (22.5$\pm$1.0\%) sources. The color (W1 $-$ W2) versus magnitude analysis (W2) using linear fitting shows that the trend ``redder-when-brighter" (RWB) is more prevalent (85.4$\pm$0.5\%) among YSOs. In contrast, the trend ``bluer-when-brighter" (BWB) is more common in younger sources compared to more evolved ones, having a BWB fraction of 29.0$\pm$1.1\% for Class I to 4.0$\pm$0.9\% for Class III.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Numerical Investigations of Electron Dynamics in a Linear Paul Trap
Authors:
Andris Huang,
Edith Hausten,
Qian Yu,
Kento Taniguchi,
Neha Yadav,
Isabel Sacksteder,
Atsushi Noguchi,
Ralf Schneider,
Hartmut Haeffner
Abstract:
Trapped electrons have emerged as an interesting platform for quantum information processing due to their light mass, two-level spin states, and potential for fully electronic manipulation. Previous experiments have demonstrated electron trapping in Penning traps, Paul traps, on solid neon, and superfluid films. In this work, we consider electrons confined in Paul traps, with their spin states as…
▽ More
Trapped electrons have emerged as an interesting platform for quantum information processing due to their light mass, two-level spin states, and potential for fully electronic manipulation. Previous experiments have demonstrated electron trapping in Penning traps, Paul traps, on solid neon, and superfluid films. In this work, we consider electrons confined in Paul traps, with their spin states as the qubits. For this approach, if the two electrons are trapped in the same potential well, they must form Wigner crystals and remain stable under a static magnetic field to enable two-qubit gates, achievable only within certain trapping parameters. To identify feasible operating conditions, we performed numerical simulations of electron dynamics in linear Paul traps, finding the threshold temperatures required to form two-electron Wigner crystals and studying how the thresholds scale with trap frequencies. In addition, we numerically verified the cooling methods required to reach the crystallization thresholds. Lastly, we examined the stability of electrons under various magnetic field strengths and identified stable regions of trap operation.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs
Authors:
Rupak Sarkar,
Neha Srikanth,
Taylor Hudson,
Rachel Rudinger,
Claire Bonial,
Philip Resnik
Abstract:
While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often ste…
▽ More
While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often stem from a misalignment in common ground, driven by a divergence in beliefs and assumptions held by participants. These disruptions, which we call conversational friction, significantly correlate with task success. We find that although LLMs can identify overt cases of conversational friction, they struggle with subtler and more context-dependent instances requiring pragmatic or domain-specific reasoning.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
A high-speed heterogeneous lithium tantalate silicon photonics platform
Authors:
Margot Niels,
Tom Vanackere,
Ewoud Vissers,
Tingting Zhai,
Patrick Nenezic,
Jakob Declercq,
Cédric Bruynsteen,
Shengpu Niu,
Arno Moerman,
Olivier Caytan,
Nishant Singh,
Sam Lemey,
Xin Yin,
Sofie Janssen,
Peter Verheyen,
Neha Singh,
Dieter Bode,
Martin Davi,
Filippo Ferraro,
Philippe Absil,
Sadhishkumar Balakrishnan,
Joris Van Campenhout,
Günther Roelkens,
Bart Kuyken,
Maximilien Billet
Abstract:
The rapid expansion of cloud computing and artificial intelligence has driven the demand for faster optical components in data centres to unprecedented levels. A key advancement in this field is the integration of multiple photonic components onto a single chip, enhancing the performance of optical transceivers. Here, silicon photonics, benefiting from mature fabrication processes, has gained prom…
▽ More
The rapid expansion of cloud computing and artificial intelligence has driven the demand for faster optical components in data centres to unprecedented levels. A key advancement in this field is the integration of multiple photonic components onto a single chip, enhancing the performance of optical transceivers. Here, silicon photonics, benefiting from mature fabrication processes, has gained prominence. The platform combines modulators, switches, photodetectors and low-loss waveguides on a single chip. However, emerging standards like 1600ZR+ potentially exceed the capabilities of silicon-based modulators. To address these limitations, thin-film lithium niobate has been proposed as an alternative to silicon photonics, offering a low voltage-length product and exceptional high-speed modulation properties. More recently, the first demonstrations of thin-film lithium tantalate circuits have emerged, addressing some of the disadvantages of lithium niobate enabling a reduced bias drift and enhanced resistance to optical damage. As such, making it a promising candidate for next-generation photonic platforms. However, a persistent drawback of such platforms is the lithium contamination, which complicates integration with CMOS fabrication processes. Here, we present for the first time the integration of lithium tantalate onto a silicon photonics chip. This integration is achieved without modifying the standard silicon photonics process design kit. Our device achieves low half-wave voltage (3.5 V), low insertion loss (2.9 dB) and high-speed operation (> 70 GHz), paving the way for next-gen applications. By minimising lithium tantalate material use, our approach reduces costs while leveraging existing silicon photonics technology advancements, in particular supporting ultra-fast monolithic germanium photodetectors and established process design kits.
△ Less
Submitted 14 March, 2025; v1 submitted 13 March, 2025;
originally announced March 2025.
-
A scalable quadratic nonlinear silicon photonics platform with printable entangled photon-pair sources
Authors:
Tom Vandekerckhove,
Jasper De Witte,
Lisa De Jaeger,
Ewoud Vissers,
Sofie Janssen,
Peter Verheyen,
Neha Singh,
Dieter Bode,
Martin Davi,
Filippo Ferraro,
Philippe Absil,
Sadhishkumar Balakrishnan,
Joris Van Campenhout,
Dries Van Thourhout,
Günther Roelkens,
Stéphane Clemmen,
Bart Kuyken
Abstract:
The integration of second-order optical nonlinearities into scalable photonic platforms remains a key challenge due to their large sensitivity to fabrication variations. Here, we present a scalable quadratic nonlinear platform that harnesses the maturity and scalability of existing CMOS processes by heterogeneously integrating periodically poled lithium niobate (PPLN) onto a silicon photonics plat…
▽ More
The integration of second-order optical nonlinearities into scalable photonic platforms remains a key challenge due to their large sensitivity to fabrication variations. Here, we present a scalable quadratic nonlinear platform that harnesses the maturity and scalability of existing CMOS processes by heterogeneously integrating periodically poled lithium niobate (PPLN) onto a silicon photonics platform. A generic PPLN design enables frequency conversion on two distinct waveguide geometries with efficiencies comparable to LNOI rib waveguides. We achieve reproducible phase-matching across the full radius of a commercial 200 mm silicon photonics wafer, leveraging superior CMOS fabrication tolerances. Furthermore, we introduce a tuning mechanism for both blue- and red-shifting of the operating wavelength, fully compensating fabrication-induced offsets. This enables deterministic phase-matching over an entire wafer and yields a strategy for wafer-scale phase-matched quadratic nonlinearities. Finally, we realize printable photon-pair sources via spontaneous parametric down-conversion, highlighting the platform's potential for large-scale quantum optical circuits. These results pave the way for wafer-scale integration of second-order optical nonlinearities in large photonic systems.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Highly Entangled Magnetodielectric and Magnetostriction effects, and Spin-Phonon coupling in the Antiferromagnetic Ni$_2$ScSbO$_6$
Authors:
Neha Patel,
Arkadeb Pal,
C. W. Wang,
G. R. Blake,
J. Khatua,
T. W. Yen,
Susaiammal Arokiasamy,
H. S. Kunwar,
Y. C. Lai,
Y. C. Chuang,
V. Sathe,
Kwang-Yong Choi,
H. D. Yang,
Sandip Chatterjee
Abstract:
Magnetic systems with noncentrosymmetric crystal structures are renowned for their complex magnetic ordering and diverse and fascinating physical properties. In this report, we provide a comprehensive study of the chiral magnetic system Ni$_2$ScSbO$_6$, which exhibits a robust incommensurate long-range antiferromagnetic spin ordering at a temperature of $T_N = 62$~K, as revealed by bulk magnetizat…
▽ More
Magnetic systems with noncentrosymmetric crystal structures are renowned for their complex magnetic ordering and diverse and fascinating physical properties. In this report, we provide a comprehensive study of the chiral magnetic system Ni$_2$ScSbO$_6$, which exhibits a robust incommensurate long-range antiferromagnetic spin ordering at a temperature of $T_N = 62$~K, as revealed by bulk magnetization, specific heat, and neutron diffraction studies. This magnetic ordering triggers a series of intriguing phenomena, including prominent magnetodielectric coupling manifested by a dielectric peak at $T_N$, significant spin-phonon coupling resulting in strong phonon renormalization characterized by anomalous softening of various Raman modes, and a remarkable volume magnetostriction effect probed by high-resolution synchrotron X-ray diffraction. These phenomena are intricately interlinked, positioning the present system as a rare and interesting material.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
Byzantine Distributed Function Computation
Authors:
Hari Krishnan P. Anilkumar,
Neha Sangwan,
Varun Narayanan,
Vinod M. Prabhakaran
Abstract:
We study the distributed function computation problem with $k$ users of which at most $s$ may be controlled by an adversary and characterize the set of functions of the sources the decoder can reconstruct robustly in the following sense -- if the users behave honestly, the function is recovered with high probability (w.h.p.); if they behave adversarially, w.h.p, either one of the adversarial users…
▽ More
We study the distributed function computation problem with $k$ users of which at most $s$ may be controlled by an adversary and characterize the set of functions of the sources the decoder can reconstruct robustly in the following sense -- if the users behave honestly, the function is recovered with high probability (w.h.p.); if they behave adversarially, w.h.p, either one of the adversarial users will be identified or the function is recovered with vanishingly small distortion.
△ Less
Submitted 10 March, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Authors:
Fajri Koto,
Rituraj Joshi,
Nurdaulet Mukhituly,
Yuxia Wang,
Zhuohan Xie,
Rahul Pal,
Daniil Orel,
Parvez Mullah,
Diana Turmakhan,
Maiya Goloburda,
Mohammed Kamran,
Samujjwal Ghosh,
Bokang Jia,
Jonibek Mansurov,
Mukhammed Togmanov,
Debopriyo Banerjee,
Nurkhan Laiyk,
Akhmed Sakip,
Xudong Han,
Ekaterina Kochmar,
Alham Fikri Aji,
Aaryamonvikram Singh,
Alok Anil Jadhav,
Satheesh Katipomu,
Samta Kamboj
, et al. (10 additional authors not shown)
Abstract:
Llama-3.1-Sherkala-8B-Chat, or Sherkala-Chat (8B) for short, is a state-of-the-art instruction-tuned open generative large language model (LLM) designed for Kazakh. Sherkala-Chat (8B) aims to enhance the inclusivity of LLM advancements for Kazakh speakers. Adapted from the LLaMA-3.1-8B model, Sherkala-Chat (8B) is trained on 45.3B tokens across Kazakh, English, Russian, and Turkish. With 8 billion…
▽ More
Llama-3.1-Sherkala-8B-Chat, or Sherkala-Chat (8B) for short, is a state-of-the-art instruction-tuned open generative large language model (LLM) designed for Kazakh. Sherkala-Chat (8B) aims to enhance the inclusivity of LLM advancements for Kazakh speakers. Adapted from the LLaMA-3.1-8B model, Sherkala-Chat (8B) is trained on 45.3B tokens across Kazakh, English, Russian, and Turkish. With 8 billion parameters, it demonstrates strong knowledge and reasoning abilities in Kazakh, significantly outperforming existing open Kazakh and multilingual models of similar scale while achieving competitive performance in English. We release Sherkala-Chat (8B) as an open-weight instruction-tuned model and provide a detailed overview of its training, fine-tuning, safety alignment, and evaluation, aiming to advance research and support diverse real-world applications.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Improving Simulation-Based Origin-Destination Demand Calibration Using Sample Segment Counts Data
Authors:
Arwa Alanqary,
Chao Zhang,
Yechen Li,
Neha Arora,
Carolina Osorio
Abstract:
This paper introduces a novel approach to demand estimation that utilizes partial observations of segment-level track counts. Building on established simulation-based demand estimation methods, we present a modified formulation that integrates sample track counts as a regularization term. This approach effectively addresses the underdetermination challenge in demand estimation, moving beyond the c…
▽ More
This paper introduces a novel approach to demand estimation that utilizes partial observations of segment-level track counts. Building on established simulation-based demand estimation methods, we present a modified formulation that integrates sample track counts as a regularization term. This approach effectively addresses the underdetermination challenge in demand estimation, moving beyond the conventional reliance on a prior OD matrix. The proposed formulation aims to preserve the distribution of the observed track counts while optimizing the demand to align with observed path-level travel times. We tested this approach on Seattle's highway network with various congestion levels. Our findings reveal significant enhancements in the solution quality, particularly in accurately recovering ground truth demand patterns at both the OD and segment levels.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
KAN-powered large-target detection for automotive radar
Authors:
Vinay Kulkarni,
V. V. Reddy,
Neha Maheshwari
Abstract:
This paper presents a novel radar signal detection pipeline focused on detecting large targets such as cars and SUVs. Traditional methods, such as Ordered-Statistic Constant False Alarm Rate (OS-CFAR), commonly used in automotive radar, are designed for point or isotropic target models. These may not adequately capture the Range-Doppler (RD) scattering patterns of larger targets, especially in hig…
▽ More
This paper presents a novel radar signal detection pipeline focused on detecting large targets such as cars and SUVs. Traditional methods, such as Ordered-Statistic Constant False Alarm Rate (OS-CFAR), commonly used in automotive radar, are designed for point or isotropic target models. These may not adequately capture the Range-Doppler (RD) scattering patterns of larger targets, especially in high-resolution radar systems. Additional modules such as association and tracking are necessary to refine and consolidate the detections over multiple dwells. To address these limitations, we propose a detection technique based on the probability density function (pdf) of RD segments, leveraging the Kolmogorov-Arnold neural network (KAN) to learn the data and generate interpretable symbolic expressions for binary hypotheses. Beside the Monte-Carlo study showing better performance for the proposed KAN expression over OS-CFAR, it is shown to exhibit a probability of detection (PD) of 96% when transfer learned with field data. The false alarm rate (PFA) is comparable with OS-CFAR designed with PFA = $10^{-6}$. Additionally, the study also examines impact of the number of pdf bins representing RD segment on performance of the KAN-based detection.
△ Less
Submitted 13 March, 2025; v1 submitted 26 February, 2025;
originally announced February 2025.
-
Geometrical subordinated Poisson processes and its extensions
Authors:
Neha Gupta,
Aditya Maheshwari,
Dheeraj Goyal
Abstract:
In this paper, we study a generalized version of the Poisson-type process by time-changing it with the geometric counting process. Our work generalizes the work done by Meoli (2023) \cite{meoli2023some}. We defined the geometric subordinated Poisson process (GSPP), the geometric subordinated compound Poisson process (GSCPP) and the geometric subordinated multiplicative Poisson process (GSMPP) by t…
▽ More
In this paper, we study a generalized version of the Poisson-type process by time-changing it with the geometric counting process. Our work generalizes the work done by Meoli (2023) \cite{meoli2023some}. We defined the geometric subordinated Poisson process (GSPP), the geometric subordinated compound Poisson process (GSCPP) and the geometric subordinated multiplicative Poisson process (GSMPP) by time-changing the subordinated Poisson process, subordinated compound Poisson process and subordinated multiplicative Poisson process with the geometric counting process, respectively. We derived several distributional properties and many special cases from the above-mentioned processes. We calculate the asymptotic behavior of the correlation structure. We have discussed applications of time-changed generalized compound Poisson in shock modelling.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements
Authors:
Arya Mazumdar,
Neha Sangwan
Abstract:
We consider the problem of exact recovery of a $k$-sparse binary vector from generalized linear measurements (such as logistic regression). We analyze the linear estimation algorithm (Plan, Vershynin, Yudovina, 2017), and also show information theoretic lower bounds on the number of required measurements. As a consequence of our results, for noisy one bit quantized linear measurements (…
▽ More
We consider the problem of exact recovery of a $k$-sparse binary vector from generalized linear measurements (such as logistic regression). We analyze the linear estimation algorithm (Plan, Vershynin, Yudovina, 2017), and also show information theoretic lower bounds on the number of required measurements. As a consequence of our results, for noisy one bit quantized linear measurements ($\mathsf{1bCSbinary}$), we obtain a sample complexity of $O((k+σ^2)\log{n})$, where $σ^2$ is the noise variance. This is shown to be optimal due to the information theoretic lower bound. We also obtain tight sample complexity characterization for logistic regression.
Since $\mathsf{1bCSbinary}$ is a strictly harder problem than noisy linear measurements ($\mathsf{SparseLinearReg}$) because of added quantization, the same sample complexity is achievable for $\mathsf{SparseLinearReg}$. While this sample complexity can be obtained via the popular lasso algorithm, linear estimation is computationally more efficient. Our lower bound holds for any set of measurements for $\mathsf{SparseLinearReg}$, (similar bound was known for Gaussian measurement matrices) and is closely matched by the maximum-likelihood upper bound. For $\mathsf{SparseLinearReg}$, it was conjectured in Gamarnik and Zadik, 2017 that there is a statistical-computational gap and the number of measurements should be at least $(2k+σ^2)\log{n}$ for efficient algorithms to exist. It is worth noting that our results imply that there is no such statistical-computational gap for $\mathsf{1bCSbinary}$ and logistic regression.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
"Who Has the Time?": Understanding Receptivity to Health Chatbots among Underserved Women in India
Authors:
Manvi S,
Roshini Deva,
Neha Madhiwalla,
Azra Ismail
Abstract:
Access to health information and services among women continues to be a major challenge in many communities globally. In recent years, there has been a growing interest in the potential of chatbots to address this information and access gap. We conducted interviews and focus group discussions with underserved women in urban India to understand their receptivity towards the use of chatbots for mate…
▽ More
Access to health information and services among women continues to be a major challenge in many communities globally. In recent years, there has been a growing interest in the potential of chatbots to address this information and access gap. We conducted interviews and focus group discussions with underserved women in urban India to understand their receptivity towards the use of chatbots for maternal and child health, as well as barriers to their adoption. Our findings uncover gaps in digital access and literacies, and perceived conflict with various responsibilities that women are burdened with, which shape their interactions with digital technology. Our paper offers insights into the design of chatbots for community health that can meet the lived realities of women in underserved settings.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Predicting Fetal Birthweight from High Dimensional Data using Advanced Machine Learning
Authors:
Nachiket Kapure,
Harsh Joshi,
Rajeshwari Mistri,
Parul Kumari,
Manasi Mali,
Seema Purohit,
Neha Sharma,
Mrityunjoy Panday,
Chittaranjan S. Yajnik
Abstract:
Birth weight serves as a fundamental indicator of neonatal health, closely linked to both early medical interventions and long-term developmental risks. Traditional predictive models, often constrained by limited feature selection and incomplete datasets, struggle to achieve overlooking complex maternal and fetal interactions in diverse clinical settings. This research explores machine learning to…
▽ More
Birth weight serves as a fundamental indicator of neonatal health, closely linked to both early medical interventions and long-term developmental risks. Traditional predictive models, often constrained by limited feature selection and incomplete datasets, struggle to achieve overlooking complex maternal and fetal interactions in diverse clinical settings. This research explores machine learning to address these limitations, utilizing a structured methodology that integrates advanced imputation strategies, supervised feature selection techniques, and predictive modeling. Given the constraints of the dataset, the research strengthens the role of data preprocessing in improving the model performance. Among the various methodologies explored, tree-based feature selection methods demonstrated superior capability in identifying the most relevant predictors, while ensemble-based regression models proved highly effective in capturing non-linear relationships and complex maternal-fetal interactions within the data. Beyond model performance, the study highlights the clinical significance of key physiological determinants, offering insights into maternal and fetal health factors that influence birth weight, offering insights that extend over statistical modeling. By bridging computational intelligence with perinatal research, this work underscores the transformative role of machine learning in enhancing predictive accuracy, refining risk assessment and informing data-driven decision-making in maternal and neonatal care. Keywords: Birth weight prediction, maternal-fetal health, MICE, BART, Gradient Boosting, neonatal outcomes, Clinipredictive.
△ Less
Submitted 8 April, 2025; v1 submitted 20 February, 2025;
originally announced February 2025.
-
NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
Authors:
Neha Srikanth,
Rachel Rudinger
Abstract:
Decomposition of text into atomic propositions is a flexible framework allowing for the closer inspection of input and output text. We use atomic decomposition of hypotheses in two natural language reasoning tasks, traditional NLI and defeasible NLI, to form atomic sub-problems, or granular inferences that models must weigh when solving the overall problem. These atomic sub-problems serve as a too…
▽ More
Decomposition of text into atomic propositions is a flexible framework allowing for the closer inspection of input and output text. We use atomic decomposition of hypotheses in two natural language reasoning tasks, traditional NLI and defeasible NLI, to form atomic sub-problems, or granular inferences that models must weigh when solving the overall problem. These atomic sub-problems serve as a tool to further understand the structure of both NLI and defeasible reasoning, probe a model's consistency and understanding of different inferences, and measure the diversity of examples in benchmark datasets. Our results indicate that LLMs still struggle with logical consistency on atomic NLI and defeasible NLI sub-problems. Lastly, we identify critical atomic sub-problems of defeasible NLI examples, or those that most contribute to the overall label, and propose a method to measure the inferential consistency of a model, a metric designed to capture the degree to which a model makes consistently correct or incorrect predictions about the same fact under different contexts.
△ Less
Submitted 7 March, 2025; v1 submitted 11 February, 2025;
originally announced February 2025.
-
Robotouille: An Asynchronous Planning Benchmark for LLM Agents
Authors:
Gonzalo Gonzalez-Pumariega,
Leong Su Yean,
Neha Sunkara,
Sanjiban Choudhury
Abstract:
Effective asynchronous planning, or the ability to efficiently reason and plan over states and actions that must happen in parallel or sequentially, is essential for agents that must account for time delays, reason over diverse long-horizon tasks, and collaborate with other agents. While large language model (LLM) agents show promise in high-level task planning, current benchmarks focus primarily…
▽ More
Effective asynchronous planning, or the ability to efficiently reason and plan over states and actions that must happen in parallel or sequentially, is essential for agents that must account for time delays, reason over diverse long-horizon tasks, and collaborate with other agents. While large language model (LLM) agents show promise in high-level task planning, current benchmarks focus primarily on short-horizon tasks and do not evaluate such asynchronous planning capabilities. We introduce Robotouille, a challenging benchmark environment designed to test LLM agents' ability to handle long-horizon asynchronous scenarios. Our synchronous and asynchronous datasets capture increasingly complex planning challenges that go beyond existing benchmarks, requiring agents to manage overlapping tasks and interruptions. Our results show that ReAct (gpt4-o) achieves 47% on synchronous tasks but only 11% on asynchronous tasks, highlighting significant room for improvement. We further analyze failure modes, demonstrating the need for LLM agents to better incorporate long-horizon feedback and self-audit their reasoning during task execution. Code is available at https://github.com/portal-cornell/robotouille.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Dynamics of monitored SSH Model in Krylov Space: From Complexity to Quantum Fisher Information
Authors:
Nilachal Chakrabarti,
Neha Nirbhan,
Arpan Bhattacharyya
Abstract:
In this paper, we investigate the dynamics of a non-Hermitian SSH model that arises out of the no-click limit of a monitored SSH model in the Krylov space. We find that the saturation timescale of the complexity associated with the spread of the state in the Krylov subspace increases with the measurement rate, and late time behaviour differs across the $\mathrm{PT}$ symmetry transition point. Furt…
▽ More
In this paper, we investigate the dynamics of a non-Hermitian SSH model that arises out of the no-click limit of a monitored SSH model in the Krylov space. We find that the saturation timescale of the complexity associated with the spread of the state in the Krylov subspace increases with the measurement rate, and late time behaviour differs across the $\mathrm{PT}$ symmetry transition point. Furthermore, extending the notion of this complexity for subsystems in Krylov space, we find that the scaling of its late time value with subsystem size shows a discontinuous jump across the $\mathrm{PT}$ transition point, indicating that it can be used as a suitable order parameter for such transition but not for the measurement-induced transition. Finally, we show that the measurement-induced transition can be detected using a generalized measure in the Krylov subspace, which contains information about the correlation landscape, such as Quantum Fisher information, which also possesses some structural similarity with the complexity functional.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Split-gasket approach to the integration of electrical leads into diamond anvil cells
Authors:
Neha Kondedan,
Ulrich Häussermann,
Andreas Rydh
Abstract:
Transport and heat capacity measurements under pressure must reconcile the limited available space and complicated geometry of a high-pressure cell with the need for multiple electrical connections. One solution for diamond anvil cells is to use customized diamonds with deposited electrical leads. Here, we instead address the problem through a split-gasket approach, intended for diamond anvil cell…
▽ More
Transport and heat capacity measurements under pressure must reconcile the limited available space and complicated geometry of a high-pressure cell with the need for multiple electrical connections. One solution for diamond anvil cells is to use customized diamonds with deposited electrical leads. Here, we instead address the problem through a split-gasket approach, intended for diamond anvil cells at moderate pressures and low temperature. A key component is the use of a substrate with lithographically defined leads, which enables connections to components such as thermometer, heater, and/or sample within the confined sample volume of the cell. The design includes an elaborate BeCu gasket sandwich with a preparation method that ensures electrical contact integrity. Using this configuration, we bring 12 leads to within 100 $μ$m of the center of the diamond anvil at a pressure of about 2 GPa, comparable to the pressure reached with a regular gasket, demonstrating the setup's capability for high-pressure experiments. The split-gasket approach may come at the cost of reduced maximum pressure, but brings versatility and reproducibility, and alleviates the experimental efforts of maintaining multiple electrical leads both intact and electrically isolated.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
Miniaturized chip calorimeter for high-pressure cells at low temperature
Authors:
Neha Kondedan,
Andreas Rydh
Abstract:
Heat capacity measurements under high pressure places high demands on the calorimeter. Here we describe the development of a miniaturized nanocalorimeter for high-pressure heat capacity measurements at low temperature. The device, fabricated on a silicon substrate, employs a high-frequency AC calorimetry technique and features a design with an outer diameter of 300 $μ$m and thickness of 25-40 $μ$m…
▽ More
Heat capacity measurements under high pressure places high demands on the calorimeter. Here we describe the development of a miniaturized nanocalorimeter for high-pressure heat capacity measurements at low temperature. The device, fabricated on a silicon substrate, employs a high-frequency AC calorimetry technique and features a design with an outer diameter of 300 $μ$m and thickness of 25-40 $μ$m, small enough to fit into high pressure diamond anvil cells. Miniaturization is achieved by stacking all components, including thermometer and heaters, within a central area. The thin-film calorimeter thermometer measures 40 $μ$m square and maintains the sensitivity and properties of larger thermometers. The fabrication process uses controlled anisotropic etch to produce calorimeter chips with a balance between robustness and thickness, suitable for experiments at high pressures and low temperatures. The calorimeter operates at a relatively high characteristic frequency between 10 Hz and 1 kHz, constraining the thermal oscillation to an effective volume dominated by the sample, thereby avoiding the use of a suspended membrane that is the basis for conventional nanocalorimeters.
△ Less
Submitted 29 January, 2025;
originally announced January 2025.
-
Reducing Size Bias in Epidemic Network Modelling
Authors:
Neha Bansal,
Katerina Kaouri,
Thomas E. Woolley
Abstract:
Epidemiological models help policymakers mitigate disease spread by predicting transmission metrics based on disease dynamics and contact networks. Calibrating these models requires representative network sampling. We investigate the Random Walk (RW) and Metropolis-Hastings Random Walk (MHRW) algorithms for three network types: Erdős-Rényi (ER), Small-world (SW), and Scale-free (SF). Disease trans…
▽ More
Epidemiological models help policymakers mitigate disease spread by predicting transmission metrics based on disease dynamics and contact networks. Calibrating these models requires representative network sampling. We investigate the Random Walk (RW) and Metropolis-Hastings Random Walk (MHRW) algorithms for three network types: Erdős-Rényi (ER), Small-world (SW), and Scale-free (SF). Disease transmission is simulated using a stochastic susceptible-infected-recovered (SIR) framework. For ER and SW networks, RW overestimates infected individuals and secondary infections by $25\%$ due to size bias, favouring highly connected nodes. MHRW, though more computationally intensive, reduces size bias and provides more representative samples. For time-to-infection, both algorithms provide representative estimates. However, neither algorithm samples SF networks representatively, exhibiting significant variability. Furthermore, removing duplicate sample nodes reduces MHRW's accuracy across three network types. We apply both algorithms to a cattle movement network of 46,512 farms combining ER, SW, and SF features. RW overestimates infected farms by about $100\%$ and secondary infections by over $900\%$, reflecting significant size bias, while MHRW estimates align within $1\%$ of the cattle network values. RW underestimates time-to-infection by about $40\%$, while MHRW overestimates it by $10\%$. Accuracy, again, deteriorates when duplicates nodes are removed. Our findings guide algorithm selection and intervention strategies based on network structure and disease severity; RW's conservative estimates suit high-mortality, fast-spreading epidemics, while MHRW enables more precise interventions for slower epidemics.
△ Less
Submitted 10 June, 2025; v1 submitted 22 January, 2025;
originally announced January 2025.
-
Low-Loss Superconducting Resonators Fabricated from Tantalum Films Grown at Room Temperature
Authors:
Guillaume Marcaud,
David Perello,
Cliff Chen,
Esha Umbarkar,
Conan Weiland,
Jiansong Gao,
Sandra Diez,
Victor Ly,
Neha Mahuli,
Nathan D'Souza,
Yuan He,
Shahriar Aghaeimeibodi,
Rachel Resnick,
Cherno Jaye,
Abdul K. Rumaiz,
Daniel A. Fischer,
Matthew Hunt,
Oskar Painter,
Ignace Jarrige
Abstract:
The use of $α$-tantalum in superconducting circuits has enabled a considerable improvement of the coherence time of transmon qubits. The standard approach to grow $α$-tantalum thin films on silicon involves heating the substrate, which takes several hours per deposition and prevents the integration of this material with wafers containing temperature-sensitive components. We report a detailed exper…
▽ More
The use of $α$-tantalum in superconducting circuits has enabled a considerable improvement of the coherence time of transmon qubits. The standard approach to grow $α$-tantalum thin films on silicon involves heating the substrate, which takes several hours per deposition and prevents the integration of this material with wafers containing temperature-sensitive components. We report a detailed experimental study of an alternative growth method of $α$-tantalum on silicon, which is achieved at room temperature through the use of a niobium seed layer. Despite a substantially higher density of oxygen-rich grain boundaries in the films sputtered at room temperature, resonators made from these films are found to have state-of-the-art quality factors, comparable to resonators fabricated from tantalum grown at high temperature. This finding challenges previous assumptions about correlations between material properties and microwave loss of superconducting thin films, and opens a new avenue for the integration of tantalum into fabrication flows with limited thermal budget.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
On How Traffic Signals Impact the Fundamental Diagrams of Urban Roads
Authors:
Chao Zhang,
Yechen Li,
Neha Arora,
Carolina Osorio
Abstract:
Being widely adopted by the transportation and planning practitioners, the fundamental diagram (FD) is the primary tool used to relate the key macroscopic traffic variables of speed, flow, and density. We empirically analyze the relation between vehicular space-mean speeds and flows given different signal settings and postulate a parsimonious parametric function form of the traditional FD where it…
▽ More
Being widely adopted by the transportation and planning practitioners, the fundamental diagram (FD) is the primary tool used to relate the key macroscopic traffic variables of speed, flow, and density. We empirically analyze the relation between vehicular space-mean speeds and flows given different signal settings and postulate a parsimonious parametric function form of the traditional FD where its function parameters are explicitly modeled as a function of the signal plan factors. We validate the proposed formulation using data from signalized urban road segments in Salt Lake City, Utah, USA. The proposed formulation builds our understanding of how changes to signal settings impact the FDs, and more generally the congestion patterns, of signalized urban segments.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.