Search | arXiv e-print repository

Students' Reliance on AI in Higher Education: Identifying Contributing Factors

Authors: Griffin Pitts, Neha Rani, Weedguet Mildort, Eva-Marie Cook

Abstract: The increasing availability and use of artificial intelligence (AI) tools in educational settings has raised concerns about students' overreliance on these technologies. Overreliance occurs when individuals accept incorrect AI-generated recommendations, often without critical evaluation, leading to flawed problem solutions and undermining learning outcomes. This study investigates potential factor… ▽ More The increasing availability and use of artificial intelligence (AI) tools in educational settings has raised concerns about students' overreliance on these technologies. Overreliance occurs when individuals accept incorrect AI-generated recommendations, often without critical evaluation, leading to flawed problem solutions and undermining learning outcomes. This study investigates potential factors contributing to patterns of AI reliance among undergraduate students, examining not only overreliance but also appropriate reliance (correctly accepting helpful and rejecting harmful recommendations) and underreliance (incorrectly rejecting helpful recommendations). Our approach combined pre- and post-surveys with a controlled experimental task where participants solved programming problems with an AI assistant that provided both accurate and deliberately incorrect suggestions, allowing direct observation of students' reliance patterns when faced with varying AI reliability. We find that appropriate reliance is significantly related to students' programming self-efficacy, programming literacy, and need for cognition, while showing negative correlations with post-task trust and satisfaction. Overreliance showed significant correlations with post-task trust and satisfaction with the AI assistant. Underreliance was negatively correlated with programming literacy, programming self-efficacy, and need for cognition. Overall, the findings provide insights for developing targeted interventions that promote appropriate reliance on AI tools, with implications for the integration of AI in curriculum and educational technologies. △ Less

Submitted 16 June, 2025; originally announced June 2025.

ACM Class: K.3; K.4; I.2.6

arXiv:2506.12103 [pdf, other]

The Amazon Nova Family of Models: Technical Report and Model Card

Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation. △ Less

Submitted 17 March, 2025; originally announced June 2025.

Comments: 48 pages, 10 figures

Report number: 20250317

arXiv:2506.11188 [pdf, ps, other]

Patchy Helium and Hydrogen Reionization from the Kinetic Sunyaev-Zel'dovich Effect and Galaxies

Authors: Neha Anil Kumar, Mesut Çalışkan, Selim C. Hotinli, Marc Kamionkowski, Simone Ferraro, Kendrick Smith

Abstract: Upcoming cosmic microwave background (CMB) experiments will measure temperature fluctuations on small angular scales with unprecedented precision, enabling improved measurements of the kinetic Sunyaev-Zel'dovich (kSZ) effect. This secondary anisotropy has emerged as a valuable probe of the distribution of ionized electrons in the post-recombination Universe. Although the sensitivity of the kSZ eff… ▽ More Upcoming cosmic microwave background (CMB) experiments will measure temperature fluctuations on small angular scales with unprecedented precision, enabling improved measurements of the kinetic Sunyaev-Zel'dovich (kSZ) effect. This secondary anisotropy has emerged as a valuable probe of the distribution of ionized electrons in the post-recombination Universe. Although the sensitivity of the kSZ effect has recently been utilized to study the high-redshift epoch of hydrogen (H) reionization, its redshift-integrated nature -- combined with anticipated improvements in measurement precision -- suggests that accounting for the later epoch of helium (He) reionization will become increasingly important in the near future. Joint characterization of the epochs will allow for a more coherent understanding of early-star and -quasar formation, as these sources drive the ionization of H and He in the intergalactic medium. In this paper, we extend the kSZ higher-order statistic introduced by Smith \& Ferraro (2017) to forecast the ability of upcoming CMB surveys to probe the morphology of both H and He reionization. Moreover, given that upcoming large-scale structure surveys will trace density fluctuations at redshifts overlapping with the epoch of He reionization, we propose a novel cross-correlation between the kSZ higher-order statistic and galaxy survey measurements. Using a joint information-matrix analysis of H and He reionization, we show that next-generation CMB and galaxy surveys will have sufficient statistical power to characterize the patchy morphology of H reionization and set constraints on the redshift evolution of its He counterpart. △ Less

Submitted 12 June, 2025; originally announced June 2025.

Comments: 29 pages, 12 figures. Comments are welcome

arXiv:2506.10910 [pdf, ps, other]

Magistral

Authors: Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo, Gabrielle Berrada, Guillaume Lample, Jason Rute, Joep Barmentlo, Karmesh Yadav, Kartik Khandelwal, Khyathi Raghavi Chandu, Léonard Blier, Lucile Saulnier, Matthieu Dinot, Maxime Darrin, Neha Gupta, Roman Soletskyi, Sagar Vaze, Teven Le Scao, Yihan Wang, Adam Yang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou , et al. (76 additional authors not shown)

Abstract: We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a s… ▽ More We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a simple method to force the reasoning language of the model, and show that RL on text data alone maintains most of the initial checkpoint's capabilities. We find that RL on text maintains or improves multimodal understanding, instruction following and function calling. We present Magistral Medium, trained for reasoning on top of Mistral Medium 3 with RL alone, and we open-source Magistral Small (Apache 2.0) which further includes cold-start data from Magistral Medium. △ Less

Submitted 12 June, 2025; originally announced June 2025.

arXiv:2506.07230 [pdf, ps, other]

First positronium imaging using $^{44}$Sc with the J-PET scanner: a case study on the NEMA-Image Quality phantom

Authors: Manish Das, Sushil Sharma, Aleksander Bilewicz, Jarosław Choiński, Neha Chug, Catalina Curceanu, Eryk Czerwiński, Jakub Hajduga, Sharareh Jalali, Krzysztof Kacprzak, Tevfik Kaplanoglu, Łukasz Kapłon, Kamila Kasperska, Aleksander Khreptak, Grzegorz Korcyl, Tomasz Kozik, Karol Kubat, Deepak Kumar, Anoop Kunimmal Venadan, Edward Lisowski, Filip Lisowski, Justyna Medrala-Sowa, Simbarashe Moyo, Wiktor Mryka, Szymon Niedźwiecki , et al. (19 additional authors not shown)

Abstract: Positronium Lifetime Imaging (PLI), an emerging extension of conventional positron emission tomography (PET) imaging, offers a novel window for probing the submolecular properties of biological tissues by imaging the mean lifetime of the positronium atom. Currently, the method is under rapid development in terms of reconstruction and detection systems. Recently, the first in vivo PLI of the human… ▽ More Positronium Lifetime Imaging (PLI), an emerging extension of conventional positron emission tomography (PET) imaging, offers a novel window for probing the submolecular properties of biological tissues by imaging the mean lifetime of the positronium atom. Currently, the method is under rapid development in terms of reconstruction and detection systems. Recently, the first in vivo PLI of the human brain was performed using the J-PET scanner utilizing the $^{68}$Ga isotope. However, this isotope has limitations due to its comparatively low prompt gamma yields, which is crucial for positronium lifetime measurement. Among alternative radionuclides, $^{44}$Sc stands out as a promising isotope for PLI, characterized by a clinically suitable half-life (4.04 hours) emitting 1157 keV prompt gamma in 100% cases after the emission of the positron. This study reports the first experimental demonstration of PLI with $^{44}$Sc, carried out on a NEMA-Image Quality (IQ) phantom using the Modular J-PET tomograph-the first plastic scintillators-based PET scanner. △ Less

Submitted 8 June, 2025; originally announced June 2025.

arXiv:2506.05739 [pdf, ps, other]

To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt

Authors: Zhilong Wang, Neha Nagaraja, Lan Zhang, Hayretdin Bahsi, Pawan Patil, Peng Liu

Abstract: LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called… ▽ More LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called Polymorphic Prompt Assembling (PPA), which protects against prompt injection with near-zero overhead. The approach is based on the insight that prompt injection requires guessing and breaking the structure of the system prompt. By dynamically varying the structure of system prompts, PPA prevents attackers from predicting the prompt structure, thereby enhancing security without compromising performance. We conducted experiments to evaluate the effectiveness of PPA against existing attacks and compared it with other defense methods. △ Less

Submitted 6 June, 2025; originally announced June 2025.

Comments: To appear in the Industry Track of the 55th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2025)

arXiv:2506.00941 [pdf, ps, other]

Interpreting the chromatic polynomial coefficients via hyperplane arrangements

Authors: Neha Goregaokar

Abstract: A recent result of Lofano and Paolini expresses the characteristic polynomial of a real hyperplane arrangement in terms of a projection statistic on the regions of the arrangement. We use this result to give an alternative proof for Greene and Zaslavsky's interpretation for the coefficients of the chromatic polynomial of a graph. We also show that this projection statistic has a nice combinatorial… ▽ More A recent result of Lofano and Paolini expresses the characteristic polynomial of a real hyperplane arrangement in terms of a projection statistic on the regions of the arrangement. We use this result to give an alternative proof for Greene and Zaslavsky's interpretation for the coefficients of the chromatic polynomial of a graph. We also show that this projection statistic has a nice combinatorial interpretation in the case of the braid arrangement, which generalizes to graphical arrangements of natural unit interval graphs. We use this generalization to give a new proof of the formula for the chromatic polynomial of a natural unit interval graph. △ Less

Submitted 1 June, 2025; originally announced June 2025.

Comments: 17 pages, 6 figures

arXiv:2505.19962 [pdf, other]

Estimating the binary neutron star merger rate density evolution with Einstein Telescope

Authors: Neha Singh, Tomasz Bulik, Aleksandra Olejak

Abstract: The Einstein Telescope (ET) is a proposed third-generation, wide-band gravitational wave (GW) detector which will have an improved detection sensitivity in low frequencies, leading to a longer observation time in the detection band and higher detection rate for binary neutron stars (BNSs). Despite the fact that ET will have a higher detection rate, a large fraction of BNSs will remain undetectable… ▽ More The Einstein Telescope (ET) is a proposed third-generation, wide-band gravitational wave (GW) detector which will have an improved detection sensitivity in low frequencies, leading to a longer observation time in the detection band and higher detection rate for binary neutron stars (BNSs). Despite the fact that ET will have a higher detection rate, a large fraction of BNSs will remain undetectable. We present a scheme to estimate accurate detection efficiency and to reconstruct the true merger rate density of the population of the BNSs, as a function of redshift. We show that with ET as a single instrumnet, for a population of BNSs with $R_{mer} \sim 100 (300)$ $\rm Gpc^{-3} yr^{-1}$ at $z\sim 0(2)$, we can reconstruct the merger rate density uptil $z \sim 2$ , with a relative error of $12\%$ at ($z \sim 2$), despite the loss in detection of the bulk of the BNS population. △ Less

Submitted 26 May, 2025; originally announced May 2025.

Comments: Contribution to the 2025 Gravitation session of the 59th Rencontres de Moriond

arXiv:2505.09970 [pdf, ps, other]

Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents

Authors: Mrinal Rawat, Ambuje Gupta, Rushil Goomer, Alessandro Di Bari, Neha Gupta, Roberto Pieraccini

Abstract: The ReAct (Reasoning + Action) capability in large language models (LLMs) has become the foundation of modern agentic systems. Recent LLMs, such as DeepSeek-R1 and OpenAI o1/o3, exemplify this by emphasizing reasoning through the generation of ample intermediate tokens, which help build a strong premise before producing the final output tokens. In this paper, we introduce Pre-Act, a novel approach… ▽ More The ReAct (Reasoning + Action) capability in large language models (LLMs) has become the foundation of modern agentic systems. Recent LLMs, such as DeepSeek-R1 and OpenAI o1/o3, exemplify this by emphasizing reasoning through the generation of ample intermediate tokens, which help build a strong premise before producing the final output tokens. In this paper, we introduce Pre-Act, a novel approach that enhances the agent's performance by creating a multi-step execution plan along with the detailed reasoning for the given user input. This plan incrementally incorporates previous steps and tool outputs, refining itself after each step execution until the final response is obtained. Our approach is applicable to both conversational and non-conversational agents. To measure the performance of task-oriented agents comprehensively, we propose a two-level evaluation framework: (1) turn level and (2) end-to-end. Our turn-level evaluation, averaged across five models, shows that our approach, Pre-Act, outperforms ReAct by 70% in Action Recall on the Almita dataset. While this approach is effective for larger models, smaller models crucial for practical applications, where latency and cost are key constraints, often struggle with complex reasoning tasks required for agentic systems. To address this limitation, we fine-tune relatively small models such as Llama 3.1 (8B & 70B) using the proposed Pre-Act approach. Our experiments show that the fine-tuned 70B model outperforms GPT-4, achieving a 69.5% improvement in action accuracy (turn-level) and a 28% improvement in goal completion rate (end-to-end) on the Almita (out-of-domain) dataset. △ Less

Submitted 18 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

arXiv:2505.00916 [pdf, other]

Photoproduction and detection of $ρ'\rightarrowπ^+π^-π^+π^-$ decays in ultra-peripheral collisions and at an electron-ion collider

Authors: Neha Devi, Minjung Kim, Spencer R. Klein, Janet Seger

Abstract: Vector meson photoproduction is an important probe of nuclear structure. Light vector mesons are most sensitive to low$-x$ structure, as long as they are not too light for perturbative QCD calculations. The $ρ'$ is of interest as an intermediate mass state (between the $ρ$ and $J/ψ$) that is easier to detect than the $φ$. Using HERA data on proton targets, we make projections for lead/gold targe… ▽ More Vector meson photoproduction is an important probe of nuclear structure. Light vector mesons are most sensitive to low$-x$ structure, as long as they are not too light for perturbative QCD calculations. The $ρ'$ is of interest as an intermediate mass state (between the $ρ$ and $J/ψ$) that is easier to detect than the $φ$. Using HERA data on proton targets, we make projections for lead/gold targets in UPCs at the Large Hadron Collider and RHIC, and for $ep$ and $eA$ collisions at a future Electron-Ion Collider (EIC). These projections for ion targets depend on the largely-unknown $ρ'\rightarrowπ^+π^-π^+π^-$ branching ratio, and use existing data to constrain that branching ratio. Current data points to a relatively low branching ratio, less than 50\%. The HERA $ep$ and ALICE UPC $e$Pb data exhibit very similar $4π$ mass spectra, indicating that, if the system is composed of two resonances, the products of their photon couplings with their four-pion branching ratios are similar. The predicted rates are high for both UPCs and the EIC. The $ρ'\rightarrowπ^+π^-π^+π^-$ decay can be observed at the EIC with high efficiency. In $ep$ collisions at the highest energy, the forward B0 detector is needed to observe this channel down to the lowest achievable Bjorken$-x$ values. △ Less

Submitted 1 May, 2025; originally announced May 2025.

Comments: 8 pages

arXiv:2504.21187 [pdf, other]

LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning

Authors: Neha Prakriya, Zijian Ding, Yizhou Sun, Jason Cong

Abstract: FPGAs are increasingly adopted in datacenter environments for their reconfigurability and energy efficiency. High-Level Synthesis (HLS) tools have eased FPGA programming by raising the abstraction level from RTL to untimed C/C++, yet attaining high performance still demands expert knowledge and iterative manual insertion of optimization pragmas to modify the microarchitecture. To address this chal… ▽ More FPGAs are increasingly adopted in datacenter environments for their reconfigurability and energy efficiency. High-Level Synthesis (HLS) tools have eased FPGA programming by raising the abstraction level from RTL to untimed C/C++, yet attaining high performance still demands expert knowledge and iterative manual insertion of optimization pragmas to modify the microarchitecture. To address this challenge, we propose LIFT, a large language model (LLM)-based coding assistant for HLS that automatically generates performance-critical pragmas given a C/C++ design. We fine-tune the LLM by tightly integrating and supervising the training process with a graph neural network (GNN), combining the sequential modeling capabilities of LLMs with the structural and semantic understanding of GNNs necessary for reasoning over code and its control/data dependencies. On average, LIFT produces designs that improve performance by 3.52x and 2.16x than prior state-of the art AutoDSE and HARP respectively, and 66x than GPT-4o. △ Less

Submitted 29 April, 2025; originally announced April 2025.

arXiv:2504.19457 [pdf, other]

Towards Long Context Hallucination Detection

Authors: Siyi Liu, Kishaloy Halder, Zheng Qi, Wei Xiao, Nikolaos Pappas, Phu Mon Htut, Neha Anna John, Yassine Benajiba, Dan Roth

Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take a… ▽ More Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take an initial step toward solving this problem by constructing a dataset specifically designed for long-context hallucination detection. Furthermore, we propose a novel architecture that enables pre-trained encoder models, such as BERT, to process long contexts and effectively detect contextual hallucinations through a decomposition and aggregation mechanism. Our experimental results show that the proposed architecture significantly outperforms previous models of similar size as well as LLM-based models across various metrics, while providing substantially faster inference. △ Less

Submitted 27 April, 2025; originally announced April 2025.

arXiv:2504.18674 [pdf, ps, other]

Noncentral moderate deviations for time-changed multivariate Lévy processes with linear combinations of inverse stable subordinators

Authors: Neha Gupta, Claudio Macci

Abstract: The term noncentral moderate deviations is used in the literature to mean a class of large deviation principles that, in some sense, fills the gap between the convergence in probability to a constant (governed by a reference large deviation principle) and a weak convergence to a non-Gaussian (and non-degenerating) distribution. Some noncentral moderate deviation results in the literature concern t… ▽ More The term noncentral moderate deviations is used in the literature to mean a class of large deviation principles that, in some sense, fills the gap between the convergence in probability to a constant (governed by a reference large deviation principle) and a weak convergence to a non-Gaussian (and non-degenerating) distribution. Some noncentral moderate deviation results in the literature concern time-changed univariate Lévy processes, where the time-changes are given by inverse stable subordinators. In this paper we present analogue results for multivariate Lévy processes; in particular the random time-changes are suitable linear combinations of independent inverse stable subordinators. △ Less

Submitted 25 April, 2025; originally announced April 2025.

MSC Class: 60F10; 60F05; 60G22; 33E12

arXiv:2504.16942 [pdf, other]

S2Vec: Self-Supervised Geospatial Embeddings

Authors: Shushman Choudhury, Elad Aharoni, Chandrakumari Suvarna, Iveel Tsogsuren, Abdul Rahman Kreidieh, Chun-Ta Lu, Neha Arora

Abstract: Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec uses the S2 Geometry library to partition large areas into discrete S2 cells, rasterizes built environment feature vectors within cells as images, and applies… ▽ More Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec uses the S2 Geometry library to partition large areas into discrete S2 cells, rasterizes built environment feature vectors within cells as images, and applies masked autoencoding on these rasterized images to encode the feature vectors. This approach yields task-agnostic embeddings that capture local feature characteristics and broader spatial relationships. We evaluate S2Vec on three large-scale socioeconomic prediction tasks, showing its competitive performance against state-of-the-art image-based embeddings. We also explore the benefits of combining S2Vec embeddings with image-based embeddings downstream, showing that such multimodal fusion can often improve performance. Our results highlight how S2Vec can learn effective general-purpose geospatial representations and how it can complement other data modalities in geospatial artificial intelligence. △ Less

Submitted 10 April, 2025; originally announced April 2025.

Comments: To be submitted to ACM Transactions on Spatial Algorithms and Systems

arXiv:2504.16277 [pdf, other]

DataS^3: Dataset Subset Selection for Specialization

Authors: Neha Hulkund, Alaa Maalouf, Levi Cai, Daniel Yang, Tsun-Hsuan Wang, Abigail O'Neil, Timm Haucke, Sandeep Mukherjee, Vikram Ramaswamy, Judy Hansen Shen, Gabriel Tseng, Mike Walmsley, Daniela Rus, Ken Goldberg, Hannah Kerner, Irene Chen, Yogesh Girdhar, Sara Beery

Abstract: In many real-world machine learning (ML) applications (e.g. detecting broken bones in x-ray images, detecting species in camera traps), in practice models need to perform well on specific deployments (e.g. a specific hospital, a specific national park) rather than the domain broadly. However, deployments often have imbalanced, unique data distributions. Discrepancy between the training distributio… ▽ More In many real-world machine learning (ML) applications (e.g. detecting broken bones in x-ray images, detecting species in camera traps), in practice models need to perform well on specific deployments (e.g. a specific hospital, a specific national park) rather than the domain broadly. However, deployments often have imbalanced, unique data distributions. Discrepancy between the training distribution and the deployment distribution can lead to suboptimal performance, highlighting the need to select deployment-specialized subsets from the available training data. We formalize dataset subset selection for specialization (DS3): given a training set drawn from a general distribution and a (potentially unlabeled) query set drawn from the desired deployment-specific distribution, the goal is to select a subset of the training data that optimizes deployment performance. We introduce DataS^3; the first dataset and benchmark designed specifically for the DS3 problem. DataS^3 encompasses diverse real-world application domains, each with a set of distinct deployments to specialize in. We conduct a comprehensive study evaluating algorithms from various families--including coresets, data filtering, and data curation--on DataS^3, and find that general-distribution methods consistently fail on deployment-specific tasks. Additionally, we demonstrate the existence of manually curated (deployment-specific) expert subsets that outperform training on all available data with accuracy gains up to 51.3 percent. Our benchmark highlights the critical role of tailored dataset curation in enhancing performance and training efficiency on deployment-specific distributions, which we posit will only become more important as global, public datasets become available across domains and ML models are deployed in the real world. △ Less

Submitted 22 April, 2025; originally announced April 2025.

arXiv:2504.15290 [pdf, other]

Parental Imprints On Birth Weight: A Data-Driven Model For Neonatal Prediction In Low Resource Prenatal Care

Authors: Rajeshwari Mistri, Harsh Joshi, Nachiket Kapure, Parul Kumari, Manasi Mali, Seema Purohit, Neha Sharma, Mrityunjoy Panday, Chittaranjan S. Yajnik

Abstract: Accurate fetal birth weight prediction is a cornerstone of prenatal care, yet traditional methods often rely on imaging technologies that remain inaccessible in resource-limited settings. This study presents a novel machine learning-based framework that circumvents these conventional dependencies, using a diverse set of physiological, environmental, and parental factors to refine birth weight esti… ▽ More Accurate fetal birth weight prediction is a cornerstone of prenatal care, yet traditional methods often rely on imaging technologies that remain inaccessible in resource-limited settings. This study presents a novel machine learning-based framework that circumvents these conventional dependencies, using a diverse set of physiological, environmental, and parental factors to refine birth weight estimation. A multi-stage feature selection pipeline filters the dataset into an optimized subset, demonstrating previously underexplored yet clinically relevant predictors of fetal growth. By integrating advanced regression architectures and ensemble learning strategies, the model captures non-linear relationships often overlooked by traditional approaches, offering a predictive solution that is both interpretable and scalable. Beyond predictive accuracy, this study addresses a question: whether birth weight can be reliably estimated without conventional diagnostic tools. The findings challenge entrenched methodologies by introducing an alternative pathway that enhances accessibility without compromising clinical utility. While limitations exist, the study lays the foundation for a new era in prenatal analytics, one where data-driven inference competes with, and potentially redefines, established medical assessments. By bridging computational intelligence with obstetric science, this research establishes a framework for equitable, technology-driven advancements in maternal-fetal healthcare. △ Less

Submitted 7 April, 2025; originally announced April 2025.

arXiv:2504.12973 [pdf, other]

Input to the ESPPU 2026 update: Searching for millicharged particles with the FORMOSA experiment at the CERN LHC

Authors: Matthew Citron, Frank Golf, Kranti Gunthoti, Andrew Haas, Christopher S. Hill, Dariush Imani, Samantha Kelly, Ming Liu, Steven Lowette, Albert De Roeck, Sai Neha Santpur, Ryan Schmitz, Jacob Steenis, David Stuart, Yu-Dai Tsai, Juan Salvador Tafoya Vargas, Tiepolo Wybouw, Jaehyeok Yoo

Abstract: In this contribution, we evaluate the sensitivity for particles with charges much smaller than the electron charge with a dedicated scintillator-based detector in the far forward region at the CERN LHC, FORMOSA. This contribution will outline the scientific case for this detector, its design and potential locations, and the sensitivity that can be achieved. The ongoing efforts to prove the feasibi… ▽ More In this contribution, we evaluate the sensitivity for particles with charges much smaller than the electron charge with a dedicated scintillator-based detector in the far forward region at the CERN LHC, FORMOSA. This contribution will outline the scientific case for this detector, its design and potential locations, and the sensitivity that can be achieved. The ongoing efforts to prove the feasibility of the detector with the FORMOSA demonstrator will be discussed. Finally, possible upgrades to the detector through the use of high-performance scintillator will be discussed. △ Less

Submitted 17 April, 2025; originally announced April 2025.

Comments: Contribution prepared for the 2026 update of the European Strategy for Particle Physics, 9 pages, 6 figures

arXiv:2504.08124 [pdf, other]

Normal state and superconducting state properties of high entropy Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf )

Authors: Nikita Sharma, J. Link, Kuldeep Kargeti, Neha Sharma, I. Heinmaa, S. K. Panda, R. Stern, Tirthankar Chakraborty, Tanmoy Chakrabarty, Sourav Marik

Abstract: High entropy alloy superconductors represent a unique blend of advanced material systems and quantum physics, offering significant potential for advancing superconducting technologies. In this study, we report a detailed theoretical and experimental investigation of high entropy alloy superconductors Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf). Our study unveils that both the materials crystallize in… ▽ More High entropy alloy superconductors represent a unique blend of advanced material systems and quantum physics, offering significant potential for advancing superconducting technologies. In this study, we report a detailed theoretical and experimental investigation of high entropy alloy superconductors Ta0.2Nb0.2V0.2Ti0.2X0.2 (X = Zr and Hf). Our study unveils that both the materials crystallize in a body-centered cubic structure (space group: I m -3 m) and exhibit bulk superconductivity with a superconducting onset temperature of (Tonset C ) of 5 K for X = Hf and 6.19 K for X = Zr sample. Our detailed analysis, including magnetization, resistivity, heat capacity measurements, and density functional theory (DFT) calculations indicates moderately coupled isotropic s-wave superconductivity in these materials. Our DFT results find significant spectral weight at the Fermi energy and phonon spectra is free of imaginary modes, confirming the dynamical stability and metallic nature of these alloys. Remarkably, we have observed a high upper critical field (HC2(0)) surpassing the Pauli paramagnetic limit for the X = Hf sample and explained it on the basis of the increased spin-orbit coupling in the structure. Ta0.2Nb0.2V0.2Ti0.2Zr0.2, on the other hand, shows a conventional HC2 behaviour. With the dynamical stability of these alloys, excellent normal state metallic nature, high micro-hardness, and high upper critical field, these samples emerge as potential candidates for future applications in superconducting devices. △ Less

Submitted 10 April, 2025; originally announced April 2025.

arXiv:2504.06011 [pdf, other]

Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi

Authors: Monojit Choudhury, Shivam Chauhan, Rocktim Jyoti Das, Dhruv Sahnan, Xudong Han, Haonan Li, Aaryamonvikram Singh, Alok Anil Jadhav, Utkarsh Agarwal, Mukund Choudhary, Debopriyo Banerjee, Fajri Koto, Junaid Bhat, Awantika Shukla, Samujjwal Ghosh, Samta Kamboj, Onkar Pandit, Lalit Pradhan, Rahul Pal, Sunil Sahu, Soundar Doraiswamy, Parvez Mullah, Ali El Filali, Neha Sengupta, Gokul Ramakrishnan , et al. (5 additional authors not shown)

Abstract: Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorp… ▽ More Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorporates continuous pre-training with expanded transformer blocks, leveraging the Llama Pro methodology. A key challenge was the limited availability of high-quality Hindi text data; we addressed this through rigorous data curation, augmentation, and strategic bilingual training, balancing Hindi and English corpora to optimize cross-linguistic knowledge transfer. With 10 billion parameters, Nanda stands among the top-performing open-source Hindi and multilingual models of similar scale, demonstrating significant advantages over many existing models. We provide an in-depth discussion of training strategies, fine-tuning techniques, safety alignment, and evaluation metrics, demonstrating how these approaches enabled Nanda to achieve state-of-the-art results. By open-sourcing Nanda, we aim to advance research in Hindi LLMs and support a wide range of real-world applications across academia, industry, and public services. △ Less

Submitted 8 April, 2025; originally announced April 2025.

arXiv:2504.03225 [pdf, ps, other]

Triple differential cross-section for Laser-assisted (e,2e) process on $H_2O$ molecule by Plane and Twisted electrons Impact

Authors: Neha, Rakesh Choubisa

Abstract: Twisted electron-impact single ionization ((e,2e) process) of water molecule is theoretically studied in the presence of an external laser field. Calculations have been performed in the framework of the first Born approximation in coplanar asymmetric geometry. The wave functions for the fast (incident/scattered) electron and the ejected electron are described by the Volkov and Coulomb Volkov wave… ▽ More Twisted electron-impact single ionization ((e,2e) process) of water molecule is theoretically studied in the presence of an external laser field. Calculations have been performed in the framework of the first Born approximation in coplanar asymmetric geometry. The wave functions for the fast (incident/scattered) electron and the ejected electron are described by the Volkov and Coulomb Volkov wave functions respectively which take into account the interaction of the laser field with incident/scattered and ejected electrons respectively. We calculate the triple differential cross section (TDCS) corresponding to the laser-assisted (e,2e) process for different orientations of the linearly polarised laser field. We describe the molecular state of $H_{2}O$ by the linear combination of atomic orbitals (self-consistent field LCAO method). The angular profile of the TDCS for plane wave electrons is strongly modified by the laser field compared to that of without laser field. However, for the twisted electrons with laser field the angular distribution is slightly changed from those of without laser field. We study the angular distribution of TDCS for laser field polarization parallel to the incident momentum ($\varepsilon_0$ $\parallel$ $k_i$), parallel to momentum transfer ($\varepsilon_0$ $\parallel$ $Δ$) and perpendicular to the momentum transfer ($\varepsilon_0$ $\perp$ $Δ$) and we observe that the $\varepsilon_0$ $\parallel$ $Δ$ has the highest magnitude of TDCS. For the orientations $\varepsilon_0$ $\parallel$ $k_i$ and $\varepsilon_0$ $\perp$ $Δ$, the laser-assisted plane wave studies shows oscillatory nature of TDCS but for the orientation $\varepsilon_0$ $\parallel$ $Δ$ we observe only recoil peak for $p-like$ character orbitals and dual peak, a recoil and binary peaks for the $s-like$ character orbital. △ Less

Submitted 26 May, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

Comments: 15 pages. arXiv admin note: text overlap with arXiv:2308.13845

arXiv:2504.02823 [pdf, other]

STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection

Authors: Divya Velayudhan, Abdelfatah Ahmed, Mohamad Alansari, Neha Gour, Abderaouf Behouch, Taimur Hassan, Syed Talal Wasim, Nabil Maalej, Muzammal Naseer, Juergen Gall, Mohammed Bennamoun, Ernesto Damiani, Naoufel Werghi

Abstract: Advancements in Computer-Aided Screening (CAS) systems are essential for improving the detection of security threats in X-ray baggage scans. However, current datasets are limited in representing real-world, sophisticated threats and concealment tactics, and existing approaches are constrained by a closed-set paradigm with predefined labels. To address these challenges, we introduce STCray, the fir… ▽ More Advancements in Computer-Aided Screening (CAS) systems are essential for improving the detection of security threats in X-ray baggage scans. However, current datasets are limited in representing real-world, sophisticated threats and concealment tactics, and existing approaches are constrained by a closed-set paradigm with predefined labels. To address these challenges, we introduce STCray, the first multimodal X-ray baggage security dataset, comprising 46,642 image-caption paired scans across 21 threat categories, generated using an X-ray scanner for airport security. STCray is meticulously developed with our specialized protocol that ensures domain-aware, coherent captions, that lead to the multi-modal instruction following data in X-ray baggage security. This allows us to train a domain-aware visual AI assistant named STING-BEE that supports a range of vision-language tasks, including scene comprehension, referring threat localization, visual grounding, and visual question answering (VQA), establishing novel baselines for multi-modal learning in X-ray baggage security. Further, STING-BEE shows state-of-the-art generalization in cross-domain settings. Code, data, and models are available at https://divs1159.github.io/STING-BEE/. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: Accepted at CVPR 2025

arXiv:2503.22845 [pdf]

Nanocrystal tuned ammonia gas sensing technique via impedance spectroscopy

Authors: Neha Sharma, Debanjan Bhattacharjee, Sunita Kumari, Sandip Paul Choudhury

Abstract: Ammonia is a harmful chemical hazard known for its widespread industrial use. Exposure to ammonia can cause environmental damage, human health hazards, and huge economic losses. Therefore, ammonia gas sensors are essential for detecting ammonia leaks to avoid serious accidental injury and death. In this study, we synthesize a nanostructured (WO3) n-type metal oxide semiconductor doped with a rare… ▽ More Ammonia is a harmful chemical hazard known for its widespread industrial use. Exposure to ammonia can cause environmental damage, human health hazards, and huge economic losses. Therefore, ammonia gas sensors are essential for detecting ammonia leaks to avoid serious accidental injury and death. In this study, we synthesize a nanostructured (WO3) n-type metal oxide semiconductor doped with a rare earth element-transition metal (Ce-Cu) via hydrothermal method for ammonia gas sensing application. Structural analysis was performed using XRD and FESEM. Further we investigate the optical properties via UV-visible spectroscopy, FTIR, and PL. We found that doping of (Ce-Cu) led to significant improvement in thermal stability for ammonia detection and selectivity performance compared to that pure one, across a wide frequency range. We believe that these studies will pave the way for exploring the use of Ce-Cu to improve the gas sensing properties of semiconductor-based gas sensors. △ Less

Submitted 28 March, 2025; originally announced March 2025.

arXiv:2503.19114 [pdf, other]

Understanding and Improving Information Preservation in Prompt Compression for LLMs

Authors: Weronika Łajewska, Momchil Hardalov, Laura Aina, Neha Anna John, Hang Su, Lluís Màrquez

Abstract: Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced t… ▽ More Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced to optimize the trade-off between reducing input length and retaining performance. We propose a holistic evaluation framework that allows for in-depth analysis of prompt compression methods. We focus on three key aspects, besides compression ratio: (i) downstream task performance, (ii) grounding in the input context, and (iii) information preservation. Through this framework, we investigate state-of-the-art soft and hard compression methods, showing that they struggle to preserve key details from the original prompt, limiting their performance on complex tasks. We demonstrate that modifying soft prompting methods to control better the granularity of the compressed information can significantly improve their effectiveness -- up to +23\% in downstream task performance, more than +8 BERTScore points in grounding, and 2.7x more entities preserved in compression. △ Less

Submitted 24 March, 2025; originally announced March 2025.

Comments: 21 pages, 6 figures, 23 tables

arXiv:2503.17114 [pdf, ps, other]

Range Avoidance in Boolean Circuits via Turan-type Bounds

Authors: Neha Kuntewar, Jayalal Sarma

Abstract: Given a circuit $C : \{0,1\}^n \to \{0,1\}^m$ from a circuit class $F$, with $m > n$, finding a $y \in \{0,1\}^m$ such that $\forall x \in \{0,1\}^n$, $C(x) \ne y$, is the range avoidance problem (denoted by $F$-$avoid$). Deterministic polynomial time algorithms (even with access to $NP$ oracles) solving this problem is known to imply explicit constructions of various pseudorandom objects like har… ▽ More Given a circuit $C : \{0,1\}^n \to \{0,1\}^m$ from a circuit class $F$, with $m > n$, finding a $y \in \{0,1\}^m$ such that $\forall x \in \{0,1\}^n$, $C(x) \ne y$, is the range avoidance problem (denoted by $F$-$avoid$). Deterministic polynomial time algorithms (even with access to $NP$ oracles) solving this problem is known to imply explicit constructions of various pseudorandom objects like hard Boolean functions, linear codes, PRGs etc. Deterministic polynomial time algorithms are known for $NC^0_2$-$avoid$ when $m > n$, and for $NC^0_3$-$avoid$ when $m \ge \frac{n^2}{\log n}$, where $NC^0_k$ is the class of circuits with bounded fan-in which have constant depth and the output depends on at most $k$ of the input bits. On the other hand, it is also known that $NC^0_3$-$avoid$ when $m = n+O\left(n^{2/3}\right)$ is at least as hard as explicit construction of rigid matrices. In this paper, we propose a new approach to solving range avoidance problem via hypergraphs. We formulate the problem in terms of Turan-type problems in hypergraphs of the following kind - for a fixed $k$-uniform hypergraph $H'$, what is the maximum number of edges that can exist in a $k$-uniform hypergraph $H$ which does not have a sub-hypergraph isomorphic to $H'$? We use our approach to show (using known Turan-type bounds) that there is a constant $c$ such that $mon$-$NC^0_3$-$avoid$ can be solved in deterministic polynomial time when $m > cn^2$. To improve the stretch constraint to linear, we show a new Turan-type theorem for a hypergraph structure (which we call the the loose $chi$-cycles) and use it to show that $mon$-$NC^0_3$-$avoid$ can be solved in deterministic polynomial time when $m > n$, thus improving the known bounds of $NC^0_3$-avoid for the case of monotone circuits. △ Less

Submitted 21 March, 2025; originally announced March 2025.

Comments: 31 pages, abstract shortened to fit in arxiv requirements

arXiv:2503.14454 [pdf, other]

The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models

Authors: Erminia Calabrese, J. Colin Hill, Hidde T. Jense, Adrien La Posta, Irene Abril-Cabezas, Graeme E. Addison, Peter A. R. Ade, Simone Aiola, Tommy Alford, David Alonso, Mandana Amiri, Rui An, Zachary Atkins, Jason E. Austermann, Eleonora Barbavara, Nicola Barbieri, Nicholas Battaglia, Elia Stefano Battistelli, James A. Beall, Rachel Bean, Ali Beheshti, Benjamin Beringue, Tanay Bhandarkar, Emily Biermann, Boris Bolliet , et al. (147 additional authors not shown)

Abstract: We use new cosmic microwave background (CMB) primary temperature and polarization anisotropy measurements from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) to test foundational assumptions of the standard cosmological model and set constraints on extensions to it. We derive constraints from the ACT DR6 power spectra alone, as well as in combination with legacy data from Planck. To br… ▽ More We use new cosmic microwave background (CMB) primary temperature and polarization anisotropy measurements from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) to test foundational assumptions of the standard cosmological model and set constraints on extensions to it. We derive constraints from the ACT DR6 power spectra alone, as well as in combination with legacy data from Planck. To break geometric degeneracies, we include ACT and Planck CMB lensing data and baryon acoustic oscillation data from DESI Year-1, and further add supernovae measurements from Pantheon+ for models that affect the late-time expansion history. We verify the near-scale-invariance (running of the spectral index $d n_s/d\ln k = 0.0062 \pm 0.0052$) and adiabaticity of the primordial perturbations. Neutrino properties are consistent with Standard Model predictions: we find no evidence for new light, relativistic species that are free-streaming ($N_{\rm eff} = 2.86 \pm 0.13$, which combined with external BBN data becomes $N_{\rm eff} = 2.89 \pm 0.11$), for non-zero neutrino masses ($\sum m_ν< 0.082$ eV at 95% CL), or for neutrino self-interactions. We also find no evidence for self-interacting dark radiation ($N_{\rm idr} < 0.134$), early-universe variation of fundamental constants, early dark energy, primordial magnetic fields, or modified recombination. Our data are consistent with standard BBN, the FIRAS-inferred CMB temperature, a dark matter component that is collisionless and with only a small fraction allowed as axion-like particles, a cosmological constant, and the late-time growth rate predicted by general relativity. We find no statistically significant preference for a departure from the baseline $Λ$CDM model. In general, models introduced to increase the Hubble constant or to decrease the amplitude of density fluctuations inferred from the primary CMB are not favored by our data. △ Less