Search | arXiv e-print repository

An Interval Hessian-based line-search method for unconstrained nonconvex optimization

Authors: Ashutosh Sharma, Gauransh Dingwani, Nikhil Gupta, Vaishnavi Gupta, Ishan Bajaj

Abstract: Second-order Newton-type algorithms that leverage the exact Hessian or its approximation are central to solving nonlinear optimization problems. These algorithms have been proven to achieve a faster convergence rate than the first-order methods and can find second-order stationary points. However, their applications in solving large-scale nonconvex problems are hindered by three primary challenges… ▽ More Second-order Newton-type algorithms that leverage the exact Hessian or its approximation are central to solving nonlinear optimization problems. These algorithms have been proven to achieve a faster convergence rate than the first-order methods and can find second-order stationary points. However, their applications in solving large-scale nonconvex problems are hindered by three primary challenges: (1) the high computational cost associated with Hessian evaluations, (2) its inversion, and (3) ensuring descent direction at points where the Hessian becomes indefinite. We propose INTHOP, an interval Hessian-based optimization algorithm for nonconvex problems. Specifically, we propose a new search direction guaranteed to be descent and requiring Hessian evaluations and inversion only at specific iterations. The proposed search direction is based on approximating the original Hessian matrix by a positive-definite matrix. We prove that the difference between the approximate and exact Hessian is bounded within an interval. Accordingly, the approximate Hessian matrix is reused if the iterates are in the interval while computing the gradients at each iteration. We develop various algorithm variants based on the interval size updating methods and minimum eigenvalue computation methods. We apply the algorithm to an extensive set of test problems and compare its performance with steepest descent, quasi-Newton, and the Newton methods. We show empirically that our method solves more problems in fewer function and gradient evaluations than steepest descent and the quasi-Newton method. Compared to the Newton method, we illustrate that for nonconvex problems, we require substantially less O(n3) operations. △ Less

Submitted 25 October, 2025; originally announced October 2025.

arXiv:2510.17995 [pdf, ps, other]

FABRIC: Framework for Agent-Based Realistic Intelligence Creation

Authors: Abhigya Verma, Seganrasan Subramanian, Nandhakumar Kandasamy, Naman Gupta

Abstract: Large language models (LLMs) are increasingly deployed as agents, expected to decompose goals, invoke tools, and verify results in dynamic environments. Realizing these capabilities requires access to agentic data-structured interaction records that couple user intents with tool specifications, argument-grounded calls, and verifiable execution traces. However, collecting such data from human annot… ▽ More Large language models (LLMs) are increasingly deployed as agents, expected to decompose goals, invoke tools, and verify results in dynamic environments. Realizing these capabilities requires access to agentic data-structured interaction records that couple user intents with tool specifications, argument-grounded calls, and verifiable execution traces. However, collecting such data from human annotators is costly, time-consuming, and difficult to scale. We present a unified framework for synthesizing agentic data using only LLMs, without any human-in-the-loop supervision. This framework decomposes generation into modular pipelines that produce complete interaction records spanning task specifications, tool definitions, policy pseudocode, natural language exchanges, and execution traces. Records conform to strict syntactic and semantic constraints, ensuring machine-parseability and faithful alignment across inputs, outputs, and tool calls. Beyond single tasks, there is support for both multi-task and multi-turn agent interactions, enabling the construction of datasets that reflect the full spectrum of tool-use competencies. To ensure quality and consistency, the framework integrates constrained generation formats, JSON-schema validation, and judge-based filtering. This paper formalizes the schema for agentic records, details the prompt design principles that guide generation, and introduces scalable pipelines for high-quality synthetic data. By providing a reproducible, LLM-only alternative to manual collection, hence advancing the development of agentic LLMs capable of robust tool use. △ Less

Submitted 20 October, 2025; originally announced October 2025.

Comments: 51 Pages, 38 Listings, 5 Figures

arXiv:2510.17487 [pdf, ps, other]

Directional Search for Persistent Gravitational Waves: Results from the First Part of LIGO-Virgo-KAGRA's Fourth Observing Run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1743 additional authors not shown)

Abstract: The angular distribution of gravitational-wave power from persistent sources may exhibit anisotropies arising from the large-scale structure of the Universe. This motivates directional searches for astrophysical and cosmological gravitational-wave backgrounds, as well as continuous-wave emitters. We present results of such a search using data from the first observing run through the first portion… ▽ More The angular distribution of gravitational-wave power from persistent sources may exhibit anisotropies arising from the large-scale structure of the Universe. This motivates directional searches for astrophysical and cosmological gravitational-wave backgrounds, as well as continuous-wave emitters. We present results of such a search using data from the first observing run through the first portion of the fourth observing run of the LIGO-Virgo-KAGRA Collaborations. We apply gravitational-wave radiometer techniques to generate skymaps and search for both narrowband and broadband persistent gravitational-wave sources. Additionally, we use spherical harmonic decomposition to probe spatially extended sources. No evidence of persistent gravitational-wave signals is found, and we set the most stringent constraints to date on such emissions. For narrowband point sources, our sensitivity estimate to effective strain amplitude lies in the range $(0.03 - 8.4) \times 10^{-24}$ across all sky and frequency range $(20 - 160)$ Hz. For targeted sources -- Scorpius X-1, SN 1987A, the Galactic Center, Terzan 5, and NGC 6397 -- we constrain the strain amplitude with best limits ranging from $\sim 1.1 \times 10^{-25}$ to $6.5 \times 10^{-24}$. For persistent broadband sources, we constrain the gravitational-wave flux $F_{α, \hat{n}}^{95\%, \mathrm{UL}}(25\, \mathrm{Hz}) < (0.008 - 5.5) \times 10^{-8}\, \mathrm{erg\, cm^{-2}\, s^{-1}\, Hz^{-1}}$, depending on the sky direction $\hat{n}$ and spectral index $α=0,\,2/3,\,3$. Finally, for extended sources, we place upper limits on the strain angular power spectrum $C_\ell^{1/2} < (0.63 - 17) \times 10^{-10} \,\mathrm{sr}^{-1}$. △ Less

Submitted 20 October, 2025; originally announced October 2025.

Comments: Main paper: 11 pages and 4 figures; Total with appendices: 39 pages and 12 figures

Report number: LIGO-P250038

arXiv:2510.16092 [pdf, ps, other]

Compressing Many-Shots in In-Context Learning

Authors: Devvrit Khatri, Pranamya Kulkarni, Nilesh Gupta, Yerram Varun, Liqian Peng, Jay Yagnik, Praneeth Netrapalli, Cho-Jui Hsieh, Alec Go, Inderjit S Dhillon, Aditya Kusupati, Prateek Jain

Abstract: Large Language Models (LLMs) have been shown to be able to learn different tasks without explicit finetuning when given many input-output examples / demonstrations through In-Context Learning (ICL). Increasing the number of examples, called ``shots'', improves downstream task performance but incurs higher memory and computational costs. In this work, we study an approach to improve the memory and… ▽ More Large Language Models (LLMs) have been shown to be able to learn different tasks without explicit finetuning when given many input-output examples / demonstrations through In-Context Learning (ICL). Increasing the number of examples, called ``shots'', improves downstream task performance but incurs higher memory and computational costs. In this work, we study an approach to improve the memory and computational efficiency of ICL inference by compressing the many-shot prompts. Given many shots comprising t tokens, our goal is to generate a m soft-token summary, where m < t. We first show that existing prompt compression methods are ineffective for many-shot compression, and simply using fewer shots as a baseline is surprisingly strong. To achieve effective compression, we find that: (a) a stronger compressor model with more trainable parameters is necessary, and (b) compressing many-shot representations at each transformer layer enables more fine-grained compression by providing each layer with its own compressed representation. Based on these insights, we propose MemCom, a layer-wise compression method. We systematically evaluate various compressor models and training approaches across different model sizes (2B and 7B), architectures (Gemma and Mistral), many-shot sequence lengths (3k-6k tokens), and compression ratios (3x to 8x). MemCom outperforms strong baselines across all compression ratios on multiple classification tasks with large label sets. Notably, while baseline performance degrades sharply at higher compression ratios, often by over 20-30%, MemCom maintains high accuracy with minimal degradation, typically dropping by less than 10%. △ Less

Submitted 17 October, 2025; originally announced October 2025.

arXiv:2510.13217 [pdf, ps, other]

LLM-guided Hierarchical Retrieval

Authors: Nilesh Gupta, Wei-Cheng Chang, Ngot Bui, Cho-Jui Hsieh, Inderjit S. Dhillon

Abstract: Modern IR systems are increasingly tasked with answering complex, multi-faceted queries that require deep reasoning rather than simple keyword or semantic matching. While LLM-based IR has shown great promise, the prevailing retrieve-then-rerank paradigm inherits the limitations of embedding-based retrieval; parametric generative approaches are difficult to update with new information; and long-con… ▽ More Modern IR systems are increasingly tasked with answering complex, multi-faceted queries that require deep reasoning rather than simple keyword or semantic matching. While LLM-based IR has shown great promise, the prevailing retrieve-then-rerank paradigm inherits the limitations of embedding-based retrieval; parametric generative approaches are difficult to update with new information; and long-context methods that place the entire corpus in context are computationally infeasible for large document collections. To address these challenges, we introduce LATTICE, a hierarchical retrieval framework that enables an LLM to reason over and navigate large corpora with logarithmic search complexity by imposing a semantic tree structure on the corpus. Our approach consists of two stages: (1) an offline phase that organizes the corpus into a semantic hierarchy via either a bottom-up agglomerative strategy or a top-down divisive strategy using multi-level summaries and (2) an online traversal phase where a search LLM navigates this tree. A central challenge in such LLM-guided search is that the model's relevance judgments are noisy, context-dependent, and unaware of the hierarchy, making cross-branch and cross-level comparisons difficult. To overcome this, we propose a traversal algorithm that estimates calibrated latent relevance scores from local LLM outputs and aggregates them into a global path relevance metric. Our training-free framework achieves state-of-the-art zero-shot performance on the reasoning-intensive BRIGHT benchmark, demonstrating up to 9% improvement in Recall@100 and 5% in nDCG@10 over the next best zero-shot baseline. Furthermore, compared to the fine-tuned SOTA method DIVER-v2, LATTICE attains comparable results on BRIGHT subsets that use a static corpus for evaluation. △ Less

Submitted 15 October, 2025; originally announced October 2025.

arXiv:2510.12825 [pdf, ps, other]

Classifier-Augmented Generation for Structured Workflow Prediction

Authors: Thomas Gschwind, Shramona Chakraborty, Nitin Gupta, Sameep Mehta

Abstract: ETL (Extract, Transform, Load) tools such as IBM DataStage allow users to visually assemble complex data workflows, but configuring stages and their properties remains time consuming and requires deep tool knowledge. We propose a system that translates natural language descriptions into executable workflows, automatically predicting both the structure and detailed configuration of the flow. At its… ▽ More ETL (Extract, Transform, Load) tools such as IBM DataStage allow users to visually assemble complex data workflows, but configuring stages and their properties remains time consuming and requires deep tool knowledge. We propose a system that translates natural language descriptions into executable workflows, automatically predicting both the structure and detailed configuration of the flow. At its core lies a Classifier-Augmented Generation (CAG) approach that combines utterance decomposition with a classifier and stage-specific few-shot prompting to produce accurate stage predictions. These stages are then connected into non-linear workflows using edge prediction, and stage properties are inferred from sub-utterance context. We compare CAG against strong single-prompt and agentic baselines, showing improved accuracy and efficiency, while substantially reducing token usage. Our architecture is modular, interpretable, and capable of end-to-end workflow generation, including robust validation steps. To our knowledge, this is the first system with a detailed evaluation across stage prediction, edge layout, and property generation for natural-language-driven ETL authoring. △ Less

Submitted 10 October, 2025; originally announced October 2025.

Comments: Accepted at EMNLP 2025

MSC Class: 68T50; 68T05; 68T09 ACM Class: I.2.7; I.2.6; H.2.5

arXiv:2510.11145 [pdf, ps, other]

doi 10.46620/URSIAPRASC25/ORRW4934

Machine Learning Frameworks for Large-Scale Radio Surveys: A Summary of Recent Studies

Authors: Nikhel Gupta

Abstract: The rapid growth of large-scale radio surveys, generating over 100 petabytes of data annually, has created a pressing need for automated data analysis methods. Recent research has explored the application of machine learning techniques to address the challenges associated with detecting and classifying radio galaxies, as well as discovering peculiar radio sources. This paper provides an overview o… ▽ More The rapid growth of large-scale radio surveys, generating over 100 petabytes of data annually, has created a pressing need for automated data analysis methods. Recent research has explored the application of machine learning techniques to address the challenges associated with detecting and classifying radio galaxies, as well as discovering peculiar radio sources. This paper provides an overview of our investigations with the Evolutionary Map of the Universe (EMU) survey, detailing the methodologies employed-including supervised, unsupervised, self-supervised, and weakly supervised learning approaches -- and their implications for ongoing and future radio astronomical surveys. △ Less

Submitted 13 October, 2025; originally announced October 2025.

Comments: 7 pages, 1 figure, URSI AP-RASC 2025

Journal ref: Published in IEEE for URSI AP-RASC 2025 Proceedings

arXiv:2510.05396 [pdf, ps, other]

Scalable In-context Ranking with Generative Models

Authors: Nilesh Gupta, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Inderjit Dhillon, Felix Yu

Abstract: In-context Ranking (ICR) is an emerging paradigm for Information Retrieval (IR), which leverages contextual understanding of LLMs by directly incorporating the task description, candidate documents, and the query into the model's input prompt and tasking the LLM to identify relevant document(s). While it is effective, efficiency is a significant challenge in this paradigm, especially as the candid… ▽ More In-context Ranking (ICR) is an emerging paradigm for Information Retrieval (IR), which leverages contextual understanding of LLMs by directly incorporating the task description, candidate documents, and the query into the model's input prompt and tasking the LLM to identify relevant document(s). While it is effective, efficiency is a significant challenge in this paradigm, especially as the candidate list grows due to quadratic/super-linear scaling of attention operation with context length. To this end, this paper first identifies inherent and exploitable structures in the attention of LLMs finetuned for ICR: (1) inter-document block sparsity: attention is dense within each document block but sparse across different documents in the context; and (2) query-document block relevance: the attention scores from certain query tokens to a document block in middle layers strongly correlate with that document's actual relevance. Motivated by these observations, we introduce BlockRank (Blockwise In-context Ranking), a novel method that adapts the attention operation in an LLM by (a) architecturally enforcing the observed inter-document block sparsity, reducing attention complexity from quadratic to linear without loss in performance, and (b) optimizing query-document block relevance for true relevant documents during fine-tuning using an auxiliary contrastive training objective, improving retrieval in attention. Experiments on BEIR, MSMarco and NQ with Mistral-7B demonstrate that BlockRank Mistral matches or outperforms existing SOTA listwise rankers and controlled fine-tuned baseline while being significantly more efficient at inference (4.7x for 100 MSMarco documents in context) and scaling gracefully to long-context shortlists, around 500 documents in-context (approximately 100K context length) within a second, presenting a scalable and effective solution for ICR. △ Less

Submitted 7 October, 2025; v1 submitted 6 October, 2025; originally announced October 2025.

Journal ref: Neurips 2025

arXiv:2510.04568 [pdf, ps, other]

COSMIR: Chain Orchestrated Structured Memory for Iterative Reasoning over Long Context

Authors: Naman Gupta, Shreeyash Gowaikar, Arun Iyer, Kirankumar Shiragur, Ramakrishna B Bairi, Rishikesh Maurya, Ritabrata Maiti, Sankarshan Damle, Shachee Mishra Gupta

Abstract: Reasoning over very long inputs remains difficult for large language models (LLMs). Common workarounds either shrink the input via retrieval (risking missed evidence), enlarge the context window (straining selectivity), or stage multiple agents to read in pieces. In staged pipelines (e.g., Chain of Agents, CoA), free-form summaries passed between agents can discard crucial details and amplify earl… ▽ More Reasoning over very long inputs remains difficult for large language models (LLMs). Common workarounds either shrink the input via retrieval (risking missed evidence), enlarge the context window (straining selectivity), or stage multiple agents to read in pieces. In staged pipelines (e.g., Chain of Agents, CoA), free-form summaries passed between agents can discard crucial details and amplify early mistakes. We introduce COSMIR (Chain Orchestrated Structured Memory for Iterative Reasoning), a chain-style framework that replaces ad hoc messages with a structured memory. A Planner agent first turns a user query into concrete, checkable sub-questions. worker agents process chunks via a fixed micro-cycle: Extract, Infer, Refine, writing all updates to the shared memory. A Manager agent then Synthesizes the final answer directly from the memory. This preserves step-wise read-then-reason benefits while changing both the communication medium (structured memory) and the worker procedure (fixed micro-cycle), yielding higher faithfulness, better long-range aggregation, and auditability. On long-context QA from the HELMET suite, COSMIR reduces propagation-stage information loss and improves accuracy over a CoA baseline. △ Less

Submitted 6 October, 2025; originally announced October 2025.

arXiv:2510.03400 [pdf, ps, other]

doi 10.3847/2041-8213/ae0d8b

H I Properties of Field Galaxies at $\boldsymbol{z\approx 0.2}$-0.6: Insights into Declining Cosmic Star Formation

Authors: David DePalma, Neeraj Gupta, Hsiao-Wen Chen, Robert A. Simcoe, Sergei Balashev, Erin Boettcher, Sebastiano Cantalupo, Mandy C. Chen, Françoise Combes, Claude-André Faucher-Giguère, Sean D. Johnson, Hans-Rainer Klöckner, Jens-Kristian Krogager, Jennifer I-Hsiu Li, Sebastián López, Pasquier Noterdaeme, Patrick Petitjean, Zhijie Qu, Gwen C. Rudie, Joop Schaye, Fakhri Zahedy

Abstract: We report statistically significant detection of H I 21-cm emission from intermediate-redshift ($z\approx0.2$-0.6) galaxies. By leveraging multi-sightline galaxy survey data from the Cosmic Ultraviolet Baryon Survey (CUBS) and deep radio observations from the MeerKAT Absorption Line Survey (MALS), we have established a sample of $\approx6000$ spectroscopically identified galaxies in 11 distinct fi… ▽ More We report statistically significant detection of H I 21-cm emission from intermediate-redshift ($z\approx0.2$-0.6) galaxies. By leveraging multi-sightline galaxy survey data from the Cosmic Ultraviolet Baryon Survey (CUBS) and deep radio observations from the MeerKAT Absorption Line Survey (MALS), we have established a sample of $\approx6000$ spectroscopically identified galaxies in 11 distinct fields to constrain the neutral gas content at intermediate redshifts. The galaxies sample a broad range in stellar mass -- $8\lesssim\log{M_\rm{star}/\rm{M}_\odot}\lesssim11$ with a median of $\langle\log{M_\rm{star}/\rm{M}_\odot}\rangle_\rm{med}\approx10$ -- and a wide range in redshift -- $0.24\lesssim z\lesssim0.63$ with a median of $\langle z\rangle_\rm{med}=0.44$. Our detected emission-line signal exceeds $4\,σ$ significance in the stacked spectra of all subsamples, and the observed total H I 21-cm line flux translates to a H I mass $M_\rm{H\;I}\approx10^{10}\rm{M}_\odot$. We find a high H I-to-stellar mass ratio of $M_\mathrm{H\;I}/M_\rm{star}\approx6$ for low-mass galaxies with $\langle\log{M_\rm{star}/\rm{M}_\odot}\rangle \approx9.3$ ($>3.7\,σ$). For galaxies with $\langle\log{M_\rm{star}/\rm{M}_\odot}\rangle\approx10.6$, we find $M_\mathrm{H\;I}/M_\rm{star}\approx0.3$ ($>4.7\,σ$). Additionally, the redshift evolution of H I mass in both low- and high-mass field galaxies, inferred from the stacked emission-line signal, aligns well with the expectation from the cosmic star formation history. This suggests that the overall decline in the cosmic star formation activity across the general galaxy population may be connected to a decreasing supply of neutral hydrogen. Finally, our analysis has revealed significant 21-cm signals at distances greater than 75 kpc from these intermediate-redshift galaxies, indicating a substantial reservoir of H I gas in their extended surroundings. △ Less

Submitted 3 October, 2025; originally announced October 2025.

Comments: 14 pages, 6 figures, forthcoming in The Astrophysical Journal Letters

arXiv:2510.00310 [pdf, ps, other]

Robust Federated Inference

Authors: Akash Dhasade, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Maxime Jacovella, Anne-Marie Kermarrec, Rafael Pinot

Abstract: Federated inference, in the form of one-shot federated learning, edge ensembles, or federated ensembles, has emerged as an attractive solution to combine predictions from multiple models. This paradigm enables each model to remain local and proprietary while a central server queries them and aggregates predictions. Yet, the robustness of federated inference has been largely neglected, leaving them… ▽ More Federated inference, in the form of one-shot federated learning, edge ensembles, or federated ensembles, has emerged as an attractive solution to combine predictions from multiple models. This paradigm enables each model to remain local and proprietary while a central server queries them and aggregates predictions. Yet, the robustness of federated inference has been largely neglected, leaving them vulnerable to even simple attacks. To address this critical gap, we formalize the problem of robust federated inference and provide the first robustness analysis of this class of methods. Our analysis of averaging-based aggregators shows that the error of the aggregator is small either when the dissimilarity between honest responses is small or the margin between the two most probable classes is large. Moving beyond linear averaging, we show that problem of robust federated inference with non-linear aggregators can be cast as an adversarial machine learning problem. We then introduce an advanced technique using the DeepSet aggregation model, proposing a novel composition of adversarial training and test-time robust aggregation to robustify non-linear aggregators. Our composition yields significant improvements, surpassing existing robust aggregation methods by 4.7 - 22.2% in accuracy points across diverse benchmarks. △ Less

Submitted 17 October, 2025; v1 submitted 30 September, 2025; originally announced October 2025.

arXiv:2509.25193 [pdf, ps, other]

Devstral: Fine-tuning Language Models for Coding Agent Applications

Authors: Abhinav Rastogi, Adam Yang, Albert Q. Jiang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou, Amélie Martin, Anmol Agarwal, Andy Ehrenberg, Andy Lo, Antoine Roux, Arthur Darcet, Arthur Mensch, Baptiste Bout, Baptiste Rozière, Baudouin De Monicault, Chris Bamford, Christian Wallenwein, Christophe Renaudin, Clémence Lanfranchi, Clément Denoix, Corentin Barreau, Darius Dabert Devon Mizelle, Diego de las Casas, Elliot Chane-Sane , et al. (78 additional authors not shown)

Abstract: We introduce Devstral-Small, a lightweight open source model for code agents with the best performance among models below 100B size. In this technical report, we give an overview of how we design and develop a model and craft specializations in agentic software development. The resulting model, Devstral-Small is a small 24B model, fast and easy to serve. Despite its size, Devstral-Small still atta… ▽ More We introduce Devstral-Small, a lightweight open source model for code agents with the best performance among models below 100B size. In this technical report, we give an overview of how we design and develop a model and craft specializations in agentic software development. The resulting model, Devstral-Small is a small 24B model, fast and easy to serve. Despite its size, Devstral-Small still attains competitive performance compared to models more than an order of magnitude larger. △ Less

Submitted 8 August, 2025; originally announced September 2025.

arXiv:2509.25155 [pdf, ps, other]

Context-Driven Performance Modeling for Causal Inference Operators on Neural Processing Units

Authors: Neelesh Gupta, Rakshith Jayanth, Dhruv Parikh, Viktor Prasanna

Abstract: The proliferation of large language models (LLMs) has driven demand for long context inference on resource constrained edge devices. However, deploying these models on Neural Processing Units (NPUs) presents significant challenges due to the architectural mismatch: quadratic complexity of standard attention mechanisms conflicts with memory and compute patterns of edge accelerators. This paper pres… ▽ More The proliferation of large language models (LLMs) has driven demand for long context inference on resource constrained edge devices. However, deploying these models on Neural Processing Units (NPUs) presents significant challenges due to the architectural mismatch: quadratic complexity of standard attention mechanisms conflicts with memory and compute patterns of edge accelerators. This paper presents a comprehensive performance analysis of various causal inference operators on a modern NPU. We benchmark standard quadratic attention against several sub-quadratic alternatives, including structured state-space and linear attention models. Our analysis reveals that while sub-quadratic methods offer superior scalability, they introduce distinct computational bottlenecks on the NPU's specialized execution units. We identify that quadratic attention becomes severely memory-bound, suffering from cache inefficiency and pipeline stalls exceeding 95% at long contexts. In contrast, sub-quadratic models can become compute-bound on programmable vector cores. These findings provide critical insights for the co-design of hardware-aware models and optimization strategies to enable on-device AI inference with long-contexts. △ Less

Submitted 29 September, 2025; originally announced September 2025.

Comments: IEEE HiPC 2025

arXiv:2509.21970 [pdf, ps, other]

doi 10.1051/0004-6361/202555015

Origin of gas in the Magellanic Bridge: MeerKAT detection of HI 21-cm absorption

Authors: A. P. M. Morelli, J. Kerp, N. Gupta, F. Combes, S. A. Balashev, P. Noterdaeme, H. Chen, K. L. Emig, E. Momjian

Abstract: HI 21-cm absorption lines are investigated to determine the origin of the neutral atomic hydrogen (HI) of the Magellanic Bridge (MB). Using the MeerKat Absorption Line Survey (MALS) data we report the detection of an HI absorption line at a peak signal-to-noise ratio of 10 caused by MB gas against the radio source J033242.97-724904.5. In combination with earlier data obtained with the Australia Te… ▽ More HI 21-cm absorption lines are investigated to determine the origin of the neutral atomic hydrogen (HI) of the Magellanic Bridge (MB). Using the MeerKat Absorption Line Survey (MALS) data we report the detection of an HI absorption line at a peak signal-to-noise ratio of 10 caused by MB gas against the radio source J033242.97-724904.5. In combination with earlier data obtained with the Australia Telescope Compact Array (ATCA) our new detected HI line permits the exploration of the MB atomic hydrogen gas across 4-6 kpc. The radial velocity profiles from the ATCA data and new data from MALS are analysed. Apart from the excitation conditions, the radial velocity structure of the HI gas seen in emission and absorption is investigated. Eventually the gas-to-dust ratio is quantified to identify the origin of the MB gas being either from the SMC (Small Magellanic Cloud) or the LMC (Large Magellanic Cloud). The HI absorption lines towards lines of sight separated by several kpc consistently coincide with the densest and perhaps coolest gas at the lower radial-velocity limit of the corresponding HI emission profiles. The gas-to-dust ratio is found to be consistent with an origin of the MB gas from the LMC. The large scale velocity distribution as seen from the HI absorption features favors the LMC-SMC direct collision scenario over the close fly-by scenario, as also currently found by numerical simulations. △ Less

Submitted 26 September, 2025; originally announced September 2025.

Comments: accepted for publication in Astronomy & Astrophysics Letters

Journal ref: A&A 702, L11 (2025)

arXiv:2509.21314 [pdf, ps, other]

Explaining the Origin of TeV Gamma Rays from M87 During High and Low States

Authors: Nibedita Mondal, Sandeep Kumar Mondal, Nayantara Gupta

Abstract: The detection of very high-energy gamma-rays from M87 can provide crucial insights into particle acceleration and radiation mechanisms in jets. The recent observations by the Large High Altitude Air Shower Observatory (LHAASO) detector extend the energy range of TeV gamma-ray astronomy, and also the variability study to the TeV energy domain. We have modelled the low state and flare state multi-wa… ▽ More The detection of very high-energy gamma-rays from M87 can provide crucial insights into particle acceleration and radiation mechanisms in jets. The recent observations by the Large High Altitude Air Shower Observatory (LHAASO) detector extend the energy range of TeV gamma-ray astronomy, and also the variability study to the TeV energy domain. We have modelled the low state and flare state multi-wavelength spectral energy distributions of M87 within a time-dependent framework. In our model, the low state gamma-ray flux results from the emissions from the sub-parsec and the kilo-parsec scale jets of M87, whereas the flare state gamma-ray flux is mainly produced in the sub-parsec scale jet. We have shown that the spectral and temporal features of the TeV gamma-ray spectrum of M87 are consistent with this two-zone model, where the contribution from the sub-parsec scale jet significantly increases during the flare state. △ Less

Submitted 25 September, 2025; originally announced September 2025.

Comments: 18 pages, 4 figures

arXiv:2509.21026 [pdf, ps, other]

A Novel Integrated Architecture for Intent Based Approach and Zero Touch Networks

Authors: Neelam Gupta, Dibakar Das, Tamizhelakkiya K, Uma Maheswari Natarajan, Sharvari Ravindran, Komal Sharma, Jyotsna Bapat, Debabrata Das

Abstract: The transition to Sixth Generation (6G) networks presents challenges in managing quality of service (QoS) of diverse applications and achieving Service Level Agreements (SLAs) under varying network conditions. Hence, network management must be automated with the help of Machine Learning (ML) and Artificial Intelligence (AI) to achieve real-time requirements. Zero touch network (ZTN) is one of the… ▽ More The transition to Sixth Generation (6G) networks presents challenges in managing quality of service (QoS) of diverse applications and achieving Service Level Agreements (SLAs) under varying network conditions. Hence, network management must be automated with the help of Machine Learning (ML) and Artificial Intelligence (AI) to achieve real-time requirements. Zero touch network (ZTN) is one of the frameworks to automate network management with mechanisms such as closed loop control to ensure that the goals are met perpetually. Intent- Based Networking (IBN) specifies the user intents with diverse network requirements or goals which are then translated into specific network configurations and actions. This paper presents a novel architecture for integrating IBN and ZTN to serve the intent goals. Users provides the intent in the form of natural language, e.g., English, which is then translated using natural language processing (NLP) techniques (e.g., retrieval augmented generation (RAG)) into Network Intent LanguagE (Nile). The Nile intent is then passed on to the BiLSTM and Q-learning based ZTN closed loop framework as a goal which maintains the intent under varying network conditions. Thus, the proposed architecture can work autonomously to ensure the network performance goal is met by just specifying the user intent in English. The integrated architecture is also implemented on a testbed using OpenAirInterface (OAI). Additionally, to evaluate the architecture, an optimization problem is formulated which evaluated with Monte Carlo simulations. Results demonstrate how ZTN can help achieve the bandwidth goals autonomously set by user intent. The simulation and the testbed results are compared and they show similar trend. Mean Opinion Score (MOS) for Quality of Experience (QoE) is also measured to indicate the user satisfaction of the intent. △ Less

Submitted 25 September, 2025; originally announced September 2025.

arXiv:2509.20547 [pdf, ps, other]

Realization of Graphene Quantum Dots for Innovative Biosensor Development and Diverse Applications

Authors: Kumar Gautam, Kumar Shubham, Hitesh Sharma, Divya Punia, Ajay K Sharma, Namisha Gupta, Varun Rathor, Vishakha Singh

Abstract: This paper investigates quantum dots (QDs), which are miniature semiconductor structures with remarkable optical and electrical properties due to quantum confinement processes. Traditional QDs, such as CdTe, have been extensively investigated; however, they frequently exhibit toxicity and stability issues. Graphene quantum dots (GQDs) are emerging as a safer and more stable alternative to traditio… ▽ More This paper investigates quantum dots (QDs), which are miniature semiconductor structures with remarkable optical and electrical properties due to quantum confinement processes. Traditional QDs, such as CdTe, have been extensively investigated; however, they frequently exhibit toxicity and stability issues. Graphene quantum dots (GQDs) are emerging as a safer and more stable alternative to traditional QDs. GQDs are honeycomb-lattice carbon atoms with unique electronic and optical properties that make them promising candidates for biomedical, electronic, and energy storage applications. GQD synthesis methods (top-down and bottom-up) and their advantages over standard QDs include better photostability, biocompatibility, and configurable band gaps. GQDs are perfect for real-world uses like sensitive biosensing, real-time food safety monitoring, and smart packaging because of their low toxicity, high sensitivity, and affordability. These uses are all essential for cutting down on food grain waste. This emphasizes the growing significance of GQDs in advancing nanotechnology and their potential integration with quantum technologies, paving the door for creative solutions in biosensing, food safety, environmental monitoring, and future quantum electronics. △ Less

Submitted 24 September, 2025; originally announced September 2025.

arXiv:2509.08207 [pdf, ps, other]

Aurora: Architecting Argonne's First Exascale Supercomputer for Accelerated Scientific Discovery

Authors: Benjamin S. Allen, James Anchell, Victor Anisimov, Thomas Applencourt, Abhishek Bagusetty, Ramesh Balakrishnan, Riccardo Balin, Solomon Bekele, Colleen Bertoni, Cyrus Blackworth, Renzo Bustamante, Kevin Canada, John Carrier, Christopher Chan-nui, Lance C. Cheney, Taylor Childers, Paul Coffman, Susan Coghlan, Michael D'Mello, Murali Emani, Kyle G. Felker, Sam Foreman, Olivier Franza, Longfei Gao, Marta García , et al. (72 additional authors not shown)

Abstract: Aurora is Argonne National Laboratory's pioneering Exascale supercomputer, designed to accelerate scientific discovery with cutting-edge architectural innovations. Key new technologies include the Intel(TM) Xeon(TM) Data Center GPU Max Series (code-named Sapphire Rapids) with support for High Bandwidth Memory (HBM), alongside the Intel(TM) Data Center GPU Max Series (code-named Ponte Vecchio) on e… ▽ More Aurora is Argonne National Laboratory's pioneering Exascale supercomputer, designed to accelerate scientific discovery with cutting-edge architectural innovations. Key new technologies include the Intel(TM) Xeon(TM) Data Center GPU Max Series (code-named Sapphire Rapids) with support for High Bandwidth Memory (HBM), alongside the Intel(TM) Data Center GPU Max Series (code-named Ponte Vecchio) on each compute node. Aurora also integrates the Distributed Asynchronous Object Storage (DAOS), a novel exascale storage solution, and leverages Intel's oneAPI programming environment. This paper presents an in-depth exploration of Aurora's node architecture, the HPE Slingshot interconnect, the supporting software ecosystem, and DAOS. We provide insights into standard benchmark performance and applications readiness efforts via Aurora's Early Science Program and the Exascale Computing Project. △ Less

Submitted 9 September, 2025; originally announced September 2025.

Comments: 40 pages, 10 figures. Submitted to J. Supercomputing

ACM Class: C.0; C.4; C.5.1; B.8.0; D.1.3

arXiv:2509.08054 [pdf, ps, other]

doi 10.1103/kw5g-d732

GW250114: testing Hawking's area law and the Kerr nature of black holes

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1763 additional authors not shown)

Abstract: The gravitational-wave signal GW250114 was observed by the two LIGO detectors with a network matched-filter signal-to-noise ratio of 80. The signal was emitted by the coalescence of two black holes with near-equal masses $m_1 = 33.6^{+1.2}_{-0.8}\,M_\odot$ and $m_2 = 32.2^{+0.8}_{-1.3}\,M_\odot$, and small spins $χ_{1,2} \leq 0.26$ (90% credibility) and negligible eccentricity $e \leq 0.03$. Post-… ▽ More The gravitational-wave signal GW250114 was observed by the two LIGO detectors with a network matched-filter signal-to-noise ratio of 80. The signal was emitted by the coalescence of two black holes with near-equal masses $m_1 = 33.6^{+1.2}_{-0.8}\,M_\odot$ and $m_2 = 32.2^{+0.8}_{-1.3}\,M_\odot$, and small spins $χ_{1,2} \leq 0.26$ (90% credibility) and negligible eccentricity $e \leq 0.03$. Post-merger data excluding the peak region are consistent with the dominant quadrupolar $(\ell = |m| = 2)$ mode of a Kerr black hole and its first overtone. We constrain the modes' frequencies to $\pm 30\%$ of the Kerr spectrum, providing a test of the remnant's Kerr nature. We also examine Hawking's area law, also known as the second law of black hole mechanics, which states that the total area of the black hole event horizons cannot decrease with time. A range of analyses that exclude up to 5 of the strongest merger cycles confirm that the remnant area is larger than the sum of the initial areas to high credibility. △ Less

Submitted 9 September, 2025; originally announced September 2025.

Comments: 6 pages, 5 figures (plus supplement)

Report number: LIGO-P2500421

arXiv:2509.07352 [pdf, ps, other]

Directed searches for gravitational waves from ultralight vector boson clouds around merger remnant and galactic black holes during the first part of the fourth LIGO-Virgo-KAGRA observing run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1747 additional authors not shown)

Abstract: We present the first directed searches for long-transient and continuous gravitational waves from ultralight vector boson clouds around known black holes (BHs). We use LIGO data from the first part of the fourth LIGO-Virgo-KAGRA observing run. The searches target two distinct types of BHs and use two new semicoherent methods: hidden Markov model (HMM) tracking for the remnant BHs of the mergers GW… ▽ More We present the first directed searches for long-transient and continuous gravitational waves from ultralight vector boson clouds around known black holes (BHs). We use LIGO data from the first part of the fourth LIGO-Virgo-KAGRA observing run. The searches target two distinct types of BHs and use two new semicoherent methods: hidden Markov model (HMM) tracking for the remnant BHs of the mergers GW230814_230901 and GW231123_135430 (referred to as GW230814 and GW231123 in this study), and a dedicated method using the Band Sampled Data (BSD) framework for the galactic BH in the Cygnus X-1 binary system. Without finding evidence of a signal from vector bosons in the data, we estimate the mass range that can be constrained. For the HMM searches targeting the remnants from GW231123 and GW230814, we disfavor vector boson masses in the ranges $[0.94, 1.08]$ and $[2.75, 3.28] \times 10^{-13}$ eV, respectively, at 30% confidence, assuming a 1% false alarm probability. Although these searches are only marginally sensitive to signals from merger remnants at relatively large distances, future observations are expected to yield more stringent constraints with high confidence. For the BSD search targeting the BH in Cygnus X-1, we exclude vector boson masses in the range $[0.85, 1.59] \times 10^{-13}$ eV at 95% confidence, assuming an initial BH spin larger than 0.5. △ Less

Submitted 14 September, 2025; v1 submitted 8 September, 2025; originally announced September 2025.

Comments: 33 pages, 4 figures

Report number: LIGO-P2500256

arXiv:2509.06272 [pdf, ps, other]

An Explainable Framework for Particle Swarm Optimization using Landscape Analysis and Machine Learning

Authors: Nitin Gupta, Bapi Dutta, Anupam Yadav

Abstract: Swarm intelligence algorithms have demonstrated remarkable success in solving complex optimization problems across diverse domains. However, their widespread adoption is often hindered by limited transparency in how algorithmic components influence performance. This work presents a multi-faceted investigation of Particle Swarm Optimization (PSO) to further understand the key role of different topo… ▽ More Swarm intelligence algorithms have demonstrated remarkable success in solving complex optimization problems across diverse domains. However, their widespread adoption is often hindered by limited transparency in how algorithmic components influence performance. This work presents a multi-faceted investigation of Particle Swarm Optimization (PSO) to further understand the key role of different topologies for better interpretability and explainability. To achieve this objective, we first develop a comprehensive landscape characterization framework using Exploratory Landscape Analysis (ELA) to quantify problem difficulty and identify critical features affecting the optimization performance of PSO. Next, we conduct a rigorous empirical study comparing three fundamental swarm communication architectures -- Ring, Star, and Von Neumann topologies -- analysing their distinct impacts on exploration-exploitation balance, convergence behaviour, and solution quality and eventually develop an explainable benchmarking framework for PSO, to decode how swarm topologies affects information flow, diversity, and convergence. Based on this, a novel machine learning approach for automated algorithm configuration is introduced for training predictive models on extensive Area over the Convergence Curve (AOCC) data to recommend optimal settings based on problem characteristics. Through systematic experimentation across twenty four benchmark functions in multiple dimensions, we establish practical guidelines for topology selection and parameter configuration. These findings advance the development of more transparent and reliable swarm intelligence systems. The source codes of this work can be accessed at https://github.com/GitNitin02/ioh_pso. △ Less

Submitted 7 September, 2025; originally announced September 2025.

arXiv:2509.04981 [pdf, ps, other]

Deep polarimetry study reveals double ring ORC-like structures

Authors: Sam Taziaux, Dominik J. Bomans, Christopher J. Riseley, Alec J. M. Thomson, Ray P. Norris, Aritra Basu, George H. Heald, Timothy J. Galvin, Björn Adebahr, Miroslav D. Filipović, Nikhel Gupta, Stas Shabala, Tayyaba Zafar

Abstract: New observations with the current generation of advanced radio interferometers, such as ASKAP and MeerKAT, have led to the discovery of new classes of extended radio sources of unknown origin, including the so-called Odd Radio Circles (ORCs). These phenomena are detected exclusively in the radio continuum, with no clear counterparts at other wavelengths, making their physical nature and origin a s… ▽ More New observations with the current generation of advanced radio interferometers, such as ASKAP and MeerKAT, have led to the discovery of new classes of extended radio sources of unknown origin, including the so-called Odd Radio Circles (ORCs). These phenomena are detected exclusively in the radio continuum, with no clear counterparts at other wavelengths, making their physical nature and origin a subject of ongoing investigation. To better understand these objects, we study their radio continuum emission, spectral characteristics, and magnetic field properties. In this work, we present a radio spectropolarimetry analysis of a newly discovered ORC (ORC J0356-4216) that exhibits a rare double-ring morphology. We use data from the MeerKAT L-band and from the ASKAP Evolutionary Map of the Universe (EMU) at 943 MHz. ORC J0356-4216 shows a symmetric double-ring structure with a diameter of approximately 2 arcminutes, corresponding to a physical size of 668 kpc based on the redshift ($0.494 \pm 0.068$) of its apparent host galaxy WISEA J035609.67-421603.5. The radio spectra of both rings are steep, with spectral indices of $-1.18 \pm 0.03$ and $-1.12 \pm 0.05$, and show no significant substructure. Equipartition magnetic field strengths (assuming K0 = 1) are estimated to be 1.82 microGauss and 1.65 microGauss for the respective rings. The degree of polarisation across the object ranges between 20-30%, consistent with a non-thermal synchrotron origin. The morphology and polarisation are broadly consistent with large-scale shocks driven by powerful starburst outflows. However, the high degree of symmetry, the coherent double-ring structure, and the absence of internal substructure are features commonly associated with relic AGN lobes, making this scenario particularly compatible with the observed characteristics. △ Less

Submitted 5 September, 2025; originally announced September 2025.

Comments: 11 pages, 7 figures

arXiv:2509.04348 [pdf, ps, other]

GWTC-4.0: Constraints on the Cosmic Expansion Rate and Modified Gravitational-wave Propagation

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1750 additional authors not shown)

Abstract: We analyze data from 142 of the 218 gravitational-wave (GW) sources in the fourth LIGO-Virgo-KAGRA Collaboration (LVK) Gravitational-Wave Transient Catalog (GWTC-4.0) to estimate the Hubble constant $H_0$ jointly with the population properties of merging compact binaries. We measure the luminosity distance and redshifted masses of GW sources directly; in contrast, we infer GW source redshifts stat… ▽ More We analyze data from 142 of the 218 gravitational-wave (GW) sources in the fourth LIGO-Virgo-KAGRA Collaboration (LVK) Gravitational-Wave Transient Catalog (GWTC-4.0) to estimate the Hubble constant $H_0$ jointly with the population properties of merging compact binaries. We measure the luminosity distance and redshifted masses of GW sources directly; in contrast, we infer GW source redshifts statistically through i) location of features in the compact object mass spectrum and merger rate evolution, and ii) identifying potential host galaxies in the GW localization volume. Probing the relationship between source luminosity distances and redshifts obtained in this way yields constraints on cosmological parameters. We also constrain parameterized deviations from general relativity which affect GW propagation, specifically those modifying the dependence of a GW signal on the source luminosity distance. Assuming our fiducial model for the source-frame mass distribution and using GW candidates detected up to the end of the fourth observing run (O4a), together with the GLADE+ all-sky galaxy catalog, we estimate $H_0 = 76.6^{+13.0}_{-9.5} (76.6^{+25.2}_{-14.0})$ km s$^{-1}$ Mpc$^{-1}$. This value is reported as a median with 68.3% (90%) symmetric credible interval, and includes combination with the $H_0$ measurement from GW170817 and its electromagnetic counterpart. Using a parametrization of modified GW propagation in terms of the magnitude parameter $Ξ_0$, we estimate $Ξ_0 = 1.2^{+0.8}_{-0.4} (1.2^{+2.4}_{-0.5})$, where $Ξ_0 = 1$ recovers the behavior of general relativity. △ Less

Submitted 7 October, 2025; v1 submitted 4 September, 2025; originally announced September 2025.

Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

Report number: LIGO-P2400152

arXiv:2508.20721 [pdf, ps, other]

Upper Limits on the Isotropic Gravitational-Wave Background from the first part of LIGO, Virgo, and KAGRA's fourth Observing Run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1751 additional authors not shown)

Abstract: We present results from the search for an isotropic gravitational-wave background using Advanced LIGO and Advanced Virgo data from O1 through O4a, the first part of the fourth observing run. This background is the accumulated signal from unresolved sources throughout cosmic history and encodes information about the merger history of compact binaries throughout the Universe, as well as exotic physi… ▽ More We present results from the search for an isotropic gravitational-wave background using Advanced LIGO and Advanced Virgo data from O1 through O4a, the first part of the fourth observing run. This background is the accumulated signal from unresolved sources throughout cosmic history and encodes information about the merger history of compact binaries throughout the Universe, as well as exotic physics and potentially primordial processes from the early cosmos. Our cross-correlation analysis reveals no statistically significant background signal, enabling us to constrain several theoretical scenarios. For compact binary coalescences which approximately follow a 2/3 power-law spectrum, we constrain the fractional energy density to $Ω_{\rm GW}(25{\rm Hz})\leq 2.0\times 10^{-9}$ (95% cred.), a factor of 1.7 improvement over previous results. Scale-invariant backgrounds are constrained to $Ω_{\rm GW}(25{\rm Hz})\leq 2.8\times 10^{-9}$, representing a 2.1x sensitivity gain. We also place new limits on gravity theories predicting non-standard polarization modes and confirm that terrestrial magnetic noise sources remain below detection threshold. Combining these spectral limits with population models for GWTC-4, the latest gravitational-wave event catalog, we find our constraints remain above predicted merger backgrounds but are approaching detectability. The joint analysis combining the background limits shown here with the GWTC-4 catalog enables improved inference of the binary black hole merger rate evolution across cosmic time. Employing GWTC-4 inference results and standard modeling choices, we estimate that the total background arising from compact binary coalescences is $Ω_{\rm CBC}(25{\rm Hz})={0.9^{+1.1}_{-0.5}\times 10^{-9}}$ at 90% confidence, where the largest contribution is due to binary black holes only, $Ω_{\rm BBH}(25{\rm Hz})=0.8^{+1.1}_{-0.5}\times 10^{-9}$. △ Less

Submitted 28 August, 2025; originally announced August 2025.

Comments: 31 pages, 7 figures

Report number: LIGO-P2500349

arXiv:2508.18083 [pdf, ps, other]

GWTC-4.0: Population Properties of Merging Compact Binaries

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, S. Ahmadzadeh, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1783 additional authors not shown)

Abstract: We detail the population properties of merging compact objects using 158 mergers from the cumulative Gravitational-Wave Transient Catalog 4.0, which includes three types of binary mergers: binary neutron star, neutron star--black hole binary, and binary black hole mergers. We resolve multiple over- and under-densities in the black hole mass distribution: features persist at primary masses of… ▽ More We detail the population properties of merging compact objects using 158 mergers from the cumulative Gravitational-Wave Transient Catalog 4.0, which includes three types of binary mergers: binary neutron star, neutron star--black hole binary, and binary black hole mergers. We resolve multiple over- and under-densities in the black hole mass distribution: features persist at primary masses of $10\,M_\odot$ and $35\,M_\odot$ with a possible third feature at $\sim 20\,M_\odot$. These are departures from an otherwise power-law-like continuum that steepens above $35\,M_\odot$. Binary black holes with primary masses near $10\,M_\odot$ are more likely to have less massive secondaries, with a mass ratio distribution peaking at $q = 0.74^{+0.13}_{-0.13}$, potentially a signature of stable mass transfer during binary evolution. Black hole spins are inferred to be non-extremal, with 90\% of black holes having $χ< 0.57$, and preferentially aligned with binary orbits, implying many merging binaries form in isolation. However, we find a significant fraction, 0.24-0.42, of binaries have negative effective inspiral spins, suggesting many could be formed dynamically in gas-free environments. We find evidence for correlation between effective inspiral spin and mass ratio, though it is unclear if this is driven by variation in the mode of the distribution or the width. (Abridged) △ Less

Submitted 17 September, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

Report number: LIGO-P2400004

arXiv:2508.18082 [pdf]

GWTC-4.0: Updating the Gravitational-Wave Transient Catalog with Observations from the First Part of the Fourth LIGO-Virgo-KAGRA Observing Run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1748 additional authors not shown)

Abstract: Version 4.0 of the Gravitational-Wave Transient Catalog (GWTC-4.0) adds new candidates detected by the LIGO, Virgo, and KAGRA observatories through the first part of the fourth observing run (O4a: 2023 May 24 15:00:00 to 2024 January 16 16:00:00 UTC) and a preceding engineering run. In this new data, we find 128 new compact binary coalescence candidates that are identified by at least one of our s… ▽ More Version 4.0 of the Gravitational-Wave Transient Catalog (GWTC-4.0) adds new candidates detected by the LIGO, Virgo, and KAGRA observatories through the first part of the fourth observing run (O4a: 2023 May 24 15:00:00 to 2024 January 16 16:00:00 UTC) and a preceding engineering run. In this new data, we find 128 new compact binary coalescence candidates that are identified by at least one of our search algorithms with a probability of astrophysical origin $p_{\rm astro} \geq 0.5$ and that are not vetoed during event validation. We also provide detailed source property measurements for 86 of these that have a false alarm rate $< 1 \rm{yr}^{-1}$. Based on the inferred component masses, these new candidates are consistent with signals from binary black holes and neutron star-black hole binaries (GW230518_125908 and GW230529_181500). Median inferred component masses of binary black holes in the catalog now range from $5.79\,M_\odot$ (GW230627_015337) to $137\,M_\odot$ (GW231123_135430), while GW231123_135430 was probably produced by the most massive binary observed in the catalog. For the first time we have discovered binary black hole signals with network signal-to-noise ratio exceeding 30, GW230814_230901 and GW231226_01520, enabling high-fidelity studies of the waveforms and astrophysical properties of these systems. Combined with the 90 candidates included in GWTC-3.0, the catalog now contains 218 candidates with $p_{\rm astro} \geq 0.5$ and not otherwise vetoed, doubling the size of the catalog and further opening our view of the gravitational-wave Universe. △ Less

Submitted 8 September, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

Report number: LIGO-P2400386

arXiv:2508.18081 [pdf, ps, other]

GWTC-4.0: Methods for Identifying and Characterizing Gravitational-wave Transients

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, S. Ahmadzadeh, L. Aiello, A. Ain, P. Ajith, S. Akcay, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1787 additional authors not shown)

Abstract: The Gravitational-Wave Transient Catalog (GWTC) is a collection of candidate gravitational-wave transient signals identified and characterized by the LIGO-Virgo-KAGRA Collaboration. Producing the contents of the GWTC from detector data requires complex analysis methods. These comprise techniques to model the signal; identify the transients in the data; evaluate the quality of the data and mitigate… ▽ More The Gravitational-Wave Transient Catalog (GWTC) is a collection of candidate gravitational-wave transient signals identified and characterized by the LIGO-Virgo-KAGRA Collaboration. Producing the contents of the GWTC from detector data requires complex analysis methods. These comprise techniques to model the signal; identify the transients in the data; evaluate the quality of the data and mitigate possible instrumental issues; infer the parameters of each transient; compare the data with the waveform models for compact binary coalescences; and handle the large amount of results associated with all these different analyses. In this paper, we describe the methods employed to produce the catalog's fourth release, GWTC-4.0, focusing on the analysis of the first part of the fourth observing run of Advanced LIGO, Advanced Virgo and KAGRA. △ Less

Submitted 25 August, 2025; originally announced August 2025.

Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog

Report number: LIGO-P2400300

arXiv:2508.18080 [pdf, ps, other]

GWTC-4.0: An Introduction to Version 4.0 of the Gravitational-Wave Transient Catalog

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, S. Ahmadzadeh, L. Aiello, A. Ain, P. Ajith, S. Akcay, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1786 additional authors not shown)

Abstract: The Gravitational-Wave Transient Catalog (GWTC) is a collection of short-duration (transient) gravitational wave signals identified by the LIGO-Virgo-KAGRA Collaboration in gravitational-wave data produced by the eponymous detectors. The catalog provides information about the identified candidates, such as the arrival time and amplitude of the signal and properties of the signal's source as inferr… ▽ More The Gravitational-Wave Transient Catalog (GWTC) is a collection of short-duration (transient) gravitational wave signals identified by the LIGO-Virgo-KAGRA Collaboration in gravitational-wave data produced by the eponymous detectors. The catalog provides information about the identified candidates, such as the arrival time and amplitude of the signal and properties of the signal's source as inferred from the observational data. GWTC is the data release of this dataset and version 4.0 extends the catalog to include observations made during the first part of the fourth LIGO-Virgo-KAGRA observing run up until 2024 January 31. This paper marks an introduction to a collection of articles related to this version of the catalog, GWTC-4.0. The collection of articles accompanying the catalog provides documentation of the methods used to analyze the data, summaries of the catalog of events, observational measurements drawn from the population, and detailed discussions of selected candidates △ Less

Submitted 23 September, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

Comments: As part of the Astrophysical Journal Letters Focus Issue on the Gravitational Wave Transient Catalog. Update following peer review

Report number: LIGO-P2400293

arXiv:2508.18079 [pdf, ps, other]

Open Data from LIGO, Virgo, and KAGRA through the First Part of the Fourth Observing Run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1746 additional authors not shown)

Abstract: LIGO, Virgo, and KAGRA form a network of gravitational-wave observatories. Data and analysis results from this network are made publicly available through the Gravitational Wave Open Science Center. This paper describes open data from this network, including the addition of data from the first part of the fourth observing run (O4a) and selected periods from the preceding engineering run, collected… ▽ More LIGO, Virgo, and KAGRA form a network of gravitational-wave observatories. Data and analysis results from this network are made publicly available through the Gravitational Wave Open Science Center. This paper describes open data from this network, including the addition of data from the first part of the fourth observing run (O4a) and selected periods from the preceding engineering run, collected from May 2023 to January 2024. The public data set includes calibrated strain time series for each instrument, data from additional channels used for noise subtraction and detector characterization, and analysis data products from version 4.0 of the Gravitational-Wave Transient Catalog. △ Less

Submitted 3 September, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

Comments: 27 pages

Report number: LIGO-P2500167

arXiv:2508.17129 [pdf, ps, other]

Reconciling Communication Compression and Byzantine-Robustness in Distributed Learning

Authors: Diksha Gupta, Nirupam Gupta, Chuan Xu, Giovanni Neglia

Abstract: Distributed learning (DL) enables scalable model training over decentralized data, but remains challenged by Byzantine faults and high communication costs. While both issues have been studied extensively in isolation, their interaction is less explored. Prior work shows that naively combining communication compression with Byzantine-robust aggregation degrades resilience to faulty nodes (or worker… ▽ More Distributed learning (DL) enables scalable model training over decentralized data, but remains challenged by Byzantine faults and high communication costs. While both issues have been studied extensively in isolation, their interaction is less explored. Prior work shows that naively combining communication compression with Byzantine-robust aggregation degrades resilience to faulty nodes (or workers). The state-of-the-art algorithm, namely Byz-DASHA-PAGE [29], makes use of the momentum variance reduction scheme to mitigate the detrimental impact of compression noise on Byzantine-robustness. We propose a new algorithm, named RoSDHB, that integrates the classic Polyak's momentum with a new coordinated compression mechanism. We show that RoSDHB performs comparably to Byz-DASHA-PAGE under the standard (G, B)-gradient dissimilarity heterogeneity model, while it relies on fewer assumptions. In particular, we only assume Lipschitz smoothness of the average loss function of the honest workers, in contrast to [29]that additionally assumes a special smoothness of bounded global Hessian variance. Empirical results on benchmark image classification task show that RoSDHB achieves strong robustness with significant communication savings. △ Less

Submitted 23 August, 2025; originally announced August 2025.

Comments: 78 Pages, 1 figure

ACM Class: I.2.11; G.1.6

arXiv:2508.16753 [pdf, ps, other]

GAICo: A Deployed and Extensible Framework for Evaluating Diverse and Multimodal Generative AI Outputs

Authors: Nitin Gupta, Pallav Koppisetti, Kausik Lakkaraju, Biplav Srivastava

Abstract: The rapid proliferation of Generative AI (GenAI) into diverse, high-stakes domains necessitates robust and reproducible evaluation methods. However, practitioners often resort to ad-hoc, non-standardized scripts, as common metrics are often unsuitable for specialized, structured outputs (e.g., automated plans, time-series) or holistic comparison across modalities (e.g., text, audio, and image). Th… ▽ More The rapid proliferation of Generative AI (GenAI) into diverse, high-stakes domains necessitates robust and reproducible evaluation methods. However, practitioners often resort to ad-hoc, non-standardized scripts, as common metrics are often unsuitable for specialized, structured outputs (e.g., automated plans, time-series) or holistic comparison across modalities (e.g., text, audio, and image). This fragmentation hinders comparability and slows AI system development. To address this challenge, we present GAICo (Generative AI Comparator): a deployed, open-source Python library that streamlines and standardizes GenAI output comparison. GAICo provides a unified, extensible framework supporting a comprehensive suite of reference-based metrics for unstructured text, specialized structured data formats, and multimedia (images, audio). Its architecture features a high-level API for rapid, end-to-end analysis, from multi-model comparison to visualization and reporting, alongside direct metric access for granular control. We demonstrate GAICo's utility through a detailed case study evaluating and debugging complex, multi-modal AI Travel Assistant pipelines. GAICo empowers AI researchers and developers to efficiently assess system performance, make evaluation reproducible, improve development velocity, and ultimately build more trustworthy AI systems, aligning with the goal of moving faster and safer in AI deployment. Since its release on PyPI in Jun 2025, the tool has been downloaded over 13K times, across versions, by Aug 2025, demonstrating growing community interest. △ Less

Submitted 24 October, 2025; v1 submitted 22 August, 2025; originally announced August 2025.

Comments: 11 pages, 7 figures, accepted at IAAI/AAAI 2026; updated with figures, captions, and acknowledgments

arXiv:2508.09495 [pdf, ps, other]

Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream

Authors: Zachary J Smeaton, Miroslav D Filipovic, Barbel S Koribalski, Manami Sasaki, Rami Z E Alsaberi, Aaron C Bradley, Evan J Crawford, Shi Dai, Nikhel Gupta, Frank Haberl, Andrew M Hopkins, Thomas H Jarrett, Sanja Lazarević, Denis Leahy, Peter Macgregor, Gavin Rowell, Stanislav S Shabala, Dejan Urosevic, Jacco Th van Loon, Tessa Vernstrom

Abstract: We present the discovery of two extended, low surface brightness radio continuum sources, each consisting of a near-circular body and an extended tail of emission, nicknamed Stingray 1 (ASKAP J0129-5350) and Stingray 2 (ASKAP J0245-5642). Both are found in the direction of the Magellanic Stream (MS) and were discovered in the Australian Square Kilometre Array Pathfinder (ASKAP) Evolutionary Map of… ▽ More We present the discovery of two extended, low surface brightness radio continuum sources, each consisting of a near-circular body and an extended tail of emission, nicknamed Stingray 1 (ASKAP J0129-5350) and Stingray 2 (ASKAP J0245-5642). Both are found in the direction of the Magellanic Stream (MS) and were discovered in the Australian Square Kilometre Array Pathfinder (ASKAP) Evolutionary Map of the Universe (EMU) survey at 944 MHz. We combine the ASKAP data with low-frequency radio observations from the GaLactic and Extragalactic All-sky MWA Survey (GLEAM) to conduct a radio continuum analysis. We explore both Galactic/near Galactic scenarios, including runaway or circumgalactic supernova remnants (SNRs) and parentless pulsar-wind nebulae (PWNe), and extragalactic scenarios including radio active galactic nuclei (AGNs), dying radio galaxies, galaxy clusters, galaxy pairs or groups, head-tail radio galaxies, and Odd Radio Circles (ORCs), as well as the possibility that the morphology is due to a chance alignment. The Stingrays exhibit non-thermal emission with spectral indices of $α$ = -0.89 $\pm$ 0.09 for Stingray 1 and $α$ = -1.77 $\pm$ 0.06 for Stingray 2. We find that none of the proposed scenarios can explain all of the observed properties, however we determine it most likely that their shape is caused by some kind of complex environmental interaction. The most likely scenario from the available data is that of a head-tail radio galaxy, but more data is required for a definitive classification. △ Less

Submitted 13 August, 2025; originally announced August 2025.

arXiv:2508.07185 [pdf, ps, other]

DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

Authors: Kabir Khan, Priya Sharma, Arjun Mehta, Neha Gupta, Ravi Narayanan

Abstract: Large Language Models (LLMs) suffer from a critical limitation: their knowledge is static and quickly becomes outdated. Retraining these massive models is computationally prohibitive, while existing knowledge editing techniques can be slow and may introduce unforeseen side effects. To address this, we propose DySK-Attn, a novel framework that enables LLMs to efficiently integrate real-time knowled… ▽ More Large Language Models (LLMs) suffer from a critical limitation: their knowledge is static and quickly becomes outdated. Retraining these massive models is computationally prohibitive, while existing knowledge editing techniques can be slow and may introduce unforeseen side effects. To address this, we propose DySK-Attn, a novel framework that enables LLMs to efficiently integrate real-time knowledge from a dynamic external source. Our approach synergizes an LLM with a dynamic Knowledge Graph (KG) that can be updated instantaneously. The core of our framework is a sparse knowledge attention mechanism, which allows the LLM to perform a coarse-to-fine grained search, efficiently identifying and focusing on a small, highly relevant subset of facts from the vast KG. This mechanism avoids the high computational cost of dense attention over the entire knowledge base and mitigates noise from irrelevant information. We demonstrate through extensive experiments on time-sensitive question-answering tasks that DySK-Attn significantly outperforms strong baselines, including standard Retrieval-Augmented Generation (RAG) and model editing techniques, in both factual accuracy for updated knowledge and computational efficiency. Our framework offers a scalable and effective solution for building LLMs that can stay current with the ever-changing world. △ Less

Submitted 10 August, 2025; originally announced August 2025.

Comments: Preprint; 7 figures, 3 tables, 1 algorithm; v1. Code and data will be released

ACM Class: I.2.7; H.3.3; H.2.8

arXiv:2507.23337 [pdf, ps, other]

doi 10.1017/pasa.2025.10076

EMU and the DRAGNs I: A Catalogue of DRAGNs

Authors: Ray P. Norris, Miranda Yew, Evan Crawford, Nikhel Gupta, Lawrence Rudnick, H. Andernach, Miroslav D. Filipović, Yjan A. Gordon, Andrew M. Hopkins, Laurence Park, Michael J. I. Brown, Ana Jimenez-Gallardo, S. S. Shabala

Abstract: We present a catalogue of 3557 Double Radio sources associated with Active Galactic Nuclei (DRAGNs) from the First Pilot Survey of the Evolutionary Map of the Universe (EMU), observed at 944 MHz with the Australian Square Kilometre Array Pathfinder (ASKAP) telescope, covering 270 deg^2. We have extracted and identified each source by eye, tagged it with a morphological type and measured its parame… ▽ More We present a catalogue of 3557 Double Radio sources associated with Active Galactic Nuclei (DRAGNs) from the First Pilot Survey of the Evolutionary Map of the Universe (EMU), observed at 944 MHz with the Australian Square Kilometre Array Pathfinder (ASKAP) telescope, covering 270 deg^2. We have extracted and identified each source by eye, tagged it with a morphological type and measured its parameters. The resulting catalogue will be used in subsequent papers to explore the properties of these sources, to train machine-learning algorithms for the detection of these sources in larger fields, and to compare with the results of Citizen Science projects, with the ultimate goal of understanding the physical processes that drive DRAGNs. Compared with earlier, lower sensitivity, catalogues, we find more diffuse structure and a plethora of more complex structures, ranging from wings of radio emission on the side of the jets, to types of object which have not been seen in earlier observations. As well as the well-known FR1 and FR2 sources, we find significant numbers of rare types of radio source such as Hybrid Morphology Radio Sources and one-sided jets, as well as a wide range of bent-tail and head-tail sources. △ Less

Submitted 31 July, 2025; originally announced July 2025.

Comments: Accepted by PASA

Journal ref: Publ. Astron. Soc. Aust. 42 (2025) e124

arXiv:2507.13264 [pdf, ps, other]

Voxtral

Authors: Alexander H. Liu, Andy Ehrenberg, Andy Lo, Clément Denoix, Corentin Barreau, Guillaume Lample, Jean-Malo Delignon, Khyathi Raghavi Chandu, Patrick von Platen, Pavankumar Reddy Muddireddy, Sanchit Gandhi, Soham Ghosh, Srijan Mishra, Thomas Foubert, Abhinav Rastogi, Adam Yang, Albert Q. Jiang, Alexandre Sablayrolles, Amélie Héliou, Amélie Martin, Anmol Agarwal, Antoine Roux, Arthur Darcet, Arthur Mensch, Baptiste Bout , et al. (81 additional authors not shown)

Abstract: We present Voxtral Mini and Voxtral Small, two multimodal audio chat models. Voxtral is trained to comprehend both spoken audio and text documents, achieving state-of-the-art performance across a diverse range of audio benchmarks, while preserving strong text capabilities. Voxtral Small outperforms a number of closed-source models, while being small enough to run locally. A 32K context window enab… ▽ More We present Voxtral Mini and Voxtral Small, two multimodal audio chat models. Voxtral is trained to comprehend both spoken audio and text documents, achieving state-of-the-art performance across a diverse range of audio benchmarks, while preserving strong text capabilities. Voxtral Small outperforms a number of closed-source models, while being small enough to run locally. A 32K context window enables the model to handle audio files up to 40 minutes in duration and long multi-turn conversations. We also contribute three benchmarks for evaluating speech understanding models on knowledge and trivia. Both Voxtral models are released under Apache 2.0 license. △ Less

Submitted 17 July, 2025; originally announced July 2025.

Comments: 17 pages

arXiv:2507.12282 [pdf, ps, other]

All-sky search for long-duration gravitational-wave transients in the first part of the fourth LIGO-Virgo-KAGRA Observing run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1750 additional authors not shown)

Abstract: We present an all-sky search for long-duration gravitational waves (GWs) from the first part of the LIGO-Virgo-KAGRA fourth observing run (O4), called O4a and comprising data taken between 24 May 2023 and 16 January 2024. The GW signals targeted by this search are the so-called "long-duration" (> 1 s) transients expected from a variety of astrophysical processes, including non-axisymmetric deforma… ▽ More We present an all-sky search for long-duration gravitational waves (GWs) from the first part of the LIGO-Virgo-KAGRA fourth observing run (O4), called O4a and comprising data taken between 24 May 2023 and 16 January 2024. The GW signals targeted by this search are the so-called "long-duration" (> 1 s) transients expected from a variety of astrophysical processes, including non-axisymmetric deformations in magnetars or eccentric binary coalescences. We make minimal assumptions on the emitted GW waveforms in terms of morphologies and durations. Overall, our search targets signals with durations ~1-1000 s and frequency content in the range 16-2048 Hz. In the absence of significant detections, we report the sensitivity limits of our search in terms of root-sum-square signal amplitude (hrss) of reference waveforms. These limits improve upon the results from the third LIGO-Virgo-KAGRA observing run (O3) by about 30% on average. Moreover, this analysis demonstrates substantial progress in our ability to search for long-duration GW signals owing to enhancements in pipeline detection efficiencies. As detector sensitivities continue to advance and observational runs grow longer, unmodeled long-duration searches will increasingly be able to explore a range of compelling astrophysical scenarios involving neutron stars and black holes. △ Less

Submitted 23 July, 2025; v1 submitted 16 July, 2025; originally announced July 2025.

Report number: LIGO-P2500090-v6

arXiv:2507.10775 [pdf, ps, other]

A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Flight Computers

Authors: Jeffrey Joan Sam, Janhavi Sathe, Nikhil Chigali, Naman Gupta, Radhey Ruparel, Yicheng Jiang, Janmajay Singh, James W. Berck, Arko Barman

Abstract: Spacecraft deployed in outer space are routinely subjected to various forms of damage due to exposure to hazardous environments. In addition, there are significant risks to the subsequent process of in-space repairs through human extravehicular activity or robotic manipulation, incurring substantial operational costs. Recent developments in image segmentation could enable the development of reliab… ▽ More Spacecraft deployed in outer space are routinely subjected to various forms of damage due to exposure to hazardous environments. In addition, there are significant risks to the subsequent process of in-space repairs through human extravehicular activity or robotic manipulation, incurring substantial operational costs. Recent developments in image segmentation could enable the development of reliable and cost-effective autonomous inspection systems. While these models often require large amounts of training data to achieve satisfactory results, publicly available annotated spacecraft segmentation data are very scarce. Here, we present a new dataset of nearly 64k annotated spacecraft images that was created using real spacecraft models, superimposed on a mixture of real and synthetic backgrounds generated using NASA's TTALOS pipeline. To mimic camera distortions and noise in real-world image acquisition, we also added different types of noise and distortion to the images. Finally, we finetuned YOLOv8 and YOLOv11 segmentation models to generate performance benchmarks for the dataset under well-defined hardware and inference time constraints to mimic real-world image segmentation challenges for real-time onboard applications in space on NASA's inspector spacecraft. The resulting models, when tested under these constraints, achieved a Dice score of 0.92, Hausdorff distance of 0.69, and an inference time of about 0.5 second. The dataset and models for performance benchmark are available at https://github.com/RiceD2KLab/SWiM. △ Less

Submitted 14 July, 2025; originally announced July 2025.

arXiv:2507.09244 [pdf, ps, other]

Deep Learning for sub-THz Radio Unit Selection using sub-10 GHz Channel Information and Inferred Device Beamforming

Authors: Nishant Gupta, Muris Sarajlic, Erik G. Larsson

Abstract: The dense and distributed deployment of sub-THz radio units (RUs) alongside sub-10 GHz access point (AP) is a promising approach to provide high data rate and reliable coverage for future 6G applications. However, beam search or RU selection for the sub-THz RUs incurs significant overhead and high power consumption. To address this, we introduce a method that leverages deep learning to infer a sui… ▽ More The dense and distributed deployment of sub-THz radio units (RUs) alongside sub-10 GHz access point (AP) is a promising approach to provide high data rate and reliable coverage for future 6G applications. However, beam search or RU selection for the sub-THz RUs incurs significant overhead and high power consumption. To address this, we introduce a method that leverages deep learning to infer a suitable sub-THz RU candidate from a set of sub-THz RUs using the sub-10 GHz channel characteristics. A novel aspect of this work is the consideration of inter-band beam configuration (IBBC), defined as the broadside angle between the low-band and high-band antenna patterns of the user equipment (UE). Since IBBC indicates the beamforming information or UE's orientation, it is typically not shared with the network as a part of signalling. Therefore, we propose a solution strategy to infer a suitable sub-THz RU even when UEs do not share their IBBC information. Simulation results illustrate the performance of the inferred sub-THz RU and highlights the detrimental impact of neglecting UE orientation on the systems performance. △ Less

Submitted 12 July, 2025; originally announced July 2025.

Comments: Accepted for Publication in IEEE VTC-Spring 2025, held at Oslo, Norway

arXiv:2507.08219 [pdf, ps, other]

GW231123: a Binary Black Hole Merger with Total Mass 190-265 $M_{\odot}$

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, D. Adhikari, N. Adhikari, R. X. Adhikari, V. K. Adkins, S. Afroz, A. Agapito, D. Agarwal, M. Agathos, N. Aggarwal, S. Aggarwal, O. D. Aguiar, I. -L. Ahrend, L. Aiello, A. Ain, P. Ajith, T. Akutsu , et al. (1763 additional authors not shown)

Abstract: On 2023 November 23 the two LIGO observatories both detected GW231123, a gravitational-wave signal consistent with the merger of two black holes with masses $137^{+22}_{-17}\, M_\odot$ and $103^{+20}_{-52}\, M_\odot$ (90\% credible intervals), at luminosity distance 0.7-4.1 Gpc and redshift of $0.39^{+0.27}_{-0.24}$, and a network signal-to-noise ratio of $\sim$22.5. Both black holes exhibit high… ▽ More On 2023 November 23 the two LIGO observatories both detected GW231123, a gravitational-wave signal consistent with the merger of two black holes with masses $137^{+22}_{-17}\, M_\odot$ and $103^{+20}_{-52}\, M_\odot$ (90\% credible intervals), at luminosity distance 0.7-4.1 Gpc and redshift of $0.39^{+0.27}_{-0.24}$, and a network signal-to-noise ratio of $\sim$22.5. Both black holes exhibit high spins, $0.9^{+0.10}_{-0.19}$ and $0.80^{+0.20}_{-0.51}$ respectively. A massive black hole remnant is supported by an independent ringdown analysis. Some properties of GW231123 are subject to large systematic uncertainties, as indicated by differences in inferred parameters between signal models. The primary black hole lies within or above the theorized mass gap where black holes between 60-130 $M_\odot$ should be rare due to pair instability mechanisms, while the secondary spans the gap. The observation of GW231123 therefore suggests the formation of black holes from channels beyond standard stellar collapse, and that intermediate-mass black holes of mass $\sim$200 $M_\odot$ form through gravitational-wave driven mergers. △ Less

Submitted 11 August, 2025; v1 submitted 10 July, 2025; originally announced July 2025.

Comments: 27 pages, 10 figures

Report number: DCC: P2500026-v6

arXiv:2507.06692 [pdf, ps, other]

A Unified Approach to Calculating Sylvester Sums

Authors: Neha Gupta, Manoj Upreti

Abstract: In the context of the Frobenius coin problem, given two relatively prime positive integers $a$ and $b$, the set of nonrepresentable numbers consists of positive integers that cannot be expressed as nonnegative integer combination of $a$ and $b$. This work provides a formula for calculating the power sums of all nonrepresentable numbers, also known as the Sylvester sums. Although alternative formul… ▽ More In the context of the Frobenius coin problem, given two relatively prime positive integers $a$ and $b$, the set of nonrepresentable numbers consists of positive integers that cannot be expressed as nonnegative integer combination of $a$ and $b$. This work provides a formula for calculating the power sums of all nonrepresentable numbers, also known as the Sylvester sums. Although alternative formulas exist in the literature, our approach is based on an elementary observation. We consider the set of natural numbers from $1$ to $ab - 1$ and compute their total sum in two distinct ways, which leads naturally to the desired Sylvester sums. This method connects an analytic identity with a combinatorial viewpoint, giving a new way to understand these classical quantities. Furthermore, in this paper, we establish a criterion using the division algorithm to determine whether a given positive integer is nonrepresentable. △ Less

Submitted 9 July, 2025; originally announced July 2025.

arXiv:2507.06261 [pdf, ps, other]

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3410 additional authors not shown)

Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal understanding and it is now able to process up to 3 hours of video content. Its unique combination of long context, multimodal and reasoning capabilities can be combined to unlock new agentic workflows. Gemini 2.5 Flash provides excellent reasoning abilities at a fraction of the compute and latency requirements and Gemini 2.0 Flash and Flash-Lite provide high performance at low latency and cost. Taken together, the Gemini 2.X model generation spans the full Pareto frontier of model capability vs cost, allowing users to explore the boundaries of what is possible with complex agentic problem solving. △ Less

Submitted 16 October, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

Comments: 72 pages, 17 figures

arXiv:2506.21906 [pdf, ps, other]

doi 10.1017/pasa.2025.10067

Quantifying Radio Source Morphology

Authors: Lachlan J. Barnes, Andrew M. Hopkins, Lawrence Rudnick, Heinz Andernach, Michael Cowley, Nikhel Gupta, Ray P. Norris, Stanislav S. Shabala, Tayyaba Zafar

Abstract: The advent of next-generation telescope facilities brings with it an unprecedented amount of data, and the demand for effective tools to process and classify this information has become increasingly important. This work proposes a novel approach to quantify the radio galaxy morphology, through the development of a series of algorithmic metrics that can quantitatively describe the structure of radi… ▽ More The advent of next-generation telescope facilities brings with it an unprecedented amount of data, and the demand for effective tools to process and classify this information has become increasingly important. This work proposes a novel approach to quantify the radio galaxy morphology, through the development of a series of algorithmic metrics that can quantitatively describe the structure of radio source, and can be applied to radio images in an automatic way. These metrics are intuitive in nature and are inspired by the intrinsic structural differences observed between the existing Fanaroff-Riley (FR) morphology types. The metrics are defined in categories of asymmetry, blurriness, concentration, disorder, and elongation ($ABCDE$/single-lobe metrics), as well as the asymmetry and angle between lobes (source metrics). We apply these metrics to a sample of $480$ sources from the Evolutionary Map of the Universe Pilot Survey (EMU-PS) and $72$ well resolved extensively studied sources from An Atlas of DRAGNs, a subset of the revised Third Cambridge Catalogue of Radio Sources (3CRR). We find that these metrics are relatively robust to resolution changes, independent of each other, and measure fundamentally different structural components of radio galaxy lobes. These metrics work particularly well for sources with reasonable signal-to-noise and well separated lobes. We also find that we can recover the original FR classification using probabilistic combinations of our metrics, highlighting the usefulness of our approach for future large data sets from radio sky surveys. △ Less

Submitted 27 June, 2025; originally announced June 2025.

Comments: 22 pages, 28 figures

Journal ref: Publ. Astron. Soc. Aust. 42 (2025) e105

arXiv:2506.21518 [pdf, ps, other]

New plasmon-like mode in PdTe$_{2}$: Raman scattering and memory function study

Authors: Bharathiganesh Devanarayanan, Sahil Rathi, Jalaja Pandya, Sonika, C. S. Yadav, Navinder Singh, Satyendra Nath Gupta

Abstract: PdTe$_2$ is a type II Dirac semimetal that has garnered significant attention due to its intriguing electronic and topological properties. Here, we report temperature dependent Raman scattering study of PdTe$_2$ in the temperature range from 10 K to 300 K. Our study reveals emergence of a new unreported peak below 100 K, centered around 250 cm$^{-1}$. We argue that the new mode is not a phonon mod… ▽ More PdTe$_2$ is a type II Dirac semimetal that has garnered significant attention due to its intriguing electronic and topological properties. Here, we report temperature dependent Raman scattering study of PdTe$_2$ in the temperature range from 10 K to 300 K. Our study reveals emergence of a new unreported peak below 100 K, centered around 250 cm$^{-1}$. We argue that the new mode is not a phonon mode because the Raman spectra calculated using Density Functional Theory shows only two intense peaks at 85 $ cm^{-1}$ and 128 $cm^{-1}$. To ascertain the origin of this new peak, we constructed a microscopic model of electrons coupling to a single plasmon mode at 250 $cm^{-1}$ and using the memory function formalism, we obtained that the Raman relaxation rate is linear in frequency. We also performed phenomenological analysis of the Raman response from the experimental data and computed frequency dependent Raman relaxation rate, which is also found to exhibit a linear dependence on frequency. With the congruence of our theoretical and phenomenological results we could ascertain that the new mode observed at low temperatures is indeed a plasmon-like mode. Further, phonon frequencies and line widths of the two phonon modes exhibit anomalous behavior above 100 K. △ Less

Submitted 26 June, 2025; originally announced June 2025.

Comments: 15 pages, 7 figures

arXiv:2506.18020 [pdf, ps, other]

Byzantine Failures Harm the Generalization of Robust Distributed Learning Algorithms More Than Data Poisoning

Authors: Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aurélien Bellet

Abstract: Robust distributed learning algorithms aim to maintain reliable performance despite the presence of misbehaving workers. Such misbehaviors are commonly modeled as Byzantine failures, allowing arbitrarily corrupted communication, or as data poisoning, a weaker form of corruption restricted to local training data. While prior work shows similar optimization guarantees for both models, an important q… ▽ More Robust distributed learning algorithms aim to maintain reliable performance despite the presence of misbehaving workers. Such misbehaviors are commonly modeled as Byzantine failures, allowing arbitrarily corrupted communication, or as data poisoning, a weaker form of corruption restricted to local training data. While prior work shows similar optimization guarantees for both models, an important question remains: How do these threat models impact generalization? Empirical evidence suggests a gap, yet it remains unclear whether it is unavoidable or merely an artifact of suboptimal attacks. We show, for the first time, a fundamental gap in generalization guarantees between the two threat models: Byzantine failures yield strictly worse rates than those achievable under data poisoning. Our findings leverage a tight algorithmic stability analysis of robust distributed learning. Specifically, we prove that: (i) under data poisoning, the uniform algorithmic stability of an algorithm with optimal optimization guarantees degrades by an additive factor of $\varTheta ( \frac{f}{n-f} )$, with $f$ out of $n$ workers misbehaving; whereas $\textit{(ii)}$ under Byzantine failures, the degradation is in $Ω\big( \sqrt{ \frac{f}{n-2f}} \big)$. △ Less

Submitted 16 October, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

arXiv:2506.15090 [pdf, ps, other]

EMUSE: Evolutionary Map of the Universe Search Engine

Authors: Nikhel Gupta, Zeeshan Hayder, Minh Huynh, Ray P. Norris, Lars Petersson, Andrew M. Hopkins, Simone Riggi, Bärbel S. Koribalski, Miroslav D. Filipović

Abstract: We present EMUSE (Evolutionary Map of the Universe Search Engine), a tool designed for searching specific radio sources within the extensive datasets of the EMU (Evolutionary Map of the Universe) survey, with potential applications to other Big Data challenges in astronomy. Built on a multimodal approach to radio source classification and retrieval, EMUSE fine-tunes the OpenCLIP model on curated r… ▽ More We present EMUSE (Evolutionary Map of the Universe Search Engine), a tool designed for searching specific radio sources within the extensive datasets of the EMU (Evolutionary Map of the Universe) survey, with potential applications to other Big Data challenges in astronomy. Built on a multimodal approach to radio source classification and retrieval, EMUSE fine-tunes the OpenCLIP model on curated radio galaxy datasets. Leveraging the power of foundation models, our work integrates visual and textual embeddings to enable efficient and flexible searches within large radio astronomical datasets. We fine-tune OpenCLIP using a dataset of 2,900 radio galaxies, encompassing various morphological classes, including FR-I, FR-II, FR-x, R-type, and other rare and peculiar sources. The model is optimized using adapter-based fine-tuning, ensuring computational efficiency while capturing the unique characteristics of radio sources. The fine-tuned model is then deployed in EMUSE, allowing for seamless image- and text-based queries over the EMU survey dataset. Our results demonstrate the model's effectiveness in retrieving and classifying radio sources, particularly in recognizing distinct morphological features. However, challenges remain in identifying rare or previously unseen radio sources, highlighting the need for expanded datasets and continuous refinement. This study showcases the potential of multimodal machine learning in radio astronomy, paving the way for more scalable and accurate search tools in the field. The search engine is accessible at https://askap-emuse.streamlit.app/ and can be used locally by cloning the repository at https://github.com/Nikhel1/EMUSE. △ Less

Submitted 17 June, 2025; originally announced June 2025.

Comments: 19 pages, 9 figures, accepted for publication in PASA

arXiv:2506.14891 [pdf, ps, other]

Worldsheet CFT$_2$ and Celestial CFT$_2$ : An AdS$_3$-CFT$_2$ perspective

Authors: Shamik Banerjee, Nishant Gupta, Sagnik Misra

Abstract: Celestial CFT$_d$ is the putative dual of quantum gravity in asymptotically flat $(d+2)$ dimensional space time. We argue that a class of Celestial CFT$_d$ can be engineered via AdS$_{d+1}$-CFT$_d$ correspondence. Our argument is based on the observation that if we zoom in near the boundary of (Euclidean) AdS$_{d+1}$ then the conformal isometry group of EAdS$_{d+1}$, which is SO$(d+2,1)$, contract… ▽ More Celestial CFT$_d$ is the putative dual of quantum gravity in asymptotically flat $(d+2)$ dimensional space time. We argue that a class of Celestial CFT$_d$ can be engineered via AdS$_{d+1}$-CFT$_d$ correspondence. Our argument is based on the observation that if we zoom in near the boundary of (Euclidean) AdS$_{d+1}$ then the conformal isometry group of EAdS$_{d+1}$, which is SO$(d+2,1)$, contracts to the Poincare group ISO$(d+1,1)$. This suggests that the near boundary scaling limit of a theory of \textit{conformal} gravity on EAdS$_{d+1}$ should be dual to a boundary CFT$_d$ with ISO$(d+1,1)$ symmetry. This dual CFT$_d$, since the symmetries match, is an example of a Celestial CFT$_d$. Similarly, if we have a \textit{non-conformal} theory of gravity on EAdS$_{d+1}$ then the near boundary scaling limit of such a theory is dual to a (boundary) Celestial CFT$_d$ with \textit{only} (SO$(d+1,1)$) Lorentz invariance. Celestial CFTs with only Lorentz invariance have been recently studied in the literature. Now following this logic we discuss, among other things, the near boundary scaling limit of the bosonic string theory on Euclidean AdS$_3$ in the presence of the NS-NS B field. The AdS$_3$ part of the worldsheet theory is free in this limit and has been studied in the literature in different contexts. This limit describes a ``long string'' which wraps the (Euclidean) AdS$_3$ boundary and it has been argued that the space-time CFT$_2$ which describes the radial fluctuations of a long string is a Liouville CFT. According to our proposal, the dual CFT$_2$ which describes the \textit{long string sector} is an example of a \textit{Celestial} CFT$_2$ with \textit{only} (SO$(3,1)$)Lorentz invariance. We do not get a full ISO$(3,1)$ invariant Celestial CFT$_2$ in this way because the string theory does not have target space conformal invariance. △ Less

Submitted 17 June, 2025; originally announced June 2025.

Comments: Latex, 23 Pages

arXiv:2506.12103 [pdf, other]

The Amazon Nova Family of Models: Technical Report and Model Card

Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation. △ Less

Submitted 17 March, 2025; originally announced June 2025.

Comments: 48 pages, 10 figures

Report number: 20250317

arXiv:2506.10910 [pdf, ps, other]

Magistral

Authors: Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo, Gabrielle Berrada, Guillaume Lample, Jason Rute, Joep Barmentlo, Karmesh Yadav, Kartik Khandelwal, Khyathi Raghavi Chandu, Léonard Blier, Lucile Saulnier, Matthieu Dinot, Maxime Darrin, Neha Gupta, Roman Soletskyi, Sagar Vaze, Teven Le Scao, Yihan Wang, Adam Yang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou , et al. (76 additional authors not shown)

Abstract: We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a s… ▽ More We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a simple method to force the reasoning language of the model, and show that RL on text data alone maintains most of the initial checkpoint's capabilities. We find that RL on text maintains or improves multimodal understanding, instruction following and function calling. We present Magistral Medium, trained for reasoning on top of Mistral Medium 3 with RL alone, and we open-source Magistral Small (Apache 2.0) which further includes cold-start data from Magistral Medium. △ Less

Submitted 12 June, 2025; originally announced June 2025.

arXiv:2506.08439 [pdf, ps, other]

doi 10.1017/pasa.2025.10061

Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection

Authors: Nikhel Gupta, Ray P. Norris, Zeeshan Hayder, Minh Huynh, Heinz Andernach, Andrew M. Hopkins, Stanislav Shabala, Lawrence Rudnick, Miroslav D. Filipović, Bärbel S. Koribalski, Lars Petersson, X. Rosalind Wang

Abstract: We present a systematic search for Odd Radio Circles (ORCs) and other unusual radio morphologies using data from the first year of the EMU (Evolutionary Map of the Universe) survey. ORCs are rare, enigmatic objects characterized by edge-brightened rings of radio emission, often found in association with distant galaxies. To identify these objects, we employ a hybrid methodology combining supervise… ▽ More We present a systematic search for Odd Radio Circles (ORCs) and other unusual radio morphologies using data from the first year of the EMU (Evolutionary Map of the Universe) survey. ORCs are rare, enigmatic objects characterized by edge-brightened rings of radio emission, often found in association with distant galaxies. To identify these objects, we employ a hybrid methodology combining supervised object detection techniques and visual inspection of radio source candidates. This approach leads to the discovery of five new ORCs and two additional candidate ORCs, expanding the known population of these objects. In addition to ORCs, we also identify 55 Galaxies with Large-scale Ambient Radio Emission (GLAREs), which feature irregular, rectangular, or circular shapes of diffuse radio emission mostly surrounding central host galaxies. These GLAREs may represent different evolutionary stages of ORCs, and studying them could offer valuable insights into their evolutionary processes. We also highlight a subset of Starburst Radio Ring Galaxies (SRRGs), which are star-forming galaxies exhibiting edge-brightened radio rings surrounding their central star-forming regions. We emphasize the importance of multi-wavelength follow-up observations to better understand the physical properties, host galaxy characteristics, and evolutionary pathways of these radio sources. △ Less

Submitted 10 June, 2025; originally announced June 2025.

Comments: 19 pages, 7 figures, 6 tables. Accepted for publication in PASA

Journal ref: Publ. Astron. Soc. Aust. 42 (2025) e097

arXiv:2506.07400 [pdf, ps, other]

MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models

Authors: Philip R. Liu, Sparsh Bansal, Jimmy Dinh, Aditya Pawar, Ramani Satishkumar, Shail Desai, Neeraj Gupta, Xin Wang, Shu Hu

Abstract: The integration of deep learning-based glaucoma detection with large language models (LLMs) presents an automated strategy to mitigate ophthalmologist shortages and improve clinical reporting efficiency. However, applying general LLMs to medical imaging remains challenging due to hallucinations, limited interpretability, and insufficient domain-specific medical knowledge, which can potentially red… ▽ More The integration of deep learning-based glaucoma detection with large language models (LLMs) presents an automated strategy to mitigate ophthalmologist shortages and improve clinical reporting efficiency. However, applying general LLMs to medical imaging remains challenging due to hallucinations, limited interpretability, and insufficient domain-specific medical knowledge, which can potentially reduce clinical accuracy. Although recent approaches combining imaging models with LLM reasoning have improved reporting, they typically rely on a single generalist agent, restricting their capacity to emulate the diverse and complex reasoning found in multidisciplinary medical teams. To address these limitations, we propose MedChat, a multi-agent diagnostic framework and platform that combines specialized vision models with multiple role-specific LLM agents, all coordinated by a director agent. This design enhances reliability, reduces hallucination risk, and enables interactive diagnostic reporting through an interface tailored for clinical review and educational use. Code available at https://github.com/Purdue-M2/MedChat. △ Less

Submitted 11 June, 2025; v1 submitted 8 June, 2025; originally announced June 2025.

Comments: 7 pages, 6 figures. Accepted to the 2025 IEEE 8th International Conference on Multimedia Information Processing and Retrieval (MIPR)

Showing 1–50 of 678 results for author: Gupta, N