-
CHIME/FRB Outriggers: Design Overview
Authors:
The CHIME/FRB Collaboration,
Mandana Amiri,
Bridget C. Andersen,
Shion Andrew,
Kevin Bandura,
Mohit Bhardwaj,
Kalyani Bhopi,
Vadym Bidula,
P. J. Boyle,
Charanjot Brar,
Mark Carlson,
Tomas Cassanelli,
Alyssa Cassity,
Shami Chatterjee,
Jean-François Cliche,
Alice P. Curtin,
Rachel Darlinger,
David R. DeBoer,
Matt Dobbs,
Fengqiu Adam Dong,
Gwendolyn Eadie,
Emmanuel Fonseca,
B. M. Gaensler,
Nina Gusinskaia,
Mark Halpern
, et al. (44 additional authors not shown)
Abstract:
The Canadian Hydrogen Intensity Mapping Experiment (CHIME) has emerged as the world's premier facility for studying fast radio bursts (FRBs) through its fast transient search backend CHIME/FRB\@. The CHIME/FRB Outriggers project will augment this high detection rate of 2--3 FRBs per day with the ability to precisely localize them using very long baseline interferometry (VLBI). Using three strategi…
▽ More
The Canadian Hydrogen Intensity Mapping Experiment (CHIME) has emerged as the world's premier facility for studying fast radio bursts (FRBs) through its fast transient search backend CHIME/FRB\@. The CHIME/FRB Outriggers project will augment this high detection rate of 2--3 FRBs per day with the ability to precisely localize them using very long baseline interferometry (VLBI). Using three strategically located stations in North America and deploying recently developed synoptic VLBI observing techniques, the Outriggers will provide $\sim 50$~milliarcsecond localization precision for the majority of detected FRBs. This paper presents an overview of the design and implementation of the Outriggers, covering their geographic distribution, structural design, and observational capabilities. We detail the scientific objectives driving the project, including the characterization of FRB populations, host galaxy demographics, and the use of FRBs as cosmological probes. We also discuss the calibration strategies available to mitigate ionospheric and instrumental effects, ensuring high-precision localization. With two stations currently in science operations, and the third in commissioning, the CHIME/FRB Outriggers project is poised to become a cornerstone of the FRB field, offering unprecedented insights into this enigmatic cosmic phenomenon.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Underreporting of Intimate Partner Violence in Brazil
Authors:
Diego de Maria André,
José Raimundo Carvalho
Abstract:
According to WHO (2013), in general 30% of all women worldwide who have been in a relationship have experienced physical and/or sexual violence by their intimate partner. However, only a small percentage of intimate partner violence (IPV) victims report it to the police. This phenomenon of under-reporting is known as ``dark figure''. This paper aims to investigate the factors associated with the r…
▽ More
According to WHO (2013), in general 30% of all women worldwide who have been in a relationship have experienced physical and/or sexual violence by their intimate partner. However, only a small percentage of intimate partner violence (IPV) victims report it to the police. This phenomenon of under-reporting is known as ``dark figure''. This paper aims to investigate the factors associated with the reporting decision of IPV victims to the police in Brazil using the third wave of the ``Pesquisa de Condições Socioeconômicas e Violência Doméstica e Familiar contra a Mulher ($PCSVDF^{Mulher}$)''. Using a bivariate probit regression model with sample selection, we found that older white women, those who do not tolerate domestic violence, and women who have experienced physical violence are more likely to report IPV to the police. In contrast, married women, those with partners who abuse alcohol and those who witnessed or knew that their mothers had experienced IPV, are less likely to report it to law enforcement.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Randomized block Krylov method for approximation of truncated tensor SVD
Authors:
Malihe Nobakht Kooshkghazi,
Salman Ahmadi-Asl,
Andre L. F. de Almeida
Abstract:
This paper is devoted to studying the application of the block Krylov subspace method for approximation of the truncated tensor SVD (T-SVD). The theoretical results of the proposed randomized approach are presented. Several experimental experiments using synthetics and real-world data are conducted to verify the efficiency and feasibility of the proposed randomized approach, and the numerical resu…
▽ More
This paper is devoted to studying the application of the block Krylov subspace method for approximation of the truncated tensor SVD (T-SVD). The theoretical results of the proposed randomized approach are presented. Several experimental experiments using synthetics and real-world data are conducted to verify the efficiency and feasibility of the proposed randomized approach, and the numerical results show that the proposed method provides promising results. Applications of the proposed approach to data completion and data compression are presented.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
M-Prometheus: A Suite of Open Multilingual LLM Judges
Authors:
José Pombal,
Dongkeun Yoon,
Patrick Fernandes,
Ian Wu,
Seungone Kim,
Ricardo Rei,
Graham Neubig,
André F. T. Martins
Abstract:
The use of language models for automatically evaluating long-form text (LLM-as-a-judge) is becoming increasingly common, yet most LLM judges are optimized exclusively for English, with strategies for enhancing their multilingual evaluation capabilities remaining largely unexplored in the current literature. This has created a disparity in the quality of automatic evaluation methods for non-English…
▽ More
The use of language models for automatically evaluating long-form text (LLM-as-a-judge) is becoming increasingly common, yet most LLM judges are optimized exclusively for English, with strategies for enhancing their multilingual evaluation capabilities remaining largely unexplored in the current literature. This has created a disparity in the quality of automatic evaluation methods for non-English languages, ultimately hindering the development of models with better multilingual capabilities. To bridge this gap, we introduce M-Prometheus, a suite of open-weight LLM judges ranging from 3B to 14B parameters that can provide both direct assessment and pairwise comparison feedback on multilingual outputs. M-Prometheus models outperform state-of-the-art open LLM judges on multilingual reward benchmarks spanning more than 20 languages, as well as on literary machine translation (MT) evaluation covering 4 language pairs. Furthermore, M-Prometheus models can be leveraged at decoding time to significantly improve generated outputs across all 3 tested languages, showcasing their utility for the development of better multilingual models. Lastly, through extensive ablations, we identify the key factors for obtaining an effective multilingual judge, including backbone model selection and training on natively multilingual feedback data instead of translated data. We release our models, training dataset, and code.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
A Llama walks into the 'Bar': Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam
Authors:
Rean Fernandes,
André Biedenkapp,
Frank Hutter,
Noor Awad
Abstract:
Legal reasoning tasks present unique challenges for large language models (LLMs) due to the complexity of domain-specific knowledge and reasoning processes. This paper investigates how effectively smaller language models (Llama 2 7B and Llama 3 8B) can be fine-tuned with a limited dataset of 1,514 Multi-state Bar Examination (MBE) questions to improve legal question answering accuracy. We evaluate…
▽ More
Legal reasoning tasks present unique challenges for large language models (LLMs) due to the complexity of domain-specific knowledge and reasoning processes. This paper investigates how effectively smaller language models (Llama 2 7B and Llama 3 8B) can be fine-tuned with a limited dataset of 1,514 Multi-state Bar Examination (MBE) questions to improve legal question answering accuracy. We evaluate these models on the 2022 MBE questions licensed from JD Advising, the same dataset used in the 'GPT-4 passes the Bar exam' study. Our methodology involves collecting approximately 200 questions per legal domain across 7 domains. We distill the dataset using Llama 3 (70B) to transform explanations into a structured IRAC (Issue, Rule, Application, Conclusion) format as a guided reasoning process to see if it results in better performance over the non-distilled dataset. We compare the non-fine-tuned models against their supervised fine-tuned (SFT) counterparts, trained for different sample sizes per domain, to study the effect on accuracy and prompt adherence. We also analyse option selection biases and their mitigation following SFT. In addition, we consolidate the performance across multiple variables: prompt type (few-shot vs zero-shot), answer ordering (chosen-option first vs generated-explanation first), response format (Numbered list vs Markdown vs JSON), and different decoding temperatures. Our findings show that domain-specific SFT helps some model configurations achieve close to human baseline performance, despite limited computational resources and a relatively small dataset. We release both the gathered SFT dataset and the family of Supervised Fine-tuned (SFT) adapters optimised for MBE performance. This establishes a practical lower bound on resources needed towards achieving effective legal question answering in smaller LLMs.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Roadmap for Photonics with 2D Materials
Authors:
F. Javier García de Abajo,
D. N. Basov,
Frank H. L. Koppens,
Lorenzo Orsini,
Matteo Ceccanti,
Sebastián Castilla,
Lorenzo Cavicchi,
Marco Polini,
P. A. D. Gonçalves,
A. T. Costa,
N. M. R. Peres,
N. Asger Mortensen,
Sathwik Bharadwaj,
Zubin Jacob,
P. J. Schuck,
A. N. Pasupathy,
Milan Delor,
M. K. Liu,
Aitor Mugarza,
Pablo Merino,
Marc G. Cuxart,
Emigdio Chávez-Angel,
Martin Svec,
Luiz H. G. Tizei,
Florian Dirnberger
, et al. (123 additional authors not shown)
Abstract:
Triggered by the development of exfoliation and the identification of a wide range of extraordinary physical properties in self-standing films consisting of one or few atomic layers, two-dimensional (2D) materials such as graphene, transition metal dichalcogenides (TMDs), and other van der Waals (vdW) crystals currently constitute a wide research field protruding in multiple directions in combinat…
▽ More
Triggered by the development of exfoliation and the identification of a wide range of extraordinary physical properties in self-standing films consisting of one or few atomic layers, two-dimensional (2D) materials such as graphene, transition metal dichalcogenides (TMDs), and other van der Waals (vdW) crystals currently constitute a wide research field protruding in multiple directions in combination with layer stacking and twisting, nanofabrication, surface-science methods, and integration into nanostructured environments. Photonics encompasses a multidisciplinary collection of those directions, where 2D materials contribute with polaritons of unique characteristics such as strong spatial confinement, large optical-field enhancement, long lifetimes, high sensitivity to external stimuli (e.g., electric and magnetic fields, heating, and strain), a broad spectral range from the far infrared to the ultraviolet, and hybridization with spin and momentum textures of electronic band structures. The explosion of photonics with 2D materials as a vibrant research area is producing breakthroughs, including the discovery and design of new materials and metasurfaces with unprecedented properties as well as applications in integrated photonics, light emission, optical sensing, and exciting prospects for applications in quantum information, and nanoscale thermal transport. This Roadmap summarizes the state of the art in the field, identifies challenges and opportunities, and discusses future goals and how to meet them through a wide collection of topical sections prepared by leading practitioners.
△ Less
Submitted 14 April, 2025; v1 submitted 6 April, 2025;
originally announced April 2025.
-
PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement
Authors:
Xin Quan,
Marco Valentino,
Danilo S. Carvalho,
Dhairya Dalal,
André Freitas
Abstract:
A persistent challenge in AI is the effective integration of material and formal inference - the former concerning the plausibility and contextual relevance of arguments, while the latter focusing on their logical and structural validity. Large Language Models (LLMs), by virtue of their extensive pre-training on large textual corpora, exhibit strong capabilities in material inference. However, the…
▽ More
A persistent challenge in AI is the effective integration of material and formal inference - the former concerning the plausibility and contextual relevance of arguments, while the latter focusing on their logical and structural validity. Large Language Models (LLMs), by virtue of their extensive pre-training on large textual corpora, exhibit strong capabilities in material inference. However, their reasoning often lacks formal rigour and verifiability. At the same time, LLMs' linguistic competence positions them as a promising bridge between natural and formal languages, opening up new opportunities for combining these two modes of reasoning. In this paper, we introduce PEIRCE, a neuro-symbolic framework designed to unify material and formal inference through an iterative conjecture-criticism process. Within this framework, LLMs play the central role of generating candidate solutions in natural and formal languages, which are then evaluated and refined via interaction with external critique models. These critiques include symbolic provers, which assess formal validity, as well as soft evaluators that measure the quality of the generated arguments along linguistic and epistemic dimensions such as plausibility, coherence, and parsimony. While PEIRCE is a general-purpose framework, we demonstrate its capabilities in the domain of natural language explanation generation - a setting that inherently demands both material adequacy and formal correctness.
△ Less
Submitted 5 April, 2025;
originally announced April 2025.
-
A posteriori closure of turbulence models: are symmetries preserved ?
Authors:
André Freitas,
Kiwon Um,
Mathieu Desbrun,
Michele Buzzicotti,
Luca Biferale
Abstract:
Turbulence modeling remains a longstanding challenge in fluid dynamics. Recent advances in data-driven methods have led to a surge of novel approaches aimed at addressing this problem. This work builds upon our previous study (arXiv:2411.13194), where we introduced a new closure for a shell model of turbulence using an a posteriori (or solver-in-the-loop) approach. Unlike most deep learning-based…
▽ More
Turbulence modeling remains a longstanding challenge in fluid dynamics. Recent advances in data-driven methods have led to a surge of novel approaches aimed at addressing this problem. This work builds upon our previous study (arXiv:2411.13194), where we introduced a new closure for a shell model of turbulence using an a posteriori (or solver-in-the-loop) approach. Unlike most deep learning-based models, our method explicitly incorporates physical equations into the neural network framework, ensuring that the closure remains constrained by the underlying physics benefiting from enhanced stability and generalizability. In this paper, we further analyze the learned closure, probing its capabilities and limitations. In particular, we look at joint probability density functions to assess whether cross-correlations are well preserved or if just the mean behavior is captured. Additionally, we investigate the scale invariance of multipliers - ratios between adjacent shells - within the inertial range. Although our model excels in reproducing high-order observables such as flatness, it breaks this known symmetry near the cutoff, indicating a fundamental limitation. We discuss the implications of these findings for subgrid-scale modeling in 3D turbulence and outline directions for future research.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Complete First-Order Game Logic
Authors:
Noah Abou El Wafa,
André Platzer
Abstract:
First-order game logic GL and the first-order modal mu-calculus Lmu are proved to be equiexpressive and equivalent, thereby fully aligning their expressive and deductive power. That is, there is a semantics-preserving translation from GL to Lmu, and vice versa. And both translations are provability-preserving, while equivalence with there-and-back-again roundtrip translations are provable in both…
▽ More
First-order game logic GL and the first-order modal mu-calculus Lmu are proved to be equiexpressive and equivalent, thereby fully aligning their expressive and deductive power. That is, there is a semantics-preserving translation from GL to Lmu, and vice versa. And both translations are provability-preserving, while equivalence with there-and-back-again roundtrip translations are provable in both calculi. This is to be contrasted with the propositional case, where game logic is strictly less expressive than the modal mu-calculus (without adding sabotage games).
The extensions with differential equations, differential game logic (dGL) and differential modal mu-calculus are also proved equiexpressive and equivalent. Moreover, as the continuous dynamics are definable by fixpoints or via games, ODEs can be axiomatized completely. Rational gameplay provably collapses the games into single-player games to yield a strong arithmetical completeness theorem for dGL with rational-time ODEs.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency
Authors:
Erik Johannes Husom,
Arda Goknil,
Merve Astekin,
Lwin Khin Shar,
Andre Kåsen,
Sagar Sen,
Benedikt Andreas Mithassel,
Ahmet Soylu
Abstract:
Deploying Large Language Models (LLMs) on edge devices presents significant challenges due to computational constraints, memory limitations, inference speed, and energy consumption. Model quantization has emerged as a key technique to enable efficient LLM inference by reducing model size and computational overhead. In this study, we conduct a comprehensive analysis of 28 quantized LLMs from the Ol…
▽ More
Deploying Large Language Models (LLMs) on edge devices presents significant challenges due to computational constraints, memory limitations, inference speed, and energy consumption. Model quantization has emerged as a key technique to enable efficient LLM inference by reducing model size and computational overhead. In this study, we conduct a comprehensive analysis of 28 quantized LLMs from the Ollama library, which applies by default Post-Training Quantization (PTQ) and weight-only quantization techniques, deployed on an edge device (Raspberry Pi 4 with 4GB RAM). We evaluate energy efficiency, inference performance, and output accuracy across multiple quantization levels and task types. Models are benchmarked on five standardized datasets (CommonsenseQA, BIG-Bench Hard, TruthfulQA, GSM8K, and HumanEval), and we employ a high-resolution, hardware-based energy measurement tool to capture real-world power consumption. Our findings reveal the trade-offs between energy efficiency, inference speed, and accuracy in different quantization settings, highlighting configurations that optimize LLM deployment for resource-constrained environments. By integrating hardware-level energy profiling with LLM benchmarking, this study provides actionable insights for sustainable AI, bridging a critical gap in existing research on energy-aware LLM deployment.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Verification of Autonomous Neural Car Control with KeYmaera X
Authors:
Enguerrand Prebet,
Samuel Teuber,
André Platzer
Abstract:
This article presents a formal model and formal safety proofs for the ABZ'25 case study in differential dynamic logic (dL). The case study considers an autonomous car driving on a highway avoiding collisions with neighbouring cars. Using KeYmaera X's dL implementation, we prove absence of collision on an infinite time horizon which ensures that safety is preserved independently of trip length. The…
▽ More
This article presents a formal model and formal safety proofs for the ABZ'25 case study in differential dynamic logic (dL). The case study considers an autonomous car driving on a highway avoiding collisions with neighbouring cars. Using KeYmaera X's dL implementation, we prove absence of collision on an infinite time horizon which ensures that safety is preserved independently of trip length. The safety guarantees hold for time-varying reaction time and brake force. Our dL model considers the single lane scenario with cars ahead or behind. We demonstrate that dL with its tools is a rigorous foundation for runtime monitoring, shielding, and neural network verification. Doing so sheds light on inconsistencies between the provided specification and simulation environment highway-env of the ABZ'25 study. We attempt to fix these inconsistencies and uncover numerous counterexamples which also indicate issues in the provided reinforcement learning environment.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Unlocking the AMD Neural Processing Unit for ML Training on the Client Using Bare-Metal-Programming Tools
Authors:
André Rösti,
Michael Franz
Abstract:
There has been a growing interest in executing machine learning (ML) workloads on the client side for reasons of customizability, privacy, performance, and availability. In response, hardware manufacturers have begun to incorporate so-called Neural Processing Units (NPUs) into their processors for consumer devices. Such dedicated hardware optimizes both power efficiency and throughput for common m…
▽ More
There has been a growing interest in executing machine learning (ML) workloads on the client side for reasons of customizability, privacy, performance, and availability. In response, hardware manufacturers have begun to incorporate so-called Neural Processing Units (NPUs) into their processors for consumer devices. Such dedicated hardware optimizes both power efficiency and throughput for common machine learning tasks. AMD's NPU, part of their Ryzen AI processors, is one of the first such accelerators integrated into a chip with an x86 processor. AMD supports bare-metal programming of their NPU rather than limiting programmers to pre-configured libraries.
In this paper, we explore the potential of using a bare-metal toolchain to accelerate the weight fine-tuning of a large language model, GPT-2, entirely on the client side using the AMD NPU. Fine-tuning on the edge allows for private customization of a model to a specific use case. To the best of our knowledge, this is the first time such an accelerator has been used to perform training on the client side. We offload time-intensive matrix multiplication operations from the CPU onto the NPU, achieving a speedup of over 2.8x for these operations. This improves end-to-end performance of the model in terms of throughput (1.7x and 1.2x speedup in FLOPS/s on mains and battery power, respectively) and energy efficiency (1.4x improvement in FLOPS/Ws on battery power). We detail our implementation approach and present an in-depth exploration of the NPU hardware and bare-metal tool-flow.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Distributed Locking: Performance Analysis and Optimization Strategies
Authors:
Andre Rodriguez,
William Osborn
Abstract:
Distributed locking mechanisms are fundamental to ensuring data consistency and integrity in distributed systems. This paper presents a comprehensive analysis of distributed locking algorithms, focusing on their performance characteristics under various workload conditions. We compare traditional centralized locking approaches with modern distributed protocols, evaluating them based on throughput,…
▽ More
Distributed locking mechanisms are fundamental to ensuring data consistency and integrity in distributed systems. This paper presents a comprehensive analysis of distributed locking algorithms, focusing on their performance characteristics under various workload conditions. We compare traditional centralized locking approaches with modern distributed protocols, evaluating them based on throughput, latency, and scalability metrics. Our experimental results demonstrate that optimized distributed locking protocols can achieve up to 68\% better performance compared to centralized approaches in high-contention scenarios, while maintaining strong consistency guarantees. Furthermore, we propose novel optimizations for distributed locking that significantly reduce coordination overhead in geo-distributed deployments. The findings contribute to the growing body of knowledge on designing efficient concurrency control mechanisms for modern distributed systems.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Galerkin reduced order model for two-dimensional Rayleigh-Bénard convection
Authors:
Enrique Flores-Montoya,
André V. G. Cavalieri
Abstract:
In this work, Galerkin projection is used to build Reduced Order Models (ROM) for two-dimensional Rayleigh-Bénard (RB) convection with no-slip walls. We compare an uncoupled projection approach that uses separate orthonormal bases for velocity and temperature with a coupled formalism where the equations are projected onto a single basis combining velocity and temperature components. Orthonormal ba…
▽ More
In this work, Galerkin projection is used to build Reduced Order Models (ROM) for two-dimensional Rayleigh-Bénard (RB) convection with no-slip walls. We compare an uncoupled projection approach that uses separate orthonormal bases for velocity and temperature with a coupled formalism where the equations are projected onto a single basis combining velocity and temperature components. Orthonormal bases for modal projection are obtained as the eigenvalues of the controllability Gramian of the linearized RB equations. Various coupled and uncoupled ROMs with different number of modes are generated and validated against Direct Numerical Simulations (DNS) over a wide range of Rayleigh numbers, $Ra$. DNS and ROM results are compared in terms of mean vertical profiles, heat flux, flow structures, bifurcation diagrams and energy spectra. Coupled ROMs are found to be unstable at high $Ra$ numbers with a stability limit that depends on the basis $Ra$. Uncoupled models show an increasing agreement with DNS as a function of the system dimension. It is found that for the system truncations investigated here, a quantitative agreement with DNS can be obtained up to $Ra\simeq 4\times 10^5$. ROMs are used to perform a bifurcation analysis for $Pr=10$ and the results compared to DNS. They qualitatively predict the transitions between periodic, quasiperiodic and chaotic states as well as the spectral characteristics over a wide range of $Ra$ numbers. Overall, these results show that these ROMs reproduce the main flow features of RB convection and could be used as DNS-surrogates for the development of active control strategies and state estimation applications.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Anomalous vortex Hall effect in a ferromagnet/superconductor heterostructure
Authors:
Weideng Sun,
Przemyslaw Swatek,
Yihong Fan,
Hwanhui Yun,
Deyuan Lyu,
K. Andre Mkhoyan,
Jian-Ping Wang,
Gang Qiu
Abstract:
The coexistence of superconductivity and ferromagnetism is a fascinating and complex phenomenon in condensed matter physics, as these two states are typically mutually exclusive due to their competing spin configurations. However, the interplay between these two orders through the proximity effect has been a subject of intense research as it opens up possibilities for novel technological applicati…
▽ More
The coexistence of superconductivity and ferromagnetism is a fascinating and complex phenomenon in condensed matter physics, as these two states are typically mutually exclusive due to their competing spin configurations. However, the interplay between these two orders through the proximity effect has been a subject of intense research as it opens up possibilities for novel technological applications. Here, we report the coexistence of superconductivity and ferromagnetism in superconducting δ-TaN/ferromagnetic CoFeB heterostructures grown by facing-target sputtering. Superconducting states are comprehensively investigated, with evidence of strong correlation between the superconducting and ferromagnetic order parameters. In particular, we observed an anomalous Hall signal without the presence of the magnetic field in the mixed state of the superconducting transition near the critical temperature. Systematic characterizations of the Hall resistance under varying temperatures and magnetic fields attribute this behavior to the vortex Hall effect (VHE), whereby superconducting vortices in the mixed state undergo transverse motions near the critical temperature. Unlike previously reported VHEs in conventional type-II superconductors, the anomalous VHE in TaN is induced by the stray field in the underlying CoFeB layers. The concurrency of strong spin-orbit coupling, the superconductivity in the TaN layer, and the highly spin-polarized ferromagnetic ordering in the CoFeB layer offers new insights into proximity-induced vortex dynamics and the design of novel superconducting spintronic devices.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Orbit Determination through Cosmic Microwave Background Radiation
Authors:
Pedro K de Albuquerque,
Andre R Kuroswiski,
Annie S. Wu,
Willer G. dos Santos,
Paulo Costa
Abstract:
This research explores the use of Cosmic Microwave Background (CMB) radiation as a reference signal for Initial Orbit Determination (IOD). By leveraging the unique properties of CMB, this study introduces a novel method for estimating spacecraft velocity and position with minimal reliance on pre-existing environmental data, offering significant advantages for space missions independent of Earth-sp…
▽ More
This research explores the use of Cosmic Microwave Background (CMB) radiation as a reference signal for Initial Orbit Determination (IOD). By leveraging the unique properties of CMB, this study introduces a novel method for estimating spacecraft velocity and position with minimal reliance on pre-existing environmental data, offering significant advantages for space missions independent of Earth-specific conditions. Using Machine Learning (ML) regression models, this approach demonstrates the capability to determine velocity from CMB signals and subsequently determine the satellite's position. The results indicate that CMB has the potential to enhance the autonomy and flexibility of spacecraft operations.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
A thorough benchmark of automatic text classification: From traditional approaches to large language models
Authors:
Washington Cunha,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
Automatic text classification (ATC) has experienced remarkable advancements in the past decade, best exemplified by recent small and large language models (SLMs and LLMs), leveraged by Transformer architectures. Despite recent effectiveness improvements, a comprehensive cost-benefit analysis investigating whether the effectiveness gains of these recent approaches compensate their much higher costs…
▽ More
Automatic text classification (ATC) has experienced remarkable advancements in the past decade, best exemplified by recent small and large language models (SLMs and LLMs), leveraged by Transformer architectures. Despite recent effectiveness improvements, a comprehensive cost-benefit analysis investigating whether the effectiveness gains of these recent approaches compensate their much higher costs when compared to more traditional text classification approaches such as SVMs and Logistic Regression is still missing in the literature. In this context, this work's main contributions are twofold: (i) we provide a scientifically sound comparative analysis of the cost-benefit of twelve traditional and recent ATC solutions including five open LLMs, and (ii) a large benchmark comprising {22 datasets}, including sentiment analysis and topic classification, with their (train-validation-test) partitions based on folded cross-validation procedures, along with documentation, and code. The release of code, data, and documentation enables the community to replicate experiments and advance the field in a more scientifically sound manner. Our comparative experimental results indicate that LLMs outperform traditional approaches (up to 26%-7.1% on average) and SLMs (up to 4.9%-1.9% on average) in terms of effectiveness. However, LLMs incur significantly higher computational costs due to fine-tuning, being, on average 590x and 8.5x slower than traditional methods and SLMs, respectively. Results suggests the following recommendations: (1) LLMs for applications that require the best possible effectiveness and can afford the costs; (2) traditional methods such as Logistic Regression and SVM for resource-limited applications or those that cannot afford the cost of tuning large LLMs; and (3) SLMs like Roberta for near-optimal effectiveness-efficiency trade-off.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Gaze-Hand Steering for Travel and Multitasking in Virtual Environments
Authors:
Mona Zavichi,
André Santos,
Catarina Moreira,
Anderson Maciel,
Joaquim Jorge
Abstract:
As head-mounted displays (HMDs) with eye-tracking become increasingly accessible, the need for effective gaze-based interfaces in virtual reality (VR) grows. Traditional gaze- or hand-based navigation often limits user precision or impairs free viewing, making multitasking difficult. We present a gaze-hand steering technique that combines eye-tracking with hand-pointing: users steer only when gaze…
▽ More
As head-mounted displays (HMDs) with eye-tracking become increasingly accessible, the need for effective gaze-based interfaces in virtual reality (VR) grows. Traditional gaze- or hand-based navigation often limits user precision or impairs free viewing, making multitasking difficult. We present a gaze-hand steering technique that combines eye-tracking with hand-pointing: users steer only when gaze aligns with a hand-defined target, reducing unintended actions and enabling free look. Speed is controlled via either a joystick or a waist-level speed circle. We evaluated our method in a user study (N=20) across multitasking and single-task scenarios, comparing it to a similar technique. Results show that gaze-hand steering maintains performance and enhances user comfort and spatial awareness during multitasking. Our findings support the use of gaze-hand steering in gaze-dominant VR applications requiring precision and simultaneous interaction. Our method significantly improves VR navigation in gaze-dominant, multitasking-intensive applications, supporting immersion and efficient control.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
US National Input to the European Strategy Update for Particle Physics
Authors:
André de Gouvêa,
Hitoshi Murayama,
Mark Palmer,
Heidi Schellman
Abstract:
In this document we summarize the output of the US community planning exercises for particle physics that were performed between 2020 and 2023 and comment upon progress made since then towards our common scientific goals. This document leans heavily on the formal report of the Particle Physics Project Prioritization Panel and other recent US planning documents, often quoting them verbatim to retai…
▽ More
In this document we summarize the output of the US community planning exercises for particle physics that were performed between 2020 and 2023 and comment upon progress made since then towards our common scientific goals. This document leans heavily on the formal report of the Particle Physics Project Prioritization Panel and other recent US planning documents, often quoting them verbatim to retain the community consensus.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Adaptation of Moreau-Yosida regularization to the modulus of convexity
Authors:
Markus Penz,
Andre Laestadius
Abstract:
We study a generalization of Moreau-Yosida regularization that is adapted to the geometry of Banach spaces where the dual space is uniformly convex with modulus of convexity of power type. Important properties for regularized convex functions are given, in particular strong monotonicity of the subdifferential of their convex conjugate and Hölder-continuity of their gradient.
We study a generalization of Moreau-Yosida regularization that is adapted to the geometry of Banach spaces where the dual space is uniformly convex with modulus of convexity of power type. Important properties for regularized convex functions are given, in particular strong monotonicity of the subdifferential of their convex conjugate and Hölder-continuity of their gradient.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Minimal pole representation for spectral functions
Authors:
Lei Zhang,
André Erpenbeck,
Yang Yu,
Emanuel Gull
Abstract:
Representing spectral densities, real-frequency, and real-time Green's functions of continuous systems by a small discrete set of complex poles is an ubiquitous problem in condensed matter physics, with applications ranging from quantum transport simulations to the simulation of strongly correlated electron systems. This paper introduces a method for obtaining a compact, approximate representation…
▽ More
Representing spectral densities, real-frequency, and real-time Green's functions of continuous systems by a small discrete set of complex poles is an ubiquitous problem in condensed matter physics, with applications ranging from quantum transport simulations to the simulation of strongly correlated electron systems. This paper introduces a method for obtaining a compact, approximate representation of these functions, based on their parameterization on the real axis and a given approximate precision. We show applications to typical spectral functions and results for structured and unstructured correlation functions of model systems.
△ Less
Submitted 30 May, 2025; v1 submitted 1 April, 2025;
originally announced April 2025.
-
Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models
Authors:
José Pombal,
Nuno M. Guerreiro,
Ricardo Rei,
André F. T. Martins
Abstract:
As language models improve and become capable of performing more complex tasks across modalities, evaluating them automatically becomes increasingly challenging. Developing strong and robust task-specific automatic metrics gets harder, and human-annotated test sets -- which are expensive to create -- saturate more quickly. A compelling alternative is to design reliable strategies to automate the c…
▽ More
As language models improve and become capable of performing more complex tasks across modalities, evaluating them automatically becomes increasingly challenging. Developing strong and robust task-specific automatic metrics gets harder, and human-annotated test sets -- which are expensive to create -- saturate more quickly. A compelling alternative is to design reliable strategies to automate the creation of test data and evaluation, but previous attempts either rely on pre-existing data, or focus solely on individual tasks. We present Zero-shot Benchmarking (ZSB), a framework for creating high-quality benchmarks for any task by leveraging language models for both synthetic test data creation and evaluation. ZSB is simple and flexible: it requires only the creation of a prompt for data generation and one for evaluation; it is scalable to tasks and languages where collecting real-world data is costly or impractical; it is model-agnostic, allowing the creation of increasingly challenging benchmarks as models improve. To assess the effectiveness of our framework, we create benchmarks for five text-only tasks and a multi-modal one: general capabilities in four languages (English, Chinese, French, and Korean), translation, and general vision-language capabilities in English. We then rank a broad range of open and closed systems on our benchmarks. ZSB rankings consistently correlate strongly with human rankings, outperforming widely-adopted standard benchmarks. Through ablations, we find that strong benchmarks can be created with open models, and that judge model size and dataset variety are crucial drivers of performance. We release all our benchmarks, and code to reproduce our experiments and to produce new benchmarks.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
An Investigation into the Causal Mechanism of Political Opinion Dynamics: A Model of Hierarchical Coarse-Graining with Community-Bounded Social Influence
Authors:
Valeria Widler,
Barbara Kaminska,
Andre C. R. Martins,
Ivan Puga-Gonzalez
Abstract:
The increasing polarization in democratic societies is an emergent outcome of political opinion dynamics. Yet, the fundamental mechanisms behind the formation of political opinions, from individual beliefs to collective consensus, remain unknown. Understanding that a causal mechanism must account for both bottom-up and top-down influences, we conceptualize political opinion dynamics as hierarchica…
▽ More
The increasing polarization in democratic societies is an emergent outcome of political opinion dynamics. Yet, the fundamental mechanisms behind the formation of political opinions, from individual beliefs to collective consensus, remain unknown. Understanding that a causal mechanism must account for both bottom-up and top-down influences, we conceptualize political opinion dynamics as hierarchical coarse-graining, where microscale opinions integrate into a macro-scale state variable. Using the CODA (Continuous Opinions Discrete Actions) model, we simulate Bayesian opinion updating, social identity-based information integration, and migration between social identity groups to represent higher-level connectivity. This results in coarse-graining across micro, meso, and macro levels. Our findings show that higher-level connectivity shapes information integration, yielding three regimes: independent (disconnected, local convergence), parallel (fast, global convergence), and iterative (slow, stepwise convergence). In the iterative regime, low connectivity fosters transient diversity, indicating an informed consensus. In all regimes, time-scale separation leads to downward causation, where agents converge on the aggregate majority choice, driving consensus. Critically, any degree of coherent higher-level information integration can overcome misalignment via global downward causation. The results highlight how emergent properties of the causal mechanism, such as downward causation, are essential for consensus and may inform more precise investigations into polarized political discourse.
△ Less
Submitted 2 April, 2025; v1 submitted 1 April, 2025;
originally announced April 2025.
-
Chemical and Morphological Transformations of a Ag-Cu Nanocatalyst During CO2 Reduction Reaction
Authors:
Gustavo Zottis Girotto,
Maximilian Jaugstetter,
Dongwoo Kim,
Ruan M. Martins,
André R. Muniz,
Miquel Salmeron,
Slavomir Nemsak,
Fabiano Bernardi
Abstract:
The conversion of CO2 into high-value chemicals through a photoreduction reaction in water is a promising route to reduce the dependence on fossil fuels. Ag nanoparticles can drive this reaction via localized surface plasmon resonance, but their low selectivity limits usage in industry. Enhancing selectivity toward hydrocarbons or alcohols requires addition of a co-catalyst such as Cu. However, th…
▽ More
The conversion of CO2 into high-value chemicals through a photoreduction reaction in water is a promising route to reduce the dependence on fossil fuels. Ag nanoparticles can drive this reaction via localized surface plasmon resonance, but their low selectivity limits usage in industry. Enhancing selectivity toward hydrocarbons or alcohols requires addition of a co-catalyst such as Cu. However, the stabilized surface state created by Ag-Cu interactions is still poorly understood. In this work, soft x-ray Ambient-Pressure X-ray Photoelectron Spectroscopy (AP-XPS) and Grazing-Incidence X-ray Scattering (AP-GIXS) were used to investigate the evolution of Ag-Cu nanoparticles under CO2RR-like conditions. AP-XPS revealed Ag and Cu surface and sub-surface diffusion, while AP-GIXS tracked change of shape and size of nanoparticles induced by diffusion mechanics. Under 532 nm laser irradiation, further oxidation of Cu and Ag sub-surface diffusion were observed, providing invaluable insights into the dynamic restructuring of the catalyst under reaction conditions.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Karabo: A versatile SKA Observation Simulation Framework
Authors:
Rohit Sharma,
Simon Felix,
Luis Fernando Machado Poletti Valle,
Vincenzo Timmel,
Lukas Gehrig,
Andreas Wassmer,
Jennifer Studer,
Pascal Hitz,
Filip Schramka,
Michele Bianco,
Devin Crichton,
Marta Spinelli,
André Csillaghy,
Stefan Kögel,
Alexandre Réfrégier
Abstract:
Karabo is a versatile Python-based software framework simplifying research with radio astronomy data. It bundles existing software packages into a coherent whole to improve the ease of use of its components. Karabo includes useful abstractions, like strategies to scale and parallelize typical workloads or science-specific Python modules. The framework includes functionality to access datasets and…
▽ More
Karabo is a versatile Python-based software framework simplifying research with radio astronomy data. It bundles existing software packages into a coherent whole to improve the ease of use of its components. Karabo includes useful abstractions, like strategies to scale and parallelize typical workloads or science-specific Python modules. The framework includes functionality to access datasets and mock observations to study the Square Kilometer Array (SKA) instruments and their expected accuracy. SKA will address problems in a wide range of fields of astronomy. We demonstrate the application of Karabo to some of the SKA science cases from HI intensity mapping, mock radio surveys, radio source detection, the epoch of re-ionisation and heliophysics. We discuss the capabilities and challenges of simulating large radio datasets in the context of SKA.
△ Less
Submitted 1 April, 2025; v1 submitted 31 March, 2025;
originally announced April 2025.
-
Reinterpretation and preservation of data and analyses in HEP
Authors:
Jon Butterworth,
Sabine Kraml,
Harrison Prosper,
Andy Buckley,
Louie Corpe,
Cristinel Diaconu,
Mark Goodsell,
Philippe Gras,
Martin Habedank,
Clemens Lange,
Kati Lassila-Perini,
André Lessa,
Rakhi Mahbubani,
Judita Mamužić,
Zach Marshall,
Thomas McCauley,
Humberto Reyes-Gonzalez,
Krzysztof Rolbiecki,
Sezen Sekmen,
Giordon Stark,
Graeme Watt,
Jonas Würzinger,
Shehu AbdusSalam,
Aytul Adiguzel,
Amine Ahriche
, et al. (123 additional authors not shown)
Abstract:
Data from particle physics experiments are unique and are often the result of a very large investment of resources. Given the potential scientific impact of these data, which goes far beyond the immediate priorities of the experimental collaborations that obtain them, it is imperative that the collaborations and the wider particle physics community publish and preserve sufficient information to en…
▽ More
Data from particle physics experiments are unique and are often the result of a very large investment of resources. Given the potential scientific impact of these data, which goes far beyond the immediate priorities of the experimental collaborations that obtain them, it is imperative that the collaborations and the wider particle physics community publish and preserve sufficient information to ensure that this impact can be realised, now and into the future. The information to be published and preserved includes the algorithms, statistical information, simulations and the recorded data. This publication and preservation requires significant resources, and should be a strategic priority with commensurate planning and resource allocation from the earliest stages of future facilities and experiments.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Variational Perturbation Theory in Open Quantum Systems for Efficient Steady State Computation
Authors:
André Melo,
Gaspard Beugnot,
Fabrizio Minganti
Abstract:
Determining the steady state of an open quantum system is crucial for characterizing quantum devices and studying various physical phenomena. Often, computing a single steady state is insufficient, and it is necessary to explore its dependence on multiple external parameters. In such cases, calculating the steady state independently for each combination of parameters quickly becomes intractable. P…
▽ More
Determining the steady state of an open quantum system is crucial for characterizing quantum devices and studying various physical phenomena. Often, computing a single steady state is insufficient, and it is necessary to explore its dependence on multiple external parameters. In such cases, calculating the steady state independently for each combination of parameters quickly becomes intractable. Perturbation theory (PT) can mitigate this challenge by expanding steady states around reference parameters, minimizing redundant computations across neighboring parameter values. However, PT has two significant limitations: it relies on the pseudo-inverse -- a numerically costly operation -- and has a limited radius of convergence. In this work, we remove both of these roadblocks. First, we introduce a variational perturbation theory (VPT) and its multipoint generalization that significantly extends the radius of convergence even in the presence of non-analytic effects such as dissipative phase transitions. Then, we develop two numerical strategies that eliminate the need to compute pseudo-inverses. The first relies on a single LU decomposition to efficiently construct the steady state within the convergence region, while the second reformulates VPT as a Krylov space recycling problem and uses preconditioned iterative methods. We benchmark these approaches across various models, demonstrating their broad applicability and significant improvements over standard PT.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Neutrino Theory in the Precision Era
Authors:
Asmaa Abada,
Gabriela Barenboim,
Toni Bertólez-Martínez,
Sandipan Bhattacherjee,
Sara Bolognesi,
Patrick D. Bolton,
Nilay Bostan,
Gustavo C. Branco,
Sabya Sachi Chatterjee,
Adriano Cherchiglia,
Marco Chianese,
B. A. Couto e Silva,
Peter B. Denton,
Stephen Dolan,
Marco Drewes,
Ilham El Atmani,
Miguel Escudero,
Ivan Esteban,
Manuel Ettengruber,
Enrique Fernández-Martínez,
Julien Froustey,
Raj Gandhi,
Julia Gehrlein,
Srubabati Goswami,
André de Gouvêa
, et al. (54 additional authors not shown)
Abstract:
This document summarises discussions on future directions in theoretical neutrino physics, which are the outcome of a neutrino theory workshop held at CERN in February 2025. The starting point is the realisation that neutrino physics offers unique opportunities to address some of the most fundamental questions in physics. This motivates a vigorous experimental programme which the theory community…
▽ More
This document summarises discussions on future directions in theoretical neutrino physics, which are the outcome of a neutrino theory workshop held at CERN in February 2025. The starting point is the realisation that neutrino physics offers unique opportunities to address some of the most fundamental questions in physics. This motivates a vigorous experimental programme which the theory community fully supports. \textbf{A strong effort in theoretical neutrino physics is paramount to optimally take advantage of upcoming neutrino experiments and to explore the synergies with other areas of particle, astroparticle, and nuclear physics, as well as cosmology.} Progress on the theory side has the potential to significantly boost the physics reach of experiments, as well as go well beyond their original scope. Strong collaboration between theory and experiment is essential in the precision era. To foster such collaboration, \textbf{we propose to establish a CERN Neutrino Physics Centre.} Taking inspiration from the highly successful LHC Physics Center at Fermilab, the CERN Neutrino Physics Centre would be the European hub of the neutrino community, covering experimental and theoretical activities.
△ Less
Submitted 27 March, 2025;
originally announced April 2025.
-
Structure and Fragmentation Scale of a Massive Star-Forming Filament in NGC6334: High-Resolution Mid-Infrared Absorption Imaging with JWST
Authors:
Philippe André,
Michael Mattern,
Doris Arzoumanian,
Yoshito Shimajiri,
Annie Zavagno,
Daisei Abe,
Delphine Russeil
Abstract:
Dense filaments are believed to be representative of the initial conditions of star formation in molecular clouds. We have used the MIRI instrument on JWST to image the massive filament NGC6334M at d~1.3 kpc with unprecedented resolution and dynamic range at 7.7 and 25.5 microns. Our observations reveal the fine structure of the filament in absorption against mid-infrared background emission. From…
▽ More
Dense filaments are believed to be representative of the initial conditions of star formation in molecular clouds. We have used the MIRI instrument on JWST to image the massive filament NGC6334M at d~1.3 kpc with unprecedented resolution and dynamic range at 7.7 and 25.5 microns. Our observations reveal the fine structure of the filament in absorption against mid-infrared background emission. From the absorption data, we derive high-resolution column density maps and perform a detailed analysis of the filament structure. We find a median filament width of 0.12+/-0.02 pc at both wavelengths, resolved by almost two orders of magnitude by MIRI, and consistent with the typical half-power width of Herschel filaments in nearby (d<0.5 kpc) clouds. The JWST data also reveal the presence of a quasi-periodic series of side filaments with a similar projected spacing of 0.125+/-0.015 pc. Combining our JWST results with Spitzer and APEX/Herschel data, we perform a study of cloud structure over four orders of magnitude in linear scale. A convergence test shows that our width estimates for NGC6334M are robust and reflect the presence of a true characteristic scale. While there is evidence of a Kolmogorov-like spectrum of small-scale fluctuations down the 1.6x10^-3 pc resolution of the JWST observations, we identify a break in the power spectrum of column density fluctuations at a scale ~0.1-0.4 pc comparable to the width of NGC6334M and its side filaments. This characteristic scale ~0.1pc has important implications for the origin of the star formation efficiency in dense gas and the IMF.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Intersection of linear and multi-twisted codes with applications
Authors:
Ramy Takieldin,
André Leroy
Abstract:
In this paper, we derive a formula for constructing a generator matrix for the intersection of any pair of linear codes over a finite field. Consequently, we establish a condition under which a linear code has a trivial intersection with another linear code (or its Galois dual). Furthermore, we provide a condition for reversibility and propose a generator matrix formula for the largest reversible…
▽ More
In this paper, we derive a formula for constructing a generator matrix for the intersection of any pair of linear codes over a finite field. Consequently, we establish a condition under which a linear code has a trivial intersection with another linear code (or its Galois dual). Furthermore, we provide a condition for reversibility and propose a generator matrix formula for the largest reversible subcode of any linear code. We then focus on the comprehensive class of multi-twisted (MT) codes, which are naturally and more effectively represented using generator polynomial matrices (GPMs). We prove that the reversed code of an MT code remains MT and derive an explicit formula for its GPM. Additionally, we examine the intersection of a pair of MT codes, possibly with different shift constants, and demonstrate that this intersection is not necessarily MT. However, when the intersection admits an MT structure, we propose the corresponding shift constants. We also establish a GPM formula for the intersection of a pair of MT codes with the same shift constants. This result enables us to derive a GPM formula for the intersection of an MT code and the Galois dual of another MT code. Finally, we examine conditions for various properties on MT codes. Perhaps most importantly, the necessary and sufficient conditions for an MT code to be Galois self-orthogonal, Galois dual-containing, Galois linear complementary dual (LCD), or reversible.
△ Less
Submitted 10 April, 2025; v1 submitted 31 March, 2025;
originally announced March 2025.
-
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Authors:
Francesco Pio Ramunno,
Paolo Massa,
Vitaliy Kinakh,
Brandon Panos,
André Csillaghy,
Slava Voloshynovskiy
Abstract:
The spatial properties of the solar magnetic field are crucial to decoding the physical processes in the solar interior and their interplanetary effects. However, observations from older instruments, such as the Michelson Doppler Imager (MDI), have limited spatial or temporal resolution, which hinders the ability to study small-scale solar features in detail. Super resolving these older datasets i…
▽ More
The spatial properties of the solar magnetic field are crucial to decoding the physical processes in the solar interior and their interplanetary effects. However, observations from older instruments, such as the Michelson Doppler Imager (MDI), have limited spatial or temporal resolution, which hinders the ability to study small-scale solar features in detail. Super resolving these older datasets is essential for uniform analysis across different solar cycles, enabling better characterization of solar flares, active regions, and magnetic network dynamics. In this work, we introduce a novel diffusion model approach for Super-Resolution and we apply it to MDI magnetograms to match the higher-resolution capabilities of the Helioseismic and Magnetic Imager (HMI). By training a Latent Diffusion Model (LDM) with residuals on downscaled HMI data and fine-tuning it with paired MDI/HMI data, we can enhance the resolution of MDI observations from 2"/pixel to 0.5"/pixel. We evaluate the quality of the reconstructed images by means of classical metrics (e.g., PSNR, SSIM, FID and LPIPS) and we check if physical properties, such as the unsigned magnetic flux or the size of an active region, are preserved. We compare our model with different variations of LDM and Denoising Diffusion Probabilistic models (DDPMs), but also with two deterministic architectures already used in the past for performing the Super-Resolution task. Furthermore, we show with an analysis in the Fourier domain that the LDM with residuals can resolve features smaller than 2", and due to the probabilistic nature of the LDM, we can asses their reliability, in contrast with the deterministic models. Future studies aim to super-resolve the temporal scale of the solar MDI instrument so that we can also have a better overview of the dynamics of the old events.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
The Compact Linear e$^+$e$^-$ Collider (CLIC)
Authors:
Erik Adli,
Gerardo D'Auria,
Nuria Catalan Lasheras,
Vera Cilento,
Roberto Corsini,
Dominik Dannheim,
Steffen Doebert,
Mick Draper,
Angeles Faus-Golfe,
Edward Fraser Mactavish,
Alexej Grudiev,
Andrea Latina,
Lucie Linssen,
John Andrew Osborne,
Yannis Papaphilippou,
Philipp Roloff,
Aidan Robson,
Carlo Rossi,
Andre Sailer,
Daniel Schulte,
Eva Sicking,
Steinar Stapnes,
Igor Syratchev,
Rogelio Tomas Garcia,
Walter Wuensch
Abstract:
The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear e$^+$e$^-$ collider studied by the international CLIC and CLICdp collaborations. CLIC uses a two-beam acceleration scheme, in which normal-conducting high-gradient 12 GHz accelerating structures are powered via a high-current drive beam. CLIC is foreseen to be built and operated in stages. The initial 380 GeV stage, with a si…
▽ More
The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear e$^+$e$^-$ collider studied by the international CLIC and CLICdp collaborations. CLIC uses a two-beam acceleration scheme, in which normal-conducting high-gradient 12 GHz accelerating structures are powered via a high-current drive beam. CLIC is foreseen to be built and operated in stages. The initial 380 GeV stage, with a site length of 11 km, optimally combines the exploration of Higgs and top-quark physics, including a top threshold scan near 350 GeV. A higher-energy stage, still using the initial single drive-beam complex, can be optimised for any energy up to 2 TeV. Parameters are presented in detail for a 1.5 TeV stage, with a site length of 29 km. Since the 2018 ESPPU reporting, significant effort was invested in CLIC accelerator optimisation, technology developments and system tests, including collaboration with new-generation light sources and free-electron lasers. CLIC implementation aspects at CERN have covered detailed studies of civil engineering, electrical networks, cooling and ventilation, scheduling, and costing. The CLIC baseline at 380 GeV is now 100 Hz operation, with a luminosity of 4.5$\times 10^{34}$\,cm$^{-2}$s$^{-1}$ and a power consumption of 166 MW. Compared to the 2018 design, this gives three times higher luminosity-per-power. The new baseline has two beam-delivery systems, allowing for two detectors operating in parallel. The cost estimate of the 380 GeV baseline is approximately 7.17 billion CHF. The construction of the first CLIC energy stage could start as early as 2033 with first beams available by 2041. This report summarises the CLIC project, its implementation and running scenarios, with emphasis on new developments and recent progress. It concludes with an update on the CLIC detector studies and on the physics potential in light of the improved accelerator performance.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Detecting Localized Density Anomalies in Multivariate Data via Coin-Flip Statistics
Authors:
Sebastian Springer,
Andre Scaffidi,
Maximilian Autenrieth,
Gabriella Contardo,
Alessandro Laio,
Roberto Trotta,
Heikki Haario
Abstract:
Detecting localized density differences in multivariate data is a crucial task in computational science. Such anomalies can indicate a critical system failure, lead to a groundbreaking scientific discovery, or reveal unexpected changes in data distribution. We introduce EagleEye, an anomaly detection method to compare two multivariate datasets with the aim of identifying local density anomalies, n…
▽ More
Detecting localized density differences in multivariate data is a crucial task in computational science. Such anomalies can indicate a critical system failure, lead to a groundbreaking scientific discovery, or reveal unexpected changes in data distribution. We introduce EagleEye, an anomaly detection method to compare two multivariate datasets with the aim of identifying local density anomalies, namely over- or under-densities affecting only localised regions of the feature space. Anomalies are detected by modelling, for each point, the ordered sequence of its neighbours' membership label as a coin-flipping process and monitoring deviations from the expected behaviour of such process. A unique advantage of our method is its ability to provide an accurate, entirely unsupervised estimate of the local signal purity. We demonstrate its effectiveness through experiments on both synthetic and real-world datasets. In synthetic data, EagleEye accurately detects anomalies in multiple dimensions even when they affect a tiny fraction of the data. When applied to a challenging resonant anomaly detection benchmark task in simulated Large Hadron Collider data, EagleEye successfully identifies particle decay events present in just 0.3% of the dataset. In global temperature data, EagleEye uncovers previously unidentified, geographically localised changes in temperature fields that occurred in the most recent years. Thanks to its key advantages of conceptual simplicity, computational efficiency, trivial parallelisation, and scalability, EagleEye is widely applicable across many fields.
△ Less
Submitted 2 April, 2025; v1 submitted 31 March, 2025;
originally announced March 2025.
-
Least-Squares Khatri-Rao Factorization of a Polynomial Matrix
Authors:
Faizan A. Khattak,
Fazal-E-Asim,
Stephan Weiss,
Andre L. F. de Almeida
Abstract:
The Khatri-Rao product is extensively used in array processing, tensor decomposition, and multi-way data analysis. Many applications require a least-squares (LS) Khatri-Rao factorization. In broadband sensor array problems, polynomial matrices effectively model frequency-dependent behaviors, necessitating extensions of conventional linear algebra techniques. This paper generalizes LS Khatri-Rao fa…
▽ More
The Khatri-Rao product is extensively used in array processing, tensor decomposition, and multi-way data analysis. Many applications require a least-squares (LS) Khatri-Rao factorization. In broadband sensor array problems, polynomial matrices effectively model frequency-dependent behaviors, necessitating extensions of conventional linear algebra techniques. This paper generalizes LS Khatri-Rao factorization from ordinary to polynomial matrices by applying it to the discrete Fourier transform (DFT) samples of polynomial matrices. Phase coherence across bin-wise Khatri-Rao factors is ensured via a phasesmoothing algorithm. The proposed method is validated through broadband angle-of-arrival (AoA) estimation for uniform planar arrays (UPAs), where the steering matrix is a polynomial matrix, which can be represented as a Khatri-Rao product between steering matrix in azimuth and elevation directions.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
Neural Bayes inference for complex bivariate extremal dependence models
Authors:
Lídia M. André,
Jennifer L. Wadsworth,
Raphaël Huser
Abstract:
Likelihood-free approaches are appealing for performing inference on complex dependence models, either because it is not possible to formulate a likelihood function, or its evaluation is very computationally costly. This is the case for several models available in the multivariate extremes literature, particularly for the most flexible tail models, including those that interpolate between the two…
▽ More
Likelihood-free approaches are appealing for performing inference on complex dependence models, either because it is not possible to formulate a likelihood function, or its evaluation is very computationally costly. This is the case for several models available in the multivariate extremes literature, particularly for the most flexible tail models, including those that interpolate between the two key dependence classes of `asymptotic dependence' and `asymptotic independence'. We focus on approaches that leverage neural networks to approximate Bayes estimators. In particular, we explore the properties of neural Bayes estimators for parameter inference for several flexible but computationally expensive models to fit, with a view to aiding their routine implementation. Owing to the absence of likelihood evaluation in the inference procedure, classical information criteria such as the Bayesian information criterion cannot be used to select the most appropriate model. Instead, we propose using neural networks as neural Bayes classifiers for model selection. Our goal is to provide a toolbox for simple, fast fitting and comparison of complex extreme-value dependence models, where the best model is selected for a given data set and its parameters subsequently estimated using neural Bayes estimation. We apply our classifiers and estimators to analyse the pairwise extremal behaviour of changes in horizontal geomagnetic field fluctuations at three different locations.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
Improved Motion Plane Adaptive 360-Degree Video Compression Using Affine Motion Models
Authors:
Marina Ritthaler,
Andy Regensky,
André Kaup
Abstract:
Efficient compression of 360-degree video content requires the application of advanced motion models for interframe prediction. The Motion Plane Adaptive (MPA) motion model projects the frames on multiple perspective planes in the 3D space. It improves the motion compensation by estimating the motion on those planes with a translational diamond search. In this work, we enhance this motion model wi…
▽ More
Efficient compression of 360-degree video content requires the application of advanced motion models for interframe prediction. The Motion Plane Adaptive (MPA) motion model projects the frames on multiple perspective planes in the 3D space. It improves the motion compensation by estimating the motion on those planes with a translational diamond search. In this work, we enhance this motion model with an affine parameterization and motion estimation method. Thereby, we find a feasible trade-off between the quality of the reconstructed frames and the computational cost. The affine motion estimation is hereby done with the inverse compositional Lucas-Kanade algorithm. With the proposed method, it is possible to improve the motion compensation significantly, so that the motion compensated frame has a Weighted-to-Spherically-uniform Peak Signal-to-Noise Ratio (WS-PSNR) which is about 1.6 dB higher than with the conventional MPA. In a basic video codec, the improved inter prediction can lead to Bjøntegaard Delta (BD) rate savings between 9 % and 35 % depending on the block size (BS) and number of motion parameters.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion
Authors:
Liuyue Xie,
Jiancong Guo,
Ozan Cakmakci,
Andre Araujo,
Laszlo A. Jeni,
Zhiheng Jia
Abstract:
Accurate camera calibration is a fundamental task for 3D perception, especially when dealing with real-world, in-the-wild environments where complex optical distortions are common. Existing methods often rely on pre-rectified images or calibration patterns, which limits their applicability and flexibility. In this work, we introduce a novel framework that addresses these challenges by jointly mode…
▽ More
Accurate camera calibration is a fundamental task for 3D perception, especially when dealing with real-world, in-the-wild environments where complex optical distortions are common. Existing methods often rely on pre-rectified images or calibration patterns, which limits their applicability and flexibility. In this work, we introduce a novel framework that addresses these challenges by jointly modeling camera intrinsic and extrinsic parameters using a generic ray camera model. Unlike previous approaches, AlignDiff shifts focus from semantic to geometric features, enabling more accurate modeling of local distortions. We propose AlignDiff, a diffusion model conditioned on geometric priors, enabling the simultaneous estimation of camera distortions and scene geometry. To enhance distortion prediction, we incorporate edge-aware attention, focusing the model on geometric features around image edges, rather than semantic content. Furthermore, to enhance generalizability to real-world captures, we incorporate a large database of ray-traced lenses containing over three thousand samples. This database characterizes the distortion inherent in a diverse variety of lens forms. Our experiments demonstrate that the proposed method significantly reduces the angular error of estimated ray bundles by ~8.2 degrees and overall calibration accuracy, outperforming existing approaches on challenging, real-world datasets.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Convergence of a Stochastic Particle System to the Continuous Generalized Exchange-Driven Growth Model
Authors:
Chun Yin Lam,
André Schlichting
Abstract:
The continuous generalized exchange-driven growth model (CGEDG) is a system of integro-differential equations describing the evolution of cluster mass under mass exchange. The rate of exchange depends on the masses of the clusters involved and the mass being exchanged. This can be viewed as both a continuous generalization of the exchange-driven growth model and a coagulation-fragmentation equatio…
▽ More
The continuous generalized exchange-driven growth model (CGEDG) is a system of integro-differential equations describing the evolution of cluster mass under mass exchange. The rate of exchange depends on the masses of the clusters involved and the mass being exchanged. This can be viewed as both a continuous generalization of the exchange-driven growth model and a coagulation-fragmentation equation that generalizes the continuous Smoluchowski equation.
Starting from a Markov jump process that describes a finite stochastic interacting particle system with exchange dynamics, we prove the weak law of large numbers for this process for sublinearly growing kernels in the mean-field limit. We establish the tightness of the stochastic process on a measure-valued Skorokhod space induced by the $1$-Wasserstein metric, from which we deduce the existence of solutions to the (CGEDG) system. The solution is shown to have a Lebesgue density under suitable assumptions on the initial data. Moreover, within the class of solutions with density, we establish the uniqueness under slightly more restrictive conditions on the kernel.
△ Less
Submitted 31 May, 2025; v1 submitted 27 March, 2025;
originally announced March 2025.
-
Long-Baseline Atom Interferometry
Authors:
Antun Balaz,
Diego Blas,
Oliver Buchmueller,
Sergio Calatroni,
Laurentiu-Ioan Caramete,
David Cerdeno,
Maria Luisa Chiofalo,
Fabio Di Pumpo,
Goran Djordjevic,
John Ellis,
Pierre Fayet,
Chris Foot,
Naceur Gaaloul,
Susan Gardner,
Barry M Garraway,
Alexandre Gauguet,
Enno Giese,
Jason M. Hogan,
Onur Hosten,
Alex Kehagias,
Eva Kilian,
Tim Kovachy,
Carlos Lacasta,
Marek Lewicki,
Elias Lopez Asamar
, et al. (28 additional authors not shown)
Abstract:
Long-baseline atom interferometry is a promising technique for probing various aspects of fundamental physics, astrophysics and cosmology, including searches for ultralight dark matter (ULDM) and for gravitational waves (GWs) in the frequency range around 1~Hz that is not covered by present and planned detectors using laser interferometry. The MAGIS detector is under construction at Fermilab, as i…
▽ More
Long-baseline atom interferometry is a promising technique for probing various aspects of fundamental physics, astrophysics and cosmology, including searches for ultralight dark matter (ULDM) and for gravitational waves (GWs) in the frequency range around 1~Hz that is not covered by present and planned detectors using laser interferometry. The MAGIS detector is under construction at Fermilab, as is the MIGA detector in France. The PX46 access shaft to the LHC has been identified as a very suitable site for an atom interferometer of height $\sim 100$m, sites at the Boulby mine in the UK and the Canfranc Laboratory are also under investigation, and possible sites for km-class detectors have been suggested. The Terrestrial Very-Long-Baseline Atom Interferometry (TVLBAI) Proto-Collaboration proposes a coordinated programme of interferometers of increasing baselines.
△ Less
Submitted 6 April, 2025; v1 submitted 27 March, 2025;
originally announced March 2025.
-
Less Noise, More Signal: DRR for Better Optimizations of SE Tasks
Authors:
Andre Lustosa,
Tim Menzies
Abstract:
SE analytics problems do not always need complex AI. Better and faster solutions can sometimes be obtained by matching the complexity of the problem to the complexity of the solution. This paper introduces the Dimensionality Reduction Ratio (DRR), a new metric for predicting when lightweight algorithms suffice. Analyzing SE optimization problems from software configuration to process decisions and…
▽ More
SE analytics problems do not always need complex AI. Better and faster solutions can sometimes be obtained by matching the complexity of the problem to the complexity of the solution. This paper introduces the Dimensionality Reduction Ratio (DRR), a new metric for predicting when lightweight algorithms suffice. Analyzing SE optimization problems from software configuration to process decisions and open-source project health we show that DRR pinpoints "simple" tasks where costly methods like DEHB (a state-of-the-art evolutionary optimizer) are overkill. For high-DRR problems, simpler methods can be just as effective and run two orders of magnitude faster.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Causal consistency requirements for gravity-induced entanglement in near-relativistic systems with internal energy
Authors:
Linda M. van Manen,
M. Kemal Döner,
André Großardt
Abstract:
We reconsider a thought experiment that employs the entanglement of the gravitational field with position space quantum states as a means for faster-than-light signaling. We present a protocol that includes the excitation to a higher internal energy level to increase sensitivity to gravitational phase shifts. We report that the explanations why previous versions of the thought experiment remain ca…
▽ More
We reconsider a thought experiment that employs the entanglement of the gravitational field with position space quantum states as a means for faster-than-light signaling. We present a protocol that includes the excitation to a higher internal energy level to increase sensitivity to gravitational phase shifts. We report that the explanations why previous versions of the thought experiment remain causally consistent are insufficient to avoid any possibility for faster-than-light signals in this case. An alternative resolution to prevent faster-than-light signaling is most reasonably the requirement for a (near) relativistic treatment. One such effect could be a decoherence channel unobserved in a nonrelativistic treatment.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
A brief introduction to fluctuation theorems: from theory to experiments
Authors:
Thalyta T. Martins,
André H. A. Malavazi,
Lucas P. Kamizaki,
Artyom Petrosyan,
Benjamin Besga,
Sergio Ciliberto,
Sérgio R. Muniz
Abstract:
Thermodynamics is a fundamental branch of physics, and over the years, it has evolved to include mesoscopic and out-of-equilibrium systems driven by theoretical and experimental advances at the micro- and nanoscale. This development has led to \textit{stochastic thermodynamics}, a framework that connects microscopic fluctuations with macroscopic laws. Despite their significance, fundamental ideas,…
▽ More
Thermodynamics is a fundamental branch of physics, and over the years, it has evolved to include mesoscopic and out-of-equilibrium systems driven by theoretical and experimental advances at the micro- and nanoscale. This development has led to \textit{stochastic thermodynamics}, a framework that connects microscopic fluctuations with macroscopic laws. Despite their significance, fundamental ideas, such as fluctuation theorems, are frequently not covered in current curricula, leaving them largely unknown across many disciplines. Here, we present the core results of stochastic thermodynamics, particularly the Jarzynski equality and the Crooks theorem, using an integrated approach combining theoretical foundations with experimental verification using optical tweezers. This approach helps to clarify the fundamentals, linking theoretical ideas to real elements in the lab, showing the simplicity of the apparatus, and presenting detailed procedures for calibration and calculation of relevant quantities to enable the implementation of these experiments in new research and teaching laboratories. The goal is to enrich thermodynamics education and to stimulate exploration in this evolving field.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework
Authors:
Usama Zafar,
André Teixeira,
Salman Toor
Abstract:
Federated Learning (FL) enables collaborative model training across decentralized devices without sharing raw data, but it remains vulnerable to poisoning attacks that compromise model integrity. Existing defenses often rely on external datasets or predefined heuristics (e.g. number of malicious clients), limiting their effectiveness and scalability. To address these limitations, we propose a priv…
▽ More
Federated Learning (FL) enables collaborative model training across decentralized devices without sharing raw data, but it remains vulnerable to poisoning attacks that compromise model integrity. Existing defenses often rely on external datasets or predefined heuristics (e.g. number of malicious clients), limiting their effectiveness and scalability. To address these limitations, we propose a privacy-preserving defense framework that leverages a Conditional Generative Adversarial Network (cGAN) to generate synthetic data at the server for authenticating client updates, eliminating the need for external datasets. Our framework is scalable, adaptive, and seamlessly integrates into FL workflows. Extensive experiments on benchmark datasets demonstrate its robust performance against a variety of poisoning attacks, achieving high True Positive Rate (TPR) and True Negative Rate (TNR) of malicious and benign clients, respectively, while maintaining model accuracy. The proposed framework offers a practical and effective solution for securing federated learning systems.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
An electron-hadron collider at the high-luminosity LHC
Authors:
Kevin David J André,
Bernhard Holzer,
Laurent Forthomme,
Krzysztof Piotrzkowski
Abstract:
We discuss a concept of a lower-energy version of the Large Hadron-electron Collider (LHeC), delivering electron-hadron collisions concurrently to the hadron-hadron collisions at the high-luminosity LHC at CERN. Assuming the use of a 20 GeV electron Energy Recovery Linac (ERL), we describe the optimised beam dynamics, accelerator technologies, and detector constraints required for such a "phase-on…
▽ More
We discuss a concept of a lower-energy version of the Large Hadron-electron Collider (LHeC), delivering electron-hadron collisions concurrently to the hadron-hadron collisions at the high-luminosity LHC at CERN. Assuming the use of a 20 GeV electron Energy Recovery Linac (ERL), we describe the optimised beam dynamics, accelerator technologies, and detector constraints required for such a "phase-one" LHeC. Finally, we also discuss the ERL configurations, the possibility of delivering electron-hadron collisions during the planned LHC Run5 and briefly outline the scientific potential of this proposal.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Thin-film superconducting NbTi microwave resonators for cryogenic thermometry
Authors:
André Chatel,
Roberto Russo,
Luca Mazzone,
Quentin Boinay,
Reza Farsi,
Jürgen Brugger,
Giovanni Boero,
Hernan Furci
Abstract:
Superconducting microwave resonators have recently gained a primary importance in the development of cryogenic applications, such as circuit quantum electrodynamics, electron spin resonance spectroscopy and particles detection for high-energy physics and astrophysics. In this work, we investigate the influence of the film thickness on the temperature response of microfabricated Nb50Ti50 supercondu…
▽ More
Superconducting microwave resonators have recently gained a primary importance in the development of cryogenic applications, such as circuit quantum electrodynamics, electron spin resonance spectroscopy and particles detection for high-energy physics and astrophysics. In this work, we investigate the influence of the film thickness on the temperature response of microfabricated Nb50Ti50 superconducting resonators. S-shaped split ring resonators (S-SRRs), 20 nm to 150 nm thick, are designed to be electromagnetically coupled with standard Cu coplanar waveguides (CPWs) and their microwave properties are characterized at temperatures below 10 K. The combined contributions of the kinetic inductance LK(T) increase and the decreasing loaded quality factor QL induce an optimum condition on the temperature sensitivity and resolution of the resonators, for thinner films. A noise equivalent temperature (NET) as low as 0.5 uK/Hz^(1/2), at 1 Hz, is reported for 100 nm thick resonators at 4.2 K. We also asses the possibility of implementing a multiplexed frequency readout, allowing for the simultaneous temperature tracking of several sensors along a single CPW. Such results demonstrate the possibility to perform a distributed cryogenic temperature monitoring, with a sub-uK resolution. In such a perspective, superconducting S-SRRs, eventually benefiting from an even higher LK(T), might be exploited to accurately monitor the on-chip temperature of devices operating in cryogenic conditions.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Gemma 3 Technical Report
Authors:
Gemma Team,
Aishwarya Kamath,
Johan Ferret,
Shreya Pathak,
Nino Vieillard,
Ramona Merhej,
Sarah Perrin,
Tatiana Matejovicova,
Alexandre Ramé,
Morgane Rivière,
Louis Rouillard,
Thomas Mesnard,
Geoffrey Cideron,
Jean-bastien Grill,
Sabela Ramos,
Edouard Yvinec,
Michelle Casbon,
Etienne Pot,
Ivo Penchev,
Gaël Liu,
Francesco Visin,
Kathleen Kenealy,
Lucas Beyer,
Xiaohai Zhai,
Anton Tsitsulin
, et al. (191 additional authors not shown)
Abstract:
We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie…
▽ More
We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achieved by increasing the ratio of local to global attention layers, and keeping the span on local attention short. The Gemma 3 models are trained with distillation and achieve superior performance to Gemma 2 for both pre-trained and instruction finetuned versions. In particular, our novel post-training recipe significantly improves the math, chat, instruction-following and multilingual abilities, making Gemma3-4B-IT competitive with Gemma2-27B-IT and Gemma3-27B-IT comparable to Gemini-1.5-Pro across benchmarks. We release all our models to the community.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Prospects and Opportunities with an upgraded FASER Neutrino Detector during the HL-LHC era: Input to the EPPSU
Authors:
FASER Collaboration,
Roshan Mammen Abraham,
Xiaocong Ai,
Saul Alonso-Monsalve,
John Anders,
Claire Antel,
Akitaka Ariga,
Tomoko Ariga,
Jeremy Atkinson,
Florian U. Bernlochner,
Tobias Boeckh,
Jamie Boyd,
Lydia Brenner,
Angela Burger,
Franck Cadoux,
Roberto Cardella,
David W. Casper,
Charlotte Cavanagh,
Xin Chen,
Dhruv Chouhan,
Sebastiani Christiano,
Andrea Coccaro,
Stephane Débieux,
Monica D'Onofrio,
Ansh Desai
, et al. (93 additional authors not shown)
Abstract:
The FASER experiment at CERN has opened a new window in collider neutrino physics by detecting TeV-energy neutrinos produced in the forward direction at the LHC. Building on this success, this document outlines the scientific case and design considerations for an upgraded FASER neutrino detector to operate during LHC Run 4 and beyond. The proposed detector will significantly enhance the neutrino p…
▽ More
The FASER experiment at CERN has opened a new window in collider neutrino physics by detecting TeV-energy neutrinos produced in the forward direction at the LHC. Building on this success, this document outlines the scientific case and design considerations for an upgraded FASER neutrino detector to operate during LHC Run 4 and beyond. The proposed detector will significantly enhance the neutrino physics program by increasing event statistics, improving flavor identification, and enabling precision measurements of neutrino interactions at the highest man-made energies. Key objectives include measuring neutrino cross sections, probing proton structure and forward QCD dynamics, testing lepton flavor universality, and searching for beyond-the-Standard Model physics. Several detector configurations are under study, including high-granularity scintillator-based tracking calorimeters, high-precision silicon tracking layers, and advanced emulsion-based detectors for exclusive event reconstruction. These upgrades will maximize the physics potential of the HL-LHC, contribute to astroparticle physics and QCD studies, and serve as a stepping stone toward future neutrino programs at the Forward Physics Facility.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Leveraging Cognitive States for Adaptive Scaffolding of Understanding in Explanatory Tasks in HRI
Authors:
André Groß,
Birte Richter,
Bjarne Thomzik,
Britta Wrede
Abstract:
Understanding how scaffolding strategies influence human understanding in human-robot interaction is important for developing effective assistive systems. This empirical study investigates linguistic scaffolding strategies based on negation as an important means that de-biases the user from potential errors but increases processing costs and hesitations as a means to ameliorate processing costs. I…
▽ More
Understanding how scaffolding strategies influence human understanding in human-robot interaction is important for developing effective assistive systems. This empirical study investigates linguistic scaffolding strategies based on negation as an important means that de-biases the user from potential errors but increases processing costs and hesitations as a means to ameliorate processing costs. In an adaptive strategy, the user state with respect to the current state of understanding and processing capacity was estimated via a scoring scheme based on task performance, prior scaffolding strategy, and current eye gaze behavior. In the study, the adaptive strategy of providing negations and hesitations was compared with a non-adaptive strategy of providing only affirmations. The adaptive scaffolding strategy was generated using the computational model SHIFT. Our findings indicate that using adaptive scaffolding strategies with SHIFT tends to (1) increased processing costs, as reflected in longer reaction times, but (2) improved task understanding, evidenced by a lower error rate of almost 23%. We assessed the efficiency of SHIFT's selected scaffolding strategies across different cognitive states, finding that in three out of five states, the error rate was lower compared to the baseline condition. We discuss how these results align with the assumptions of the SHIFT model and highlight areas for refinement. Moreover, we demonstrate how scaffolding strategies, such as negation and hesitation, contribute to more effective human-robot explanatory dialogues.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Perturbative aspects of the supersymmetric three-dimensional massive QED
Authors:
A. C. Lehum,
J. R. Nascimento,
A. C. Pina Neto,
A. Yu. Petrov
Abstract:
We perform the study of perturbative aspects of a three-dimensional supersymmetric Maxwell-Chern-Simons-Proca theory minimally coupled to scalar superfields. Using the superfield formalism, we derive the propagators for both gauge and matter superfields and compute the leading quantum corrections to the effective action. The presence of the Proca-like term explicitly breaks gauge invariance, modif…
▽ More
We perform the study of perturbative aspects of a three-dimensional supersymmetric Maxwell-Chern-Simons-Proca theory minimally coupled to scalar superfields. Using the superfield formalism, we derive the propagators for both gauge and matter superfields and compute the leading quantum corrections to the effective action. The presence of the Proca-like term explicitly breaks gauge invariance, modifying the structure of the gauge superfield propagator and leading to an essentially new form of quantum contributions in comparison with the usual QED. We analyze the Feynman diagrams that contribute to the quadratic part of the effective action, obtaining corrections to both the kinetic and mass terms of the scalar superfields. Furthermore, we discuss the UV behavior of the model, considering its renormalization properties and the possibility of perturbative finiteness to all loop orders, similar to supersymmetric QED$_3$. Finally, we highlight potential applications of this model in condensed matter systems and possible connections with modified supersymmetric electrodynamics and dualities in lower-dimensional theories.
△ Less
Submitted 24 May, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Average consensus with resilience and privacy guarantees without losing accuracy
Authors:
Guilherme Ramos,
Daniel Silvestre,
André M. H. Teixeira,
Sérgio Pequito
Abstract:
This paper addresses the challenge of achieving private and resilient average consensus among a group of discrete-time networked agents without compromising accuracy. State-of-the-art solutions to attain privacy and resilient consensus entail an explicit trade-off between the two with an implicit compromise on accuracy. In contrast, in the present work, we propose a methodology that avoids trade-o…
▽ More
This paper addresses the challenge of achieving private and resilient average consensus among a group of discrete-time networked agents without compromising accuracy. State-of-the-art solutions to attain privacy and resilient consensus entail an explicit trade-off between the two with an implicit compromise on accuracy. In contrast, in the present work, we propose a methodology that avoids trade-offs between privacy, resilience, and accuracy. We design a methodology that, under certain conditions, enables non-faulty agents, i.e., agents complying with the established protocol, to reach average consensus in the presence of faulty agents, while keeping the non-faulty agents' initial states private. For privacy, agents strategically add noise to obscure their original state, while later withdrawing a function of it to ensure accuracy. Besides, and unlikely many consensus methods, our approach does not require each agent to compute the left-eigenvector of the dynamics matrix associated with the eigenvalue one. Moreover, the proposed framework has a polynomial time complexity relative to the number of agents and the maximum quantity of faulty agents. Finally, we illustrate our method with examples covering diverse faulty agents scenarios.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.