-
One Giant Leap for Womankind: First Menstrual Cups Tested in Space Flight Conditions
Authors:
Ligia F. Coelho,
Catarina Miranda,
Joao Canas,
Miguel Morgado,
Diogo Nunes,
Andre F. Henriques,
Adam B. Langeveld
Abstract:
In the early days of space exploration, when Sally Ride was offered 100 tampons for a week-long mission, menstrual medical devices first began to be used in space conditions. Since then, hormonal menstrual suppression has become the preferred method for managing menstruation in space, offering significant advantages. However, this is not an option for astronauts who choose to menstruate. The lack…
▽ More
In the early days of space exploration, when Sally Ride was offered 100 tampons for a week-long mission, menstrual medical devices first began to be used in space conditions. Since then, hormonal menstrual suppression has become the preferred method for managing menstruation in space, offering significant advantages. However, this is not an option for astronauts who choose to menstruate. The lack of sustainable menstrual technologies will pose challenges for long-duration missions to the Moon and Mars, where astronauts may spend years in space. The AstroCup mission represents the first effort to test menstrual cups in spaceflight, evaluating their durability and functionality. Through material integrity tests and functional assessments using a rheological analogue of human blood, we demonstrate the resilience of menstrual cups and discuss their implications for sustainable menstrual management in future lunar and Martian missions.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Sparse Infrared Spectroscopy for Detection of Volatile Organic Compounds
Authors:
Mira Welner,
Andre Hazbun,
Thomas Beechem
Abstract:
To reduce the complexity of infrared spectroscopy hardware while maintaining performance, a data informed, task-specific, spectral collection approach termed Sparse Infrared Spectroscopy (SIRS) is developed. Using a numerically based virtual experiment based on a quantitatively accurate infrared database, non-negative matrix factorization is used to identify the spectral pass bands of a minimal nu…
▽ More
To reduce the complexity of infrared spectroscopy hardware while maintaining performance, a data informed, task-specific, spectral collection approach termed Sparse Infrared Spectroscopy (SIRS) is developed. Using a numerically based virtual experiment based on a quantitatively accurate infrared database, non-negative matrix factorization is used to identify the spectral pass bands of a minimal number of filters necessary to identify volatile organic compounds (VOC) within either an inert background or mixture of gases. The data-driven approach is found capable of identifying contaminants at the 1-10 part per million level (PPM) with $\mathrm{\sim~20-50}$ spectral samples as opposed to the more than 1,000 typical of a traditional infrared spectrum. Reasonably robust to both noise and the characteristics of the base compound in a mixture, the task-specific spectral sampling points to simplified hardware design that maintains performance.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
The mass of the exo-Venus Gliese 12 b, as revealed by HARPS-N, ESPRESSO, and CARMENES
Authors:
Daisy A. Turner,
Yoshi Nike Emilia Eschen,
Felipe Murgas,
Annelies Mortier,
Thomas G Wilson,
Jorge Fernández Fernández,
Giuseppe Morello,
Shreyas Vissapragada,
José A. Caballero,
Stefan Dreizler,
Jo Ann Egger,
Alix Violet Freckelton,
Nicole Gromek,
Artie P. Hatzes,
Ben Scott Lakeland,
Evangelos Nagel,
Luca Naponiello,
Hugo M. Tabernero,
Siegfried Vanaverbeke,
Alexander Venner,
María Rosa Zapatero Osorio,
Pedro J. Amado,
Víctor J. S. Béjar,
Aldo Stefano Bonomo,
Lars A. Buchhave
, et al. (38 additional authors not shown)
Abstract:
Small temperate planets are prime targets for exoplanet studies due to their possible similarities with the rocky planets in the Solar System. M dwarfs are promising hosts since the planetary signals are within our current detection capabilities. Gliese 12 b is a Venus-sized temperate planet orbiting a quiet M dwarf. We present here the first precise mass measurement of this small exoplanet. We pe…
▽ More
Small temperate planets are prime targets for exoplanet studies due to their possible similarities with the rocky planets in the Solar System. M dwarfs are promising hosts since the planetary signals are within our current detection capabilities. Gliese 12 b is a Venus-sized temperate planet orbiting a quiet M dwarf. We present here the first precise mass measurement of this small exoplanet. We performed a detailed analysis using HARPS-N, ESPRESSO, and CARMENES radial velocities, along with new and archival TESS, CHEOPS, and MuSCAT2/3 photometry data. From fitting the available data, we find that the planet has a radius of $R_\mathrm{p} = 0.904^{+0.037}_{-0.034} \,\mathrm{R_\oplus}$ and a mass of $M_\mathrm{p} = 0.95^{+0.26}_{-0.27} \,\mathrm{M_\oplus}$ (a $3.6σ$ measurement of the semi-amplitude $K=0.70^{+0.19}_{-0.20}\,\mathrm{m\,s^{-1}}$), and is on an orbit with a period of $12.761421 \pm 0.000047\,\mathrm{d}$. A variety of techniques were utilised to attenuate stellar activity signals. Gliese 12 b has an equilibrium temperature of $T_\mathrm{eq}=315 \pm 7\,\mathrm{K}$, assuming an albedo of zero, and a density consistent with that of Earth and Venus ($ρ_\mathrm{p}=7.0^{+2.3}_{-2.1}\,\mathrm{g\,cm^{-3}}$). We find that Gliese 12 b has a predominantly rocky interior and simulations indicate that it is unlikely to have retained any of its primordial gaseous envelope. The bulk properties of Gliese 12 b place it in an extremely sparsely-populated region of both mass--radius and density--$T_\mathrm{eq}$ parameter space, making it a prime target for follow-up observations, including Lyman-$α$ studies.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
On plane cycles in geometric multipartite graphs
Authors:
Marco Ricci,
Jonathan Rollin,
André Schulz,
Alexandra Weinberger
Abstract:
A geometric graph is a drawing of a graph in the plane where the vertices are drawn as points in general position and the edges as straight-line segments connecting their endpoints. It is plane if it contains no crossing edges. We study plane cycles in geometric complete multipartite graphs. We prove that if a geometric complete multipartite graph contains a plane cycle of length $t$, with…
▽ More
A geometric graph is a drawing of a graph in the plane where the vertices are drawn as points in general position and the edges as straight-line segments connecting their endpoints. It is plane if it contains no crossing edges. We study plane cycles in geometric complete multipartite graphs. We prove that if a geometric complete multipartite graph contains a plane cycle of length $t$, with $t \geq 6$, it also contains a smaller plane cycle of length at least $\lfloor t/2\rfloor + 1$. We further give a characterization of geometric complete multipartite graphs that contain plane cycles with a color class appearing at least twice. For geometric drawings of $K_{n,n}$, we give a sufficient condition under which they have, for each $s \leq n$, a plane cycle of length 2s. We also provide an algorithm to decide whether a given geometric drawing of $K_{n,n}$ contains a plane Hamiltonian cycle in time $O(n \log n + nk^2) + O(k^{5k})$, where k is the number of vertices inside the convex hull of all vertices. Finally, we prove that it is NP-complete to decide if a subset of edges of a geometric complete bipartite graph H is contained in a plane Hamiltonian cycle in H.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Evoluindo resiliência em rotas de ônibus: Proposta de um método para a maximização de acessibilidade em cenários de incerteza por meio de algoritmo genético
Authors:
Andre Borgato Morelli,
André Luiz Cunha
Abstract:
Resilience has raised interest in transport planning as rare phenomena, such as fuel supply crises, have recently shown their potential to destabilize transport systems. However, the proposed methods for planning resilience in transit systems fail to consider the impact that bus frequency has on user accessibility. To address this gap, this paper proposes a bus allocation method aimed at maximizin…
▽ More
Resilience has raised interest in transport planning as rare phenomena, such as fuel supply crises, have recently shown their potential to destabilize transport systems. However, the proposed methods for planning resilience in transit systems fail to consider the impact that bus frequency has on user accessibility. To address this gap, this paper proposes a bus allocation method aimed at maximizing accessibility in impact scenarios - where some bus routes have their frequency reduced - making use of a genetic algorithm. The method is applied in the city of São Paulo and the results show that evolving the system foreseeing moderate impacts not only contributes to reducing the negative effects of lower route frequency, but also improves its efficiency in normal conditions, showing the importance of the contribution of this research to the planning of efficient systems.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder
Authors:
Yingji Zhang,
Danilo S. Carvalho,
André Freitas
Abstract:
Integrating compositional and symbolic properties into current distributional semantic spaces can enhance the interpretability, controllability, compositionality, and generalisation capabilities of Transformer-based auto-regressive language models (LMs). In this survey, we offer a novel perspective on latent space geometry through the lens of compositional semantics, a direction we refer to as \te…
▽ More
Integrating compositional and symbolic properties into current distributional semantic spaces can enhance the interpretability, controllability, compositionality, and generalisation capabilities of Transformer-based auto-regressive language models (LMs). In this survey, we offer a novel perspective on latent space geometry through the lens of compositional semantics, a direction we refer to as \textit{semantic representation learning}. This direction enables a bridge between symbolic and distributional semantics, helping to mitigate the gap between them. We review and compare three mainstream autoencoder architectures-Variational AutoEncoder (VAE), Vector Quantised VAE (VQVAE), and Sparse AutoEncoder (SAE)-and examine the distinctive latent geometries they induce in relation to semantic structure and interpretability.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study
Authors:
Yingji Zhang,
Marco Valentino,
Danilo S. Carvalho,
André Freitas
Abstract:
Incorporating explicit reasoning rules within the latent space of language models (LMs) offers a promising pathway to enhance generalisation, interpretability, and controllability. While current Transformer-based language models have shown strong performance on Natural Language Inference (NLI) tasks, they often rely on memorisation rather than rule-based inference. This work investigates how reaso…
▽ More
Incorporating explicit reasoning rules within the latent space of language models (LMs) offers a promising pathway to enhance generalisation, interpretability, and controllability. While current Transformer-based language models have shown strong performance on Natural Language Inference (NLI) tasks, they often rely on memorisation rather than rule-based inference. This work investigates how reasoning rules can be explicitly embedded and memorised within the LMs through Language Variational Autoencoders (VAEs). We propose a complete pipeline for learning reasoning rules within Transformer-based language VAEs. This pipeline encompasses three rule-based reasoning tasks, a supporting theoretical framework, and a practical end-to-end architecture. The experiment illustrates the following findings: Disentangled reasoning: Under explicit signal supervision, reasoning rules - viewed as functional mappings - can be disentangled within the encoder's parametric space. This separation results in distinct clustering of rules in the output feature space. Prior knowledge injection: injecting reasoning information into the Query enables the model to more effectively retrieve the stored value Value from memory based on Key. This approach offers a simple method for integrating prior knowledge into decoder-only language models. Performance bottleneck: In mathematical reasoning tasks using Qwen2.5(0.5B), increasing sample count doesn't improve performance beyond a point. Moreover, ffn layers are better than attention layers at preserving the separation of reasoning rules in the model's parameters.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Shaping Boundaries to Control and Transport Topological Defects in Colloidal Nematic Liquid Crystals
Authors:
Gerardo Campos-Villalobos,
André F. V. Matias,
Ethan I. L. Jull,
Lisa Tran,
Marjolein Dijkstra
Abstract:
Anisotropic rod-like particles form liquid crystalline phases with varying degrees of orientational and translational order. When confined geometrically, these phases can give rise to topological defects, which can be selected and controlled by tuning how the rods align near boundaries, known as anchoring. While anchoring in molecular liquid crystals can be controlled through surface functionaliza…
▽ More
Anisotropic rod-like particles form liquid crystalline phases with varying degrees of orientational and translational order. When confined geometrically, these phases can give rise to topological defects, which can be selected and controlled by tuning how the rods align near boundaries, known as anchoring. While anchoring in molecular liquid crystals can be controlled through surface functionalization, this approach is not easily applicable to microscale colloidal systems, which have so far been limited to planar anchoring. Here, using particle-based simulations, Landau-de Gennes theory, and experiments on colloidal rods, we demonstrate that topographical patterning of the boundary can effectively control the anchoring type and, in turn, the defect state in two-dimensional confined nematics. Building on this, we show that dynamically shape-shifting the boundaries can transform and transport topological defects, enabling the design of liquid crystal analogs for binary information storage.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Sizing Antenna Arrays for Near-field Communication and Sensing
Authors:
Marcin Wachowiak,
André Bourdoux,
Sofie Pollin
Abstract:
This paper presents key performance metrics for near-field communication and sensing systems with a focus on their scaling behavior as a function of the antenna array aperture. Analytical expressions are derived for several standard array geometries to enable the design of the large antenna arrays for given system requirements. First, the near-field beam focusing is analyzed and the minimum beamde…
▽ More
This paper presents key performance metrics for near-field communication and sensing systems with a focus on their scaling behavior as a function of the antenna array aperture. Analytical expressions are derived for several standard array geometries to enable the design of the large antenna arrays for given system requirements. First, the near-field beam focusing is analyzed and the minimum beamdepth is observed to rapidly saturate to a low asymptotic limit as the array aperture increases. In contrast, the near-field region span is shown to scale quadratically with the array aperture. Based on these two metrics, the maximum number of resolvable beamspots at 3 dB separation is derived analytically, exhibiting a linear dependence on the array aperture. Finally, the number of significant singular values of a channel observed at the array's broadside is estimated, showing a power-law dependence on the aperture. The resulting expressions provide practical design guidelines for evaluating aperture requirements in near-field communication and sensing applications.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
NIKA2 Cosmological Legacy Survey: Blind detection of galaxy clusters in the COSMOS field via the Sunyaev-Zel'dovich effect
Authors:
D. Chérouvrier,
J. F. Macias-Perez,
F. X. Désert,
R. Adam,
P. Ade,
H. Ajeddig,
S. Amarantidis,
P. André,
H. Aussel,
R. Barrena,
A. Beelen,
A. Benoit,
S. Berta,
M. Béthermin,
A. Bongiovanni,
J. Bounmy,
O. Bourrion,
L. -J. Bing,
M. Calvo,
A. Catalano,
M. De Petris,
S. Doyle,
E. F. C. Driessen,
G. Ejlali,
A. Ferragamo
, et al. (37 additional authors not shown)
Abstract:
(Abridged) Clusters of galaxies, formed in the latest stages of structure formation, are unique cosmological probes. With the advent of large CMB surveys like those from the Planck satellite, the ACT and SPT telescopes, we now have access to a large number of galaxy clusters detected at millimeter wavelengths via the thermal Sunyaev-Zel'dovich (tSZ) effect. Nevertheless, it is interesting to compl…
▽ More
(Abridged) Clusters of galaxies, formed in the latest stages of structure formation, are unique cosmological probes. With the advent of large CMB surveys like those from the Planck satellite, the ACT and SPT telescopes, we now have access to a large number of galaxy clusters detected at millimeter wavelengths via the thermal Sunyaev-Zel'dovich (tSZ) effect. Nevertheless, it is interesting to complement them with high-angular-resolution (tens of arcseconds) observations to target the lowest-mass and highest-redshift clusters. This is the case of observations with the NIKA2 camera, which is installed on the IRAM 30--m telescope in Pico Veleta, Spain. We used the existing 150 GHz (2 mm) data from the NIKA2 Cosmological Legacy Survey (N2CLS) Large Program to blindly search for galaxy clusters in the well-known COSMOS field, across a 877 arcmin$^2$ region centered on (R.A., Dec.)$_{J2000}$ = (10h00m28.81s, +02d17m30.44s). We first developed a dedicated data reduction pipeline to construct NIKA2 maps at 2 mm. We then used a matched-filter algorithm to extract cluster candidates assuming a universal pressure profile to model the expected cluster tSZ signal. We computed the purity and completeness of the sample by applying the previous algorithm to simulated maps of the sky signal in the COSMOS field. We find a total of 16 cluster candidates at S/N > 4, from which eight have either an optical or X-ray cluster (or group of galaxies) counterpart. This is the first blind detection of clusters of galaxies at mm wavelengths at 18" angular resolution. From this analysis, we confirm that NIKA2 and the IRAM 30--m telescope should be sensitive to low-mass clusters at intermediate and high redshift, complementing current and planned large tSZ-based cluster surveys.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
Variational quantum algorithms with exact geodesic transport
Authors:
André J. Ferreira-Martins,
Renato M. S. Farias,
Giancarlo Camilo,
Thiago O. Maciel,
Allan Tosta,
Ruge Lin,
Abdulla Alhajri,
Tobias Haug,
Leandro Aolita
Abstract:
Variational quantum algorithms (VQAs) are promising candidates for near-term applications of quantum computers, but their training represents a major challenge in practice. We introduce exact-geodesic VQAs, a curvature-aware framework that enables analytic Riemannian optimization of variational quantum circuits through a convenient choice of circuit ansatz. Our method exploits the exact metric to…
▽ More
Variational quantum algorithms (VQAs) are promising candidates for near-term applications of quantum computers, but their training represents a major challenge in practice. We introduce exact-geodesic VQAs, a curvature-aware framework that enables analytic Riemannian optimization of variational quantum circuits through a convenient choice of circuit ansatz. Our method exploits the exact metric to find a parameter optimization path based on exact geodesic transport with conjugate gradients (EGT-CG). This supersedes the quantum natural gradient method, in fact recovering it as its first-order approximation. Further, the exact-geodesic updates for our circuit ansatz have the same measurement cost as standard gradient descent. This contrasts with previous metric-aware methods, which require resource-intensive estimations of the metric tensor using quantum hardware. In numerical simulations for electronic structure problems of up to 14 spin-orbitals, our framework allows us to achieve up to a 20x reduction in the number of iterations over Adam or quantum natural gradient methods. Moreover, for degenerate cases, which are notoriously difficult to optimize with conventional methods, we achieve rapid convergence to the global minima. Our work demonstrates that the cost of VQA optimization can be drastically reduced by harnessing the Riemannian geometry of the manifold expressed by the circuit ansatz, with potential implications at the interface between quantum machine learning, differential geometry, and optimal control theory.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
José Pombal,
João Alves,
Pedro Teixeirinha,
Amin Farajian,
André F. T. Martins
Abstract:
Fine-tuning pretrained LLMs has been shown to be an effective strategy for reaching state-of-the-art performance on specific tasks like machine translation. However, this process of adaptation often implies sacrificing general-purpose capabilities, such as conversational reasoning and instruction-following, hampering the utility of the system in real-world applications that require a mixture of sk…
▽ More
Fine-tuning pretrained LLMs has been shown to be an effective strategy for reaching state-of-the-art performance on specific tasks like machine translation. However, this process of adaptation often implies sacrificing general-purpose capabilities, such as conversational reasoning and instruction-following, hampering the utility of the system in real-world applications that require a mixture of skills. In this paper, we introduce Tower+, a suite of models designed to deliver strong performance across both translation and multilingual general-purpose text capabilities. We achieve a Pareto frontier between translation specialization and multilingual general-purpose capabilities by introducing a novel training recipe that builds on Tower (Alves et al., 2024), comprising continued pretraining, supervised fine-tuning, preference optimization, and reinforcement learning with verifiable rewards. At each stage of training, we carefully generate and curate data to strengthen performance on translation as well as general-purpose tasks involving code generation, mathematics problem solving, and general instruction-following. We develop models at multiple scales: 2B, 9B, and 72B. Our smaller models often outperform larger general-purpose open-weight and proprietary LLMs (e.g., Llama 3.3 70B, GPT-4o). Our largest model delivers best-in-class translation performance for high-resource languages and top results in multilingual Arena Hard evaluations and in IF-MT, a benchmark we introduce for evaluating both translation and instruction-following. Our findings highlight that it is possible to rival frontier models in general capabilities, while optimizing for specific business domains, such as translation and localization.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning
Authors:
Giuseppe Attanasio,
Sonal Sannigrahi,
Ben Peters,
André F. T. Martins
Abstract:
This paper presents the IT-IST submission to the IWSLT 2025 Shared Task on Instruction Following Speech Processing. We submit results for the Short Track, i.e., speech recognition, translation, and spoken question answering. Our model is a unified speech-to-text model that integrates a pre-trained continuous speech encoder and text decoder through a first phase of modality alignment and a second p…
▽ More
This paper presents the IT-IST submission to the IWSLT 2025 Shared Task on Instruction Following Speech Processing. We submit results for the Short Track, i.e., speech recognition, translation, and spoken question answering. Our model is a unified speech-to-text model that integrates a pre-trained continuous speech encoder and text decoder through a first phase of modality alignment and a second phase of instruction fine-tuning. Crucially, we focus on using small-scale language model backbones (< 2B) and restrict to high-quality, CC-BY data along with synthetic data generation to supplement existing resources.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Long-Context Generalization with Sparse Attention
Authors:
Pavlo Vasylenko,
Marcos Treviso,
André F. T. Martins
Abstract:
Transformer-based architectures traditionally employ softmax to compute attention weights, which produces dense distributions over all tokens in a sequence. While effective in many settings, this density has been shown to be detrimental for tasks that demand precise focus on fixed-size patterns: as sequence length increases, non-informative tokens accumulate attention probability mass, leading to…
▽ More
Transformer-based architectures traditionally employ softmax to compute attention weights, which produces dense distributions over all tokens in a sequence. While effective in many settings, this density has been shown to be detrimental for tasks that demand precise focus on fixed-size patterns: as sequence length increases, non-informative tokens accumulate attention probability mass, leading to dispersion and representational collapse. We show in this paper that sparse attention mechanisms using $α$-entmax can avoid these issues, due to their ability to assign exact zeros to irrelevant tokens. Furthermore, we introduce Adaptive-Scalable Entmax (ASEntmax), which endows $α$-entmax with a learnable temperature parameter, allowing the attention distribution to interpolate between sparse (pattern-focused) and dense (softmax-like) regimes. Finally, we show that the ability to locate and generalize fixed-size patterns can be further improved through a careful design of position encodings, which impacts both dense and sparse attention methods. By integrating ASEntmax into standard transformer layers alongside proper positional encodings, we show that our models greatly outperform softmax, scalable softmax, and fixed-temperature $α$-entmax baselines on long-context generalization.
△ Less
Submitted 24 June, 2025; v1 submitted 19 June, 2025;
originally announced June 2025.
-
Can GPT-4o Evaluate Usability Like Human Experts? A Comparative Study on Issue Identification in Heuristic Evaluation
Authors:
Guilherme Guerino,
Luiz Rodrigues,
Bruna Capeleti,
Rafael Ferreira Mello,
André Freire,
Luciana Zaina
Abstract:
Heuristic evaluation is a widely used method in Human-Computer Interaction (HCI) to inspect interfaces and identify issues based on heuristics. Recently, Large Language Models (LLMs), such as GPT-4o, have been applied in HCI to assist in persona creation, the ideation process, and the analysis of semi-structured interviews. However, considering the need to understand heuristics and the high degree…
▽ More
Heuristic evaluation is a widely used method in Human-Computer Interaction (HCI) to inspect interfaces and identify issues based on heuristics. Recently, Large Language Models (LLMs), such as GPT-4o, have been applied in HCI to assist in persona creation, the ideation process, and the analysis of semi-structured interviews. However, considering the need to understand heuristics and the high degree of abstraction required to evaluate them, LLMs may have difficulty conducting heuristic evaluation. However, prior research has not investigated GPT-4o's performance in heuristic evaluation compared to HCI experts in web-based systems. In this context, this study aims to compare the results of a heuristic evaluation performed by GPT-4o and human experts. To this end, we selected a set of screenshots from a web system and asked GPT-4o to perform a heuristic evaluation based on Nielsen's Heuristics from a literature-grounded prompt. Our results indicate that only 21.2% of the issues identified by human experts were also identified by GPT-4o, despite it found 27 new issues. We also found that GPT-4o performed better for heuristics related to aesthetic and minimalist design and match between system and real world, whereas it has difficulty identifying issues in heuristics related to flexibility, control, and user efficiency. Additionally, we noticed that GPT-4o generated several false positives due to hallucinations and attempts to predict issues. Finally, we highlight five takeaways for the conscious use of GPT-4o in heuristic evaluations.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
Local Routing on Ordered $Θ$-graphs
Authors:
André van Renssen,
Shuei Sakaguchi
Abstract:
The problem of locally routing on geometric networks using limited memory is extensively studied in computational geometry. We consider one particular graph, the ordered $Θ$-graph, which is significantly harder to route on than the $Θ$-graph, for which a number of routing algorithms are known. Currently, no local routing algorithm is known for the ordered $Θ$-graph.
We prove that, unfortunately,…
▽ More
The problem of locally routing on geometric networks using limited memory is extensively studied in computational geometry. We consider one particular graph, the ordered $Θ$-graph, which is significantly harder to route on than the $Θ$-graph, for which a number of routing algorithms are known. Currently, no local routing algorithm is known for the ordered $Θ$-graph.
We prove that, unfortunately, there does not exist a deterministic memoryless local routing algorithm that works on the ordered $Θ$-graph. This motivates us to consider allowing a small amount of memory, and we present a deterministic $O(1)$-memory local routing algorithm that successfully routes from the source to the destination on the ordered $Θ$-graph. We show that our local routing algorithm converges to the destination in $O(n)$ hops, where $n$ is the number of vertices. To the best of our knowledge, our algorithm is the first deterministic local routing algorithm that is guaranteed to reach the destination on the ordered $Θ$-graph.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
Scalable quantum current source on commercial 22-nm CMOS process technology
Authors:
Ajit Dash,
Suyash Pati Tripathi,
Dimitrios Georgakopoulos,
MengKe Feng,
Steve Yianni,
Ensar Vahapoglu,
Md Mamunur Rahman,
Shai Bonen,
Owen Brace,
Jonathan Y. Huang,
Wee Han Lim,
Kok Wai Chan,
Will Gilbert,
Arne Laucht,
Andrea Morello,
Andre Saraiva,
Christopher C. Escott,
Sorin P. Voinigescu,
Andrew S. Dzurak,
Tuomo Tanttu
Abstract:
Utilizing quantum effects in nanoscopic devices has in the past mostly been accessible through academic cleanrooms and research foundries. Opening the quantum frontier for wider industrial applications likely requires the scale of well-established complementary metal-oxide-semiconductor (CMOS) foundries for manufacturing transistor-based quantum devices operable above subkelvin temperatures. Here,…
▽ More
Utilizing quantum effects in nanoscopic devices has in the past mostly been accessible through academic cleanrooms and research foundries. Opening the quantum frontier for wider industrial applications likely requires the scale of well-established complementary metal-oxide-semiconductor (CMOS) foundries for manufacturing transistor-based quantum devices operable above subkelvin temperatures. Here, we operate a commercial 22-nm-node fully depleted silicon-on-insulator (FDSOI) CMOS device as dual parallel-connected charge-pumps for the implementation of a quantum current standard in the International System of Units (SI). We measure the accuracy of (1.2 +/- 0.1)E-3 A/A for this scalable architecture at 50 MHz with reference to SI-traceable voltage and resistance standards in a pumped helium system. Looking ahead we propose a practical monolithic CMOS chip that incorporates one million parallel-connected charge pumps along with on-chip control electronics. This can be operated as a table-top primary standard, generating quantum currents up to microampere levels.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Approximate Ricci-flat Metrics for Calabi-Yau Manifolds
Authors:
Seung-Joo Lee,
Andre Lukas
Abstract:
We outline a method to determine analytic Kähler potentials with associated approximately Ricci-flat Kähler metrics on Calabi-Yau manifolds. Key ingredients are numerically calculating Ricci-flat Kähler potentials via machine learning techniques and fitting the numerical results to Donaldson's Ansatz. We apply this method to the Dwork family of quintic hypersurfaces in $\mathbb{P}^4$ and an analog…
▽ More
We outline a method to determine analytic Kähler potentials with associated approximately Ricci-flat Kähler metrics on Calabi-Yau manifolds. Key ingredients are numerically calculating Ricci-flat Kähler potentials via machine learning techniques and fitting the numerical results to Donaldson's Ansatz. We apply this method to the Dwork family of quintic hypersurfaces in $\mathbb{P}^4$ and an analogous one-parameter family of bi-cubic CY hypersurfaces in $\mathbb{P}^2\times\mathbb{P}^2$. In each case, a relatively simple analytic expression is obtained for the approximately Ricci-flat Kähler potentials, including the explicit dependence on the complex structure parameter. We find that these Kähler potentials only depend on the modulus of the complex structure parameter.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Creating User-steerable Projections with Interactive Semantic Mapping
Authors:
Artur André Oliveira,
Mateus Espadoto,
Roberto Hirata Jr.,
Roberto M. Cesar Jr.,
Alex C. Telea
Abstract:
Dimensionality reduction (DR) techniques map high-dimensional data into lower-dimensional spaces. Yet, current DR techniques are not designed to explore semantic structure that is not directly available in the form of variables or class labels. We introduce a novel user-guided projection framework for image and text data that enables customizable, interpretable, data visualizations via zero-shot c…
▽ More
Dimensionality reduction (DR) techniques map high-dimensional data into lower-dimensional spaces. Yet, current DR techniques are not designed to explore semantic structure that is not directly available in the form of variables or class labels. We introduce a novel user-guided projection framework for image and text data that enables customizable, interpretable, data visualizations via zero-shot classification with Multimodal Large Language Models (MLLMs). We enable users to steer projections dynamically via natural-language guiding prompts, to specify high-level semantic relationships of interest to the users which are not explicitly present in the data dimensions. We evaluate our method across several datasets and show that it not only enhances cluster separation, but also transforms DR into an interactive, user-driven process. Our approach bridges the gap between fully automated DR techniques and human-centered data exploration, offering a flexible and adaptive way to tailor projections to specific analytical needs.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Overdense fireworks in GOODS-N: Unveiling a record number of massive dusty star forming galaxies at z$\sim$5.2 with the N2CLS
Authors:
G. Lagache,
M. Xiao,
A. Beelen,
S. Berta,
L. Ciesla,
R. Neri,
R. Pello,
R. Adam,
P. Ade,
H. Ajeddig,
S. Amarantidis,
P. André,
H. Aussel,
A. Benoît,
M. Béthermin,
L. -J. Bing,
A. Bongiovanni,
J. Bounmy,
O. Bourrion,
M. Calvo,
A. Catalano,
D. Chérouvrier,
U. Chowdhury,
M. De Petris,
F. -X. Désert
, et al. (37 additional authors not shown)
Abstract:
As part of the N2CLS Survey, we have identified a remarkable overdensity of ten bright dusty star-forming galaxies at z$\sim$5.2 in the GOODS-N field. Three of these galaxies, N2GN_1_01, 06, and 23 (known as GN10, HDF850.1, and S3, respectively), had previously been spectroscopically confirmed as members of the exceptional large-scale structure at z$\sim$5.1-5.3, which is notably elongated along t…
▽ More
As part of the N2CLS Survey, we have identified a remarkable overdensity of ten bright dusty star-forming galaxies at z$\sim$5.2 in the GOODS-N field. Three of these galaxies, N2GN_1_01, 06, and 23 (known as GN10, HDF850.1, and S3, respectively), had previously been spectroscopically confirmed as members of the exceptional large-scale structure at z$\sim$5.1-5.3, which is notably elongated along the line of sight, spanning 30 cMpc. We present the spectroscopic confirmation of N2GN_1_13 at z$_{\rm spec}$=5.182, a massive dusty star-forming galaxy identified through targeted NOEMA observations, and N2GN_1_61 at z$_{\rm spec}$=5.201, revealed using JWST/FRESCO data. In addition to these five spectroscopically confirmed members, we identify five further candidates with photometric redshifts consistent with the overdense structure. These galaxies are massive (with a median stellar mass of 10$^{11}$ M$_{\odot}$) and highly obscured (with a median A$_V$ of 2.9), caught in a short-lived yet extreme starburst phase at z$\sim$5.2. Their high SFRs (with a median of 680 M$_{\odot}$ yr$^{-1}$), efficient baryon to stellar mass conversion ($ε_{\star}>$20%), substantial gas reservoir and dust content, suggest rapid evolution and imminent quenching. Six of these galaxies reside in overdense filaments, while the remaining four may trace new distinct structures which will have to be spectroscopically confirmed. These few dusty galaxies dominate the star formation within the overdensity, contributing more than the numerous H$_α$ emitters, and surpassing the cosmic average star formation rate density for this epoch. Their properties suggest an accelerated evolution that current models and simulations have difficulty reproducing.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Compact representation and long-time extrapolation of real-time data for quantum systems
Authors:
Andre Erpenbeck,
Yuanran Zhu,
Yang Yu,
Lei Zhang,
Richard Gerum,
Olga Goulko,
Chao Yang,
Guy Cohen,
Emanuel Gull
Abstract:
Representing real-time data as a sum of complex exponentials provides a compact form that enables both denoising and extrapolation. As a fully data-driven method, the Estimation of Signal Parameters via Rotational Invariance Techniques (ESPRIT) algorithm is agnostic to the underlying physical equations, making it broadly applicable to various observables and experimental or numerical setups. In th…
▽ More
Representing real-time data as a sum of complex exponentials provides a compact form that enables both denoising and extrapolation. As a fully data-driven method, the Estimation of Signal Parameters via Rotational Invariance Techniques (ESPRIT) algorithm is agnostic to the underlying physical equations, making it broadly applicable to various observables and experimental or numerical setups. In this work, we consider applications of the ESPRIT algorithm primarily to extend real-time dynamical data from simulations of quantum systems. We evaluate ESPRIT's performance in the presence of noise and compare it to other extrapolation methods. We demonstrate its ability to extract information from short-time dynamics to reliably predict long-time behavior and determine the minimum time interval required for accurate results. We discuss how this insight can be leveraged in numerical methods that propagate quantum systems in time, and show how ESPRIT can predict infinite-time values of dynamical observables, offering a purely data-driven approach to characterizing quantum phases.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Global hypoellipticity on time-periodic Gelfand-Shilov spaces via non-discrete Fourier analysis
Authors:
André Pedroso Kowacs,
Pedro Meyer Tokoro
Abstract:
In this paper, we provide a characterization of the time-periodic Gelfand-Shilov spaces, as introduced by F. de Ávila Silva and M. Cappiello [J. Funct. Anal., 282(9):29, 2022], through the asymptotic behaviour of both the Euclidean and periodic partial Fourier transforms of their elements. As an application, we establish necessary and sufficient conditions for global regularity -- within this fram…
▽ More
In this paper, we provide a characterization of the time-periodic Gelfand-Shilov spaces, as introduced by F. de Ávila Silva and M. Cappiello [J. Funct. Anal., 282(9):29, 2022], through the asymptotic behaviour of both the Euclidean and periodic partial Fourier transforms of their elements. As an application, we establish necessary and sufficient conditions for global regularity -- within this framework -- for a broad class of constant-coefficient differential operators, as well as for first-order tube-type operators.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
An Interdisciplinary Approach to Human-Centered Machine Translation
Authors:
Marine Carpuat,
Omri Asscher,
Kalika Bali,
Luisa Bentivogli,
Frédéric Blain,
Lynne Bowker,
Monojit Choudhury,
Hal Daumé III,
Kevin Duh,
Ge Gao,
Alvin Grissom II,
Marzena Karpinska,
Elaine C. Khoong,
William D. Lewis,
André F. T. Martins,
Mary Nurminen,
Douglas W. Oard,
Maja Popovic,
Michel Simard,
François Yvon
Abstract:
Machine Translation (MT) tools are widely used today, often in contexts where professional translators are not present. Despite progress in MT technology, a gap persists between system development and real-world usage, particularly for non-expert users who may struggle to assess translation reliability. This paper advocates for a human-centered approach to MT, emphasizing the alignment of system d…
▽ More
Machine Translation (MT) tools are widely used today, often in contexts where professional translators are not present. Despite progress in MT technology, a gap persists between system development and real-world usage, particularly for non-expert users who may struggle to assess translation reliability. This paper advocates for a human-centered approach to MT, emphasizing the alignment of system design with diverse communicative goals and contexts of use. We survey the literature in Translation Studies and Human-Computer Interaction to recontextualize MT evaluation and design to address the diverse real-world scenarios in which MT is used today.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Fast and Furious Symmetric Learning in Zero-Sum Games: Gradient Descent as Fictitious Play
Authors:
John Lazarsfeld,
Georgios Piliouras,
Ryann Sim,
Andre Wibisono
Abstract:
This paper investigates the sublinear regret guarantees of two non-no-regret algorithms in zero-sum games: Fictitious Play, and Online Gradient Descent with constant stepsizes. In general adversarial online learning settings, both algorithms may exhibit instability and linear regret due to no regularization (Fictitious Play) or small amounts of regularization (Gradient Descent). However, their abi…
▽ More
This paper investigates the sublinear regret guarantees of two non-no-regret algorithms in zero-sum games: Fictitious Play, and Online Gradient Descent with constant stepsizes. In general adversarial online learning settings, both algorithms may exhibit instability and linear regret due to no regularization (Fictitious Play) or small amounts of regularization (Gradient Descent). However, their ability to obtain tighter regret bounds in two-player zero-sum games is less understood. In this work, we obtain strong new regret guarantees for both algorithms on a class of symmetric zero-sum games that generalize the classic three-strategy Rock-Paper-Scissors to a weighted, n-dimensional regime. Under symmetric initializations of the players' strategies, we prove that Fictitious Play with any tiebreaking rule has $O(\sqrt{T})$ regret, establishing a new class of games for which Karlin's Fictitious Play conjecture holds. Moreover, by leveraging a connection between the geometry of the iterates of Fictitious Play and Gradient Descent in the dual space of payoff vectors, we prove that Gradient Descent, for almost all symmetric initializations, obtains a similar $O(\sqrt{T})$ regret bound when its stepsize is a sufficiently large constant. For Gradient Descent, this establishes the first "fast and furious" behavior (i.e., sublinear regret without time-vanishing stepsizes) for zero-sum games larger than 2x2.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Semi-Blind Channel Estimation for Downlink Communications Based on Dynamic Metasurface Antennas
Authors:
Amarilton L. Magalhães,
André L. F. de Almeida,
A. Lee Swindlehurst
Abstract:
Dynamic metasurface antennas (DMAs) are emerging as a promising technology to enable energy-efficient, large array-based multi-antenna systems. This paper presents a simple channel estimation scheme for the downlink of a multiple-input single-output orthogonal frequency division multiplexing (MISO-OFDM) communication system exploiting DMAs. The proposed scheme extracts separate estimates of the wi…
▽ More
Dynamic metasurface antennas (DMAs) are emerging as a promising technology to enable energy-efficient, large array-based multi-antenna systems. This paper presents a simple channel estimation scheme for the downlink of a multiple-input single-output orthogonal frequency division multiplexing (MISO-OFDM) communication system exploiting DMAs. The proposed scheme extracts separate estimates of the wireless channel and the unknown waveguide propagation vector using a simple iterative algorithm based on the parallel factor (PARAFAC) decomposition. Obtaining decoupled estimates of the wireless channel and inner waveguide vector enables the isolation and compensation for its effect when designing the DMA beamformer, regardless of the wireless channel state, which evolves much faster due to its shorter coherence time and bandwidth. Additionally, our solution operates in a data-aided manner, delivering estimates of useful data symbols jointly with channel estimates, without requiring sequential pilot and data stages. To the best of our knowledge, this is the first work to explore this CE approach. Numerical results corroborate the notable performance of the proposed scheme.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Feeling Machines: Ethics, Culture, and the Rise of Emotional AI
Authors:
Vivek Chavan,
Arsen Cenaj,
Shuyuan Shen,
Ariane Bar,
Srishti Binwani,
Tommaso Del Becaro,
Marius Funk,
Lynn Greschner,
Roberto Hung,
Stina Klein,
Romina Kleiner,
Stefanie Krause,
Sylwia Olbrych,
Vishvapalsinhji Parmar,
Jaleh Sarafraz,
Daria Soroko,
Daksitha Withanage Don,
Chang Zhou,
Hoang Thuy Duong Vu,
Parastoo Semnani,
Daniel Weinhardt,
Elisabeth Andre,
Jörg Krüger,
Xavier Fresquet
Abstract:
This paper explores the growing presence of emotionally responsive artificial intelligence through a critical and interdisciplinary lens. Bringing together the voices of early-career researchers from multiple fields, it explores how AI systems that simulate or interpret human emotions are reshaping our interactions in areas such as education, healthcare, mental health, caregiving, and digital life…
▽ More
This paper explores the growing presence of emotionally responsive artificial intelligence through a critical and interdisciplinary lens. Bringing together the voices of early-career researchers from multiple fields, it explores how AI systems that simulate or interpret human emotions are reshaping our interactions in areas such as education, healthcare, mental health, caregiving, and digital life. The analysis is structured around four central themes: the ethical implications of emotional AI, the cultural dynamics of human-machine interaction, the risks and opportunities for vulnerable populations, and the emerging regulatory, design, and technical considerations. The authors highlight the potential of affective AI to support mental well-being, enhance learning, and reduce loneliness, as well as the risks of emotional manipulation, over-reliance, misrepresentation, and cultural bias. Key challenges include simulating empathy without genuine understanding, encoding dominant sociocultural norms into AI systems, and insufficient safeguards for individuals in sensitive or high-risk contexts. Special attention is given to children, elderly users, and individuals with mental health challenges, who may interact with AI in emotionally significant ways. However, there remains a lack of cognitive or legal protections which are necessary to navigate such engagements safely. The report concludes with ten recommendations, including the need for transparency, certification frameworks, region-specific fine-tuning, human oversight, and longitudinal research. A curated supplementary section provides practical tools, models, and datasets to support further work in this domain.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Enhancing Privacy: The Utility of Stand-Alone Synthetic CT and MRI for Tumor and Bone Segmentation
Authors:
André Ferreira,
Kunpeng Xie,
Caroline Wilpert,
Gustavo Correia,
Felix Barajas Ordonez,
Tiago Gil Oliveira,
Maike Bode,
Robert Siepmann,
Frank Hölzle,
Rainer Röhrig,
Jens Kleesiek,
Daniel Truhn,
Jan Egger,
Victor Alves,
Behrus Puladi
Abstract:
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthe…
▽ More
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthetic data can replace real data in segmentation tasks. We employed head and neck cancer CT scans and brain glioma MRI scans from two large datasets. Synthetic data were generated using generative adversarial networks and diffusion models. We evaluated the quality of the synthetic data using MAE, MS-SSIM, Radiomics and a Visual Turing Test (VTT) performed by 5 radiologists and their usefulness in segmentation tasks using DSC. Radiomics indicates high fidelity of synthetic MRIs, but fall short in producing highly realistic CT tissue, with correlation coefficient of 0.8784 and 0.5461 for MRI and CT tumors, respectively. DSC results indicate limited utility of synthetic data: tumor segmentation achieved DSC=0.064 on CT and 0.834 on MRI, while bone segmentation a mean DSC=0.841. Relation between DSC and correlation is observed, but is limited by the complexity of the task. VTT results show synthetic CTs' utility, but with limited educational applications. Synthetic data can be used independently for the segmentation task, although limited by the complexity of the structures to segment. Advancing generative models to better tolerate heterogeneous inputs and learn subtle details is essential for enhancing their realism and expanding their application potential.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
The Amazon Nova Family of Models: Technical Report and Model Card
Authors:
Amazon AGI,
Aaron Langford,
Aayush Shah,
Abhanshu Gupta,
Abhimanyu Bhatter,
Abhinav Goyal,
Abhinav Mathur,
Abhinav Mohanty,
Abhishek Kumar,
Abhishek Sethi,
Abi Komma,
Abner Pena,
Achin Jain,
Adam Kunysz,
Adam Opyrchal,
Adarsh Singh,
Aditya Rawal,
Adok Achar Budihal Prasad,
Adrià de Gispert,
Agnika Kumar,
Aishwarya Aryamane,
Ajay Nair,
Akilan M,
Akshaya Iyengar,
Akshaya Vishnu Kudlu Shanbhogue
, et al. (761 additional authors not shown)
Abstract:
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents…
▽ More
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation.
△ Less
Submitted 17 March, 2025;
originally announced June 2025.
-
Evaluating Privacy-Utility Tradeoffs in Synthetic Smart Grid Data
Authors:
Andre Catarino,
Rui Melo,
Rui Abreu,
Luis Cruz
Abstract:
The widespread adoption of dynamic Time-of-Use (dToU) electricity tariffs requires accurately identifying households that would benefit from such pricing structures. However, the use of real consumption data poses serious privacy concerns, motivating the adoption of synthetic alternatives. In this study, we conduct a comparative evaluation of four synthetic data generation methods, Wasserstein-GP…
▽ More
The widespread adoption of dynamic Time-of-Use (dToU) electricity tariffs requires accurately identifying households that would benefit from such pricing structures. However, the use of real consumption data poses serious privacy concerns, motivating the adoption of synthetic alternatives. In this study, we conduct a comparative evaluation of four synthetic data generation methods, Wasserstein-GP Generative Adversarial Networks (WGAN), Conditional Tabular GAN (CTGAN), Diffusion Models, and Gaussian noise augmentation, under different synthetic regimes. We assess classification utility, distribution fidelity, and privacy leakage. Our results show that architectural design plays a key role: diffusion models achieve the highest utility (macro-F1 up to 88.2%), while CTGAN provide the strongest resistance to reconstruction attacks. These findings highlight the potential of structured generative models for developing privacy-preserving, data-driven energy systems.
△ Less
Submitted 20 May, 2025;
originally announced June 2025.
-
Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning
Authors:
Lan Zhang,
Marco Valentino,
Andre Freitas
Abstract:
Autoformalization plays a crucial role in formal mathematical reasoning by enabling the automatic translation of natural language statements into formal languages. While recent advances using large language models (LLMs) have shown promising results, methods for automatically evaluating autoformalization remain underexplored. As one moves to more complex domains (e.g., advanced mathematics), human…
▽ More
Autoformalization plays a crucial role in formal mathematical reasoning by enabling the automatic translation of natural language statements into formal languages. While recent advances using large language models (LLMs) have shown promising results, methods for automatically evaluating autoformalization remain underexplored. As one moves to more complex domains (e.g., advanced mathematics), human evaluation requires significant time and domain expertise, especially as the complexity of the underlying statements and background knowledge increases. LLM-as-a-judge presents a promising approach for automating such evaluation. However, existing methods typically employ coarse-grained and generic evaluation criteria, which limit their effectiveness for advanced formal mathematical reasoning, where quality hinges on nuanced, multi-granular dimensions. In this work, we take a step toward addressing this gap by introducing a systematic, automatic method to evaluate autoformalization tasks. The proposed method is based on an epistemically and formally grounded ensemble (EFG) of LLM judges, defined on criteria encompassing logical preservation (LP), mathematical consistency (MC), formal validity (FV), and formal quality (FQ), resulting in a transparent assessment that accounts for different contributing factors. We validate the proposed framework to serve as a proxy for autoformalization assessment within the domain of formal mathematics. Overall, our experiments demonstrate that the EFG ensemble of LLM judges is a suitable emerging proxy for evaluation, more strongly correlating with human assessments than a coarse-grained model, especially when assessing formal qualities. These findings suggest that LLM-as-judges, especially when guided by a well-defined set of atomic properties, could offer a scalable, interpretable, and reliable support for evaluating formal mathematical reasoning.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Predicting air flow in calendered paper sheets from $μ$-CT data: combining physics with morphology
Authors:
Phillip Gräfensteiner,
Andoni Rodriguez,
Peter Leitl,
Ekaterina Baikova,
Maximilian Fuchs,
Eduardo Machado Charry,
Ulrich Hirn,
André Hilger,
Ingo Manke,
Robert Schennach,
Matthias Neumann,
Volker Schmidt,
Karin Zojer
Abstract:
Predicting the macroscopic properties of thin fiber-based porous materials from their microscopic morphology remains challenging because of the structural heterogeneity of these materials. In this study, computational fluid dynamics simulations were performed to compute volume air flow based on tomographic image data of uncompressed and compressed paper sheets. To reduce computational demands, a p…
▽ More
Predicting the macroscopic properties of thin fiber-based porous materials from their microscopic morphology remains challenging because of the structural heterogeneity of these materials. In this study, computational fluid dynamics simulations were performed to compute volume air flow based on tomographic image data of uncompressed and compressed paper sheets. To reduce computational demands, a pore network model was employed, allowing volume air flow to be approximated with less computational effort. To improve prediction accuracy, geometric descriptors of the pore space, such as porosity, surface area, median pore radius, and geodesic tortuosity, were combined with predictions of the pore network model. This integrated approach significantly improves the predictive power of the pore network model and indicates which aspects of the pore space morphology are not accurately represented within the pore network model. In particular, we illustrate that a high correlation among descriptors does not necessarily imply redundancy in a combined prediction.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Size-adaptive Hypothesis Testing for Fairness
Authors:
Antonio Ferrara,
Francesco Cozzi,
Alan Perotti,
André Panisson,
Francesco Bonchi
Abstract:
Determining whether an algorithmic decision-making system discriminates against a specific demographic typically involves comparing a single point estimate of a fairness metric against a predefined threshold. This practice is statistically brittle: it ignores sampling error and treats small demographic subgroups the same as large ones. The problem intensifies in intersectional analyses, where mult…
▽ More
Determining whether an algorithmic decision-making system discriminates against a specific demographic typically involves comparing a single point estimate of a fairness metric against a predefined threshold. This practice is statistically brittle: it ignores sampling error and treats small demographic subgroups the same as large ones. The problem intensifies in intersectional analyses, where multiple sensitive attributes are considered jointly, giving rise to a larger number of smaller groups. As these groups become more granular, the data representing them becomes too sparse for reliable estimation, and fairness metrics yield excessively wide confidence intervals, precluding meaningful conclusions about potential unfair treatments.
In this paper, we introduce a unified, size-adaptive, hypothesis-testing framework that turns fairness assessment into an evidence-based statistical decision. Our contribution is twofold. (i) For sufficiently large subgroups, we prove a Central-Limit result for the statistical parity difference, leading to analytic confidence intervals and a Wald test whose type-I (false positive) error is guaranteed at level $α$. (ii) For the long tail of small intersectional groups, we derive a fully Bayesian Dirichlet-multinomial estimator; Monte-Carlo credible intervals are calibrated for any sample size and naturally converge to Wald intervals as more data becomes available. We validate our approach empirically on benchmark datasets, demonstrating how our tests provide interpretable, statistically rigorous decisions under varying degrees of data availability and intersectionality.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Graph Neural Networks for Automatic Addition of Optimizing Components in Printed Circuit Board Schematics
Authors:
Pascal Plettenberg,
André Alcalde,
Bernhard Sick,
Josephine M. Thomas
Abstract:
The design and optimization of Printed Circuit Board (PCB) schematics is crucial for the development of high-quality electronic devices. Thereby, an important task is to optimize drafts by adding components that improve the robustness and reliability of the circuit, e.g., pull-up resistors or decoupling capacitors. Since there is a shortage of skilled engineers and manual optimizations are very ti…
▽ More
The design and optimization of Printed Circuit Board (PCB) schematics is crucial for the development of high-quality electronic devices. Thereby, an important task is to optimize drafts by adding components that improve the robustness and reliability of the circuit, e.g., pull-up resistors or decoupling capacitors. Since there is a shortage of skilled engineers and manual optimizations are very time-consuming, these best practices are often neglected. However, this typically leads to higher costs for troubleshooting in later development stages as well as shortened product life cycles, resulting in an increased amount of electronic waste that is difficult to recycle. Here, we present an approach for automating the addition of new components into PCB schematics by representing them as bipartite graphs and utilizing a node pair prediction model based on Graph Neural Networks (GNNs). We apply our approach to three highly relevant PCB design optimization tasks and compare the performance of several popular GNN architectures on real-world datasets labeled by human experts. We show that GNNs can solve these problems with high accuracy and demonstrate that our approach offers the potential to automate PCB design optimizations in a time- and cost-efficient manner.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Stability analysis of the free-surface Stokes problem and an unconditionally stable explicit scheme
Authors:
Igor Tominec,
Lukas Lundgren,
André Löfgren,
Josefin Ahlkrona
Abstract:
Accurate simulations of ice sheet dynamics, mantle convection, lava flow, and other highly viscous free-surface flows involve solving the coupled Stokes/free-surface equations. In this paper, we theoretically analyze the stability and conservation properties of the weak form of this system for Newtonian fluids and non-Newtonian fluids, at both the continuous and discrete levels. We perform the ful…
▽ More
Accurate simulations of ice sheet dynamics, mantle convection, lava flow, and other highly viscous free-surface flows involve solving the coupled Stokes/free-surface equations. In this paper, we theoretically analyze the stability and conservation properties of the weak form of this system for Newtonian fluids and non-Newtonian fluids, at both the continuous and discrete levels. We perform the fully discrete stability analysis for finite element methods used in space with explicit and implicit Euler time-stepping methods used in time. Motivated by the theory, we propose a stabilization term designed for the explicit Euler discretization, which ensures unconditional time stability and permits conservation of the domain volume. Numerical experiments validate and support our theoretical findings.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Constructive interference at the edge of quantum ergodic dynamics
Authors:
Dmitry A. Abanin,
Rajeev Acharya,
Laleh Aghababaie-Beni,
Georg Aigeldinger,
Ashok Ajoy,
Ross Alcaraz,
Igor Aleiner,
Trond I. Andersen,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Nikita Astrakhantsev,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Brian Ballard,
Joseph C. Bardin,
Christian Bengs,
Andreas Bengtsson,
Alexander Bilmes,
Sergio Boixo,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird
, et al. (240 additional authors not shown)
Abstract:
Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully imp…
▽ More
Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully implemented to restore sensitivities of quantum observables. Using a 103-qubit superconducting quantum processor, we characterize ergodic dynamics using the second-order out-of-time-order correlators, OTOC$^{(2)}$. In contrast to dynamics without time reversal, OTOC$^{(2)}$ are observed to remain sensitive to the underlying dynamics at long time scales. Furthermore, by inserting Pauli operators during quantum evolution and randomizing the phases of Pauli strings in the Heisenberg picture, we observe substantial changes in OTOC$^{(2)}$ values. This indicates that OTOC$^{(2)}$ is dominated by constructive interference between Pauli strings that form large loops in configuration space. The observed interference mechanism endows OTOC$^{(2)}$ with a high degree of classical simulation complexity, which culminates in a set of large-scale OTOC$^{(2)}$ measurements exceeding the simulation capacity of known classical algorithms. Further supported by an example of Hamiltonian learning through OTOC$^{(2)}$, our results indicate a viable path to practical quantum advantage.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Interfacial deformation and energy exchange in strong free-surface turbulence
Authors:
Andre Calado,
Elias Balaras
Abstract:
This study investigates the dynamics of strong free-surface turbulence (SFST) using direct numerical simulations (DNS). We focus on the energy exchange between the deformed free-surface and underlying turbulence, examining the influence of Reynolds ($Re$), Froude ($Fr$), and Weber ($We$) numbers. The two-fluid DNS of SFST at high $Fr$ and $We$ is able to incorporate air entrainment effects in a st…
▽ More
This study investigates the dynamics of strong free-surface turbulence (SFST) using direct numerical simulations (DNS). We focus on the energy exchange between the deformed free-surface and underlying turbulence, examining the influence of Reynolds ($Re$), Froude ($Fr$), and Weber ($We$) numbers. The two-fluid DNS of SFST at high $Fr$ and $We$ is able to incorporate air entrainment effects in a statistical steady-state. Results reveal that high $We$ primarily affects entrained bubble shapes (sphericity), while $Fr$ significantly alters free-surface deformation, two-dimensional compressibility, and turbulent kinetic energy (TKE) modulation. Vorticity flux across the interface occurs from viscous diffusion of surface-parallel structures. At lower $Fr$, kinetic energy is redistributed between horizontal and vertical components, aligning with rapid distortion theory (RDT), whereas higher $Fr$ preserves isotropy near the surface. Evidence of a reverse or dual energy cascade is verified through third-order structure functions, with a strong net reverse cascade near the integral length scale, and enhanced vertical kinetic energy in upwelling eddies. Discrete wavelet transforms (DWT) of TKE show weaker decay at smallest scales near the interface, suggesting contributions from gravitational energy conversion and reduced dissipation. The wavelet energy spectra also exhibits different scaling laws across the wavenumber range, with a $-3$ slope within the inertial subrange. These findings highlight scale- and proximity-dependent effects on two-phase TKE transport, with implications for sub-grid modeling. This work underscores the need for advanced analytical tools that allow for localization in both physical and spectral domains to further elucidate the complex energy cascade mechanisms in SFST.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Precision $e^+e^-$ Hemisphere Masses in the Dijet Region with Power Corrections
Authors:
Andre H. Hoang,
Vicent Mateu,
Matthew D. Schwartz,
Iain W. Stewart
Abstract:
We derive high-precision results for the $e^+e^-$ heavy jet mass (HJM) $d σ/d ρ$ and dihemisphere mass (DHM) $d^2σ/(d s_1 d s_2)$ distributions, for $s_1\sim s_2$, in the dijet region. New results include: i) the N$^3$LL resummation for HJM of large logarithms $\ln^n(ρ)$ at small $ρ$ including the exact two-loop non-global hemisphere soft function, the 4-loop cusp anomalous dimension and the 3-loo…
▽ More
We derive high-precision results for the $e^+e^-$ heavy jet mass (HJM) $d σ/d ρ$ and dihemisphere mass (DHM) $d^2σ/(d s_1 d s_2)$ distributions, for $s_1\sim s_2$, in the dijet region. New results include: i) the N$^3$LL resummation for HJM of large logarithms $\ln^n(ρ)$ at small $ρ$ including the exact two-loop non-global hemisphere soft function, the 4-loop cusp anomalous dimension and the 3-loop hard and jet functions, ii) N$^3$LL results for DHM with resummation of logarithms $\ln(s_{1,2}/Q^2)$ when there is no large separation between $s_1$ and $s_2$, iii) profile functions for HJM to give results simultaneously valid in the peak and tail regions, iv) a complete two-dimensional basis of non-perturbative functions which can be used for double differential observables, that are needed for both HJM and DHM in the peak region, and v) an implementation of renormalon subtractions for large-angle soft radiation to ${\cal O}(α_s^3)$ together with a resummation of the additional large $\ln(Qρ/Λ_{QCD})$ logarithms. Here $Q$ is the $e^+e^-$ center-of-mass energy. Our resummation results are combined with known fixed-order ${\cal O}(α_s^3)$ results and we discuss the convergence and remaining perturbative uncertainty in the cross section. We also prove that, at order $1/Q$, the first moment of the HJM distribution involves an additional non-perturbative parameter compared to the power correction that shifts the tail of the spectrum (where $1\gg ρ\gg Λ_{QCD}/Q$). This differs from thrust where a single non-perturbative parameter at order $1/Q$ describes both the first moment and the tail, and it disfavors models of power corrections employing a single non-perturbative parameter, such as the low-scale effective coupling model. In this paper we focus only on the dijet region, not the far-tail distribution for $ρ\gtrsim 0.2$.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
The Fourier transform in variable exponent Lebesgue spaces
Authors:
André Pedroso Kowacs,
Wagner Augusto Almeida de Moraes
Abstract:
In this work we define a Fourier transform for each $f\in L^{p(\cdot)}(\mathbb{R})$, for a large class of exponent functions $p(\cdot)$, as the distributional derivative of a Hölder continuous function. A norm is defined in the space of such Fourier transforms so that it is isometrically isomorphic to $L^{p(\cdot)}(\mathbb{R})$. We also prove several properties of this Fourier transform, such as i…
▽ More
In this work we define a Fourier transform for each $f\in L^{p(\cdot)}(\mathbb{R})$, for a large class of exponent functions $p(\cdot)$, as the distributional derivative of a Hölder continuous function. A norm is defined in the space of such Fourier transforms so that it is isometrically isomorphic to $L^{p(\cdot)}(\mathbb{R})$. We also prove several properties of this Fourier transform, such as inversion in norm and an exchange theorem.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Communicating Through Avatars in Industry 5.0: A Focus Group Study on Human-Robot Collaboration
Authors:
Stina Klein,
Pooja Prajod,
Katharina Weitz,
Matteo Lavit Nicora,
Dimitra Tsovaltzi,
Elisabeth André
Abstract:
The integration of collaborative robots (cobots) in industrial settings raises concerns about worker well-being, particularly due to reduced social interactions. Avatars - designed to facilitate worker interactions and engagement - are promising solutions to enhance the human-robot collaboration (HRC) experience. However, real-world perspectives on avatar-supported HRC remain unexplored. To addres…
▽ More
The integration of collaborative robots (cobots) in industrial settings raises concerns about worker well-being, particularly due to reduced social interactions. Avatars - designed to facilitate worker interactions and engagement - are promising solutions to enhance the human-robot collaboration (HRC) experience. However, real-world perspectives on avatar-supported HRC remain unexplored. To address this gap, we conducted a focus group study with employees from a German manufacturing company that uses cobots. Before the discussion, participants engaged with a scripted, industry-like HRC demo in a lab setting. This qualitative approach provided valuable insights into the avatar's potential roles, improvements to its behavior, and practical considerations for deploying them in industrial workcells. Our findings also emphasize the importance of personalized communication and task assistance. Although our study's limitations restrict its generalizability, it serves as an initial step in recognizing the potential of adaptive, context-aware avatar interactions in real-world industrial environments.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Integrating positive energy representations of the Virasoro algebra
Authors:
André G. Henriques,
James E. Tener
Abstract:
We show that every unitary positive energy representation W of the Virasoro algebra exponentiates to a holomorphic *-representation of the semigroup of annuli by bounded operators on the Hilbert space completion of W. We use this to show that every representation of the Virasoro conformal net also carries a representation of the semigroup of annuli of the same kind.
We show that every unitary positive energy representation W of the Virasoro algebra exponentiates to a holomorphic *-representation of the semigroup of annuli by bounded operators on the Hilbert space completion of W. We use this to show that every representation of the Virasoro conformal net also carries a representation of the semigroup of annuli of the same kind.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
The Impact of Feature Scaling In Machine Learning: Effects on Regression and Classification Tasks
Authors:
João Manoel Herrera Pinheiro,
Suzana Vilas Boas de Oliveira,
Thiago Henrique Segreto Silva,
Pedro Antonio Rabelo Saraiva,
Enzo Ferreira de Souza,
Leonardo André Ambrosio,
Marcelo Becker
Abstract:
This research addresses the critical lack of comprehensive studies on feature scaling by systematically evaluating 12 scaling techniques - including several less common transformations - across 14 different Machine Learning algorithms and 16 datasets for classification and regression tasks. We meticulously analyzed impacts on predictive performance (using metrics such as accuracy, MAE, MSE, and…
▽ More
This research addresses the critical lack of comprehensive studies on feature scaling by systematically evaluating 12 scaling techniques - including several less common transformations - across 14 different Machine Learning algorithms and 16 datasets for classification and regression tasks. We meticulously analyzed impacts on predictive performance (using metrics such as accuracy, MAE, MSE, and $R^2$) and computational costs (training time, inference time, and memory usage). Key findings reveal that while ensemble methods (such as Random Forest and gradient boosting models like XGBoost, CatBoost and LightGBM) demonstrate robust performance largely independent of scaling, other widely used models such as Logistic Regression, SVMs, TabNet, and MLPs show significant performance variations highly dependent on the chosen scaler. This extensive empirical analysis, with all source code, experimental results, and model parameters made publicly available to ensure complete transparency and reproducibility, offers model-specific crucial guidance to practitioners on the need for an optimal selection of feature scaling techniques.
△ Less
Submitted 11 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
Approximate Axiomatization for Differentially-Defined Functions
Authors:
André Platzer,
Long Qian
Abstract:
This article establishes a complete approximate axiomatization for the real-closed field $\mathbb{R}$ expanded with all differentially-defined functions, including special functions such as $\sin(x), \cos(x), e^x, \dots$. Every true sentence is provable up to some numerical approximation, and the truth of such approximations converge under mild conditions. Such an axiomatization is a fragment of t…
▽ More
This article establishes a complete approximate axiomatization for the real-closed field $\mathbb{R}$ expanded with all differentially-defined functions, including special functions such as $\sin(x), \cos(x), e^x, \dots$. Every true sentence is provable up to some numerical approximation, and the truth of such approximations converge under mild conditions. Such an axiomatization is a fragment of the axiomatization for differential dynamic logic, and is therefore a finite extension of the axiomatization of real-closed fields. Furthermore, the numerical approximations approximate formulas containing special function symbols by $\text{FOL}_{\mathbb{R}}$ formulas, improving upon earlier decidability results only concerning closed sentences.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Spatio-Temporal Weak Measurement of Chiral Ultra short Laser Pulse
Authors:
Sahil Sahoo,
Andre Yaroshevsky,
Dima Cheskis,
Yuri Gorodetski
Abstract:
We present a comprehensive study on the spatio temporal weak measurement of a chiral ultrafast optical pulse. We create a chiral vector wave packet by transmitting ultrashort laser pulse via a birefringent or magneto-optic medium. Employing time-resolved leakage radiation microscopy, we examine how the real and imaginary components of the weak value parameter ($ε$) influence pulse propagation over…
▽ More
We present a comprehensive study on the spatio temporal weak measurement of a chiral ultrafast optical pulse. We create a chiral vector wave packet by transmitting ultrashort laser pulse via a birefringent or magneto-optic medium. Employing time-resolved leakage radiation microscopy, we examine how the real and imaginary components of the weak value parameter ($ε$) influence pulse propagation over time. Our technique allows us to detect and categorize the temporal polarization fluctuation in a $75$ fs pulse with an excellent repeatability. The achieved experimental results demonstrate a satisfactory consistency with the theoretical predictions.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Performance Evaluation of Beyond Diagonal RIS under Hardware Impairments
Authors:
Jose Carlos da Silva Filho,
Josué V. de Araújo,
Bruno Sokal,
André L. F. de Almeida
Abstract:
Beyond diagonal reconfigurable intelligent surface (BD-RIS) improves the traditional reconfigurable intelligent surface (RIS) architecture functionality by interconnecting elements for advanced wave control. However, real-world implementations face hardware imperfections, such as impedance mismatches and varactor nonidealities, which can degrade overall system performance. In this paper, we propos…
▽ More
Beyond diagonal reconfigurable intelligent surface (BD-RIS) improves the traditional reconfigurable intelligent surface (RIS) architecture functionality by interconnecting elements for advanced wave control. However, real-world implementations face hardware imperfections, such as impedance mismatches and varactor nonidealities, which can degrade overall system performance. In this paper, we propose three hardware impairment models that directly affect the BD-RIS scattering matrix structure and evaluate their impact on the channel estimation accuracy using the normalized mean square error (NMSE) as a performance metric. The proposed impairment models consider imperfections affecting self-impedances, mutual impedances, or both. Our results reveal how each impairment type degrades the system performance, allowing us to identify scenarios where the traditional RIS can outperform the BD-RIS.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
BIMgent: Towards Autonomous Building Modeling via Computer-use Agents
Authors:
Zihan Deng,
Changyu Du,
Stavros Nousias,
André Borrmann
Abstract:
Existing computer-use agents primarily focus on general-purpose desktop automation tasks, with limited exploration of their application in highly specialized domains. In particular, the 3D building modeling process in the Architecture, Engineering, and Construction (AEC) sector involves open-ended design tasks and complex interaction patterns within Building Information Modeling (BIM) authoring so…
▽ More
Existing computer-use agents primarily focus on general-purpose desktop automation tasks, with limited exploration of their application in highly specialized domains. In particular, the 3D building modeling process in the Architecture, Engineering, and Construction (AEC) sector involves open-ended design tasks and complex interaction patterns within Building Information Modeling (BIM) authoring software, which has yet to be thoroughly addressed by current studies. In this paper, we propose BIMgent, an agentic framework powered by multimodal large language models (LLMs), designed to enable autonomous building model authoring via graphical user interface (GUI) operations. BIMgent automates the architectural building modeling process, including multimodal input for conceptual design, planning of software-specific workflows, and efficient execution of the authoring GUI actions. We evaluate BIMgent on real-world building modeling tasks, including both text-based conceptual design generation and reconstruction from existing building design. The design quality achieved by BIMgent was found to be reasonable. Its operations achieved a 32% success rate, whereas all baseline models failed to complete the tasks (0% success rate). Results demonstrate that BIMgent effectively reduces manual workload while preserving design intent, highlighting its potential for practical deployment in real-world architectural modeling scenarios.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Joint Channel and Symbol Estimation for Communication Systems with Movable Antennas
Authors:
Josué V. de Araújo,
Jose Carlos da Silva Filho,
Gilderlan T. de Araújo,
Paulo R. B. Gomes,
André L. F. de Almeida
Abstract:
Communication systems aided by movable antennas have been the subject of recent research due to their potentially increased spatial degrees of freedom offered by optimizing the antenna positioning at the transmitter and/or receiver. In this context, a topic that deserves attention is channel estimation. Conventional methods reported recently rely on pilot-assisted strategies to estimate the channe…
▽ More
Communication systems aided by movable antennas have been the subject of recent research due to their potentially increased spatial degrees of freedom offered by optimizing the antenna positioning at the transmitter and/or receiver. In this context, a topic that deserves attention is channel estimation. Conventional methods reported recently rely on pilot-assisted strategies to estimate the channel coefficients. In this work, we address the joint channel and symbol estimation problem for an uplink multi-user communication system, where the base station is equipped with a movable antenna array. A semi-blind receiver based on the PARAFAC2 model is formulated to exploit the tensor decomposition structure for the received signals, from which channel and symbol estimates can be jointly obtained via an alternating estimation algorithm. Compared with reference schemes, our preliminary numerical simulations yield remarkable results for the proposed method.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação
Authors:
Washington Cunha,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
Progress in Natural Language Processing (NLP) has been dictated by the rule of more: more data, more computing power and more complexity, best exemplified by the Large Language Models. However, training (or fine-tuning) large dense models for specific applications usually requires significant amounts of computing resources. This \textbf{Ph.D. dissertation} focuses on an under-investi\-gated NLP da…
▽ More
Progress in Natural Language Processing (NLP) has been dictated by the rule of more: more data, more computing power and more complexity, best exemplified by the Large Language Models. However, training (or fine-tuning) large dense models for specific applications usually requires significant amounts of computing resources. This \textbf{Ph.D. dissertation} focuses on an under-investi\-gated NLP data engineering technique, whose potential is enormous in the current scenario known as Instance Selection (IS). The IS goal is to reduce the training set size by removing noisy or redundant instances while maintaining the effectiveness of the trained models and reducing the training process cost. We provide a comprehensive and scientifically sound comparison of IS methods applied to an essential NLP task -- Automatic Text Classification (ATC), considering several classification solutions and many datasets. Our findings reveal a significant untapped potential for IS solutions. We also propose two novel IS solutions that are noise-oriented and redundancy-aware, specifically designed for large datasets and transformer architectures. Our final solution achieved an average reduction of 41\% in training sets, while maintaining the same levels of effectiveness in all datasets. Importantly, our solutions demonstrated speedup improvements of 1.67x (up to 2.46x), making them scalable for datasets with hundreds of thousands of documents.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Circuit-Based Modeling Approach for Channel Estimation in RIS-Assisted Communications
Authors:
Daniel C. Alcantara,
Daniel V. C. de Oliveira,
Gilderlan T. de Araújo,
Paulo R. B. Gomes,
André L. F. de Almeida
Abstract:
Reconfigurable intelligent surface (RIS) has been explored as a supportive technology for wireless communication since around 2019. While the literature highlights the potential of RIS in different modern applications, two key issues have gained significant attention from the research community: channel estimation and phase shift optimization. The performance gains of RIS-assisted systems rely hea…
▽ More
Reconfigurable intelligent surface (RIS) has been explored as a supportive technology for wireless communication since around 2019. While the literature highlights the potential of RIS in different modern applications, two key issues have gained significant attention from the research community: channel estimation and phase shift optimization. The performance gains of RIS-assisted systems rely heavily on optimal phase shifts, which, in turn, depend on accurate channel estimation. Several studies have addressed these challenges under different assumptions. Some works consider a range of continuous phase shifts, while others propose a limited number of discrete phase values for the RIS elements. Many studies present an idealized perspective, whereas others aim to approximate more practical aspects by considering circuit system responses and employing phase shifts derived from a Discrete Fourier Transform (DFT) or other lookup tables. However, to our knowledge, no study has examined the influence of circuit system parameters on channel estimation and subsequent phase shift optimization. This paper models each RIS element as an equivalent resonant circuit composed of resistance, capacitance, and inductance. We propose that resistance and capacitance parameters can be dynamically and independently configured, leading to the formulation of an impedance matrix. Furthermore, we construct a circuit-based RIS phase shift matrix that accounts for the response of the resonant circuit, which changes with variations in the physical parameters of resistance and capacitance. We investigate the impact of this circuit-based RIS phase shift within a tensor-based channel estimation approach. Our results indicate a performance loss compared to ideal scenarios, such as those using the DFT design. However, we found that increasing the training time can mitigate this performance degradation.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
On the randomized SVD in infinite dimensions
Authors:
Daniel Kressner,
David Persson,
André Uschmajew
Abstract:
Randomized methods, such as the randomized SVD (singular value decomposition) and Nyström approximation, are an effective way to compute low-rank approximations of large matrices. Motivated by applications to operator learning, Boullé and Townsend (FoCM, 2023) recently proposed an infinite-dimensional extension of the randomized SVD for a Hilbert--Schmidt operator $A$ that invokes randomness throu…
▽ More
Randomized methods, such as the randomized SVD (singular value decomposition) and Nyström approximation, are an effective way to compute low-rank approximations of large matrices. Motivated by applications to operator learning, Boullé and Townsend (FoCM, 2023) recently proposed an infinite-dimensional extension of the randomized SVD for a Hilbert--Schmidt operator $A$ that invokes randomness through a Gaussian process with a covariance operator $K$. While the non-isotropy introduced by $K$ allows one to incorporate prior information on $A$, an unfortunate choice may lead to unfavorable performance and large constants in the error bounds. In this work, we introduce a novel infinite-dimensional extension of the randomized SVD that does not require such a choice and enjoys error bounds that match those for the finite-dimensional case. Moreover, it reflects the common practice of using the randomized SVD with isotropic random vectors, also when approximating discretized operators. In fact, the theoretical results of this work show how the usual randomized SVD applied to a discretization of $A$ approaches our infinite-dimensional extension as the discretization gets refined, both in terms of error bounds and the Wasserstein distance. We also present and analyze a novel extension of the Nyström approximation for self-adjoint positive semi-definite trace class operators.
△ Less
Submitted 7 June, 2025;
originally announced June 2025.
-
Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding
Authors:
Emmanouil Zaranis,
António Farinhas,
Saul Santos,
Beatriz Canaverde,
Miguel Moura Ramos,
Aditya K Surikuchi,
André Viveiros,
Baohao Liao,
Elena Bueno-Benito,
Nithin Sivakumaran,
Pavlo Vasylenko,
Shoubin Yu,
Sonal Sannigrahi,
Wafaa Mohammed,
Ben Peters,
Danae Sánchez Villegas,
Elias Stengel-Eskin,
Giuseppe Attanasio,
Jaehong Yoon,
Stella Frank,
Alessandro Suglia,
Chrysoula Zerva,
Desmond Elliott,
Mariella Dimiccoli,
Mohit Bansal
, et al. (6 additional authors not shown)
Abstract:
Despite recent progress in vision-language models (VLMs), holistic understanding of long-form video content remains a significant challenge, partly due to limitations in current benchmarks. Many focus on peripheral, ``needle-in-a-haystack'' details, encouraging context-insensitive retrieval over deep comprehension. Others rely on large-scale, semi-automatically generated questions (often produced…
▽ More
Despite recent progress in vision-language models (VLMs), holistic understanding of long-form video content remains a significant challenge, partly due to limitations in current benchmarks. Many focus on peripheral, ``needle-in-a-haystack'' details, encouraging context-insensitive retrieval over deep comprehension. Others rely on large-scale, semi-automatically generated questions (often produced by language models themselves) that are easier for models to answer but fail to reflect genuine understanding. In this paper, we introduce MF$^2$, a new benchmark for evaluating whether models can comprehend, consolidate, and recall key narrative information from full-length movies (50-170 minutes long). MF$^2$ includes over 50 full-length, open-licensed movies, each paired with manually constructed sets of claim pairs -- one true (fact) and one plausible but false (fib), totalling over 850 pairs. These claims target core narrative elements such as character motivations and emotions, causal chains, and event order, and refer to memorable moments that humans can recall without rewatching the movie. Instead of multiple-choice formats, we adopt a binary claim evaluation protocol: for each pair, models must correctly identify both the true and false claims. This reduces biases like answer ordering and enables a more precise assessment of reasoning. Our experiments demonstrate that both open-weight and closed state-of-the-art models fall well short of human performance, underscoring the relative ease of the task for humans and their superior ability to retain and reason over critical narrative information -- an ability current VLMs lack.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.