-
CPN-Py: A Python-Based Tool for Modeling and Analyzing Colored Petri Nets
Authors:
Alessandro Berti,
Wil M. P. van der Aalst
Abstract:
Colored Petri Nets (CPNs) are an established formalism for modeling processes where tokens carry data. Although tools like CPN Tools and CPN IDE excel at CPN-based simulation, they are often separate from modern data science ecosystems. Meanwhile, Python has become the de facto language for process mining, machine learning, and data analytics. In this paper, we introduce CPN-Py, a Python library t…
▽ More
Colored Petri Nets (CPNs) are an established formalism for modeling processes where tokens carry data. Although tools like CPN Tools and CPN IDE excel at CPN-based simulation, they are often separate from modern data science ecosystems. Meanwhile, Python has become the de facto language for process mining, machine learning, and data analytics. In this paper, we introduce CPN-Py, a Python library that faithfully preserves the core concepts of Colored Petri Nets -- including color sets, timed tokens, guard logic, and hierarchical structures -- while providing seamless integration with the Python environment. We discuss its design, highlight its synergy with PM4Py (including stochastic replay, process discovery, and decision mining functionalities), and illustrate how the tool supports state space analysis and hierarchical CPNs. We also outline how CPN-Py accommodates large language models, which can generate or refine CPN models through a dedicated JSON-based format.
△ Less
Submitted 27 March, 2025;
originally announced June 2025.
-
Multimode and Random-Access Optical Quantum Memory via Adiabatic Phase Imprinting
Authors:
Nasser Gohari Kamel,
Sourabh Kumar,
Ujjwal Gautam,
Erhan Saglamyurek,
Vahid Salari,
Daniel Oblak
Abstract:
A photonic quantum memory capable of simultaneously storing multiple qubits and subsequently recalling any randomly selected subset of the qubits, is essential for large-scale quantum networking and computing. Such functionality, akin to classical Random-Access Memory (RAM), has proven difficult to implement due to the absence of a versatile random-access mechanism and limited multimode capacity i…
▽ More
A photonic quantum memory capable of simultaneously storing multiple qubits and subsequently recalling any randomly selected subset of the qubits, is essential for large-scale quantum networking and computing. Such functionality, akin to classical Random-Access Memory (RAM), has proven difficult to implement due to the absence of a versatile random-access mechanism and limited multimode capacity in existing quantum memory protocols. A potential path to developing the quantum analog to RAM is offered by photon-echo protocols in rare-earth ion-doped materials, such as Revival Of Silenced Echo. These can utilize optical rephasing pulses to selectively read-out frequency multiplexed photonic qubits within an inhomogeneously broadened optical transition. However, the conventional non-adiabatic nature of the rephasing pulses requires intense, short-duration pulses, impeding their fidelity and multimode capacity. To address these critical limitations, we introduce an alternate protocol that employs Rapid Adiabatic Passage (RAP) rephasing pulses, to realize quantum memory, which invokes phase-imprints to suppress undesirable echoes. Using the optical transitions of a $^{171}{\rm Yb}^{3+}$:${\rm Y}_2{\rm SiO}_5$ crystal, we demonstrate the storage and retrieval of multiple intricate spectro-temporally photonic modes and achieve optical random access memory across eight distinct spectral modes. This protocol yields greatly enhanced mode-mapping versatility while substantially lowering the required rephasing pulse intensity, providing a more efficient and reliable approach for high-fidelity qubit storage and retrieval.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Updated line list for the principal isotopologue of carbon monoxide
Authors:
Vladimir G. Ushakov,
Emile S. Medvedev
Abstract:
The line list for the principal isotopologue of CO calculated earlier by the present authors [1, 2] with the irregular dipole-moment function (DMF) is updated with use of the recent high-precision measurements in the 3-0 [3, 4] (Bielska et al. 2022, Hodges et al. 2025) and 7-0 [5] (Balashov et al. 2023) bands. The new data came in contradiction with the experimental data on the 1-0 band [6, 7]. Th…
▽ More
The line list for the principal isotopologue of CO calculated earlier by the present authors [1, 2] with the irregular dipole-moment function (DMF) is updated with use of the recent high-precision measurements in the 3-0 [3, 4] (Bielska et al. 2022, Hodges et al. 2025) and 7-0 [5] (Balashov et al. 2023) bands. The new data came in contradiction with the experimental data on the 1-0 band [6, 7]. Therefore, we performed fitting several model DMFs to the modified original data set of Meshkov et al. [8] by including the new above-referenced data and by excluding the data for the 1-0 band. The updated line list is calculated with the irregular DMF. In particular, excellent agreement with recent high-level ab initio calculations on the 3-0 band [3] is emphasized and predictions for the 1-0 and 8-0 bands are outlined. In the new update of the HITRAN database [9], new high-precision measurements in the cold and hot fundamental bands are announced. When these data are published, they will be compared with the predictions of our new line list.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Morphology across cosmic time: assessing the evolution and interplay of disk and bulge-dominated galaxies in the CANDELS survey
Authors:
Vitor M. Sampaio,
Igor Kolesnikov,
Reinaldo R. de Carvalho,
Ignacio Ferreras,
Joseph Silk
Abstract:
We investigate the redshift evolution of disk and bulge-dominated galaxies using a mass-complete sample of $\sim$14,000 galaxies from the CANDELS survey, selected with $H_{\rm mag} \leq 24$, $M_{\rm stellar} \geq 10^9\,{\rm M}_\odot$, and spanning $0.2 \leq z \leq 2.4$. Adopting an unbiased morphological classification, free from visual inspection or parametric assumptions, we explore the evolutio…
▽ More
We investigate the redshift evolution of disk and bulge-dominated galaxies using a mass-complete sample of $\sim$14,000 galaxies from the CANDELS survey, selected with $H_{\rm mag} \leq 24$, $M_{\rm stellar} \geq 10^9\,{\rm M}_\odot$, and spanning $0.2 \leq z \leq 2.4$. Adopting an unbiased morphological classification, free from visual inspection or parametric assumptions, we explore the evolution of specific star formation rate (sSFR), stellar mass, structural properties, and galaxy fractions as a function of redshift and morphology. We find that while disk and bulge-dominated galaxies exhibit similar sSFR distributions at $z \sim 2.4$, bulge-dominated systems develop a redshift-dependent bimodality below $z < 1.6$, unlike the unimodal behaviour of disks. This bimodality correlates with stellar mass: bulge-dominated galaxies with lower sSFR are significantly more massive and exhibit higher Sérsic indices than their star-forming counterparts, despite having similar effective radii. Based on a Gaussian mixture decomposition, we identify two evolutionary tracks for bulge-dominated galaxies: G1, a long-lived, star-forming population with disk-like properties; and G2, a quenched, massive population whose prominence increases with decreasing redshift. The evolution of the star formation main sequence and morphology--mass fractions support a scenario in which G2 systems form through merger-driven transformations of massive disks. Our results indicate that bulge-dominated galaxies are not a homogeneous population, but instead follow divergent evolutionary paths driven by distinct physical mechanisms.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Remote sensing of tectonic induced stress across faults using high energy muon beams
Authors:
L. Serafini,
G. Muttoni,
A. Bacci,
F. Broggi,
L. Giuliano,
A. M. Marotta,
V. Petrillo,
E. Puppin,
M. Rossetti Conti,
A. R. Rossi,
S. Samsam,
M. Voltolini,
M. Zucali
Abstract:
We illustrate a theoretical study of a newly conceived technique using high-energy muon beams (TeV-class) propagating through thick (km-long) crystalline rock layers subject to tectonic-induced stress, potentially capable of actively monitoring the temporal evolution of the pressure rise in seismic fault zones associated with earthquake triggering when the induced tectonic pressure reaches and ove…
▽ More
We illustrate a theoretical study of a newly conceived technique using high-energy muon beams (TeV-class) propagating through thick (km-long) crystalline rock layers subject to tectonic-induced stress, potentially capable of actively monitoring the temporal evolution of the pressure rise in seismic fault zones associated with earthquake triggering when the induced tectonic pressure reaches and overcomes the rock elasto-plastic deformation limit. This technique could contribute to improving earthquake forecasting statistics in seismically active regions, offering support for seismic hazard assessment and prevention strategies.
Active monitoring of the induced tectonic stress and its time evolution is achieved by remote sensing of the electric field generated in quartz crystals embedded in crystalline rocks by piezoelectric effects. In this context, tectonic pressure refers to the time-dependent stress field acting on the rock body due to tectonic forces, which adds to the time-independent lithostatic pressure resulting from the weight of overlying materials. High-energy muon beams transmitted through a rock layer subject to tectonic pressure will be affected in their transverse phase space distributions by the piezoelectric fields, therefore transferring to a detector the information on the applied tectonic stress.
Finally, we illustrate the design of a proof-of-principle experiment to be conducted in a standard accelerator laboratory, using moderate-energy muons (GeV-class) propagating through granite slabs subject to a press-induced stress reaching the rupture limit. A zero-generation proof-of-principle test can also be performed using 20-150\,MeV electron beams transmitted through single quartz crystals subject to variable pressure.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
SPLATART: Articulated Gaussian Splatting with Estimated Object Structure
Authors:
Stanley Lewis,
Vishal Chandra,
Tom Gao,
Odest Chadwicke Jenkins
Abstract:
Representing articulated objects remains a difficult problem within the field of robotics. Objects such as pliers, clamps, or cabinets require representations that capture not only geometry and color information, but also part seperation, connectivity, and joint parametrization. Furthermore, learning these representations becomes even more difficult with each additional degree of freedom. Complex…
▽ More
Representing articulated objects remains a difficult problem within the field of robotics. Objects such as pliers, clamps, or cabinets require representations that capture not only geometry and color information, but also part seperation, connectivity, and joint parametrization. Furthermore, learning these representations becomes even more difficult with each additional degree of freedom. Complex articulated objects such as robot arms may have seven or more degrees of freedom, and the depth of their kinematic tree may be notably greater than the tools, drawers, and cabinets that are the typical subjects of articulated object research. To address these concerns, we introduce SPLATART - a pipeline for learning Gaussian splat representations of articulated objects from posed images, of which a subset contains image space part segmentations. SPLATART disentangles the part separation task from the articulation estimation task, allowing for post-facto determination of joint estimation and representation of articulated objects with deeper kinematic trees than previously exhibited. In this work, we present data on the SPLATART pipeline as applied to the syntheic Paris dataset objects, and qualitative results on a real-world object under spare segmentation supervision. We additionally present on articulated serial chain manipulators to demonstrate usage on deeper kinematic tree structures.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers
Authors:
Siva Rajesh Kasa,
Karan Gupta,
Sumegh Roychowdhury,
Ashutosh Kumar,
Yaswanth Biruduraju,
Santhosh Kumar Kasa,
Nikhil Priyatam Pattisapu,
Arindam Bhattacharya,
Shailendra Agarwal,
Vijay huddar
Abstract:
The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present…
▽ More
The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures - Auto-regressive modeling, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical 'two regimes' phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance for selecting the most suitable modeling approach based on real-world constraints such as latency and data limitations.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Synthesis and anisotropic magnetism of singlecrystalline GdPt2Si2
Authors:
Gustavo Gomes Vasques,
Mateus Dutra,
Pedro Caetano Sabino,
Juliana Gonçalves Dias,
Julian Andrés Munévar Cagigas,
Adriano Reinaldo Viçoto Benvenho,
Marcos A. Avila
Abstract:
Single crystals of GdPt$_2$Si$_2$ were grown using the Sn flux method, crystallizing in the CaBe$_2$Ge$_2$-type tetragonal structure with space group $P4/nmm$. Electrical resistivity, specific heat, and magnetization data revealed the presence of a double magnetic transition with $T_N \approx 8.4$~K and $T_0 \approx 6.8$~K. Analysis of the specific heat data suggest amplitude-modulated and equal-m…
▽ More
Single crystals of GdPt$_2$Si$_2$ were grown using the Sn flux method, crystallizing in the CaBe$_2$Ge$_2$-type tetragonal structure with space group $P4/nmm$. Electrical resistivity, specific heat, and magnetization data revealed the presence of a double magnetic transition with $T_N \approx 8.4$~K and $T_0 \approx 6.8$~K. Analysis of the specific heat data suggest amplitude-modulated and equal-moment antiferromagnetic orderings, respectively. Field-induced magnetization and magnetic susceptibility data show a metamagnetic transition in the $H \parallel a$ direction at 2~K, as well as the suppression of the magnetic transition located at $T_0$ with increasing external magnetic field. Electron Spin Resonance (ESR) shows the Gd$^{3+}$ resonance followed by a small second resonance. Peak-to-peak linewidth ($ΔH_{pp}$) analysis reveals slight broadening at $T \sim 120$~K, indicating an increase in magnetic fluctuations at high temperatures. Ferromagnetic (FM) local polarization at high temperatures is also observed through the $g$-factor analysis, which shows a notable positive shift ($Δg$). Our results establish the fundamental physical properties of this material to aid in further understanding of the magnetism in the RPt$_2$Si$_2$ series and related non-centrosymmetric systems.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding
Authors:
Haoran Zhou,
Xingchen Song,
Brendan Fahy,
Qiaochu Song,
Binbin Zhang,
Zhendong Peng,
Anshul Wadhawan,
Denglin Jiang,
Apurv Verma,
Vinay Ramesh,
Srivas Prasad,
Michele M. Franceschini
Abstract:
OpenAI Whisper is a family of robust Automatic Speech Recognition (ASR) models trained on 680,000 hours of audio. However, its encoder-decoder architecture, trained with a sequence-to-sequence objective, lacks native support for streaming ASR. In this paper, we fine-tune Whisper for streaming ASR using the WeNet toolkit by adopting a Unified Two-pass (U2) structure. We introduce an additional Conn…
▽ More
OpenAI Whisper is a family of robust Automatic Speech Recognition (ASR) models trained on 680,000 hours of audio. However, its encoder-decoder architecture, trained with a sequence-to-sequence objective, lacks native support for streaming ASR. In this paper, we fine-tune Whisper for streaming ASR using the WeNet toolkit by adopting a Unified Two-pass (U2) structure. We introduce an additional Connectionist Temporal Classification (CTC) decoder trained with causal attention masks to generate streaming partial transcripts, while the original Whisper decoder reranks these partial outputs. Our experiments on LibriSpeech and an earnings call dataset demonstrate that, with adequate fine-tuning data, Whisper can be adapted into a capable streaming ASR model. We also introduce a hybrid tokenizer approach, which uses a smaller token space for the CTC decoder while retaining Whisper's original token space for the attention decoder, resulting in improved data efficiency and generalization.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Functional central limit theorem for dependent models with finite memory
Authors:
Víctor Hugo Vázquez Guevara,
Manuel González-Navarrete
Abstract:
We provide complementary results for a family of models with dependence on their previous $k$-sum. Using a martingale-based approach, we establish a functional central limit theorem and analyze the limiting behavior of the center of mass. Additionally, we explore the connection between our findings and the study of certain reinforced random walks in the literature
We provide complementary results for a family of models with dependence on their previous $k$-sum. Using a martingale-based approach, we establish a functional central limit theorem and analyze the limiting behavior of the center of mass. Additionally, we explore the connection between our findings and the study of certain reinforced random walks in the literature
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Hubble's Multi-Year Search for Exospheres in the TRAPPIST-1 System Reveals Frequent Microflares
Authors:
David Berardo,
Julien de Wit,
Michael Gillon,
Ward S. Howard,
Vincent Bourrier,
Matthew W. Cotton,
Florian Quatresooz,
Léonie Hoerner,
Emeline Bolmont,
Artem Burdanov,
Adam J. Burgasser,
Brice-Olivier Demory,
David Enhrenreich,
Susan M. Lederer,
Benjamin V. Rackham,
Sara Seager,
Amaury Triaud
Abstract:
Ly-$α$ observations provide a powerful probe of stellar activity and atmospheric escape in exoplanetary systems. We present here an analysis of 104 HST/STIS orbits monitoring the TRAPPIST-1 system between 2017 and 2022, covering 3--5 transits for each of its seven planets. We rule out transit depths $\gtrsim20\%$, which translates into an upper limit on the escape rate of $1064~EO_H$/Gyr for plane…
▽ More
Ly-$α$ observations provide a powerful probe of stellar activity and atmospheric escape in exoplanetary systems. We present here an analysis of 104 HST/STIS orbits monitoring the TRAPPIST-1 system between 2017 and 2022, covering 3--5 transits for each of its seven planets. We rule out transit depths $\gtrsim20\%$, which translates into an upper limit on the escape rate of $1064~EO_H$/Gyr for planet b ($1~EO_H$ is the Earth-ocean-equivalent hydrogen content), in agreement with recent claims that planet b should be airless. These upper limits are $\sim$3 times larger than expected from the photon noise due to a large baseline scatter, which we ultimately link to TRAPPIST-1's intrinsic Ly-$α$ variability from frequent ``microflares.'' While JWST observations of TRAPPIST-1 in the near infrared have shown that $\sim10^{30}$-erg flares occur every $\sim$6 hours, we report here $\sim10^{29}$-erg flares on sub-hour timescales in the HST/STIS and also Very Large Telescope (VLT) $g^{'}$ observations. The FUV and optical amplitudes ($\sim$400$\%$ vs $\sim$3$\%$, respectively) for flares with similar waiting-times indicate flare temperatures of 11000$^{+4200}_{-3100}$~K over 0.011$^{+0.03}_{-0.01}$\% of the stellar disk. Finally, our multi-year baseline reveals a variability with $P = 3.27 \pm 0.04$ days, providing further validation of the previously reported 3.295-day rotation period for TRAPPIST-1. These results highlight the importance of accounting for stellar microvariability when searching for exospheres around active M dwarfs.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Luminous, rapidly declining supernovae as stripped transitional objects in low metallicity environments: the case of SN 2022lxg
Authors:
P. Charalampopoulos,
R. Kotak,
J. Sollerman,
C. P. Gutiérrez,
M. Pursiainen,
T. L. Killestein,
S. Schulze,
P. J. Pessi,
K. Maeda,
T. Kangas,
Y. -Z. Cai,
C. Fremling,
K. R. Hinds,
T. Jegou du Laz,
E. Kankare,
M. M. Kasliwal,
H. Kuncarayakti,
P. Lundqvist,
F. J. Masci,
S. Mattila,
D. A. Perley,
A. Reguitti,
T. M. Reynolds,
M. Stritzinger,
L. Tartaglia
, et al. (2 additional authors not shown)
Abstract:
We present an analysis of the optical and near-infrared properties of SN 2022lxg, a bright ($\rm M_{g\, \mathrm{peak}}=-19.41$ mag) and rapidly evolving SN. It was discovered within a day of explosion, and rose to peak brightness in 10 d. Two distinct phases of circumstellar interaction are evident in the data. The first is marked by a steep blue continuum (T $>15,000$ K) with flash-ionisation fea…
▽ More
We present an analysis of the optical and near-infrared properties of SN 2022lxg, a bright ($\rm M_{g\, \mathrm{peak}}=-19.41$ mag) and rapidly evolving SN. It was discovered within a day of explosion, and rose to peak brightness in 10 d. Two distinct phases of circumstellar interaction are evident in the data. The first is marked by a steep blue continuum (T $>15,000$ K) with flash-ionisation features due to hydrogen and He II. The second, weaker phase is marked by a change in the colour evolution accompanied by changes in the shapes and velocities of the spectral line profiles. Narrow P-Cygni profiles (~ $150$ km s$^{-1}$) of He I further indicate the presence of slow-moving unshocked material and suggesting partial stripping of the progenitor. The fast decline of the light curve from peak (3.48$\pm$ 0.26 mag $\rm (50\,d)^{-1}$ in $g$-band) implies that the ejecta mass must be low. Spectroscopically, until $+35$ d there are similarities to some Type IIb SNe but then there is a transition to spectra that are more reminiscent of an interacting SN II. However, metal lines are largely absent in the spectra, even at epochs of 80 d. Its remote location from the presumed host galaxy, a dwarf with $\rm M_B$ ~ $-14.4$ mag, is consistent with our metallicity estimate - close to the SMC value - obtained from scaling relations. Furthermore, several lines of evidence (including intrinsic polarisation of $p$ ~ (0.5-1.0) %) point to deviations from spherical symmetry. We suggest that a plausible way of uniting the observational clues is to consider a binary system that underwent case C mass transfer. This failed to remove the entire H-envelope of the progenitor before it underwent core-collapse. In this scenario, the progenitor itself would be more compact and perhaps straddle the boundary between blue and yellow supergiants, tying in with the early spectroscopic similarity to Type IIb SNe.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Improved Ground State Estimation in Quantum Field Theories via Normalising Flow-Assisted Neural Quantum States
Authors:
Vishal S. Ngairangbam,
Michael Spannowsky,
Timur Sypchenko
Abstract:
We propose a hybrid variational framework that enhances Neural Quantum States (NQS) with a Normalising Flow-based sampler to improve the expressivity and trainability of quantum many-body wavefunctions. Our approach decouples the sampling task from the variational ansatz by learning a continuous flow model that targets a discretised, amplitude-supported subspace of the Hilbert space. This overcome…
▽ More
We propose a hybrid variational framework that enhances Neural Quantum States (NQS) with a Normalising Flow-based sampler to improve the expressivity and trainability of quantum many-body wavefunctions. Our approach decouples the sampling task from the variational ansatz by learning a continuous flow model that targets a discretised, amplitude-supported subspace of the Hilbert space. This overcomes limitations of Markov Chain Monte Carlo (MCMC) and autoregressive methods, especially in regimes with long-range correlations and volume-law entanglement. Applied to the transverse-field Ising model with both short- and long-range interactions, our method achieves comparable ground state energy errors with state-of-the-art matrix product states and lower energies than autoregressive NQS. For systems up to 50 spins, we demonstrate high accuracy and robust convergence across a wide range of coupling strengths, including regimes where competing methods fail. Our results showcase the utility of flow-assisted sampling as a scalable tool for quantum simulation and offer a new approach toward learning expressive quantum states in high-dimensional Hilbert spaces.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
pop-cosmos: Insights from generative modeling of a deep, infrared-selected galaxy population
Authors:
Stephen Thorp,
Hiranya V. Peiris,
Gurjeet Jagwani,
Sinan Deger,
Justin Alsing,
Boris Leistedt,
Daniel J. Mortlock,
Anik Halder,
Joel Leja
Abstract:
We present an extension of the pop-cosmos model for the evolving galaxy population up to redshift $z\sim6$. The model is trained on distributions of observed colors and magnitudes, from 26-band photometry of $\sim420,000$ galaxies in the COSMOS2020 catalog with Spitzer IRAC $\textit{Ch. 1}<26$. The generative model includes a flexible distribution over 16 stellar population synthesis (SPS) paramet…
▽ More
We present an extension of the pop-cosmos model for the evolving galaxy population up to redshift $z\sim6$. The model is trained on distributions of observed colors and magnitudes, from 26-band photometry of $\sim420,000$ galaxies in the COSMOS2020 catalog with Spitzer IRAC $\textit{Ch. 1}<26$. The generative model includes a flexible distribution over 16 stellar population synthesis (SPS) parameters, and a depth-dependent photometric uncertainty model, both represented using score-based diffusion models. We use the trained model to predict scaling relationships for the galaxy population, such as the stellar mass function, star-forming main sequence, and gas-phase and stellar metallicity vs. mass relations, demonstrating reasonable-to-excellent agreement with previously published results. We explore the connection between mid-infrared emission from active galactic nuclei (AGN) and star-formation rate, finding high AGN activity for galaxies above the star-forming main sequence at $1\lesssim z\lesssim 2$. Using the trained population model as a prior distribution, we perform inference of the redshifts and SPS parameters for 429,669 COSMOS2020 galaxies, including 39,588 with publicly available spectroscopic redshifts. The resulting redshift estimates exhibit minimal bias ($\text{median}[Δ_z]=-8\times10^{-4}$), scatter ($σ_\text{MAD}=0.0132$), and outlier fraction ($6.19\%$) for the full $0<z<6$ spectroscopic compilation. These results establish that pop-cosmos can achieve the accuracy and realism needed to forward-model modern wide--deep surveys for Stage IV cosmology. We publicly release pop-cosmos software, mock galaxy catalogs, and COSMOS2020 redshift and SPS parameter posteriors.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Enhancing Privacy: The Utility of Stand-Alone Synthetic CT and MRI for Tumor and Bone Segmentation
Authors:
André Ferreira,
Kunpeng Xie,
Caroline Wilpert,
Gustavo Correia,
Felix Barajas Ordonez,
Tiago Gil Oliveira,
Maike Bode,
Robert Siepmann,
Frank Hölzle,
Rainer Röhrig,
Jens Kleesiek,
Daniel Truhn,
Jan Egger,
Victor Alves,
Behrus Puladi
Abstract:
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthe…
▽ More
AI requires extensive datasets, while medical data is subject to high data protection. Anonymization is essential, but poses a challenge for some regions, such as the head, as identifying structures overlap with regions of clinical interest. Synthetic data offers a potential solution, but studies often lack rigorous evaluation of realism and utility. Therefore, we investigate to what extent synthetic data can replace real data in segmentation tasks. We employed head and neck cancer CT scans and brain glioma MRI scans from two large datasets. Synthetic data were generated using generative adversarial networks and diffusion models. We evaluated the quality of the synthetic data using MAE, MS-SSIM, Radiomics and a Visual Turing Test (VTT) performed by 5 radiologists and their usefulness in segmentation tasks using DSC. Radiomics indicates high fidelity of synthetic MRIs, but fall short in producing highly realistic CT tissue, with correlation coefficient of 0.8784 and 0.5461 for MRI and CT tumors, respectively. DSC results indicate limited utility of synthetic data: tumor segmentation achieved DSC=0.064 on CT and 0.834 on MRI, while bone segmentation a mean DSC=0.841. Relation between DSC and correlation is observed, but is limited by the complexity of the task. VTT results show synthetic CTs' utility, but with limited educational applications. Synthetic data can be used independently for the segmentation task, although limited by the complexity of the structures to segment. Advancing generative models to better tolerate heterogeneous inputs and learn subtle details is essential for enhancing their realism and expanding their application potential.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
The Amazon Nova Family of Models: Technical Report and Model Card
Authors:
Amazon AGI,
Aaron Langford,
Aayush Shah,
Abhanshu Gupta,
Abhimanyu Bhatter,
Abhinav Goyal,
Abhinav Mathur,
Abhinav Mohanty,
Abhishek Kumar,
Abhishek Sethi,
Abi Komma,
Abner Pena,
Achin Jain,
Adam Kunysz,
Adam Opyrchal,
Adarsh Singh,
Aditya Rawal,
Adok Achar Budihal Prasad,
Adrià de Gispert,
Agnika Kumar,
Aishwarya Aryamane,
Ajay Nair,
Akilan M,
Akshaya Iyengar,
Akshaya Vishnu Kudlu Shanbhogue
, et al. (761 additional authors not shown)
Abstract:
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents…
▽ More
We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents and text. Amazon Nova Micro is a text-only model that delivers our lowest-latency responses at very low cost. Amazon Nova Canvas is an image generation model that creates professional grade images with rich customization controls. Amazon Nova Reel is a video generation model offering high-quality outputs, customization, and motion control. Our models were built responsibly and with a commitment to customer trust, security, and reliability. We report benchmarking results for core capabilities, agentic performance, long context, functional adaptation, runtime performance, and human evaluation.
△ Less
Submitted 17 March, 2025;
originally announced June 2025.
-
UCD: Unlearning in LLMs via Contrastive Decoding
Authors:
Vinith M. Suriyakumar,
Ayush Sekhari,
Ashia Wilson
Abstract:
Machine unlearning aims to remove specific information, e.g. sensitive or undesirable content, from large language models (LLMs) while preserving overall performance. We propose an inference-time unlearning algorithm that uses contrastive decoding, leveraging two auxiliary smaller models, one trained without the forget set and one trained with it, to guide the outputs of the original model using t…
▽ More
Machine unlearning aims to remove specific information, e.g. sensitive or undesirable content, from large language models (LLMs) while preserving overall performance. We propose an inference-time unlearning algorithm that uses contrastive decoding, leveraging two auxiliary smaller models, one trained without the forget set and one trained with it, to guide the outputs of the original model using their difference during inference. Our strategy substantially improves the tradeoff between unlearning effectiveness and model utility. We evaluate our approach on two unlearning benchmarks, TOFU and MUSE. Results show notable gains in both forget quality and retained performance in comparison to prior approaches, suggesting that incorporating contrastive decoding can offer an efficient, practical avenue for unlearning concepts in large-scale models.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Continuously Updating Digital Twins using Large Language Models
Authors:
Harry Amad,
Nicolás Astorga,
Mihaela van der Schaar
Abstract:
Digital twins are models of real-world systems that can simulate their dynamics in response to potential actions. In complex settings, the state and action variables, and available data and knowledge relevant to a system can constantly change, requiring digital twins to continuously update with these changes to remain relevant. Current approaches struggle in this regard, as they require fixed, wel…
▽ More
Digital twins are models of real-world systems that can simulate their dynamics in response to potential actions. In complex settings, the state and action variables, and available data and knowledge relevant to a system can constantly change, requiring digital twins to continuously update with these changes to remain relevant. Current approaches struggle in this regard, as they require fixed, well-defined modelling environments, and they cannot adapt to novel variables without re-designs, or incorporate new information without re-training. To address this, we frame digital twinning as an in-context learning problem using large language models, enabling seamless updates to the twin at inference time. We develop CALM-DT, a Context-Adaptive Language Model-based Digital Twin that can accurately simulate across diverse state-action spaces using in-context learning alone by utilising fine-tuned encoders for sample retrieval. We empirically demonstrate CALM-DT's competitive performance with existing digital twin approaches, and its unique ability to adapt to changes in its modelling environment without parameter updates.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Latency Optimization for Wireless Federated Learning in Multihop Networks
Authors:
Shaba Shaon,
Van-Dinh Nguyen,
Dinh C. Nguyen
Abstract:
In this paper, we study a novel latency minimization problem in wireless federated learning (FL) across multi-hop networks. The system comprises multiple routes, each integrating leaf and relay nodes for FL model training. We explore a personalized learning and adaptive aggregation-aware FL (PAFL) framework that effectively addresses data heterogeneity across participating nodes by harmonizing ind…
▽ More
In this paper, we study a novel latency minimization problem in wireless federated learning (FL) across multi-hop networks. The system comprises multiple routes, each integrating leaf and relay nodes for FL model training. We explore a personalized learning and adaptive aggregation-aware FL (PAFL) framework that effectively addresses data heterogeneity across participating nodes by harmonizing individual and collective learning objectives. We formulate an optimization problem aimed at minimizing system latency through the joint optimization of leaf and relay nodes, as well as relay routing indicator. We also incorporate an additional energy harvesting scheme for the relay nodes to help with their relay tasks. This formulation presents a computationally demanding challenge, and thus we develop a simple yet efficient algorithm based on block coordinate descent and successive convex approximation (SCA) techniques. Simulation results illustrate the efficacy of our proposed joint optimization approach for leaf and relay nodes with relay routing indicator. We observe significant latency savings in the wireless multi-hop PAFL system, with reductions of up to 69.37% compared to schemes optimizing only one node type, traditional greedy algorithm, and scheme without relay routing indicator.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Rényi-Induced Information Geometry and Hartigan's Prior Family
Authors:
Rebecca Maria Kuntz,
Heinrich von Campe,
Björn Malte Schäfer
Abstract:
We derive the information geometry induced by the statistical Rényi divergence, namely its metric tensor, its dual parametrized connections, as well as its dual Laplacians. Based on these results, we demonstrate that the Rényi-geometry, though closely related, differs in structure from Amari's well-known $α$-geometry. Subsequently, we derive the canonical uniform prior distributions for a statisti…
▽ More
We derive the information geometry induced by the statistical Rényi divergence, namely its metric tensor, its dual parametrized connections, as well as its dual Laplacians. Based on these results, we demonstrate that the Rényi-geometry, though closely related, differs in structure from Amari's well-known $α$-geometry. Subsequently, we derive the canonical uniform prior distributions for a statistical manifold endowed with a Rényi-geometry, namely the dual Rényi-covolumes. We find that the Rényi-priors can be made to coincide with Takeuchi and Amari's $α$-priors by a reparameterization, which is itself of particular significance in statistics. Herewith, we demonstrate that Hartigan's parametrized ($α_H$) family of priors is precisely the parametrized ($ρ$) family of Rényi-priors ($α_H = ρ$).
△ Less
Submitted 22 May, 2025;
originally announced June 2025.
-
Sign-Rank of $k$-Hamming Distance is Constant
Authors:
Mika Göös,
Nathaniel Harms,
Valentin Imbach,
Dmitry Sokolov
Abstract:
We prove that the sign-rank of the $k$-Hamming Distance matrix on $n$ bits is $2^{O(k)}$, independent of the number of bits $n$. This strongly refutes the conjecture of Hatami, Hatami, Pires, Tao, and Zhao (RANDOM 2022), and Hatami, Hosseini, and Meng (STOC 2023), repeated in several other papers, that the sign-rank should depend on $n$. This conjecture would have qualitatively separated margin fr…
▽ More
We prove that the sign-rank of the $k$-Hamming Distance matrix on $n$ bits is $2^{O(k)}$, independent of the number of bits $n$. This strongly refutes the conjecture of Hatami, Hatami, Pires, Tao, and Zhao (RANDOM 2022), and Hatami, Hosseini, and Meng (STOC 2023), repeated in several other papers, that the sign-rank should depend on $n$. This conjecture would have qualitatively separated margin from sign-rank (or, equivalently, bounded-error from unbounded-error randomized communication). In fact, our technique gives constant sign-rank upper bounds for all matrices which reduce to $k$-Hamming Distance, as well as large-margin matrices recently shown to be irreducible to $k$-Hamming Distance.
△ Less
Submitted 1 May, 2025;
originally announced June 2025.
-
The Limits of Tractable Marginalization
Authors:
Oliver Broadrick,
Sanyam Agarwal,
Guy Van den Broeck,
Markus Bläser
Abstract:
Marginalization -- summing a function over all assignments to a subset of its inputs -- is a fundamental computational problem with applications from probabilistic inference to formal verification. Despite its computational hardness in general, there exist many classes of functions (e.g., probabilistic models) for which marginalization remains tractable, and they can be commonly expressed by polyn…
▽ More
Marginalization -- summing a function over all assignments to a subset of its inputs -- is a fundamental computational problem with applications from probabilistic inference to formal verification. Despite its computational hardness in general, there exist many classes of functions (e.g., probabilistic models) for which marginalization remains tractable, and they can be commonly expressed by polynomial size arithmetic circuits computing multilinear polynomials. This raises the question, can all functions with polynomial time marginalization algorithms be succinctly expressed by such circuits? We give a negative answer, exhibiting simple functions with tractable marginalization yet no efficient representation by known models, assuming $\textsf{FP}\neq\#\textsf{P}$ (an assumption implied by $\textsf{P} \neq \textsf{NP}$). To this end, we identify a hierarchy of complexity classes corresponding to stronger forms of marginalization, all of which are efficiently computable on the known circuit models. We conclude with a completeness result, showing that whenever there is an efficient real RAM performing virtual evidence marginalization for a function, then there are small circuits for that function's multilinear representation.
△ Less
Submitted 17 April, 2025;
originally announced June 2025.
-
crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 2023
Authors:
Navodini Wijethilake,
Reuben Dorent,
Marina Ivory,
Aaron Kujawa,
Stefan Cornelissen,
Patrick Langenhuizen,
Mohamed Okasha,
Anna Oviedova,
Hexin Dong,
Bogyeong Kang,
Guillaume Sallé,
Luyi Han,
Ziyuan Zhao,
Han Liu,
Tao Yang,
Shahad Hardan,
Hussain Alasmawi,
Santosh Sanjeev,
Yuzhou Zhuang,
Satoshi Kondo,
Maria Baldeon Calisto,
Shaikh Muhammad Uzair Noman,
Cancan Chen,
Ipek Oguz,
Rongguo Zhang
, et al. (14 additional authors not shown)
Abstract:
The cross-Modality Domain Adaptation (crossMoDA) challenge series, initiated in 2021 in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), focuses on unsupervised cross-modality segmentation, learning from contrast-enhanced T1 (ceT1) and transferring to T2 MRI. The task is an extreme example of domain shift chosen to serve as a mea…
▽ More
The cross-Modality Domain Adaptation (crossMoDA) challenge series, initiated in 2021 in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), focuses on unsupervised cross-modality segmentation, learning from contrast-enhanced T1 (ceT1) and transferring to T2 MRI. The task is an extreme example of domain shift chosen to serve as a meaningful and illustrative benchmark. From a clinical application perspective, it aims to automate Vestibular Schwannoma (VS) and cochlea segmentation on T2 scans for more cost-effective VS management. Over time, the challenge objectives have evolved to enhance its clinical relevance. The challenge evolved from using single-institutional data and basic segmentation in 2021 to incorporating multi-institutional data and Koos grading in 2022, and by 2023, it included heterogeneous routine data and sub-segmentation of intra- and extra-meatal tumour components. In this work, we report the findings of the 2022 and 2023 editions and perform a retrospective analysis of the challenge progression over the years. The observations from the successive challenge contributions indicate that the number of outliers decreases with an expanding dataset. This is notable since the diversity of scanning protocols of the datasets concurrently increased. The winning approach of the 2023 edition reduced the number of outliers on the 2021 and 2022 testing data, demonstrating how increased data heterogeneity can enhance segmentation performance even on homogeneous data. However, the cochlea Dice score declined in 2023, likely due to the added complexity from tumour sub-annotations affecting overall segmentation performance. While progress is still needed for clinically acceptable VS segmentation, the plateauing performance suggests that a more challenging cross-modal task may better serve future benchmarking.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Spectral Estimation with Free Decompression
Authors:
Siavash Ameli,
Chris van der Heide,
Liam Hodgkinson,
Michael W. Mahoney
Abstract:
Computing eigenvalues of very large matrices is a critical task in many machine learning applications, including the evaluation of log-determinants, the trace of matrix functions, and other important metrics. As datasets continue to grow in scale, the corresponding covariance and kernel matrices become increasingly large, often reaching magnitudes that make their direct formation impractical or im…
▽ More
Computing eigenvalues of very large matrices is a critical task in many machine learning applications, including the evaluation of log-determinants, the trace of matrix functions, and other important metrics. As datasets continue to grow in scale, the corresponding covariance and kernel matrices become increasingly large, often reaching magnitudes that make their direct formation impractical or impossible. Existing techniques typically rely on matrix-vector products, which can provide efficient approximations, if the matrix spectrum behaves well. However, in settings like distributed learning, or when the matrix is defined only indirectly, access to the full data set can be restricted to only very small sub-matrices of the original matrix. In these cases, the matrix of nominal interest is not even available as an implicit operator, meaning that even matrix-vector products may not be available. In such settings, the matrix is "impalpable," in the sense that we have access to only masked snapshots of it. We draw on principles from free probability theory to introduce a novel method of "free decompression" to estimate the spectrum of such matrices. Our method can be used to extrapolate from the empirical spectral densities of small submatrices to infer the eigenspectrum of extremely large (impalpable) matrices (that we cannot form or even evaluate with full matrix-vector products). We demonstrate the effectiveness of this approach through a series of examples, comparing its performance against known limiting distributions from random matrix theory in synthetic settings, as well as applying it to submatrices of real-world datasets, matching them with their full empirical eigenspectra.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Sturmian basis set for the Dirac equation with finite nuclear size: Application to polarizability, Zeeman and hyperfine splitting, and vacuum polarization
Authors:
V. K. Ivanov,
D. A. Glazov,
A. V. Volotka
Abstract:
We investigate the application of the Sturmian basis set in relativistic atomic structure calculations. We propose a simple implementation of this approach and demonstrate its ability to provide various quantities for hydrogen-like ions, including binding energies, static dipole polarizability, $g$ factor, hyperfine splitting, and nuclear magnetic shielding. Finally, we calculate the all-order (Wi…
▽ More
We investigate the application of the Sturmian basis set in relativistic atomic structure calculations. We propose a simple implementation of this approach and demonstrate its ability to provide various quantities for hydrogen-like ions, including binding energies, static dipole polarizability, $g$ factor, hyperfine splitting, and nuclear magnetic shielding. Finally, we calculate the all-order (Wichmann-Kroll) vacuum polarization charge density, which was a challenge for the finite-basis-set approach until recently. Comparison of the obtained results with the previously published numerical and analytical calculations is presented. All calculations are performed with the finite size of the nucleus and can in principle be extended to arbitrary binding potentials.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
A triple torsion linking form and 3-manifolds in $S^4$
Authors:
Michael Freedman,
Vyacheslav Krushkal
Abstract:
Given a rational homology 3-sphere $M$, we introduce a triple linking form on $H_1(M; \mathbb{Z})$, defined when the classical torsion linking pairing of three homology classes vanishes pairwise. If $M$ is the boundary of a simply-connected 4-manifold $N$, the triple linking form can be computed in terms of the higher order intersection form on $N$, introduced by Matsumoto. We use these methods to…
▽ More
Given a rational homology 3-sphere $M$, we introduce a triple linking form on $H_1(M; \mathbb{Z})$, defined when the classical torsion linking pairing of three homology classes vanishes pairwise. If $M$ is the boundary of a simply-connected 4-manifold $N$, the triple linking form can be computed in terms of the higher order intersection form on $N$, introduced by Matsumoto. We use these methods to formulate an embedding obstruction for rational homology spheres in $S^4$, extending a 1938 theorem of Hantzsche.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Towards 4D modelisation of thermal-field emission from semiconductors
Authors:
Salvador Barranco Cárceles,
Aquila Mavalankar,
Veronika Zadin,
Ian Underwood,
Andreas Kyritsakis
Abstract:
The theoretical picture of thermal field-emission (TFE) from semiconductors has been limited to 1D and 2D models. This can be attributed to the complex and interdependent phenomena that is involved in TFE from semiconductors which makes the calculations cumbersome. Such limitations result in a partial understanding of the underlying physics of semiconducting surfaces under high electrical fields,…
▽ More
The theoretical picture of thermal field-emission (TFE) from semiconductors has been limited to 1D and 2D models. This can be attributed to the complex and interdependent phenomena that is involved in TFE from semiconductors which makes the calculations cumbersome. Such limitations result in a partial understanding of the underlying physics of semiconducting surfaces under high electrical fields, which requires the addition of the temporal dimension (4D) to yield a realistic model. Here we develop a 3D model of TFE from semiconductors that can take arbitrary geometries and doping levels. Our model successfully reproduces the characteristic saturation plateau of some semiconductors, as well as its dependence in temperature. The model is found to be in good agreement with experimental data from ntype Germanium at a qualitative level. We propose this model as a platform for future extensions into the full 4D framework, incorporating temporal dynamics for a more complete and predictive description of thermal-field emission from semiconductors.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
mimic-one: a Scalable Model Recipe for General Purpose Robot Dexterity
Authors:
Elvis Nava,
Victoriano Montesinos,
Erik Bauer,
Benedek Forrai,
Jonas Pai,
Stefan Weirich,
Stephan-Daniel Gravert,
Philipp Wand,
Stephan Polinski,
Benjamin F. Grewe,
Robert K. Katzschmann
Abstract:
We present a diffusion-based model recipe for real-world control of a highly dexterous humanoid robotic hand, designed for sample-efficient learning and smooth fine-motor action inference. Our system features a newly designed 16-DoF tendon-driven hand, equipped with wide angle wrist cameras and mounted on a Franka Emika Panda arm. We develop a versatile teleoperation pipeline and data collection p…
▽ More
We present a diffusion-based model recipe for real-world control of a highly dexterous humanoid robotic hand, designed for sample-efficient learning and smooth fine-motor action inference. Our system features a newly designed 16-DoF tendon-driven hand, equipped with wide angle wrist cameras and mounted on a Franka Emika Panda arm. We develop a versatile teleoperation pipeline and data collection protocol using both glove-based and VR interfaces, enabling high-quality data collection across diverse tasks such as pick and place, item sorting and assembly insertion. Leveraging high-frequency generative control, we train end-to-end policies from raw sensory inputs, enabling smooth, self-correcting motions in complex manipulation scenarios. Real-world evaluations demonstrate up to 93.3% out of distribution success rates, with up to a +33.3% performance boost due to emergent self-correcting behaviors, while also revealing scaling trends in policy performance. Our results advance the state-of-the-art in dexterous robotic manipulation through a fully integrated, practical approach to hardware, learning, and real-world deployment.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
A Hyperactive FRB Pinpointed in an SMC-Like Satellite Host Galaxy
Authors:
M. Bhardwaj,
M. P. Snelders,
J. W. T. Hessels,
A. Gil de Paz,
S. Bhandari,
B. Marcote,
A. Kirichenko,
O. S. Ould-Boukattine,
F. Kirsten,
E. K. Bempong-Manful,
V. Bezrukovs,
J. D. Bray,
S. Buttaccio,
A. Corongiu,
R. Feiler,
M. P. Gawronski,
M. Giroletti,
D. M. Hewitt,
M. Lindqvist,
G. Maccaferri,
A. Moroianu,
K. Nimmo,
Z. Paragi,
W. Puchalska,
N. Wang
, et al. (2 additional authors not shown)
Abstract:
Precise localizations of fast radio bursts (FRBs) are essential for uncovering their host galaxies and immediate environments. We present the milliarcsecond-precision European VLBI Network (EVN) localization of FRB 20240114A, a hyper-active repeating FRB, achieving <90x30 mas (1-sigma) accuracy. This precision places the burst 0.5 kpc from the nucleus of its low-metallicity star-forming dwarf host…
▽ More
Precise localizations of fast radio bursts (FRBs) are essential for uncovering their host galaxies and immediate environments. We present the milliarcsecond-precision European VLBI Network (EVN) localization of FRB 20240114A, a hyper-active repeating FRB, achieving <90x30 mas (1-sigma) accuracy. This precision places the burst 0.5 kpc from the nucleus of its low-metallicity star-forming dwarf host at a spectroscopic redshift of z = 0.1300. Our Gran Telescopio CANARIAS (GTC) spectroscopic follow-up reveals that the dwarf FRB host is gravitationally bound to a more massive, star-forming spiral galaxy. This establishes the first known instance of an FRB residing in a satellite galaxy within a larger galactic system. This configuration, analogous to the Small Magellanic Cloud orbiting the Milky Way (but at a lower overall mass scale), expands the known diversity of FRB host environments and offers important insights for interpreting seemingly "hostless" or highly offset FRBs. Furthermore, our detailed dispersion measure (DM) budget analysis indicates that the dominant contribution to FRB 20240114A's DM likely originates from the foreground halo of the central galaxy. This finding addresses the anomalously high DM observed for this FRB and underscores the significant role of intervening foreground structures in shaping observed FRB DMs, which is important for accurate FRB-based cosmological measurements. Our results highlight the importance of deep, high-resolution optical/infrared observations (e.g., with Hubble or James Webb Space Telescopes) to fully leverage our precise radio localization and probe the immediate astrophysical birthplaces of FRB progenitors within these complex galactic systems.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
DMRS-Based Uplink Channel Estimation for MU-MIMO Systems with Location-Specific SCSI Acquisition
Authors:
Jiawei Zhuang,
Hongwei Hou,
Minjie Tang,
Wenjin Wang,
Shi Jin,
Vincent K. N. Lau
Abstract:
With the growing number of users in multi-user multiple-input multiple-output (MU-MIMO) systems, demodulation reference signals (DMRSs) are efficiently multiplexed in the code domain via orthogonal cover codes (OCC) to ensure orthogonality and minimize pilot interference. In this paper, we investigate uplink DMRS-based channel estimation for MU-MIMO systems with Type II OCC pattern standardized in…
▽ More
With the growing number of users in multi-user multiple-input multiple-output (MU-MIMO) systems, demodulation reference signals (DMRSs) are efficiently multiplexed in the code domain via orthogonal cover codes (OCC) to ensure orthogonality and minimize pilot interference. In this paper, we investigate uplink DMRS-based channel estimation for MU-MIMO systems with Type II OCC pattern standardized in 3GPP Release 18, leveraging location-specific statistical channel state information (SCSI) to enhance performance. Specifically, we propose a SCSI-assisted Bayesian channel estimator (SA-BCE) based on the minimum mean square error criterion to suppress the pilot interference and noise, albeit at the cost of cubic computational complexity due to matrix inversions. To reduce this complexity while maintaining performance, we extend the scheme to a windowed version (SA-WBCE), which incorporates antenna-frequency domain windowing and beam-delay domain processing to exploit asymptotic sparsity and mitigate energy leakage in practical systems. To avoid the frequent real-time SCSI acquisition, we construct a grid-based location-specific SCSI database based on the principle of spatial consistency, and subsequently leverage the uplink received signals within each grid to extract the SCSI. Facilitated by the multilinear structure of wireless channels, we formulate the SCSI acquisition problem within each grid as a tensor decomposition problem, where the factor matrices are parameterized by the multi-path powers, delays, and angles. The computational complexity of SCSI acquisition can be significantly reduced by exploiting the Vandermonde structure of the factor matrices. Simulation results demonstrate that the proposed location-specific SCSI database construction method achieves high accuracy, while the SA-BCE and SA-WBCE significantly outperform state-of-the-art benchmarks in MU-MIMO systems.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Measurement-aligned Flow for Inverse Problem
Authors:
Shaorong Zhang,
Rob Brekelmans,
Yunshu Wu,
Greg Ver Steeg
Abstract:
Diffusion models provide a powerful way to incorporate complex prior information for solving inverse problems. However, existing methods struggle to correctly incorporate guidance from conflicting signals in the prior and measurement, especially in the challenging setting of non-Gaussian or unknown noise. To bridge these gaps, we propose Measurement-Aligned Sampling (MAS), a novel framework for li…
▽ More
Diffusion models provide a powerful way to incorporate complex prior information for solving inverse problems. However, existing methods struggle to correctly incorporate guidance from conflicting signals in the prior and measurement, especially in the challenging setting of non-Gaussian or unknown noise. To bridge these gaps, we propose Measurement-Aligned Sampling (MAS), a novel framework for linear inverse problem solving that can more flexibly balance prior and measurement information. MAS unifies and extends existing approaches like DDNM and DAPS, and offers a new optimization perspective. MAS can generalize to handle known Gaussian noise, unknown or non-Gaussian noise types. Extensive experiments show that MAS consistently outperforms state-of-the-art methods across a range of tasks.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Towards a Cascaded LLM Framework for Cost-effective Human-AI Decision-Making
Authors:
Claudio Fanconi,
Mihaela van der Schaar
Abstract:
Effective human-AI decision-making balances three key factors: the \textit{correctness} of predictions, the \textit{cost} of knowledge and reasoning complexity, and the confidence about whether to \textit{abstain} automated answers or involve human experts. In this work, we present a cascaded LLM decision framework that adaptively delegates tasks across multiple tiers of expertise -- a base model…
▽ More
Effective human-AI decision-making balances three key factors: the \textit{correctness} of predictions, the \textit{cost} of knowledge and reasoning complexity, and the confidence about whether to \textit{abstain} automated answers or involve human experts. In this work, we present a cascaded LLM decision framework that adaptively delegates tasks across multiple tiers of expertise -- a base model for initial candidate answers, a more capable and knowledgeable (but costlier) large model, and a human expert for when the model cascade abstains. Our method proceeds in two stages. First, a deferral policy determines whether to accept the base model's answer or regenerate it with the large model based on the confidence score. Second, an abstention policy decides whether the cascade model response is sufficiently certain or requires human intervention. Moreover, we incorporate an online learning mechanism in the framework that can leverage human feedback to improve decision quality over time. We demonstrate this approach to general question-answering (ARC-Easy and ARC-Challenge) and medical question-answering (MedQA and MedMCQA). Our results show that our cascaded strategy outperforms in most cases single-model baselines in accuracy while reducing cost and providing a principled way to handle abstentions.
△ Less
Submitted 16 June, 2025; v1 submitted 13 June, 2025;
originally announced June 2025.
-
A relation between k-symplectic and k-contact Hamiltonian systems
Authors:
S. Vilariño
Abstract:
Systems of partial differential equations which appear in classical field theories can be studied geometrically using different geometrical structures, for example, k-symplectic geometry, k-cosymplectic geometry, multisymplectic geometry, etc. In recent years, there has been a notable increase in the study of k-contact Hamiltonian systems. These are based on the description of the dynamics of fiel…
▽ More
Systems of partial differential equations which appear in classical field theories can be studied geometrically using different geometrical structures, for example, k-symplectic geometry, k-cosymplectic geometry, multisymplectic geometry, etc. In recent years, there has been a notable increase in the study of k-contact Hamiltonian systems. These are based on the description of the dynamics of field theories using the so-called k-contact manifolds. Such structures are generalizations of contact structures and k-symplectic structures. The relation between k-symplectic manifolds and k-contact manifolds was previoustly established. In light of the above relation, this work seeks to explore the relationship between k-symplectic Hamiltonian systems and k-contact Hamiltonian systems.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Parenthood Penalty in Russia: Evidence from Exogenous Variation in Family Size
Authors:
Vadim Ustyuzhanin
Abstract:
The present study aimed to improve upon the existing correlational literature on the parenthood penalty in Russia. An instrumental variables approach based on sibling sex composition and multiple births was employed alongside difference-in-differences designs to analyze rich census and longitudinal datasets. To the best of the authors' knowledge, this is the first study to provide causal estimates…
▽ More
The present study aimed to improve upon the existing correlational literature on the parenthood penalty in Russia. An instrumental variables approach based on sibling sex composition and multiple births was employed alongside difference-in-differences designs to analyze rich census and longitudinal datasets. To the best of the authors' knowledge, this is the first study to provide causal estimates of the effect of fertility decisions on subsequent labor market outcomes for mothers and fathers in contemporary Russia. The study's primary finding is that, in contrast to the approximately 10 percent long-term motherhood penalty observed in developed countries, the causal impact of childbearing on women's employment in Russia is most significant in the first year after birth, reducing employment by around 15 percent. This penalty then rapidly declines to a modest 3 percent once children reach school age. The analysis indicates an absence of a systematic fatherhood penalty in terms of employment, although a modest increase in labor supply is observed.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Exact requirements for battery-assisted qubit gates
Authors:
Riccardo Castellano,
Vasco Cavina,
Martí Perarnau-Llobet,
Pavel Sekatski,
Vittorio Giovannetti
Abstract:
We consider the implementation of a unitary gate on a qubit system S via a global energy-preserving operation acting on S and an auxiliary system B that can be seen as a battery. We derive a simple, asymptotically exact expression for the implementation error as a function of the battery state, which we refer to as the it Unitary Defect. Remarkably, this quantity is independent of the specific gat…
▽ More
We consider the implementation of a unitary gate on a qubit system S via a global energy-preserving operation acting on S and an auxiliary system B that can be seen as a battery. We derive a simple, asymptotically exact expression for the implementation error as a function of the battery state, which we refer to as the it Unitary Defect. Remarkably, this quantity is independent of the specific gate being implemented, highlighting a universal property of the battery itself. We show that minimizing the unitary defect, under given physical constraints on the battery state, is mathematically equivalent to solving a Lagrangian optimization problem, often corresponding to finding the ground state of a one-dimensional quantum system. Using this mapping, we identify optimal battery states that achieve the highest precision under constraints on energy, squared energy, and Quantum Fisher Information. Overall, our results provide an efficient method for establishing bounds on the physical requirements needed to implement a unitary gate via energy-preserving operations and for determining the corresponding optimal protocols.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Infinitely many solutions for nonlinear superposition operators of mixed fractional order involving critical exponent
Authors:
Souvik Bhowmick,
Sekhar Ghosh,
Vishvesh Kumar
Abstract:
This paper addresses elliptic problems involving the superposition of nonlinear fractional operators with the critical Sobolev exponent in sublinear regimes. We establish the existence of infinitely many nontrivial weak solutions using a variational framework that combines a truncation argument with the notion of genus. A central part of our analysis is the verification of the Palais--Smale…
▽ More
This paper addresses elliptic problems involving the superposition of nonlinear fractional operators with the critical Sobolev exponent in sublinear regimes. We establish the existence of infinitely many nontrivial weak solutions using a variational framework that combines a truncation argument with the notion of genus. A central part of our analysis is the verification of the Palais--Smale $(\mathrm{PS})_c$ condition for every $q \in (1, p_{s_\sharp}^*)$, despite the challenges posed by the lack of compactness due to the critical exponent.
On the one hand, our approach extends and generalizes earlier results by Garc{í}a Azorero and Peral Alonso \emph{[Trans. Amer. Math. Soc., 1991]} and by Da Silva, Fiscella, and Viloria \emph{[J. Differential Equations, 2024]}; on the other hand, the results obtained here complement the study of the Brezis--Nirenberg-type problem by Dipierro, Perera, Sportelli, and Valdinoci \emph{[Commun. Contemp. Math., 2024]} for $q = p$ and by Aikyn, Ghosh, Kumar, and Ruzhansky \emph{[arXiv:2504.05105, 2025]} for $p < q < p_{s_\sharp}^*$. Notably, our results are new even in the classical case $p = 2$, highlighting the broader applicability of the methods developed here.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
The Central Emission Vertex of two gravitons
Authors:
Damiano Barcaro,
Vittorio Del Duca
Abstract:
It has recently been shown that there exists an $s$-channel sequence of classical corrections to the $H$ diagram computed long ago by Amati, Ciafaloni and Veneziano. At leading logarithmic accuracy, those corrections feature the gravity BFKL kernel as a crucial element, and may be computed through either rapidity renormalisation group equations or amplitudes built through $s$-channel unitarity cut…
▽ More
It has recently been shown that there exists an $s$-channel sequence of classical corrections to the $H$ diagram computed long ago by Amati, Ciafaloni and Veneziano. At leading logarithmic accuracy, those corrections feature the gravity BFKL kernel as a crucial element, and may be computed through either rapidity renormalisation group equations or amplitudes built through $s$-channel unitarity cuts. In this paper, we evaluate six-graviton amplitudes in next-to-multi-Regge kinematics, and compute for the first time the Central Emission Vertex for the emission of two gravitons, which is relevant to evaluate the corrections to the gravity BFKL kernel, and thus to go beyond the leading logarithmic accuracy.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Optimal Execution under Liquidity Uncertainty
Authors:
Etienne Chevalier,
Yadh Hafsi,
Vathana Ly Vath,
Sergio Pulido
Abstract:
We study an optimal execution strategy for purchasing a large block of shares over a fixed time horizon. The execution problem is subject to a general price impact that gradually dissipates due to market resilience. This resilience is modeled through a potentially arbitrary limit-order book shape. To account for liquidity dynamics, we introduce a stochastic volume effect governing the recovery of…
▽ More
We study an optimal execution strategy for purchasing a large block of shares over a fixed time horizon. The execution problem is subject to a general price impact that gradually dissipates due to market resilience. This resilience is modeled through a potentially arbitrary limit-order book shape. To account for liquidity dynamics, we introduce a stochastic volume effect governing the recovery of the deviation process, which represents the difference between the impacted and unaffected price. Additionally, we incorporate stochastic liquidity variations through a regime-switching Markov chain to capture abrupt shifts in market conditions. We study this singular control problem, where the trader optimally determines the timing and rate of purchases to minimize execution costs. The associated value function to this optimization problem is shown to satisfy a system of variational Hamilton-Jacobi-Bellman inequalities. Moreover, we establish that it is the unique viscosity solution to this HJB system and study the analytical properties of the free boundary separating the execution and continuation regions. To illustrate our results, we present numerical examples under different limit-order book configurations, highlighting the interplay between price impact, resilience dynamics, and stochastic liquidity regimes in shaping the optimal execution strategy.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
On the Performance of LLMs for Real Estate Appraisal
Authors:
Margot Geerts,
Manon Reusens,
Bart Baesens,
Seppe vanden Broucke,
Jochen De Weerdt
Abstract:
The real estate market is vital to global economies but suffers from significant information asymmetry. This study examines how Large Language Models (LLMs) can democratize access to real estate insights by generating competitive and interpretable house price estimates through optimized In-Context Learning (ICL) strategies. We systematically evaluate leading LLMs on diverse international housing d…
▽ More
The real estate market is vital to global economies but suffers from significant information asymmetry. This study examines how Large Language Models (LLMs) can democratize access to real estate insights by generating competitive and interpretable house price estimates through optimized In-Context Learning (ICL) strategies. We systematically evaluate leading LLMs on diverse international housing datasets, comparing zero-shot, few-shot, market report-enhanced, and hybrid prompting techniques. Our results show that LLMs effectively leverage hedonic variables, such as property size and amenities, to produce meaningful estimates. While traditional machine learning models remain strong for pure predictive accuracy, LLMs offer a more accessible, interactive and interpretable alternative. Although self-explanations require cautious interpretation, we find that LLMs explain their predictions in agreement with state-of-the-art models, confirming their trustworthiness. Carefully selected in-context examples based on feature similarity and geographic proximity, significantly enhance LLM performance, yet LLMs struggle with overconfidence in price intervals and limited spatial reasoning. We offer practical guidance for structured prediction tasks through prompt optimization. Our findings highlight LLMs' potential to improve transparency in real estate appraisal and provide actionable insights for stakeholders.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
A retrospective on DISPEED -- Leveraging heterogeneity in a drone swarm for IDS execution
Authors:
Vincent Lannurien,
Camélia Slimani,
Louis Morge-Rollet,
Laurent Lemarchand,
David Espes,
Frédéric Le Roy,
Jalil Boukhobza
Abstract:
Swarms of drones are gaining more and more autonomy and efficiency during their missions. However, security threats can disrupt their missions' progression. To overcome this problem, Network Intrusion Detection Systems ((N)IDS) are promising solutions to detect malicious behavior on network traffic. However, modern NIDS rely on resource-hungry machine learning techniques, that can be difficult to…
▽ More
Swarms of drones are gaining more and more autonomy and efficiency during their missions. However, security threats can disrupt their missions' progression. To overcome this problem, Network Intrusion Detection Systems ((N)IDS) are promising solutions to detect malicious behavior on network traffic. However, modern NIDS rely on resource-hungry machine learning techniques, that can be difficult to deploy on a swarm of drones. The goal of the DISPEED project is to leverage the heterogeneity (execution platforms, memory) of the drones composing a swarm to deploy NIDS. It is decomposed in two phases: (1) a characterization phase that consists in characterizing various IDS implementations on diverse embedded platforms, and (2) an IDS implementation mapping phase that seeks to develop selection strategies to choose the most relevant NIDS depending on the context. On the one hand, the characterization phase allowed us to identify 36 relevant IDS implementations on three different embedded platforms: a Raspberry Pi 4B, a Jetson Xavier, and a Pynq-Z2. On the other hand, the IDS implementation mapping phase allowed us to design both standalone and distributed strategies to choose the best NIDSs to deploy depending on the context. The results of the project have led to three publications in international conferences, and one publication in a journal.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
The Karl G. Jansky Very Large Array Local Group L-band Survey (LGLBS)
Authors:
Eric W. Koch,
Adam K. Leroy,
Erik W. Rosolowsky,
Laura Chomiuk,
Julianne J. Dalcanton,
Nickolas M. Pingel,
Sumit K. Sarbadhicary,
Snežana Stanimirović,
Fabian Walter,
Haylee N. Archer,
Alberto D. Bolatto,
Michael P. Busch,
Hongxing Chen,
Ryan Chown,
Harrisen Corbould,
Serena A. Cronin,
Jeremy Darling,
Thomas Do,
Jennifer Donovan Meyer,
Cosima Eibensteiner,
Deidre Hunter,
Rémy Indebetouw,
Preshanth Jagannathan,
Amanda A. Kepley,
Chang-Goo Kim
, et al. (23 additional authors not shown)
Abstract:
We present the Local Group L-Band Survey (LGLBS), a Karl G. Jansky Very Large Array (VLA) survey producing the highest quality 21-cm and 1-2 GHz radio continuum images to date for the six VLA-accessible, star-forming, Local Group galaxies. Leveraging the VLA's spectral multiplexing power, we simultaneously survey the 21-cm line at high 0.4 km/s velocity resolution, the 1-2 GHz polarized continuum,…
▽ More
We present the Local Group L-Band Survey (LGLBS), a Karl G. Jansky Very Large Array (VLA) survey producing the highest quality 21-cm and 1-2 GHz radio continuum images to date for the six VLA-accessible, star-forming, Local Group galaxies. Leveraging the VLA's spectral multiplexing power, we simultaneously survey the 21-cm line at high 0.4 km/s velocity resolution, the 1-2 GHz polarized continuum, and four OH lines. For the massive spiral M31, the dwarf spiral M33, and the dwarf irregular galaxies NGC6822, IC10, IC1613, and the Wolf-Lundmark-Melotte Galaxy (WLM), we use all four VLA configurations and the Green Bank Telescope to reach angular resolutions of $< 5''$ ($10{-}20$~pc) for the 21-cm line with $<10^{20}$~cm$^{-2}$ column density sensitivity, and even sharper views ($< 2''$; $5{-}10$~pc) of the continuum. Targeting these nearby galaxies ($D\lesssim1$ Mpc) reveals a sharp, resolved view of the atomic gas, including 21-cm absorption, and continuum emission from supernova remnants and HII regions. These datasets can be used to test theories of the abundance and formation of cold clouds, the driving and dissipation of interstellar turbulence, and the impact of feedback from massive stars and supernovae. Here, we describe the survey design and execution, scientific motivation, data processing, and quality assurance. We provide a first look at and publicly release the wide-field 21-cm HI data products for M31, M33, and four dwarf irregular targets in the survey, which represent some of the highest physical resolution 21-cm observations of any external galaxies beyond the LMC and SMC.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Why Do Class-Dependent Evaluation Effects Occur with Time Series Feature Attributions? A Synthetic Data Investigation
Authors:
Gregor Baer,
Isel Grau,
Chao Zhang,
Pieter Van Gorp
Abstract:
Evaluating feature attribution methods represents a critical challenge in explainable AI (XAI), as researchers typically rely on perturbation-based metrics when ground truth is unavailable. However, recent work demonstrates that these evaluation metrics can show different performance across predicted classes within the same dataset. These "class-dependent evaluation effects" raise questions about…
▽ More
Evaluating feature attribution methods represents a critical challenge in explainable AI (XAI), as researchers typically rely on perturbation-based metrics when ground truth is unavailable. However, recent work demonstrates that these evaluation metrics can show different performance across predicted classes within the same dataset. These "class-dependent evaluation effects" raise questions about whether perturbation analysis reliably measures attribution quality, with direct implications for XAI method development and the trustworthiness of evaluation techniques. We investigate under which conditions these class-dependent effects arise by conducting controlled experiments with synthetic time series data where ground truth feature locations are known. We systematically vary feature types and class contrasts across binary classification tasks, then compare perturbation-based degradation scores with ground truth-based precision-recall metrics using multiple attribution methods. Our experiments demonstrate that class-dependent effects emerge with both evaluation approaches even in simple scenarios with temporally localized features, triggered by basic variations in feature amplitude or temporal extent between classes. Most critically, we find that perturbation-based and ground truth metrics frequently yield contradictory assessments of attribution quality across classes, with weak correlations between evaluation approaches. These findings suggest that researchers should interpret perturbation-based metrics with care, as they may not always align with whether attributions correctly identify discriminating features. These findings reveal opportunities to reconsider what attribution evaluation actually measures and to develop more comprehensive evaluation frameworks that capture multiple dimensions of attribution quality.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Bias and Identifiability in the Bounded Confidence Model
Authors:
Claudio Borile,
Jacopo Lenti,
Valentina Ghidini,
Corrado Monti,
Gianmarco De Francisci Morales
Abstract:
Opinion dynamics models such as the bounded confidence models (BCMs) describe how a population can reach consensus, fragmentation, or polarization, depending on a few parameters. Connecting such models to real-world data could help understanding such phenomena, testing model assumptions. To this end, estimation of model parameters is a key aspect, and maximum likelihood estimation provides a princ…
▽ More
Opinion dynamics models such as the bounded confidence models (BCMs) describe how a population can reach consensus, fragmentation, or polarization, depending on a few parameters. Connecting such models to real-world data could help understanding such phenomena, testing model assumptions. To this end, estimation of model parameters is a key aspect, and maximum likelihood estimation provides a principled way to tackle it. Here, our goal is to outline the properties of statistical estimators of the two key BCM parameters: the confidence bound and the convergence rate. We find that their maximum likelihood estimators present different characteristics: the one for the confidence bound presents a small-sample bias but is consistent, while the estimator of the convergence rate shows a persistent bias. Moreover, the joint parameter estimation is affected by identifiability issues for specific regions of the parameter space, as several local maxima are present in the likelihood function. Our results show how the analysis of the likelihood function is a fruitful approach for better understanding the pitfalls and possibilities of estimating the parameters of opinion dynamics models, and more in general, agent-based models, and for offering formal guarantees for their calibration.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Practical colinear chaining on sequences revisited
Authors:
Nicola Rizzo,
Manuel Cáceres,
Veli Mäkinen
Abstract:
Colinear chaining is a classical heuristic for sequence alignment and is widely used in modern practical aligners. Jain et al. (J. Comput. Biol. 2022) proposed an $O(n \log^3 n)$ time algorithm to chain a set of $n$ anchors so that the chaining cost matches the edit distance of the input sequences, when anchors are maximal exact matches. Moreover, assuming a uniform and sparse distribution of anch…
▽ More
Colinear chaining is a classical heuristic for sequence alignment and is widely used in modern practical aligners. Jain et al. (J. Comput. Biol. 2022) proposed an $O(n \log^3 n)$ time algorithm to chain a set of $n$ anchors so that the chaining cost matches the edit distance of the input sequences, when anchors are maximal exact matches. Moreover, assuming a uniform and sparse distribution of anchors, they provided a practical solution ($\mathtt{ChainX}$) working in $O(n \cdot \mathsf{SOL} + n \log n)$ average-case time, where $\mathsf{SOL}$ is the cost of the output chain and $n$ is the number of anchors in the input. This practical solution is not guaranteed to be optimal: we study the failing cases, introduce the anchor diagonal distance, and find and implement an optimal algorithm working in the same $O(n \cdot \mathsf{OPT} + n \log n)$ average-case time, where $\mathsf{OPT}$ is the optimal chaining cost; then, we validate the results by Jain et al., show that $\mathtt{ChainX}$ can be suboptimal with a realistic long read dataset, and show minimal computational slowdown for our solution.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Quizzard@INOVA Challenge 2025 -- Track A: Plug-and-Play Technique in Interleaved Multi-Image Model
Authors:
Dinh Viet Cuong,
Hoang-Bao Le,
An Pham Ngoc Nguyen,
Liting Zhou,
Cathal Gurrin
Abstract:
This paper addresses two main objectives. Firstly, we demonstrate the impressive performance of the LLaVA-NeXT-interleave on 22 datasets across three different tasks: Multi-Image Reasoning, Documents and Knowledge-Based Understanding and Interactive Multi-Modal Communication. Secondly, we add the Dense Channel Integration (DCI) connector to the LLaVA-NeXT-Interleave and compare its performance aga…
▽ More
This paper addresses two main objectives. Firstly, we demonstrate the impressive performance of the LLaVA-NeXT-interleave on 22 datasets across three different tasks: Multi-Image Reasoning, Documents and Knowledge-Based Understanding and Interactive Multi-Modal Communication. Secondly, we add the Dense Channel Integration (DCI) connector to the LLaVA-NeXT-Interleave and compare its performance against the standard model. We find that the standard model achieves the highest overall accuracy, excelling in vision-heavy tasks like VISION, NLVR2, and Fashion200K. Meanwhile, the DCI-enhanced version shows particular strength on datasets requiring deeper semantic coherence or structured change understanding such as MIT-States_PropertyCoherence and SlideVQA. Our results highlight the potential of combining powerful foundation models with plug-and-play techniques for Interleave tasks. The code is available at https://github.com/dinhvietcuong1996/icme25-inova.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
ALMA millimetre-wavelength imaging of HD 138965: New constraints on the debris dust composition and presence of planetary companions
Authors:
J. P. Marshall,
S. Hengst,
A. Trejo-Cruz,
C. del Burgo,
J. Milli,
M. Booth,
J. C. Augereau,
E. Choquet,
F. Y. Morales,
P. Thébault,
F. Kemper,
V. Faramaz-Gorka,
G. Bryden
Abstract:
HD 138965 is a young A type star and member of the nearby young Argus association. This star is surrounded by a broad, bright debris disc with two temperature components that was spatially resolved at far-infrared wavelengths by Herschel. Here we present ALMA millimetre-wavelength imaging of the cool outer belt. These reveal its radial extent to be $150^{+10}_{-7}$ au with a width ($σ$) of 49…
▽ More
HD 138965 is a young A type star and member of the nearby young Argus association. This star is surrounded by a broad, bright debris disc with two temperature components that was spatially resolved at far-infrared wavelengths by Herschel. Here we present ALMA millimetre-wavelength imaging of the cool outer belt. These reveal its radial extent to be $150^{+10}_{-7}$ au with a width ($σ$) of 49$^{+7}_{-6}$ au ($ΔR/R$ = 0.77), at a moderate inclination of 49$\fdeg$9^{+3.3}_{-3.7}$. Due to the limited angular resolution, signal-to-noise, and inclination we have no constraint on the disc's vertical scale height. We modelled the disc emission with both gravitational and radiation forces acting on the dust grains. As the inner belt has not been spatially resolved, we fixed its radius and width prior to modelling the outer belt. We find astronomical silicate is the best fit for the dust composition. However, we could not reject possible scenarios where there are at least 10 \% water-ice inclusions. Combining the spatially resolved imaging by ALMA with non-detection at optical wavelengths by HST, we obtain a limit on the scattering albedo $ω\leq 0.09$ for the debris dust in the outer belt. Analysis of the outer belt's architecture in conjunction with simple stirring models places a mass limit of $2.3~\pm~0.4 M_{\rm Jup}$ on a companion interior to the belt ($a \leq 78$ au), a factor of two improvement over constraints from high contrast imaging.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Classification of Quality Characteristics in Online User Feedback using Linguistic Analysis, Crowdsourcing and LLMs
Authors:
Eduard C. Groen,
Fabiano Dalpiaz,
Martijn van Vliet,
Boris Winter,
Joerg Doerr,
Sjaak Brinkkemper
Abstract:
Software qualities such as usability or reliability are among the strongest determinants of mobile app user satisfaction and constitute a significant portion of online user feedback on software products, making it a valuable source of quality-related feedback to guide the development process. The abundance of online user feedback warrants the automated identification of quality characteristics, bu…
▽ More
Software qualities such as usability or reliability are among the strongest determinants of mobile app user satisfaction and constitute a significant portion of online user feedback on software products, making it a valuable source of quality-related feedback to guide the development process. The abundance of online user feedback warrants the automated identification of quality characteristics, but the online user feedback's heterogeneity and the lack of appropriate training corpora limit the applicability of supervised machine learning. We therefore investigate the viability of three approaches that could be effective in low-data settings: language patterns (LPs) based on quality-related keywords, instructions for crowdsourced micro-tasks, and large language model (LLM) prompts. We determined the feasibility of each approach and then compared their accuracy. For the complex multiclass classification of quality characteristics, the LP-based approach achieved a varied precision (0.38-0.92) depending on the quality characteristic, and low recall; crowdsourcing achieved the best average accuracy in two consecutive phases (0.63, 0.72), which could be matched by the best-performing LLM condition (0.66) and a prediction based on the LLMs' majority vote (0.68). Our findings show that in this low-data setting, the two approaches that use crowdsourcing or LLMs instead of involving experts achieve accurate classifications, while the LP-based approach has only limited potential. The promise of crowdsourcing and LLMs in this context might even extend to building training corpora.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Automatic differentiation for Lax-Wendroff-type discretizations
Authors:
Arpit Babbar,
Valentin Churavy,
Michael Schlottke Lakemper,
Hendrik Ranocha
Abstract:
Lax-Wendroff methods combined with discontinuous Galerkin/flux reconstruction spatial discretization provide a high-order, single-stage, quadrature-free method for solving hyperbolic conservation laws. In this work, we introduce automatic differentiation (AD) in the element-local time average flux computation step (the predictor step) of Lax-Wendroff methods. The application of AD is similar for m…
▽ More
Lax-Wendroff methods combined with discontinuous Galerkin/flux reconstruction spatial discretization provide a high-order, single-stage, quadrature-free method for solving hyperbolic conservation laws. In this work, we introduce automatic differentiation (AD) in the element-local time average flux computation step (the predictor step) of Lax-Wendroff methods. The application of AD is similar for methods of any order and does not need positivity corrections during the predictor step. This contrasts with the approximate Lax-Wendroff procedure, which requires different finite difference formulas for different orders of the method and positivity corrections in the predictor step for fluxes that can only be computed on admissible states. The method is Jacobian-free and problem-independent, allowing direct application to any physical flux function. Numerical experiments demonstrate the order and positivity preservation of the method. Additionally, performance comparisons indicate that the wall-clock time of automatic differentiation is always on par with the approximate Lax-Wendroff method.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Simulating realistic radio continuum survey maps with diffusion models
Authors:
Tobias Vičánek Martínez,
Henrik W. Edler,
Marcus Brüggen
Abstract:
The next generation of radio surveys is going to be transformative for cosmology and other aspects of our understanding of astrophysics. Realistic simulations of radio observations are essential for the design and planning of radio surveys. They are employed in the development of methods for tasks, such as data calibration and reduction, automated analysis and statistical studies in cosmology. We…
▽ More
The next generation of radio surveys is going to be transformative for cosmology and other aspects of our understanding of astrophysics. Realistic simulations of radio observations are essential for the design and planning of radio surveys. They are employed in the development of methods for tasks, such as data calibration and reduction, automated analysis and statistical studies in cosmology. We implemented a software for machine learning-assisted simulations of realistic surveys with the LOFAR telescope, resulting in a synthetic radio sky model and a corresponding artificial telescope observation. We employed a diffusion model trained on LoTSS observations to generate individual radio galaxy images with control over the angular size. Single sources are assembled into a radio sky model, using an input catalog from cosmological simulations. We then transformed this sky model into visibilities corresponding to a typical LoTSS pointing. We added realistic noise to this synthetic measurement and obtained our final simulated sky maps through deconvolution. We explored different ways to evaluate our resulting sky model. We were able to simulate realistic LOFAR observations, covering a sky patch of 5x5 degrees at an effective resolution of 8.5 arcseconds. The simulated sources have flux and size distributions that match real observations, and the resulting maps have sensitivities compatible with LoTSS observations. Our diffusion model is able to synthesize high-quality realistic radio galaxy images with precise control over the source sizes. This software can readily be applied to other instruments.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Heavy-ball dynamics with Hessian-driven damping for non-convex optimization under the Łojasiewicz condition
Authors:
Vassilis Apidopoulos,
Vasiliki Mavrogeorgou,
Theodoros G. Tsironis
Abstract:
In this paper, we examine the convergence properties of heavy-ball dynamics with Hessian-driven damping in smooth non-convex optimization problems satisfying a Łojasiewicz condition. In this general setting, we provide a series of tight, worst-case optimal convergence rate guarantees as a function of the dynamics' friction coefficients and the Łojasiewicz exponent of the problem's objective functi…
▽ More
In this paper, we examine the convergence properties of heavy-ball dynamics with Hessian-driven damping in smooth non-convex optimization problems satisfying a Łojasiewicz condition. In this general setting, we provide a series of tight, worst-case optimal convergence rate guarantees as a function of the dynamics' friction coefficients and the Łojasiewicz exponent of the problem's objective function. Importantly, the linear rates that we obtain improve on previous available rates and they suggest a different tuning of the dynamics' damping terms, even in the strongly convex regime. We complement our analysis with a range of stability estimates in the presence of perturbation errors and inexact gradient input, as well as an avoidance result showing that the dynamics under study avoid strict saddle points from almost every initial condition,
△ Less
Submitted 13 June, 2025;
originally announced June 2025.