-
Euclid preparation. Full-shape modelling of 2-point and 3-point correlation functions in real space
Authors:
Euclid Collaboration,
M. Guidi,
A. Veropalumbo,
A. Pugno,
M. Moresco,
E. Sefusatti,
C. Porciani,
E. Branchini,
M. -A. Breton,
B. Camacho Quevedo,
M. Crocce,
S. de la Torre,
V. Desjacques,
A. Eggemeier,
A. Farina,
M. Kärcher,
D. Linde,
M. Marinucci,
A. Moradinezhad Dizgah,
C. Moretti,
K. Pardede,
A. Pezzotta,
E. Sarpa,
A. Amara,
S. Andreon
, et al. (286 additional authors not shown)
Abstract:
We investigate the accuracy and range of validity of the perturbative model for the 2-point (2PCF) and 3-point (3PCF) correlation functions in real space in view of the forthcoming analysis of the Euclid mission spectroscopic sample. We take advantage of clustering measurements from four snapshots of the Flagship I N-body simulations at z = {0.9, 1.2, 1.5, 1.8}, which mimic the expected galaxy pop…
▽ More
We investigate the accuracy and range of validity of the perturbative model for the 2-point (2PCF) and 3-point (3PCF) correlation functions in real space in view of the forthcoming analysis of the Euclid mission spectroscopic sample. We take advantage of clustering measurements from four snapshots of the Flagship I N-body simulations at z = {0.9, 1.2, 1.5, 1.8}, which mimic the expected galaxy population in the ideal case of absence of observational effects such as purity and completeness. For the 3PCF we consider all available triangle configurations given a minimal separation. First, we assess the model performance by fixing the cosmological parameters and evaluating the goodness-of-fit provided by the perturbative bias expansion in the joint analysis of the two statistics, finding overall agreement with the data down to separations of 20 Mpc/h. Subsequently, we build on the state-of-the-art and extend the analysis to include the dependence on three cosmological parameters: the amplitude of scalar perturbations As, the matter density ωcdm and the Hubble parameter h. To achieve this goal, we develop an emulator capable of generating fast and robust modelling predictions for the two summary statistics, allowing efficient sampling of the joint likelihood function. We therefore present the first joint full-shape analysis of the real-space 2PCF and 3PCF, testing the consistency and constraining power of the perturbative model across both probes, and assessing its performance in a combined likelihood framework. We explore possible systematic uncertainties induced by the perturbative model at small scales finding an optimal scale cut of rmin = 30 Mpc/h for the 3PCF, when imposing an additional limitation on nearly isosceles triangular configurations included in the data vector. This work is part of a Euclid Preparation series validating theoretical models for galaxy clustering.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
Euclid: Quick Data Release (Q1) -- Watching ICM-selected galaxy clusters with Euclid eyes -- prospects of Euclid data in the context of large SZ and X-ray based surveys
Authors:
M. Klein,
K. George,
J. J. Mohr,
B. Altieri,
L. Amendola,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
M. Baldi,
A. Balestra,
S. Bardelli,
A. Biviano,
E. Branchini,
M. Brescia,
S. Camera,
G. Cañas-Herrera,
V. Capobianco,
C. Carbone,
J. Carretero,
S. Casas,
M. Castellano,
G. Castignani,
S. Cavuoti,
K. C. Chambers,
A. Cimatti
, et al. (122 additional authors not shown)
Abstract:
Galaxy clusters detected through their X-ray emission or Sunyaev--Zeldovich effect (SZE), both produced by the intra-cluster medium (ICM), are key probes in cosmological and astrophysical studies. To maximise the scientific return of such surveys, complementary data are required for cluster confirmation and redshift estimation. This is typically provided by wide-field optical and infrared surveys,…
▽ More
Galaxy clusters detected through their X-ray emission or Sunyaev--Zeldovich effect (SZE), both produced by the intra-cluster medium (ICM), are key probes in cosmological and astrophysical studies. To maximise the scientific return of such surveys, complementary data are required for cluster confirmation and redshift estimation. This is typically provided by wide-field optical and infrared surveys, which are increasingly challenged by ongoing and future ICM-selected samples. In particular, at high redshifts ($z>1$) probed by upcoming SZE-selected samples, current large surveys may be insufficient for reliable confirmation. Deep, high-resolution infrared surveys like Euclid will thus be essential for confirming most high-redshift clusters. We present an analysis of the first sizeable Euclid dataset (Q1), overlapping with several ICM-selected cluster samples. We apply an adaptation of the MCMF cluster confirmation tool to estimate key properties, including redshift and richness, and to predict Euclid's capabilities for high-redshift cluster confirmation. We find promising performance, particularly at high redshifts, while richness estimates at low redshifts ($z<0.4$) are currently limited by Q1 data quality but should improve with future releases. Using MCMF runs on random lines of sight, we predict that Euclid will confirm clusters at $1<z<2$ as effectively as current optical surveys at $z<0.6$, significantly enhancing high-redshift confirmation. SZE-selected samples will thus greatly benefit from Euclid overlap. Among five known high-$z$ SZE clusters in Q1, we identify the highest-redshift jellyfish galaxy candidate to date, EUCLJ035330.86$-$504347.6 in SPT-CLJ0353$-$5043 ($z=1.32$), two massive star-forming galaxies near ACT-CLJ0350.0$-$4819 ($z=1.46$), and strong lensing features in SPT-CLJ0353$-$5043 and SPT-CLJ0421$-$4845.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Euclid: An emulator for baryonic effects on the matter bispectrum
Authors:
P. A. Burger,
G. Aricò,
L. Linke,
R. E. Angulo,
J. C. Broxterman,
J. Schaye,
M. Schaller,
M. Zennaro,
A. Halder,
L. Porth,
S. Heydenreich,
M. J. Hudson,
A. Amara,
S. Andreon,
C. Baccigalupi,
M. Baldi,
A. Balestra,
S. Bardelli,
A. Biviano,
E. Branchini,
M. Brescia,
S. Camera,
V. Capobianco,
C. Carbone,
V. F. Cardone
, et al. (131 additional authors not shown)
Abstract:
The Euclid mission and other next-generation large-scale structure surveys will enable high-precision measurements of the cosmic matter distribution. Understanding the impact of baryonic processes such as star formation and AGN feedback on matter clustering is crucial to ensure precise and unbiased cosmological inference. Most theoretical models of baryonic effects to date focus on two-point stati…
▽ More
The Euclid mission and other next-generation large-scale structure surveys will enable high-precision measurements of the cosmic matter distribution. Understanding the impact of baryonic processes such as star formation and AGN feedback on matter clustering is crucial to ensure precise and unbiased cosmological inference. Most theoretical models of baryonic effects to date focus on two-point statistics, neglecting higher-order contributions. This work develops a fast and accurate emulator for baryonic effects on the matter bispectrum, a key non-Gaussian statistic in the nonlinear regime. We employ high-resolution $N$-body simulations from the BACCO suite and apply a combination of cutting-edge techniques such as cosmology scaling and baryonification to efficiently span a large cosmological and astrophysical parameter space. A deep neural network is trained to emulate baryonic effects on the matter bispectrum measured in simulations, capturing modifications across various scales and redshifts relevant to Euclid. We validate the emulator accuracy and robustness using an analysis of \Euclid mock data, employing predictions from the state-of-the-art FLAMINGO hydrodynamical simulations. The emulator reproduces baryonic suppression in the bispectrum to better than 2$\%$ for the $68\%$ percentile across most triangle configurations for $k \in [0.01, 20]\,h^{-1}\mathrm{Mpc}$ and ensures consistency between cosmological posteriors inferred from second- and third-order weak lensing statistics.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Euclid: The potential of slitless infrared spectroscopy: A z=5.4 quasar and new ultracool dwarfs
Authors:
E. Bañados,
V. Le Brun,
S. Belladitta,
I. Momcheva,
D. Stern,
J. Wolf,
M. Ezziati,
D. J. Mortlock,
A. Humphrey,
R. L. Smart,
S. L. Casewell,
A. Pérez-Garrido,
B. Goldman,
E. L. Martín,
A. Mohandasan,
C. Reylé,
C. Dominguez-Tagle,
Y. Copin,
E. Lusso,
Y. Matsuoka,
K. McCarthy,
F. Ricci,
H. -W. Rix,
H. J. A. Rottgering,
J. -T. Schindler
, et al. (204 additional authors not shown)
Abstract:
We demonstrate the potential of Euclid's slitless spectroscopy to discover high-redshift (z>5) quasars and their main photometric contaminant, ultracool dwarfs. Sensitive infrared spectroscopy from space is able to efficiently identify both populations, as demonstrated by Euclid Near-Infrared Spectrometer and Photometer Red Grism (NISP RGE) spectra of the newly discovered z=5.404 quasar EUCL J1815…
▽ More
We demonstrate the potential of Euclid's slitless spectroscopy to discover high-redshift (z>5) quasars and their main photometric contaminant, ultracool dwarfs. Sensitive infrared spectroscopy from space is able to efficiently identify both populations, as demonstrated by Euclid Near-Infrared Spectrometer and Photometer Red Grism (NISP RGE) spectra of the newly discovered z=5.404 quasar EUCL J181530.01+652054.0, as well as several ultracool dwarfs in the Euclid Deep Field North and the Euclid Early Release Observation field Abell 2764. The ultracool dwarfs were identified by cross-correlating their spectra with templates. The quasar was identified by its strong and broad CIII] and MgII emission lines in the NISP RGE 1206-1892 nm spectrum, and confirmed through optical spectroscopy from the Large Binocular Telescope. The NISP Blue Grism (NISP BGE) 926-1366 nm spectrum confirms CIV and CIII] emission. NISP RGE can find bright quasars at z~5.5 and z>7, redshift ranges that are challenging for photometric selection due to contamination from ultracool dwarfs. EUCL J181530.01+652054.0 is a high-excitation, broad absorption line quasar detected at 144 MHz by the LOw-Frequency Array (L144=4e25 W/Hz). The quasar has a bolometric luminosity of 3e12 Lsun and is powered by a 3.4e9 Msun black hole. The discovery of this bright quasar is noteworthy as fewer than one such object was expected in the ~20 deg2 surveyed. This finding highlights the potential and effectiveness of NISP spectroscopy in identifying rare, luminous high-redshift quasars, previewing the census of these sources that Euclid's slitless spectroscopy will deliver over about 14,000 deg2 of the sky.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
RelTopo: Enhancing Relational Modeling for Driving Scene Topology Reasoning
Authors:
Yueru Luo,
Changqing Zhou,
Yiming Yang,
Erlong Li,
Chao Zheng,
Shuqi Mei,
Shuguang Cui,
Zhen Li
Abstract:
Accurate road topology reasoning is critical for autonomous driving, enabling effective navigation and adherence to traffic regulations. Central to this task are lane perception and topology reasoning. However, existing methods typically focus on either lane detection or Lane-to-Lane (L2L) topology reasoning, often \textit{neglecting} Lane-to-Traffic-element (L2T) relationships or \textit{failing}…
▽ More
Accurate road topology reasoning is critical for autonomous driving, enabling effective navigation and adherence to traffic regulations. Central to this task are lane perception and topology reasoning. However, existing methods typically focus on either lane detection or Lane-to-Lane (L2L) topology reasoning, often \textit{neglecting} Lane-to-Traffic-element (L2T) relationships or \textit{failing} to optimize these tasks jointly. Furthermore, most approaches either overlook relational modeling or apply it in a limited scope, despite the inherent spatial relationships among road elements. We argue that relational modeling is beneficial for both perception and reasoning, as humans naturally leverage contextual relationships for road element recognition and their connectivity inference. To this end, we introduce relational modeling into both perception and reasoning, \textit{jointly} enhancing structural understanding. Specifically, we propose: 1) a relation-aware lane detector, where our geometry-biased self-attention and \curve\ cross-attention refine lane representations by capturing relational dependencies; 2) relation-enhanced topology heads, including a geometry-enhanced L2L head and a cross-view L2T head, boosting reasoning with relational cues; and 3) a contrastive learning strategy with InfoNCE loss to regularize relationship embeddings. Extensive experiments on OpenLane-V2 demonstrate that our approach significantly improves both detection and topology reasoning metrics, achieving +3.1 in DET$_l$, +5.3 in TOP$_{ll}$, +4.9 in TOP$_{lt}$, and an overall +4.4 in OLS, setting a new state-of-the-art. Code will be released.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
Authors:
Yixiao Huang,
Hanlin Zhu,
Tianyu Guo,
Jiantao Jiao,
Somayeh Sojoudi,
Michael I. Jordan,
Stuart Russell,
Song Mei
Abstract:
Large language models (LLMs) can acquire new knowledge through fine-tuning, but this process exhibits a puzzling duality: models can generalize remarkably from new facts, yet are also prone to hallucinating incorrect information. However, the reasons for this phenomenon remain poorly understood. In this work, we argue that both behaviors stem from a single mechanism known as out-of-context reasoni…
▽ More
Large language models (LLMs) can acquire new knowledge through fine-tuning, but this process exhibits a puzzling duality: models can generalize remarkably from new facts, yet are also prone to hallucinating incorrect information. However, the reasons for this phenomenon remain poorly understood. In this work, we argue that both behaviors stem from a single mechanism known as out-of-context reasoning (OCR): the ability to deduce implications by associating concepts, even those without a causal link. Our experiments across five prominent LLMs confirm that OCR indeed drives both generalization and hallucination, depending on whether the associated concepts are causally related. To build a rigorous theoretical understanding of this phenomenon, we then formalize OCR as a synthetic factual recall task. We empirically show that a one-layer single-head attention-only transformer with factorized output and value matrices can learn to solve this task, while a model with combined weights cannot, highlighting the crucial role of matrix factorization. Our theoretical analysis shows that the OCR capability can be attributed to the implicit bias of gradient descent, which favors solutions that minimize the nuclear norm of the combined output-value matrix. This mathematical structure explains why the model learns to associate facts and implications with high sample efficiency, regardless of whether the correlation is causal or merely spurious. Ultimately, our work provides a theoretical foundation for understanding the OCR phenomenon, offering a new lens for analyzing and mitigating undesirable behaviors from knowledge injection.
△ Less
Submitted 4 July, 2025; v1 submitted 12 June, 2025;
originally announced June 2025.
-
Euclid preparation. Accurate and precise data-driven angular power spectrum covariances
Authors:
Euclid Collaboration,
K. Naidoo,
J. Ruiz-Zapatero,
N. Tessore,
B. Joachimi,
A. Loureiro,
N. Aghanim,
B. Altieri,
A. Amara,
L. Amendola,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
D. Bagot,
M. Baldi,
S. Bardelli,
P. Battaglia,
A. Biviano,
E. Branchini,
M. Brescia,
S. Camera,
V. Capobianco,
C. Carbone,
V. F. Cardone,
J. Carretero
, et al. (258 additional authors not shown)
Abstract:
We develop techniques for generating accurate and precise internal covariances for measurements of clustering and weak lensing angular power spectra. These methods are designed to produce non-singular and unbiased covariances for Euclid's large anticipated data vector and will be critical for validation against observational systematic effects. We construct jackknife segments that are equal in are…
▽ More
We develop techniques for generating accurate and precise internal covariances for measurements of clustering and weak lensing angular power spectra. These methods are designed to produce non-singular and unbiased covariances for Euclid's large anticipated data vector and will be critical for validation against observational systematic effects. We construct jackknife segments that are equal in area to high precision by adapting the binary space partition algorithm to work on arbitrarily shaped regions on the unit sphere. Jackknife estimates of the covariances are internally derived and require no assumptions about cosmology or galaxy population and bias. Our covariance estimation, called DICES (Debiased Internal Covariance Estimation with Shrinkage), first estimates a noisy covariance through conventional delete-1 jackknife resampling. This is followed by linear shrinkage of the empirical correlation matrix towards the Gaussian prediction, rather than linear shrinkage of the covariance matrix. Shrinkage ensures the covariance is non-singular and therefore invertible, critical for the estimation of likelihoods and validation. We then apply a delete-2 jackknife bias correction to the diagonal components of the jackknife covariance that removes the general tendency for jackknife error estimates to be biased high. We validate internally derived covariances, which use the jackknife resampling technique, on synthetic Euclid-like lognormal catalogues. We demonstrate that DICES produces accurate, non-singular covariance estimates, with the relative error improving by $33\%$ for the covariance and $48\%$ for the correlation structure in comparison to jackknife estimates. These estimates can be used for highly accurate regression and inference.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning
Authors:
Ruiqi Zhang,
Daman Arora,
Song Mei,
Andrea Zanette
Abstract:
Training large language models with reinforcement learning (RL) against verifiable rewards significantly enhances their reasoning abilities, yet remains computationally expensive due to inefficient uniform prompt sampling. We introduce Selective Prompting with Efficient Estimation of Difficulty (SPEED), an adaptive online RL curriculum that selectively chooses training examples of intermediate dif…
▽ More
Training large language models with reinforcement learning (RL) against verifiable rewards significantly enhances their reasoning abilities, yet remains computationally expensive due to inefficient uniform prompt sampling. We introduce Selective Prompting with Efficient Estimation of Difficulty (SPEED), an adaptive online RL curriculum that selectively chooses training examples of intermediate difficulty to maximize learning efficiency. Theoretically, we establish that intermediate-difficulty prompts improve the gradient estimator's signal-to-noise ratio, accelerating convergence. Empirically, our efficient implementation leads to 2x to 6x faster training without degrading accuracy, requires no manual tuning, and integrates seamlessly into standard RL algorithms.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Euclid preparation: The NISP spectroscopy channel, on ground performance and calibration
Authors:
Euclid Collaboration,
W. Gillard,
T. Maciaszek,
E. Prieto,
F. Grupp,
A. Costille,
K. Jahnke,
J. Clemens,
S. Dusini,
M. Carle,
C. Sirignano,
E. Medinaceli,
S. Ligori,
E. Franceschi,
M. Trifoglio,
W. Bon,
R. Barbier,
S. Ferriol,
A. Secroun,
N. Auricchio,
P. Battaglia,
C. Bonoli,
L. Corcione,
F. Hormuth,
D. Le Mignant
, et al. (334 additional authors not shown)
Abstract:
ESA's Euclid cosmology mission relies on the very sensitive and accurately calibrated spectroscopy channel of the Near-Infrared Spectrometer and Photometer (NISP). With three operational grisms in two wavelength intervals, NISP provides diffraction-limited slitless spectroscopy over a field of $0.57$ deg$^2$. A blue grism $\text{BG}_\text{E}$ covers the wavelength range $926$--$1366$\,nm at a spec…
▽ More
ESA's Euclid cosmology mission relies on the very sensitive and accurately calibrated spectroscopy channel of the Near-Infrared Spectrometer and Photometer (NISP). With three operational grisms in two wavelength intervals, NISP provides diffraction-limited slitless spectroscopy over a field of $0.57$ deg$^2$. A blue grism $\text{BG}_\text{E}$ covers the wavelength range $926$--$1366$\,nm at a spectral resolution $R=440$--$900$ for a $0.5''$ diameter source with a dispersion of $1.24$ nm px$^{-1}$. Two red grisms $\text{RG}_\text{E}$ span $1206$ to $1892$\,nm at $R=550$--$740$ and a dispersion of $1.37$ nm px$^{-1}$. We describe the construction of the grisms as well as the ground testing of the flight model of the NISP instrument where these properties were established.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Euclid preparation. Constraining parameterised models of modifications of gravity with the spectroscopic and photometric primary probes
Authors:
Euclid Collaboration,
I. S. Albuquerque,
N. Frusciante,
Z. Sakr,
S. Srinivasan,
L. Atayde,
B. Bose,
V. F. Cardone,
S. Casas,
M. Martinelli,
J. Noller,
E. M. Teixeira,
D. B. Thomas,
I. Tutusaus,
M. Cataneo,
K. Koyama,
L. Lombriser,
F. Pace,
A. Silvestri,
N. Aghanim,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
M. Baldi
, et al. (263 additional authors not shown)
Abstract:
The Euclid mission has the potential to understand the fundamental physical nature of late-time cosmic acceleration and, as such, of deviations from the standard cosmological model, LCDM. In this paper, we focus on model-independent methods to modify the evolution of scalar perturbations at linear scales. We consider two approaches: the first is based on the two phenomenological modified gravity (…
▽ More
The Euclid mission has the potential to understand the fundamental physical nature of late-time cosmic acceleration and, as such, of deviations from the standard cosmological model, LCDM. In this paper, we focus on model-independent methods to modify the evolution of scalar perturbations at linear scales. We consider two approaches: the first is based on the two phenomenological modified gravity (PMG) parameters, $μ_{\rm mg}$ and $Σ_{\rm mg}$, which are phenomenologically connected to the clustering of matter and weak lensing, respectively; and the second is the effective field theory (EFT) of dark energy and modified gravity, which we use to parameterise the braiding function, $α_{\rm B}$, which defines the mixing between the metric and the dark energy field. We discuss the predictions from spectroscopic and photometric primary probes by Euclid on the cosmological parameters and a given set of additional parameters featuring the PMG and EFT models. We use the Fisher matrix method applied to spectroscopic galaxy clustering (GCsp), weak lensing (WL), photometric galaxy clustering (GCph), and cross-correlation (XC) between GCph and WL. For the modelling of photometric predictions on nonlinear scales, we use the halo model to cover two limits for the screening mechanism: the unscreened (US) case, for which the screening mechanism is not present; and the super-screened (SS) case, which assumes strong screening. We also assume scale cuts to account for our uncertainties in the modelling of nonlinear perturbation evolution. We choose a time-dependent form for $\{μ_{\rm mg},Σ_{\rm mg}\}$, with two fiducial sets of values for the corresponding model parameters at the present time, $\{\barμ_0,\barΣ_0\}$, and two forms for $α_{\rm B}$, with one fiducial set of values for each of the model parameters, $α_{\rm B,0}$ and $\{α_{\rm B,0},m\}$. (Abridged)
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation
Authors:
Hao Chen,
Yukun Yan,
Sen Mei,
Wanxiang Che,
Zhenghao Liu,
Qi Shi,
Xinze Li,
Yuchun Fan,
Pengcheng Huang,
Qiushi Xiong,
Zhiyuan Liu,
Maosong Sun
Abstract:
Retrieval-Augmented Generation (RAG) augments Large Language Models (LLMs) with external knowledge to improve factuality. However, existing RAG systems frequently underutilize the retrieved documents, failing to extract and integrate the key clues needed to support faithful and interpretable reasoning, especially in cases where relevant evidence is implicit, scattered, or obscured by noise. To add…
▽ More
Retrieval-Augmented Generation (RAG) augments Large Language Models (LLMs) with external knowledge to improve factuality. However, existing RAG systems frequently underutilize the retrieved documents, failing to extract and integrate the key clues needed to support faithful and interpretable reasoning, especially in cases where relevant evidence is implicit, scattered, or obscured by noise. To address this issue, we propose ClueAnchor, a novel framework for enhancing RAG via clue-anchored reasoning exploration and optimization. ClueAnchor extracts key clues from retrieved content and generates multiple reasoning paths based on different knowledge configurations, optimizing the model by selecting the most effective one through reward-based preference optimization. Experiments show that ClueAnchor significantly outperforms prior RAG baselines in reasoning completeness and robustness. Further analysis confirms its strong resilience to noisy or partially relevant retrieved content, as well as its capability to identify supporting evidence even in the absence of explicit clue supervision during inference.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Euclid: Early Release Observations of ram-pressure stripping in the Perseus cluster. Detection of parsec scale star formation with in the low surface brightness stripped tails of UGC 2665 and MCG +07-07-070
Authors:
Koshy George,
A. Boselli,
J. -C. Cuillandre,
M. Kümmel,
A. Lançon,
C. Bellhouse,
T. Saifollahi,
M. Mondelin,
M. Bolzonella,
P. Joseph,
I. D. Roberts,
R. J. van Weeren,
Q. Liu,
E. Sola,
M. Urbano,
M. Baes,
R. F. Peletier,
M. Klein,
C. T. Davies,
I. A. Zinchenko,
J. G. Sorce,
M. Poulain,
N. Aghanim,
B. Altieri,
A. Amara
, et al. (155 additional authors not shown)
Abstract:
Euclid is delivering optical and near-infrared imaging data over 14,000 deg$^2$ on the sky at spatial resolution and surface brightness levels that can be used to understand the morphological transformation of galaxies within groups and clusters. Using the Early Release Observations (ERO) of the Perseus cluster, we demonstrate the capability offered by Euclid in studying the nature of perturbation…
▽ More
Euclid is delivering optical and near-infrared imaging data over 14,000 deg$^2$ on the sky at spatial resolution and surface brightness levels that can be used to understand the morphological transformation of galaxies within groups and clusters. Using the Early Release Observations (ERO) of the Perseus cluster, we demonstrate the capability offered by Euclid in studying the nature of perturbations for galaxies in clusters. Filamentary structures are observed along the discs of two spiral galaxies with no extended diffuse emission expected from tidal interactions at surface brightness levels of $\sim$ $30\,{\rm mag}\,{\rm arcsec}^{-2}$. The detected features exhibit a good correspondence in morphology between optical and near-infrared wavelengths, with a surface brightness of $\sim$ $25\,{\rm mag}\,{\rm arcsec}^{-2}$, and the knots within the features have sizes of $\sim$ 100 pc, as observed through $I_E$ imaging. Using the Euclid, CFHT, UVIT, and LOFAR $144\,{\rm MHz}$ radio continuum observations, we conduct a detailed analysis to understand the origin of the detected features. We constructed the \textit{Euclid} $I_E-Y_E$, $Y_E-H_E$, and CFHT $u - r$, $g - i$ colour-colour plane and showed that these features contain recent star formation events, which are also indicated by their H$α$ and NUV emissions. Euclid colours alone are insufficient for studying stellar population ages in unresolved star-forming regions, which require multi-wavelength optical imaging data. The morphological shape, orientation, and mean age of the stellar population, combined with the presence of extended radio continuum cometary tails can be consistently explained if these features have been formed during a recent ram-pressure stripping event. This result further confirms the exceptional qualities of Euclid in the study of galaxy evolution in dense environments.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling
Authors:
Haidong Xin,
Qiushi Xiong,
Zhenghao Liu,
Sen Mei,
Yukun Yan,
Shi Yu,
Shuo Wang,
Yu Gu,
Ge Yu,
Chenyan Xiong
Abstract:
User-item interaction histories are pivotal for sequential recommendation systems but often include noise, such as unintended clicks or actions that fail to reflect genuine user preferences. To address this issue, we propose the User-Consistent Preference-based Sequential Recommendation System (ConsRec), designed to capture stable user preferences and filter noisy items from interaction histories.…
▽ More
User-item interaction histories are pivotal for sequential recommendation systems but often include noise, such as unintended clicks or actions that fail to reflect genuine user preferences. To address this issue, we propose the User-Consistent Preference-based Sequential Recommendation System (ConsRec), designed to capture stable user preferences and filter noisy items from interaction histories. Specifically, ConsRec constructs a user-interacted item graph, learns item similarities from their text representations, and then extracts the maximum connected subgraph from the user-interacted item graph for denoising items. Experimental results on the Yelp and Amazon Product datasets illustrate that ConsRec achieves a 13% improvement over baseline recommendation models, showing its effectiveness in denoising user-interacted items. Further analysis reveals that the denoised interaction histories form semantically tighter clusters of user-preferred items, leading to higher relevance scores for ground-truth targets and more accurate recommendations. All codes are available at https://github.com/NEUIR/ConsRec.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models
Authors:
Ziheng Cheng,
Yixiao Huang,
Hui Xu,
Somayeh Sojoudi,
Xuandong Zhao,
Dawn Song,
Song Mei
Abstract:
Text-to-Image (T2I) models have achieved remarkable success in generating visual content from text inputs. Although multiple safety alignment strategies have been proposed to prevent harmful outputs, they often lead to overly cautious behavior -- rejecting even benign prompts -- a phenomenon known as $\textit{over-refusal}$ that reduces the practical utility of T2I models. Despite over-refusal hav…
▽ More
Text-to-Image (T2I) models have achieved remarkable success in generating visual content from text inputs. Although multiple safety alignment strategies have been proposed to prevent harmful outputs, they often lead to overly cautious behavior -- rejecting even benign prompts -- a phenomenon known as $\textit{over-refusal}$ that reduces the practical utility of T2I models. Despite over-refusal having been observed in practice, there is no large-scale benchmark that systematically evaluates this phenomenon for T2I models. In this paper, we present an automatic workflow to construct synthetic evaluation data, resulting in OVERT ($\textbf{OVE}$r-$\textbf{R}$efusal evaluation on $\textbf{T}$ext-to-image models), the first large-scale benchmark for assessing over-refusal behaviors in T2I models. OVERT includes 4,600 seemingly harmful but benign prompts across nine safety-related categories, along with 1,785 genuinely harmful prompts (OVERT-unsafe) to evaluate the safety-utility trade-off. Using OVERT, we evaluate several leading T2I models and find that over-refusal is a widespread issue across various categories (Figure 1), underscoring the need for further research to enhance the safety alignment of T2I models without compromising their functionality. As a preliminary attempt to reduce over-refusal, we explore prompt rewriting; however, we find it often compromises faithfulness to the meaning of the original prompts. Finally, we demonstrate the flexibility of our generation framework in accommodating diverse safety requirements by generating customized evaluation data adapting to user-defined policies.
△ Less
Submitted 27 May, 2025; v1 submitted 27 May, 2025;
originally announced May 2025.
-
Learning Wavelet-Sparse FDK for 3D Cone-Beam CT Reconstruction
Authors:
Yipeng Sun,
Linda-Sophie Schneider,
Chengze Ye,
Mingxuan Gu,
Siyuan Mei,
Siming Bayer,
Andreas Maier
Abstract:
Cone-Beam Computed Tomography (CBCT) is essential in medical imaging, and the Feldkamp-Davis-Kress (FDK) algorithm is a popular choice for reconstruction due to its efficiency. However, FDK is susceptible to noise and artifacts. While recent deep learning methods offer improved image quality, they often increase computational complexity and lack the interpretability of traditional methods. In this…
▽ More
Cone-Beam Computed Tomography (CBCT) is essential in medical imaging, and the Feldkamp-Davis-Kress (FDK) algorithm is a popular choice for reconstruction due to its efficiency. However, FDK is susceptible to noise and artifacts. While recent deep learning methods offer improved image quality, they often increase computational complexity and lack the interpretability of traditional methods. In this paper, we introduce an enhanced FDK-based neural network that maintains the classical algorithm's interpretability by selectively integrating trainable elements into the cosine weighting and filtering stages. Recognizing the challenge of a large parameter space inherent in 3D CBCT data, we leverage wavelet transformations to create sparse representations of the cosine weights and filters. This strategic sparsification reduces the parameter count by $93.75\%$ without compromising performance, accelerates convergence, and importantly, maintains the inference computational cost equivalent to the classical FDK algorithm. Our method not only ensures volumetric consistency and boosts robustness to noise, but is also designed for straightforward integration into existing CT reconstruction pipelines. This presents a pragmatic enhancement that can benefit clinical applications, particularly in environments with computational limitations.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
GeoMM: On Geodesic Perspective for Multi-modal Learning
Authors:
Shibin Mei,
Hang Wang,
Bingbing Ni
Abstract:
Geodesic distance serves as a reliable means of measuring distance in nonlinear spaces, and such nonlinear manifolds are prevalent in the current multimodal learning. In these scenarios, some samples may exhibit high similarity, yet they convey different semantics, making traditional distance metrics inadequate for distinguishing between positive and negative samples. This paper introduces geodesi…
▽ More
Geodesic distance serves as a reliable means of measuring distance in nonlinear spaces, and such nonlinear manifolds are prevalent in the current multimodal learning. In these scenarios, some samples may exhibit high similarity, yet they convey different semantics, making traditional distance metrics inadequate for distinguishing between positive and negative samples. This paper introduces geodesic distance as a novel distance metric in multi-modal learning for the first time, to mine correlations between samples, aiming to address the limitations of common distance metric. Our approach incorporates a comprehensive series of strategies to adapt geodesic distance for the current multimodal learning. Specifically, we construct a graph structure to represent the adjacency relationships among samples by thresholding distances between them and then apply the shortest-path algorithm to obtain geodesic distance within this graph. To facilitate efficient computation, we further propose a hierarchical graph structure through clustering and combined with incremental update strategies for dynamic status updates. Extensive experiments across various downstream tasks validate the effectiveness of our proposed method, demonstrating its capability to capture complex relationships between samples and improve the performance of multimodal learning models.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Euclid: Photometric redshift calibration with the clustering redshifts technique
Authors:
W. d'Assignies Doumerg,
M. Manera,
C. Padilla,
O. Ilbert,
H. Hildebrandt,
L. Reynolds,
J. Chaves-Montero,
A. H. Wright,
P. Tallada-Crespí,
M. Eriksen,
J. Carretero,
W. Roster,
Y. Kang,
K. Naidoo,
R. Miquel,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
D. Bagot,
M. Baldi,
A. Balestra,
S. Bardelli,
P. Battaglia
, et al. (150 additional authors not shown)
Abstract:
Aims: The precision of cosmological constraints from imaging surveys hinges on accurately estimating the redshift distribution $ n(z) $ of tomographic bins, especially their mean redshifts. We assess the effectiveness of the clustering redshifts technique in constraining Euclid tomographic redshift bins to meet the target uncertainty of $ σ( \langle z \rangle ) < 0.002 (1 + z) $. In this work, the…
▽ More
Aims: The precision of cosmological constraints from imaging surveys hinges on accurately estimating the redshift distribution $ n(z) $ of tomographic bins, especially their mean redshifts. We assess the effectiveness of the clustering redshifts technique in constraining Euclid tomographic redshift bins to meet the target uncertainty of $ σ( \langle z \rangle ) < 0.002 (1 + z) $. In this work, these mean redshifts are inferred from the small-scale angular clustering of Euclid galaxies, which are distributed into bins with spectroscopic samples localised in narrow redshift slices.
Methods: We generate spectroscopic mocks from the Flagship2 simulation for the Baryon Oscillation Spectroscopic Survey (BOSS), the Dark Energy Spectroscopic Instrument (DESI), and Euclid's Near-Infrared Spectrometer and Photometer (NISP) spectroscopic survey. We evaluate and optimise the clustering redshifts pipeline, introducing a new method for measuring photometric galaxy bias (clustering), which is the primary limitation of this technique.
Results: We have successfully constrained the means and standard deviations of the redshift distributions for all of the tomographic bins (with a maximum photometric redshift of 1.6), achieving precision beyond the required thresholds. We have identified the main sources of bias, particularly the impact of the 1-halo galaxy distribution, which imposed a minimal separation scale of 1.5 Mpc for evaluating cross-correlations. These results demonstrate the potential of clustering redshifts to meet the precision requirements for Euclid, and we highlight several avenues for future improvements.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Coevolution of halo and quasar properties in dense environments: CARLA J1017+6116 at z=2.8
Authors:
Sofia G. Gallego,
Simona Mei,
Christopher Martin,
Donal O'Sullivan,
Emanuele Daddi,
Dominika Wylezalek,
Nicholas Seymour
Abstract:
Radio-loud active galactic nuclei, in particular radio-loud quasars, are fueled by accretion onto supermassive black holes and are among the most energetic sources in the Universe. While their impact on their surroundings - from the interstellar medium to the circumgalactic medium - is well recognized, the specific mechanisms remain uncertain. In this study we analyze deep Keck Cosmic Web Imager o…
▽ More
Radio-loud active galactic nuclei, in particular radio-loud quasars, are fueled by accretion onto supermassive black holes and are among the most energetic sources in the Universe. While their impact on their surroundings - from the interstellar medium to the circumgalactic medium - is well recognized, the specific mechanisms remain uncertain. In this study we analyze deep Keck Cosmic Web Imager observations of the Lyman-alpha (Lya) halo surrounding the radio-loud quasar at the center of the cluster CARLA J1017+6116 at redshift z = 2.8. As is known from previous observations, the cluster hosts a high fraction of early-type galaxies, and the star formation of its spectroscopically confirmed cluster members is typical of or higher than that of galaxies on the main sequence. We find that the Lya halo extends at least 16 arcsec (128 pkpc) down to a surface brightness level of 1e-19 erg/s/cm^2/arcsec^2, with a total observed Lya luminosity of log10(L/Lsun) = 43.35 +- 0.05. The halo has distinct kinematic regions with asymmetries suggestive of complex interactions between the quasar and the intracluster medium, possibly driven by a combination of biconical feedback and episodic activity. Despite the quasar classification, our reanalysis of very long baseline interferometry data finds no evidence of extended jet structures; we instead find compact and variable radio emission that could indicate episodic jet activity or suppression by the dense interstellar medium. Combining these observations with imaging obtained with the Hubble Space Telescope, we identified one Lya-emitting source within the quasar halo. While mechanical feedback from a jet appears limited or episodic, radiative feedback likely plays a dominant role in shaping the extended Lya halo, highlighting the complex interplay between quasar-driven processes and the surrounding dense environment.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Euclid preparation. The impact of redshift interlopers on the two-point correlation function analysis
Authors:
Euclid Collaboration,
I. Risso,
A. Veropalumbo,
E. Branchini,
E. Maragliano,
S. de la Torre,
E. Sarpa,
P. Monaco,
B. R. Granett,
S. Lee,
G. E. Addison,
S. Bruton,
C. Carbone,
G. Lavaux,
K. Markovic,
K. McCarthy,
G. Parimbelli,
F. Passalacqua,
W. J. Percival,
C. Scarlata,
E. Sefusatti,
Y. Wang,
M. Bonici,
F. Oppizzi,
N. Aghanim
, et al. (295 additional authors not shown)
Abstract:
The Euclid survey aims to measure the spectroscopic redshift of emission-line galaxies by identifying the H$\,α$ line in their slitless spectra. This method is sensitive to the signal-to-noise ratio of the line, as noise fluctuations or other strong emission lines can be misidentified as H$\,α$, depending on redshift. These effects lead to catastrophic redshift errors and the inclusion of interlop…
▽ More
The Euclid survey aims to measure the spectroscopic redshift of emission-line galaxies by identifying the H$\,α$ line in their slitless spectra. This method is sensitive to the signal-to-noise ratio of the line, as noise fluctuations or other strong emission lines can be misidentified as H$\,α$, depending on redshift. These effects lead to catastrophic redshift errors and the inclusion of interlopers in the sample. We forecast the impact of such redshift errors on galaxy clustering measurements. In particular, we study the effect of interloper contamination on the two-point correlation function (2PCF), the growth rate of structures, and the Alcock-Paczynski (AP) parameters. We analyze 1000 synthetic spectroscopic catalogues, the EuclidLargeMocks, designed to match the area and selection function of the Data Release 1 (DR1) sample. We estimate the 2PCF of the contaminated catalogues, isolating contributions from correctly identified galaxies and from interlopers. We explore different models with increasing complexity to describe the measured 2PCF at fixed cosmology. Finally, we perform a cosmological inference and evaluate the systematic error on the inferred $fσ_8$, $α_{\parallel}$ and $α_{\perp}$ values associated with different models. Our results demonstrate that a minimal modelling approach, which only accounts for an attenuation of the clustering signal regardless of the type of contaminants, is sufficient to recover the correct values of $fσ_8$, $α_{\parallel}$, and $α_{\perp}$ at DR1. The accuracy and precision of the estimated AP parameters are largely insensitive to the presence of interlopers. The adoption of a minimal model induces a 1%-3% systematic error on the growth rate of structure estimation, depending on the redshift. However, this error remains smaller than the statistical error expected for the Euclid DR1 analysis.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Euclid preparation: TBD. Cosmic Dawn Survey: evolution of the galaxy stellar mass function across 0.2<z<6.5 measured over 10 square degrees
Authors:
Euclid Collaboration,
L. Zalesky,
J. R. Weaver,
C. J. R. McPartland,
G. Murphree,
I. Valdes,
C. K. Jespersen,
S. Taamoli,
N. Chartab,
N. Allen,
S. W. J. Barrow,
D. B. Sanders,
S. Toft,
B. Mobasher,
I. Szapudi,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
M. Baldi,
S. Bardelli,
P. Battaglia,
A. Biviano,
D. Bonino
, et al. (282 additional authors not shown)
Abstract:
The Cosmic Dawn Survey Pre-launch (PL) catalogues cover an effective 10.13 deg$^{2}$ area with uniform deep Spitzer/IRAC data ($m\sim25$ mag, 5$σ$), the largest area covered to these depths in the infrared. These data are used to gain new insight into the growth of stellar mass across cosmic history by characterising the evolution of the galaxy stellar mass function (GSMF) through…
▽ More
The Cosmic Dawn Survey Pre-launch (PL) catalogues cover an effective 10.13 deg$^{2}$ area with uniform deep Spitzer/IRAC data ($m\sim25$ mag, 5$σ$), the largest area covered to these depths in the infrared. These data are used to gain new insight into the growth of stellar mass across cosmic history by characterising the evolution of the galaxy stellar mass function (GSMF) through $0.2 < z \leq 6.5$. The total volume (0.62 Gpc$^{3}$) represents a tenfold increase compared to previous works that have explored $z > 3$ and significantly reduces cosmic variance, yielding strong constraints on the abundance of massive galaxies. Results are generally consistent with the literature but now provide firm estimates of number density where only upper limits were previously available. Contrasting the GSMF with the dark matter halo mass function suggests that massive galaxies ($M \gtrsim10^{11}$ M$_{\odot}$) at $z > 3.5$ required integrated star-formation efficiencies of $M/(M_{\rm h}f_{\rm b}) \gtrsim$ 0.25--0.5, in excess of the commonly-held view of ``universal peak efficiency" from studies on the stellar-to-halo mass relation (SHMR). Such increased efficiencies imply an evolving peak in the SHMR at $z > 3.5$ which can be maintained if feedback mechanisms from active galactic nuclei and stellar processes are ineffective at early times. In addition, a significant fraction of the most massive quiescent galaxies are observed to be in place already by $z\sim 2.5$--3. The apparent lack in change of their number density by $z\sim 0.2$ is consistent with relatively little mass growth from mergers. Utilising the unique volume, evidence for an environmental dependence of the galaxy stellar mass function is found all the way through $z\sim 3.5$ for the first time, though a more careful characterisation of the density field is ultimately required for confirmation.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Data-driven model order reduction for T-Product-Based dynamical systems
Authors:
Shenghan Mei,
Ziqin He,
Yidan Mei,
Xin Mao,
Anqi Dong,
Ren Wang,
Can Chen
Abstract:
Model order reduction plays a crucial role in simplifying complex systems while preserving their essential dynamic characteristics, making it an invaluable tool in a wide range of applications, including robotic systems, signal processing, and fluid dynamics. However, traditional model order reduction techniques like balanced truncation are not designed to handle tensor data directly and instead r…
▽ More
Model order reduction plays a crucial role in simplifying complex systems while preserving their essential dynamic characteristics, making it an invaluable tool in a wide range of applications, including robotic systems, signal processing, and fluid dynamics. However, traditional model order reduction techniques like balanced truncation are not designed to handle tensor data directly and instead require unfolding the data, which may lead to the loss of important higher-order structural information. In this article, we introduce a novel framework for data-driven model order reduction of T-product-based dynamical systems (TPDSs), which are often used to capture the evolution of third-order tensor data such as images and videos through the T-product. Specifically, we develop advanced T-product-based techniques, including T-balanced truncation, T-balanced proper orthogonal decomposition, and the T-eigensystem realization algorithm for input-output TPDSs by leveraging the unique properties of T-singular value decomposition. We demonstrate that these techniques offer significant memory and computational savings while achieving reduction errors that are comparable to those of conventional methods. The effectiveness of the proposed framework is further validated through synthetic and real-world examples.
△ Less
Submitted 20 April, 2025;
originally announced April 2025.
-
Filter2Noise: Interpretable Self-Supervised Single-Image Denoising for Low-Dose CT with Attention-Guided Bilateral Filtering
Authors:
Yipeng Sun,
Linda-Sophie Schneider,
Mingxuan Gu,
Siyuan Mei,
Chengze Ye,
Fabian Wagner,
Siming Bayer,
Andreas Maier
Abstract:
Effective denoising is crucial in low-dose CT to enhance subtle structures and low-contrast lesions while preventing diagnostic errors. Supervised methods struggle with limited paired datasets, and self-supervised approaches often require multiple noisy images and rely on deep networks like U-Net, offering little insight into the denoising mechanism. To address these challenges, we propose an inte…
▽ More
Effective denoising is crucial in low-dose CT to enhance subtle structures and low-contrast lesions while preventing diagnostic errors. Supervised methods struggle with limited paired datasets, and self-supervised approaches often require multiple noisy images and rely on deep networks like U-Net, offering little insight into the denoising mechanism. To address these challenges, we propose an interpretable self-supervised single-image denoising framework -- Filter2Noise (F2N). Our approach introduces an Attention-Guided Bilateral Filter that adapted to each noisy input through a lightweight module that predicts spatially varying filter parameters, which can be visualized and adjusted post-training for user-controlled denoising in specific regions of interest. To enable single-image training, we introduce a novel downsampling shuffle strategy with a new self-supervised loss function that extends the concept of Noise2Noise to a single image and addresses spatially correlated noise. On the Mayo Clinic 2016 low-dose CT dataset, F2N outperforms the leading self-supervised single-image method (ZS-N2N) by 4.59 dB PSNR while improving transparency, user control, and parametric efficiency. These features provide key advantages for medical applications that require precise and interpretable noise reduction. Our code is demonstrated at https://github.com/sypsyp97/Filter2Noise.git .
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Euclid preparation. Estimating galaxy physical properties using CatBoost chained regressors with attention
Authors:
Euclid Collaboration,
A. Humphrey,
P. A. C. Cunha,
L. Bisigello,
C. Tortora,
M. Bolzonella,
L. Pozzetti,
M. Baes,
B. R. Granett,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
M. Baldi,
S. Bardelli,
A. Biviano,
C. Bodendorf,
D. Bonino,
E. Branchini,
M. Brescia,
J. Brinchmann,
S. Camera,
G. Cañas-Herrera,
V. Capobianco,
C. Carbone
, et al. (210 additional authors not shown)
Abstract:
Euclid will image ~14000 deg^2 of the extragalactic sky at visible and NIR wavelengths, providing a dataset of unprecedented size and richness that will facilitate a multitude of studies into the evolution of galaxies. In the vast majority of cases the main source of information will come from broad-band images and data products thereof. Therefore, there is a pressing need to identify or develop s…
▽ More
Euclid will image ~14000 deg^2 of the extragalactic sky at visible and NIR wavelengths, providing a dataset of unprecedented size and richness that will facilitate a multitude of studies into the evolution of galaxies. In the vast majority of cases the main source of information will come from broad-band images and data products thereof. Therefore, there is a pressing need to identify or develop scalable yet reliable methodologies to estimate the redshift and physical properties of galaxies using broad-band photometry from Euclid, optionally including ground-based optical photometry also. To address this need, we present a novel method to estimate the redshift, stellar mass, star-formation rate, specific star-formation rate, E(B-V), and age of galaxies, using mock Euclid and ground-based photometry. The main novelty of our property-estimation pipeline is its use of the CatBoost implementation of gradient-boosted regression-trees, together with chained regression and an intelligent, automatic optimization of the training data. The pipeline also includes a computationally-efficient method to estimate prediction uncertainties, and, in the absence of ground-truth labels, provides accurate predictions for metrics of model performance up to z~2. We apply our pipeline to several datasets consisting of mock Euclid broad-band photometry and mock ground-based ugriz photometry, to evaluate the performance of our methodology for estimating the redshift and physical properties of galaxies detected in the Euclid Wide Survey. The quality of our photometric redshift and physical property estimates are highly competitive overall, validating our modeling approach. We find that the inclusion of ground-based optical photometry significantly improves the quality of the property estimation, highlighting the importance of combining Euclid data with ancillary ground-based optical data. (Abridged)
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation
Authors:
Yuxuan Chen,
Dewen Guo,
Sen Mei,
Xinze Li,
Hao Chen,
Yishan Li,
Yixuan Wang,
Chaoyue Tang,
Ruobing Wang,
Dingjun Wu,
Yukun Yan,
Zhenghao Liu,
Shi Yu,
Zhiyuan Liu,
Maosong Sun
Abstract:
Retrieval-Augmented Generation (RAG) significantly enhances the performance of large language models (LLMs) in downstream tasks by integrating external knowledge. To facilitate researchers in deploying RAG systems, various RAG toolkits have been introduced. However, many existing RAG toolkits lack support for knowledge adaptation tailored to specific application scenarios. To address this limitati…
▽ More
Retrieval-Augmented Generation (RAG) significantly enhances the performance of large language models (LLMs) in downstream tasks by integrating external knowledge. To facilitate researchers in deploying RAG systems, various RAG toolkits have been introduced. However, many existing RAG toolkits lack support for knowledge adaptation tailored to specific application scenarios. To address this limitation, we propose UltraRAG, a RAG toolkit that automates knowledge adaptation throughout the entire workflow, from data construction and training to evaluation, while ensuring ease of use. UltraRAG features a user-friendly WebUI that streamlines the RAG process, allowing users to build and optimize systems without coding expertise. It supports multimodal input and provides comprehensive tools for managing the knowledge base. With its highly modular architecture, UltraRAG delivers an end-to-end development solution, enabling seamless knowledge adaptation across diverse user scenarios. The code, demonstration videos, and installable package for UltraRAG are publicly available at https://github.com/OpenBMB/UltraRAG.
△ Less
Submitted 30 March, 2025;
originally announced April 2025.
-
Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models
Authors:
Jiawei Lian,
Jianhong Pan,
Lefan Wang,
Yi Wang,
Shaohui Mei,
Lap-Pui Chau
Abstract:
Large language models (LLMs) are foundational explorations to artificial general intelligence, yet their alignment with human values via instruction tuning and preference learning achieves only superficial compliance. Here, we demonstrate that harmful knowledge embedded during pretraining persists as indelible "dark patterns" in LLMs' parametric memory, evading alignment safeguards and resurfacing…
▽ More
Large language models (LLMs) are foundational explorations to artificial general intelligence, yet their alignment with human values via instruction tuning and preference learning achieves only superficial compliance. Here, we demonstrate that harmful knowledge embedded during pretraining persists as indelible "dark patterns" in LLMs' parametric memory, evading alignment safeguards and resurfacing under adversarial inducement at distributional shifts. In this study, we first theoretically analyze the intrinsic ethical vulnerability of aligned LLMs by proving that current alignment methods yield only local "safety regions" in the knowledge manifold. In contrast, pretrained knowledge remains globally connected to harmful concepts via high-likelihood adversarial trajectories. Building on this theoretical insight, we empirically validate our findings by employing semantic coherence inducement under distributional shifts--a method that systematically bypasses alignment constraints through optimized adversarial prompts. This combined theoretical and empirical approach achieves a 100% attack success rate across 19 out of 23 state-of-the-art aligned LLMs, including DeepSeek-R1 and LLaMA-3, revealing their universal vulnerabilities.
△ Less
Submitted 2 June, 2025; v1 submitted 7 April, 2025;
originally announced April 2025.
-
Euclid Quick Data Release (Q1) Ultracool dwarfs in the Euclid Deep Field North
Authors:
A. Mohandasan,
R. L. Smart,
C. Reylé,
V. Le Brun,
A. Pérez-Garrido,
E. Bañados,
B. Goldman,
S. L. Casewell,
M. R. Zapatero Osorio,
T. Dupuy,
M. Rejkuba,
E. L. Martín,
C. Dominguez-Tagle,
M. {Ž}erjal,
N. Huélamo,
N. Lodieu,
P. Cruz,
R. Rebolo,
M. W. Phillips,
J. -Y. Zhang,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio
, et al. (154 additional authors not shown)
Abstract:
Ultracool dwarfs (UCDs) encompass the lowest mass stars and brown dwarfs, defining the stellar substellar boundary. They have significant potential for advancing the understanding of substellar physics; however, these objects are challenging to detect due to their low luminosity. The wide coverage and deep sensitivity of the Euclid survey will increase the number of confirmed and well characterise…
▽ More
Ultracool dwarfs (UCDs) encompass the lowest mass stars and brown dwarfs, defining the stellar substellar boundary. They have significant potential for advancing the understanding of substellar physics; however, these objects are challenging to detect due to their low luminosity. The wide coverage and deep sensitivity of the Euclid survey will increase the number of confirmed and well characterised UCDs by several orders of magnitude. In this study, we take advantage of the Euclid Quick Data Release (Q1) and in particular we look in detail at the known and new UCDs in the Euclid Deep Field North (22.9 deg2 down to JE = 24.5 mag), to understand the advantages of using the slitless Euclid spectroscopy. We compile a comparison sample of known UCDs and use their spectra to demonstrate the capability of Euclid to derive spectral types using a template matching method. This method is then applied to the spectra of the newly identified candidates. We confirm that 33 of these candidates are new UCDs, with spectral types ranging from M7 to T1 and JE = 17 to 21 mag. We look at their locus in colour colour diagrams and compare them with the expected colours of QSOs. A machine readable catalogue is provided for further study, containing both the comparison sample and the newly identified UCDs, along with their spectral classifications where the Q1 spectra quality allows for confident determination
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1): A photometric search for ultracool dwarfs in the Euclid Deep Fields
Authors:
M. Žerjal,
C. Dominguez-Tagle,
N. Sedighi,
E. L. Martín,
N. Lodieu,
B. Goldman,
C. Reylé,
R. L. Smart,
A. Mohandasan,
M. R. Zapatero Osorio,
D. Barrado,
P. Mas Buitrago,
N. Vitas,
P. Cruz,
V. J. S. Béjar,
H. Bouy,
A. Burgasser,
S. Muñoz Torres,
N. Phan-Bao,
E. Solano,
R. Tata,
S. Tsilia,
J. -Y. Zhang,
N. Aghanim,
B. Altieri
, et al. (155 additional authors not shown)
Abstract:
We present a catalogue of more than 5000 new ultracool dwarf (UCD) candidates in the three Euclid Deep Fields in the Q1 data release. They range from late M to late T dwarfs, and include 1200 L and T dwarfs. More than 100 of them have been spectroscopically confirmed, with seven of them being T dwarfs. Our UCD selection criteria are based only on colour ($I_\mathrm{E}-Y_\mathrm{E}>2.5$). The combi…
▽ More
We present a catalogue of more than 5000 new ultracool dwarf (UCD) candidates in the three Euclid Deep Fields in the Q1 data release. They range from late M to late T dwarfs, and include 1200 L and T dwarfs. More than 100 of them have been spectroscopically confirmed, with seven of them being T dwarfs. Our UCD selection criteria are based only on colour ($I_\mathrm{E}-Y_\mathrm{E}>2.5$). The combined requirement for optical detection and stringent signal-to-noise ratio threshold ensure a high purity of the sample, but at the expense of completeness, especially for T dwarfs. The detections range from magnitudes 19 and 24 in the near-infrared bands, and extend down to 26 in the optical band. The average surface density of detected UCDs on the sky is approximately 100 objects per $\mathrm{deg}^2$, including 20 L and T dwarfs per $\mathrm{deg}^2$. This leads to an expectation of at least 1.4 million ultracool dwarfs in the final data release of the Euclid Wide Survey, including at least 300,000 L dwarfs, and more than 2,600 T dwarfs, using the strict selection criteria from this work. We provide empirical Euclid colours as a function of spectral type, and a probability that an object with a given colour has a certain spectral type.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The Euclid view on Planck galaxy protocluster candidates: towards a probe of the highest sites of star formation at cosmic noon
Authors:
Euclid Collaboration,
T. Dusserre,
H. Dole,
F. Sarron,
G. Castignani,
N. Ramos-Chernenko,
N. Aghanim,
A. Garic,
I. -E. Mellouki,
N. Dagoneau,
O. Chapuis,
B. L. Frye,
M. Polletta,
H. Dannerbauer,
M. Langer,
L. Maurin,
E. Soubrie,
A. Biviano,
S. Mei,
N. Mai,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi
, et al. (317 additional authors not shown)
Abstract:
We search for galaxy protoclusters at redshifts $z > 1.5$ in the first data release (Q1) of the $\textit{Euclid}$ survey. We make use of the catalogues delivered by the $\textit{Euclid}$ Science Ground Segment (SGS). After a galaxy selection on the $H_\textrm{E}$ magnitude and on the photometric redshift quality, we undertake the search using the $\texttt{DETECTIFz}$ algorithm, an overdensity find…
▽ More
We search for galaxy protoclusters at redshifts $z > 1.5$ in the first data release (Q1) of the $\textit{Euclid}$ survey. We make use of the catalogues delivered by the $\textit{Euclid}$ Science Ground Segment (SGS). After a galaxy selection on the $H_\textrm{E}$ magnitude and on the photometric redshift quality, we undertake the search using the $\texttt{DETECTIFz}$ algorithm, an overdensity finder based on Delaunay tessellation that uses photometric redshift probability distributions through Monte Carlo simulations. In this pilot study, we conduct a search in the 11 $\textit{Euclid}$ tiles that contain previously known $\textit{Planck}$ high star-forming galaxy protocluster candidates and focus on the two detections that coincide with these regions. These counterparts lie at photometric redshifts $z_\textrm{ph}=1.63^{+0.19}_{-0.23}$ and $z_\textrm{ph}=1.56^{+0.18}_{-0.21}$ and have both been confirmed by two other independent protocluster detection algorithms. We study their colours, their derived stellar masses and star-formation rates, and we estimate their halo mass lower limits. We investigate whether we are intercepting these galaxy overdensities in their `dying' phase, such that the high star-formation rates would be due to their last unsustainable starburst before transitioning to groups or clusters of galaxies. Indeed, some galaxy members are found to lie above the main sequence of galaxies (star-formation rate versus stellar mass). These overdense regions occupy a specific position in the dark matter halo mass / redshift plane where forming galaxy clusters are expected to have experienced a transition between cold flows to shock heating in the halo. Finally, we empirically update the potential for galaxy protocluster discoveries at redshift up to $z \simeq3$ (wide survey) and $z \simeq5.5$ (deep survey) with $\textit{Euclid}$ for the next data release (DR1).
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). First detections from the galaxy cluster workflow
Authors:
Euclid Collaboration,
S. Bhargava,
C. Benoist,
A. H. Gonzalez,
M. Maturi,
J. -B. Melin,
S. A. Stanford,
E. Munari,
M. Vannier,
C. Murray,
S. Maurogordato,
A. Biviano,
J. Macias-Perez,
J. G. Bartlett,
F. Pacaud,
A. Widmer,
M. Meneghetti,
B. Sartoris,
M. Aguena,
G. Alguero,
S. Andreon,
S. Bardelli,
L. Baumont,
M. Bolzonella,
R. Cabanac
, et al. (329 additional authors not shown)
Abstract:
The first survey data release by the Euclid mission covers approximately $63\,\mathrm{deg^2}$ in the Euclid Deep Fields to the same depth as the Euclid Wide Survey. This paper showcases, for the first time, the performance of cluster finders on Euclid data and presents examples of validated clusters in the Quick Release 1 (Q1) imaging data. We identify clusters using two algorithms (AMICO and PZWa…
▽ More
The first survey data release by the Euclid mission covers approximately $63\,\mathrm{deg^2}$ in the Euclid Deep Fields to the same depth as the Euclid Wide Survey. This paper showcases, for the first time, the performance of cluster finders on Euclid data and presents examples of validated clusters in the Quick Release 1 (Q1) imaging data. We identify clusters using two algorithms (AMICO and PZWav) implemented in the Euclid cluster-detection pipeline. We explore the internal consistency of detections from the two codes, and cross-match detections with known clusters from other surveys using external multi-wavelength and spectroscopic data sets. This enables assessment of the Euclid photometric redshift accuracy and also of systematics such as mis-centring between the optical cluster centre and centres based on X-ray and/or Sunyaev--Zeldovich observations. We report 426 joint PZWav and AMICO-detected clusters with high signal-to-noise ratios over the full Q1 area in the redshift range $0.2 \leq z \leq 1.5$. The chosen redshift and signal-to-noise thresholds are motivated by the photometric quality of the early Euclid data. We provide richness estimates for each of the Euclid-detected clusters and show its correlation with various external cluster mass proxies. Out of the full sample, 77 systems are potentially new to the literature. Overall, the Q1 cluster catalogue demonstrates a successful validation of the workflow ahead of the Euclid Data Release 1, based on the consistency of internal and external properties of Euclid-detected clusters.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Tensor-based homogeneous polynomial dynamical system analysis from data
Authors:
Xin Mao,
Anqi Dong,
Ziqin He,
Yidan Mei,
Shenghan Mei,
Can Chen
Abstract:
Numerous complex real-world systems, such as those in biological, ecological, and social networks, exhibit higher-order interactions that are often modeled using polynomial dynamical systems or homogeneous polynomial dynamical systems (HPDSs). However, identifying system parameters and analyzing key system-theoretic properties remain challenging due to their inherent nonlinearity and complexity, p…
▽ More
Numerous complex real-world systems, such as those in biological, ecological, and social networks, exhibit higher-order interactions that are often modeled using polynomial dynamical systems or homogeneous polynomial dynamical systems (HPDSs). However, identifying system parameters and analyzing key system-theoretic properties remain challenging due to their inherent nonlinearity and complexity, particularly for large-scale systems. To address these challenges, we develop an innovative computational framework in this article that leverages advanced tensor decomposition techniques, namely tensor train and hierarchical Tucker decompositions, to facilitate efficient identification and analysis of HPDSs that can be equivalently represented by tensors. Specifically, we introduce memory-efficient system identification techniques for directly estimating system parameters represented through tensor decompositions from time-series data. Additionally, we develop necessary and sufficient conditions for determining controllability and observability using the tensor decomposition-based representations of HPDSs, accompanied by detailed complexity analyses that demonstrate significant reductions in computational demands. The effectiveness and efficiency of our framework are validated through numerical examples.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics
Authors:
Licong Lin,
Song Mei
Abstract:
Contrastive learning -- a modern approach to extract useful representations from unlabeled data by training models to distinguish similar samples from dissimilar ones -- has driven significant progress in foundation models. In this work, we develop a new theoretical framework for analyzing data augmentation-based contrastive learning, with a focus on SimCLR as a representative example. Our approac…
▽ More
Contrastive learning -- a modern approach to extract useful representations from unlabeled data by training models to distinguish similar samples from dissimilar ones -- has driven significant progress in foundation models. In this work, we develop a new theoretical framework for analyzing data augmentation-based contrastive learning, with a focus on SimCLR as a representative example. Our approach is based on the concept of \emph{approximate sufficient statistics}, which we extend beyond its original definition in \cite{oko2025statistical} for contrastive language-image pretraining (CLIP) using KL-divergence. We generalize it to equivalent forms and general f-divergences, and show that minimizing SimCLR and other contrastive losses yields encoders that are approximately sufficient. Furthermore, we demonstrate that these near-sufficient encoders can be effectively adapted to downstream regression and classification tasks, with performance depending on their sufficiency and the error induced by data augmentation in contrastive learning. Concrete examples in linear regression and topic classification are provided to illustrate the broad applicability of our results.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Euclid preparation LXX. Forecasting detection limits for intracluster light in the Euclid Wide Survey
Authors:
Euclid Collaboration,
C. Bellhouse,
J. B. Golden-Marx,
S. P. Bamford,
N. A. Hatch,
M. Kluge,
A. Ellien,
S. L. Ahad,
P. Dimauro,
F. Durret,
A. H. Gonzalez,
Y. Jimenez-Teja,
M. Montes,
M. Sereno,
E. Slezak,
M. Bolzonella,
G. Castignani,
O. Cucciati,
G. De Lucia,
Z. Ghaffari,
L. Moscardini,
R. Pello,
L. Pozzetti,
T. Saifollahi,
A. S. Borlaff
, et al. (270 additional authors not shown)
Abstract:
The intracluster light (ICL) permeating galaxy clusters is a tracer of the cluster's assembly history, and potentially a tracer of their dark matter structure. In this work we explore the capability of the Euclid Wide Survey to detect ICL using H-band mock images. We simulate clusters across a range of redshifts (0.3-1.8) and halo masses ($10^{13.9}$-$10^{15.0}$ M$_\odot$), using an observationall…
▽ More
The intracluster light (ICL) permeating galaxy clusters is a tracer of the cluster's assembly history, and potentially a tracer of their dark matter structure. In this work we explore the capability of the Euclid Wide Survey to detect ICL using H-band mock images. We simulate clusters across a range of redshifts (0.3-1.8) and halo masses ($10^{13.9}$-$10^{15.0}$ M$_\odot$), using an observationally motivated model of the ICL. We identify a 50-200 kpc circular annulus around the brightest cluster galaxy (BCG) in which the signal-to-noise ratio (S/N) of the ICL is maximised and use the S/N within this aperture as our figure of merit for ICL detection. We compare three state-of-the-art methods for ICL detection, and find that a method that performs simple aperture photometry after high-surface brightness source masking is able to detect ICL with minimal bias for clusters more massive than $10^{14.2}$ M$_\odot$. The S/N of the ICL detection is primarily limited by the redshift of the cluster, driven by cosmological dimming, rather than the mass of the cluster. Assuming the ICL in each cluster contains 15% of the stellar light, we forecast that Euclid will be able to measure the presence of ICL in up to $\sim80000$ clusters of $>10^{14.2}$ M$_\odot$ between $z=0.3$ and 1.5 with a S/N$>3$. Half of these clusters will reside below $z=0.75$ and the majority of those below $z=0.6$ will be detected with a S/N $>20$. A few thousand clusters at $1.3<z<1.5$ will have ICL detectable with a S/N greater than 3. The surface brightness profile of the ICL model is strongly dependent on both the mass of the cluster and the redshift at which it is observed so the outer ICL is best observed in the most massive clusters of $>10^{14.7}$ M$_\odot$. Euclid will detect the ICL at more than 500 kpc distance from the BCG, up to $z=0.7$, in several hundred of these massive clusters over its large survey volume.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Euclid: Star clusters in IC 342, NGC 2403, and Holmberg II
Authors:
S. S. Larsen,
A. M. N. Ferguson,
J. M. Howell,
F. Annibali,
J. -C. Cuillandre,
L. K. Hunt,
A. Lançon,
T. Saifollahi,
D. Massari,
M. N. Le,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
C. Baccigalupi,
M. Baldi,
A. Balestra,
S. Bardelli,
P. Battaglia,
A. Biviano,
E. Branchini,
M. Brescia,
J. Brinchmann,
S. Camera
, et al. (134 additional authors not shown)
Abstract:
We examine the star cluster populations in the three nearby galaxies IC 342, NGC 2403, and Holmberg II, observed as part of the Euclid Early Release Observations programme. Our main focus is on old globular clusters (GCs), for which the wide field-of-view and excellent image quality of Euclid offer substantial advantages over previous work. For IC 342 this is the first study of stellar clusters ot…
▽ More
We examine the star cluster populations in the three nearby galaxies IC 342, NGC 2403, and Holmberg II, observed as part of the Euclid Early Release Observations programme. Our main focus is on old globular clusters (GCs), for which the wide field-of-view and excellent image quality of Euclid offer substantial advantages over previous work. For IC 342 this is the first study of stellar clusters other than its nuclear cluster. After selection based on size and magnitude criteria, followed by visual inspection, we identify 111 old (> 1 Gyr) GC candidates in IC 342, 50 in NGC 2403 (of which 15 were previously known), and 7 in Holmberg II. In addition, a number of younger and/or intermediate-age candidates are identified. The colour distributions of GC candidates in the two larger galaxies show hints of bimodality with peaks at IE-HE = 0.36 and 0.79 (IC 342) and IE-HE = 0.36 and 0.80 (NGC 2403), corresponding to metallicities of [Fe/H]=-1.5 and [Fe/H]=-0.5, similar to those of the metal-poor and metal-rich GC subpopulations in the Milky Way. The luminosity functions of our GC candidates exhibit an excess of relatively faint objects, relative to a canonical, approximately Gaussian GC luminosity function (GCLF). The excess objects may be similar to those previously identified in other galaxies. The specific frequency of classical old GCs in IC 342, as determined based on the brighter half of the GCLF, appears to be unusually low with SN=0.2-0.3. The combined luminosity function of young and intermediate-age clusters in all three galaxies is consistent with a power-law distribution, dN/dL ~ L^(-2.3+/-0.1) and the total numbers of young clusters brighter than M(IE)=-8 in NGC 2403 and Holmberg II are comparable with those found in their Local Group counterparts, that is, M33 and the Small Magellanic Cloud, respectively.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Euclid: Early Release Observations -- Interplay between dwarf galaxies and their globular clusters in the Perseus galaxy cluster
Authors:
T. Saifollahi,
A. Lançon,
Michele Cantiello,
J. -C. Cuillandre,
M. Bethermin,
D. Carollo,
P. -A. Duc,
A. Ferré-Mateu,
N. A. Hatch,
M. Hilker,
L. K. Hunt,
F. R. Marleau,
J. Román,
R. Sánchez-Janssen,
C. Tortora,
M. Urbano,
K. Voggel,
M. Bolzonella,
H. Bouy,
M. Kluge,
M. Schirmer,
C. Stone,
C. Giocoli,
J. H. Knapen,
M. N. Le
, et al. (161 additional authors not shown)
Abstract:
We present an analysis of globular clusters (GCs) of dwarf galaxies in the Perseus galaxy cluster to explore the relationship between dwarf galaxy properties and their GCs. Our focus is on GC numbers ($N_{\rm GC}$) and GC half-number radii ($R_{\rm GC}$) around dwarf galaxies, and their relations with host galaxy stellar masses ($M_*$), central surface brightnesses ($μ_0$), and effective radii (…
▽ More
We present an analysis of globular clusters (GCs) of dwarf galaxies in the Perseus galaxy cluster to explore the relationship between dwarf galaxy properties and their GCs. Our focus is on GC numbers ($N_{\rm GC}$) and GC half-number radii ($R_{\rm GC}$) around dwarf galaxies, and their relations with host galaxy stellar masses ($M_*$), central surface brightnesses ($μ_0$), and effective radii ($R_{\rm e}$). Interestingly, we find that at a given stellar mass, $R_{\rm GC}$ is almost independent of the host galaxy $μ_0$ and $R_{\rm e}$, while $R_{\rm GC}/R_{\rm e}$ depends on $μ_0$ and $R_{\rm e}$; lower surface brightness and diffuse dwarf galaxies show $R_{\rm GC}/R_{\rm e}\approx 1$ while higher surface brightness and compact dwarf galaxies show $R_{\rm GC}/R_{\rm e}\approx 1.5$-$2$. This means that for dwarf galaxies of similar stellar mass, the GCs have a similar median extent; however, their distribution is different from the field stars of their host. Additionally, low surface brightness and diffuse dwarf galaxies on average have a higher $N_{\rm GC}$ than high surface brightness and compact dwarf galaxies at any given stellar mass. We also find that UDGs (ultra-diffuse galaxies) and non-UDGs have similar $R_{\rm GC}$, while UDGs have smaller $R_{\rm GC}/R_{\rm e}$ (typically less than 1) and 3-4 times higher $N_{\rm GC}$ than non-UDGs. Examining nucleated and not-nucleated dwarf galaxies, we find that for $M_*>10^8M_{\odot}$, nucleated dwarf galaxies seem to have smaller $R_{\rm GC}$ and $R_{\rm GC}/R_{\rm e}$, with no significant differences between their $N_{\rm GC}$, except at $M_*<10^8M_{\odot}$ where the nucleated dwarf galaxies tend to have a higher $N_{\rm GC}$. Lastly, we explore the stellar-to-halo mass ratio (SHMR) of dwarf galaxies and conclude that the Perseus cluster dwarf galaxies follow the expected SHMR at $z=0$ extrapolated down to $M_*=10^6M_{\odot}$.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Euclid preparation. Spatially resolved stellar populations of local galaxies with Euclid: a proof of concept using synthetic images with the TNG50 simulation
Authors:
Euclid Collaboration,
Abdurro'uf,
C. Tortora,
M. Baes,
A. Nersesian,
I. Kovačić,
M. Bolzonella,
A. Lançon,
L. Bisigello,
F. Annibali,
M. N. Bremer,
D. Carollo,
C. J. Conselice,
A. Enia,
A. M. N. Ferguson,
A. Ferré-Mateu,
L. K. Hunt,
E. Iodice,
J. H. Knapen,
A. Iovino,
F. R. Marleau,
R. F. Peletier,
R. Ragusa,
M. Rejkuba,
A. S. G. Robotham
, et al. (264 additional authors not shown)
Abstract:
The European Space Agency's Euclid mission will observe approximately 14,000 $\rm{deg}^{2}$ of the extragalactic sky and deliver high-quality imaging for many galaxies. The depth and high spatial resolution of the data will enable a detailed analysis of stellar population properties of local galaxies. In this study, we test our pipeline for spatially resolved SED fitting using synthetic images of…
▽ More
The European Space Agency's Euclid mission will observe approximately 14,000 $\rm{deg}^{2}$ of the extragalactic sky and deliver high-quality imaging for many galaxies. The depth and high spatial resolution of the data will enable a detailed analysis of stellar population properties of local galaxies. In this study, we test our pipeline for spatially resolved SED fitting using synthetic images of Euclid, LSST, and GALEX generated from the TNG50 simulation. We apply our pipeline to 25 local simulated galaxies to recover their resolved stellar population properties. We produce 3 types of data cubes: GALEX + LSST + Euclid, LSST + Euclid, and Euclid-only. We perform the SED fitting tests with two SPS models in a Bayesian framework. Because the age, metallicity, and dust attenuation estimates are biased when applying only classical formulations of flat priors, we examine the effects of additional priors in the forms of mass-age-$Z$ relations, constructed using a combination of empirical and simulated data. Stellar-mass surface densities can be recovered well using any of the 3 data cubes, regardless of the SPS model and prior variations. The new priors then significantly improve the measurements of mass-weighted age and $Z$ compared to results obtained without priors, but they may play an excessive role compared to the data in determining the outcome when no UV data is available. The spatially resolved SED fitting method is powerful for mapping the stellar populations of galaxies with the current abundance of high-quality imaging data. Our study re-emphasizes the gain added by including multiwavelength data from ancillary surveys and the roles of priors in Bayesian SED fitting. With the Euclid data alone, we will be able to generate complete and deep stellar mass maps of galaxies in the local Universe, thus exploiting the telescope's wide field, NIR sensitivity, and high spatial resolution.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid: Quick Data Release (Q1) -- A census of dwarf galaxies across a range of distances and environments
Authors:
F. R. Marleau,
R. Habas,
D. Carollo,
C. Tortora,
P. -A. Duc,
E. Sola,
T. Saifollahi,
M. Fügenschuh,
M. Walmsley,
R. Zöller,
A. Ferré-Mateu,
M. Cantiello,
M. Urbano,
E. Saremi,
R. Ragusa,
R. Laureijs,
M. Hilker,
O. Müller,
M. Poulain,
R. F. Peletier,
S. J. Sprenger,
O. Marchal,
N. Aghanim,
B. Altieri,
A. Amara
, et al. (182 additional authors not shown)
Abstract:
The Euclid Q1 fields were selected for calibration purposes in cosmology and are therefore relatively devoid of nearby galaxies. However, this is precisely what makes them interesting fields in which to search for dwarf galaxies in local density environments. We take advantage of the unprecedented depth, spatial resolution, and field of view of the Euclid Quick Release (Q1) to build a census of dw…
▽ More
The Euclid Q1 fields were selected for calibration purposes in cosmology and are therefore relatively devoid of nearby galaxies. However, this is precisely what makes them interesting fields in which to search for dwarf galaxies in local density environments. We take advantage of the unprecedented depth, spatial resolution, and field of view of the Euclid Quick Release (Q1) to build a census of dwarf galaxies in these regions. We have identified dwarfs in a representative sample of 25 contiguous tiles in the Euclid Deep Field North (EDF-N), covering an area of 14.25 sq. deg. The dwarf candidates were identified using a semi-automatic detection method, based on properties measured by the Euclid pipeline and listed in the MER catalogue. A selection cut in surface brightness and magnitude was used to produce an initial dwarf candidate catalogue, followed by a cut in morphology and colour. This catalogue was visually classified to produce a final sample of dwarf candidates, including their morphology, number of nuclei, globular cluster (GC) richness, and presence of a blue compact centre. We identified 2674 dwarf candidates, corresponding to 188 dwarfs per sq. deg. The visual classification of the dwarfs reveals a slightly uneven morphological mix of 58% ellipticals and 42% irregulars, with very few potentially GC-rich (1.0%) and nucleated (4.0%) candidates but a noticeable fraction (6.9%) of dwarfs with blue compact centres. The distance distribution of 388 (15%) of the dwarfs with spectroscopic redshifts peaks at about 400 Mpc. Their stellar mass distribution confirms that our selection effectively identifies dwarfs while minimising contamination. The most prominent dwarf overdensities are dominated by dEs, while dIs are more evenly distributed. This work highlights Euclid's remarkable ability to detect and characterise dwarf galaxies across diverse masses, distances, and environments.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). Galaxy shapes and alignments in the cosmic web
Authors:
Euclid Collaboration,
C. Laigle,
C. Gouin,
F. Sarron,
L. Quilley,
C. Pichon,
K. Kraljic,
F. Durret,
N. E. Chisari,
U. Kuchner,
N. Malavasi,
M. Magliocchetti,
H. J. McCracken,
J. G. Sorce,
Y. Kang,
C. J. R. McPartland,
S. Toft,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
H. Aussel,
C. Baccigalupi,
M. Baldi
, et al. (319 additional authors not shown)
Abstract:
Galaxy morphologies and shape orientations are expected to correlate with their large-scale environment, since they grow by accreting matter from the cosmic web and are subject to interactions with other galaxies. Cosmic filaments are extracted in projection from the Euclid Quick Data Release 1 (covering 63.1 $\mathrm{deg}^2$) at $0.5<z<0.9$ in tomographic slices of 170 comoving…
▽ More
Galaxy morphologies and shape orientations are expected to correlate with their large-scale environment, since they grow by accreting matter from the cosmic web and are subject to interactions with other galaxies. Cosmic filaments are extracted in projection from the Euclid Quick Data Release 1 (covering 63.1 $\mathrm{deg}^2$) at $0.5<z<0.9$ in tomographic slices of 170 comoving $h^{-1}\mathrm{Mpc}$ using photometric redshifts. Galaxy morphologies are accurately retrieved thanks to the excellent resolution of VIS data. The distribution of massive galaxies ($M_* > 10^{10} M_\odot$) in the projected cosmic web is analysed as a function of morphology measured from VIS data. Specifically, the 2D alignment of galaxy shapes with large-scale filaments is quantified as a function of Sérsic indices and masses. We find the known trend that more massive galaxies are closer to filament spines. At fixed stellar masses, morphologies correlate both with densities and distances to large-scale filaments. In addition, the large volume of this data set allows us to detect a signal indicating that there is a preferential alignment of the major axis of massive early-type galaxies along projected cosmic filaments. Overall, these results demonstrate our capabilities to carry out detailed studies of galaxy environments with Euclid, which will be extended to higher redshift and lower stellar masses with the future Euclid Deep Survey.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The role of cosmic connectivity in shaping galaxy clusters
Authors:
Euclid Collaboration,
C. Gouin,
C. Laigle,
F. Sarron,
T. Bonnaire,
J. G. Sorce,
N. Aghanim,
M. Magliocchetti,
L. Quilley,
P. Boldrini,
F. Durret,
C. Pichon,
U. Kuchner,
N. Malavasi,
K. Kraljic,
R. Gavazzi,
Y. Kang,
S. A. Stanford,
P. Awad,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
H. Aussel,
C. Baccigalupi
, et al. (315 additional authors not shown)
Abstract:
The matter distribution around galaxy clusters is distributed over several filaments, reflecting their positions as nodes in the large-scale cosmic web. The number of filaments connected to a cluster, namely its connectivity, is expected to affect the physical properties of clusters. Using the first Euclid galaxy catalogue from the Euclid Quick Release 1 (Q1), we investigate the connectivity of ga…
▽ More
The matter distribution around galaxy clusters is distributed over several filaments, reflecting their positions as nodes in the large-scale cosmic web. The number of filaments connected to a cluster, namely its connectivity, is expected to affect the physical properties of clusters. Using the first Euclid galaxy catalogue from the Euclid Quick Release 1 (Q1), we investigate the connectivity of galaxy clusters and how it correlates with their physical and galaxy member properties. Around 220 clusters located within the three fields of Q1 (covering $\sim 63 \ \text{deg}^2$), are analysed in the redshift range $0.2 < z < 0.7$. Due to the photometric redshift uncertainty, we reconstruct the cosmic web skeleton, and measure cluster connectivity, in 2-D projected slices with a thickness of 170 comoving $h^{-1}.\text{Mpc}$ and centred on each cluster redshift, by using two different filament finder algorithms on the most massive galaxies ($M_*\ > 10^{10.3} \ M_\odot$). In agreement with previous measurements, we recover the mass-connectivity relation independently of the filament detection algorithm, showing that the most massive clusters are, on average, connected to a larger number of cosmic filaments, consistent with hierarchical structure formation models. Furthermore, we explore possible correlations between connectivities and two cluster properties: the fraction of early-type galaxies and the Sérsic index of galaxy members. Our result suggests that the clusters populated by early-type galaxies exhibit higher connectivity compared to clusters dominated by late-type galaxies. These preliminary investigations highlight our ability to quantify the impact of the cosmic web connectivity on cluster properties with Euclid.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). Combined Euclid and Spitzer galaxy density catalogues at $z>$ 1.3 and detection of significant Euclid passive galaxy overdensities in Spitzer overdense regions
Authors:
Euclid Collaboration,
N. Mai,
S. Mei,
C. Cleland,
R. Chary,
J. G. Bartlett,
G. Castignani,
H. Dannerbauer,
G. De Lucia,
F. Fontanot,
D. Scott,
S. Andreon,
S. Bhargava,
H. Dole,
T. DUSSERRE,
S. A. Stanford,
V. P. Tran,
J. R. Weaver,
P. -A. Duc,
I. Risso,
N. Aghanim,
B. Altieri,
A. Amara,
N. Auricchio,
H. Aussel
, et al. (286 additional authors not shown)
Abstract:
Euclid will detect tens of thousands of clusters and protoclusters at $z$>1.3. With a total coverage of 63.1deg$^2$, the Euclid Quick Data Release 1 (Q1) is large enough to detect tens of clusters and hundreds of protoclusters at these early epochs. The Q1 photometric redshift catalogue enables us to detect clusters out to $z$ < 1.5; however, infrared imaging from Spitzer extends this limit to hig…
▽ More
Euclid will detect tens of thousands of clusters and protoclusters at $z$>1.3. With a total coverage of 63.1deg$^2$, the Euclid Quick Data Release 1 (Q1) is large enough to detect tens of clusters and hundreds of protoclusters at these early epochs. The Q1 photometric redshift catalogue enables us to detect clusters out to $z$ < 1.5; however, infrared imaging from Spitzer extends this limit to higher redshifts by using high local projected densities of Spitzer-selected galaxies as signposts for cluster and protocluster candidates. We use Spitzer imaging of the Euclid Deep Fields (EDFs) to derive densities for a sample of Spitzer-selected galaxies at redshifts $z$ > 1.3, building Spitzer IRAC1 and IRAC2 photometric catalogues that are 95% complete at a magnitude limit of IRAC2=22.2, 22.6, and 22.8 for the EDF-S, EDF-F, and EDF-N, respectively. We apply two complementary methods to calculate galaxy densities: (1) aperture and surface density; and (2) the Nth-nearest-neighbour method. When considering a sample selected at a magnitude limit of IRAC2 < 22.2, at which all three EDFs are 95% complete, our surface density distributions are consistent among the three EDFs and with the SpUDS blank field survey. We also considered a deeper sample (IRAC2 < 22.8), finding that 2% and 3% of the surface densities in the North and Fornax fields are 3$σ$ higher than the average field distribution and similar to densities found in the CARLA cluster survey. Our surface densities are also consistent with predictions from the GAEA semi-analytical model. Using combined Euclid and ground-based i-band photometry we show that our highest Spitzer-selected galaxy overdense regions, found at $z$~1.5, also host high densities of passive galaxies. This means that we measure densities consistent with those found in clusters and protoclusters at $z$>1.3.
△ Less
Submitted 20 March, 2025; v1 submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The first catalogue of strong-lensing galaxy clusters
Authors:
Euclid Collaboration,
P. Bergamini,
M. Meneghetti,
A. Acebron,
B. Clément,
M. Bolzonella,
C. Grillo,
P. Rosati,
D. Abriola,
J. A. Acevedo Barroso,
G. Angora,
L. Bazzanini,
R. Cabanac,
B. C. Nagam,
A. R. Cooray,
G. Despali,
G. Di Rosa,
J. M. Diego,
M. Fogliardi,
A. Galan,
R. Gavazzi,
G. Granata,
N. B. Hogg,
K. Jahnke,
L. Leuzzi
, et al. (353 additional authors not shown)
Abstract:
We present the first catalogue of strong lensing galaxy clusters identified in the Euclid Quick Release 1 observations (covering $63.1\,\mathrm{deg^2}$). This catalogue is the result of the visual inspection of 1260 cluster fields. Each galaxy cluster was ranked with a probability, $\mathcal{P}_{\mathrm{lens}}$, based on the number and plausibility of the identified strong lensing features. Specif…
▽ More
We present the first catalogue of strong lensing galaxy clusters identified in the Euclid Quick Release 1 observations (covering $63.1\,\mathrm{deg^2}$). This catalogue is the result of the visual inspection of 1260 cluster fields. Each galaxy cluster was ranked with a probability, $\mathcal{P}_{\mathrm{lens}}$, based on the number and plausibility of the identified strong lensing features. Specifically, we identified 83 gravitational lenses with $\mathcal{P}_{\mathrm{lens}}>0.5$, of which 14 have $\mathcal{P}_{\mathrm{lens}}=1$, and clearly exhibiting secure strong lensing features, such as giant tangential and radial arcs, and multiple images. Considering the measured number density of lensing galaxy clusters, approximately $0.3\,\mathrm{deg}^{-2}$ for $\mathcal{P}_{\mathrm{lens}}>0.9$, we predict that \Euclid\ will likely see more than 4500 strong lensing clusters over the course of the mission. Notably, only three of the identified cluster-scale lenses had been previously observed from space. Thus, \Euclid has provided the first high-resolution imaging for the remaining $80$ galaxy cluster lenses, including those with the highest probability. The identified strong lensing features will be used for training deep-learning models for identifying gravitational arcs and multiple images automatically in \Euclid observations. This study confirms the huge potential of \Euclid for finding new strong lensing clusters, enabling exciting new discoveries on the nature of dark matter and dark energy and the study of the high-redshift Universe.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). LEMON -- Lens Modelling with Neural networks. Automated and fast modelling of Euclid gravitational lenses with a singular isothermal ellipsoid mass profile
Authors:
Euclid Collaboration,
V. Busillo,
C. Tortora,
R. B. Metcalf,
J. W. Nightingale,
M. Meneghetti,
F. Gentile,
R. Gavazzi,
F. Zhong,
R. Li,
B. Clément,
G. Covone,
N. R. Napolitano,
F. Courbin,
M. Walmsley,
E. Jullo,
J. Pearson,
D. Scott,
A. M. C. Le Brun,
L. Leuzzi,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
H. Aussel
, et al. (290 additional authors not shown)
Abstract:
The Euclid mission aims to survey around 14000 deg^{2} of extragalactic sky, providing around 10^{5} gravitational lens images. Modelling of gravitational lenses is fundamental to estimate the total mass of the lens galaxy, along with its dark matter content. Traditional modelling of gravitational lenses is computationally intensive and requires manual input. In this paper, we use a Bayesian neura…
▽ More
The Euclid mission aims to survey around 14000 deg^{2} of extragalactic sky, providing around 10^{5} gravitational lens images. Modelling of gravitational lenses is fundamental to estimate the total mass of the lens galaxy, along with its dark matter content. Traditional modelling of gravitational lenses is computationally intensive and requires manual input. In this paper, we use a Bayesian neural network, LEns MOdelling with Neural networks (LEMON), for modelling Euclid gravitational lenses with a singular isothermal ellipsoid mass profile. Our method estimates key lens mass profile parameters, such as the Einstein radius, while also predicting the light parameters of foreground galaxies and their uncertainties. We validate LEMON's performance on both mock Euclid data sets, real Euclidised lenses observed with Hubble Space Telescope (hereafter HST), and real Euclid lenses found in the Perseus ERO field, demonstrating the ability of LEMON to predict parameters of both simulated and real lenses. Results show promising accuracy and reliability in predicting the Einstein radius, axis ratio, position angle, effective radius, Sérsic index, and lens magnitude for simulated lens galaxies. The application to real data, including the latest Quick Release 1 strong lens candidates, provides encouraging results, particularly for the Einstein radius. We also verified that LEMON has the potential to accelerate traditional modelling methods, by giving to the classical optimiser the LEMON predictions as starting points, resulting in a speed-up of up to 26 times the original time needed to model a sample of gravitational lenses, a result that would be impossible with randomly initialised guesses. This work represents a significant step towards efficient, automated gravitational lens modelling, which is crucial for handling the large data volumes expected from Euclid.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The Strong Lensing Discovery Engine E -- Ensemble classification of strong gravitational lenses: lessons for Data Release 1
Authors:
Euclid Collaboration,
P. Holloway,
A. Verma,
M. Walmsley,
P. J. Marshall,
A. More,
T. E. Collett,
N. E. P. Lines,
L. Leuzzi,
A. Manjón-García,
S. H. Vincken,
J. Wilde,
R. Pearce-Casey,
I. T. Andika,
J. A. Acevedo Barroso,
T. Li,
A. Melo,
R. B. Metcalf,
K. Rojas,
B. Clément,
H. Degaudenzi,
F. Courbin,
G. Despali,
R. Gavazzi,
S. Schuldt
, et al. (321 additional authors not shown)
Abstract:
The Euclid Wide Survey (EWS) is expected to identify of order $100\,000$ galaxy-galaxy strong lenses across $14\,000$deg$^2$. The Euclid Quick Data Release (Q1) of $63.1$deg$^2$ Euclid images provides an excellent opportunity to test our lens-finding ability, and to verify the anticipated lens frequency in the EWS. Following the Q1 data release, eight machine learning networks from five teams were…
▽ More
The Euclid Wide Survey (EWS) is expected to identify of order $100\,000$ galaxy-galaxy strong lenses across $14\,000$deg$^2$. The Euclid Quick Data Release (Q1) of $63.1$deg$^2$ Euclid images provides an excellent opportunity to test our lens-finding ability, and to verify the anticipated lens frequency in the EWS. Following the Q1 data release, eight machine learning networks from five teams were applied to approximately one million images. This was followed by a citizen science inspection of a subset of around $100\,000$ images, of which $65\%$ received high network scores, with the remainder randomly selected. The top scoring outputs were inspected by experts to establish confident (grade A), likely (grade B), possible (grade C), and unlikely lenses. In this paper we combine the citizen science and machine learning classifiers into an ensemble, demonstrating that a combined approach can produce a purer and more complete sample than the original individual classifiers. Using the expert-graded subset as ground truth, we find that this ensemble can provide a purity of $52\pm2\%$ (grade A/B lenses) with $50\%$ completeness (for context, due to the rarity of lenses a random classifier would have a purity of $0.05\%$). We discuss future lessons for the first major Euclid data release (DR1), where the big-data challenges will become more significant and will require analysing more than $\sim300$ million galaxies, and thus time investment of both experts and citizens must be carefully managed.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The Strong Lensing Discovery Engine D -- Double-source-plane lens candidates
Authors:
Euclid Collaboration,
T. Li,
T. E. Collett,
M. Walmsley,
N. E. P. Lines,
K. Rojas,
J. W. Nightingale,
W. J. R. Enzi,
L. A. Moustakas,
C. Krawczyk,
R. Gavazzi,
G. Despali,
P. Holloway,
S. Schuldt,
F. Courbin,
R. B. Metcalf,
D. J. Ballard,
A. Verma,
B. Clément,
H. Degaudenzi,
A. Melo,
J. A. Acevedo Barroso,
L. Leuzzi,
A. Manjón-García,
R. Pearce-Casey
, et al. (313 additional authors not shown)
Abstract:
Strong gravitational lensing systems with multiple source planes are powerful tools for probing the density profiles and dark matter substructure of the galaxies. The ratio of Einstein radii is related to the dark energy equation of state through the cosmological scaling factor $β$. However, galaxy-scale double-source-plane lenses (DSPLs) are extremely rare. In this paper, we report the discovery…
▽ More
Strong gravitational lensing systems with multiple source planes are powerful tools for probing the density profiles and dark matter substructure of the galaxies. The ratio of Einstein radii is related to the dark energy equation of state through the cosmological scaling factor $β$. However, galaxy-scale double-source-plane lenses (DSPLs) are extremely rare. In this paper, we report the discovery of four new galaxy-scale double-source-plane lens candidates in the Euclid Quick Release 1 (Q1) data. These systems were initially identified through a combination of machine learning lens-finding models and subsequent visual inspection from citizens and experts. We apply the widely-used {\tt LensPop} lens forecasting model to predict that the full \Euclid survey will discover 1700 DSPLs, which scales to $6 \pm 3$ DSPLs in 63 deg$^2$, the area of Q1. The number of discoveries in this work is broadly consistent with this forecast. We present lens models for each DSPL and infer their $β$ values. Our initial Q1 sample demonstrates the promise of \Euclid to discover such rare objects.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The Strong Lensing Discovery Engine C: Finding lenses with machine learning
Authors:
Euclid Collaboration,
N. E. P. Lines,
T. E. Collett,
M. Walmsley,
K. Rojas,
T. Li,
L. Leuzzi,
A. Manjón-García,
S. H. Vincken,
J. Wilde,
P. Holloway,
A. Verma,
R. B. Metcalf,
I. T. Andika,
A. Melo,
M. Melchior,
H. Domínguez Sánchez,
A. Díaz-Sánchez,
J. A. Acevedo Barroso,
B. Clément,
C. Krawczyk,
R. Pearce-Casey,
S. Serjeant,
F. Courbin,
G. Despali
, et al. (328 additional authors not shown)
Abstract:
Strong gravitational lensing has the potential to provide a powerful probe of astrophysics and cosmology, but fewer than 1000 strong lenses have been confirmed so far. With a 0.16'' resolution covering a third of the sky, the Euclid telescope will revolutionise the identification of strong lenses, with 170 000 lenses forecasted to be discovered amongst the 1.5 billion galaxies it will observe. We…
▽ More
Strong gravitational lensing has the potential to provide a powerful probe of astrophysics and cosmology, but fewer than 1000 strong lenses have been confirmed so far. With a 0.16'' resolution covering a third of the sky, the Euclid telescope will revolutionise the identification of strong lenses, with 170 000 lenses forecasted to be discovered amongst the 1.5 billion galaxies it will observe. We present an analysis of the performance of five machine-learning models at finding strong gravitational lenses in the quick release of Euclid data (Q1) covering 63 deg2. The models have been validated by citizen scientists and expert visual inspection. We focus on the best-performing network: a fine-tuned version of the Zoobot pretrained model originally trained to classify galaxy morphologies in heterogeneous astronomical imaging surveys. Of the one million Q1 objects that Zoobot was tasked to find strong lenses within, the top 1000 ranked objects contain 122 grade A lenses (almost-certain lenses) and 41 grade B lenses (probable lenses). A deeper search with the five networks combined with visual inspection yielded 250 (247) grade A (B) lenses, of which 224 (182) are ranked in the top 20 000 by Zoobot. When extrapolated to the full Euclid survey, the highest ranked one million images will contain 75 000 grade A or B strong gravitational lenses.
△ Less
Submitted 26 June, 2025; v1 submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1) The Strong Lensing Discovery Engine B -- Early strong lens candidates from visual inspection of high velocity dispersion galaxies
Authors:
Euclid Collaboration,
K. Rojas,
T. E. Collett,
J. A. Acevedo Barroso,
J. W. Nightingale,
D. Stern,
L. A. Moustakas,
S. Schuldt,
G. Despali,
A. Melo,
M. Walmsley,
D. J. Ballard,
W. J. R. Enzi,
T. Li,
A. Sainz de Murieta,
I. T. Andika,
B. Clément,
F. Courbin,
L. R. Ecker,
R. Gavazzi,
N. Jackson,
A. Kovács,
P. Matavulj,
M. Meneghetti,
S. Serjeant
, et al. (314 additional authors not shown)
Abstract:
We present a search for strong gravitational lenses in Euclid imaging with high stellar velocity dispersion ($σ_ν> 180$ km/s) reported by SDSS and DESI. We performed expert visual inspection and classification of $11\,660$ \Euclid images. We discovered 38 grade A and 40 grade B candidate lenses, consistent with an expected sample of $\sim$32. Palomar spectroscopy confirmed 5 lens systems, while DE…
▽ More
We present a search for strong gravitational lenses in Euclid imaging with high stellar velocity dispersion ($σ_ν> 180$ km/s) reported by SDSS and DESI. We performed expert visual inspection and classification of $11\,660$ \Euclid images. We discovered 38 grade A and 40 grade B candidate lenses, consistent with an expected sample of $\sim$32. Palomar spectroscopy confirmed 5 lens systems, while DESI spectra confirmed one, provided ambiguous results for another, and help to discard one. The \Euclid automated lens modeler modelled 53 candidates, confirming 38 as lenses, failing to model 9, and ruling out 6 grade B candidates. For the remaining 25 candidates we could not gather additional information. More importantly, our expert-classified non-lenses provide an excellent training set for machine learning lens classifiers. We create high-fidelity simulations of \Euclid lenses by painting realistic lensed sources behind the expert tagged (non-lens) luminous red galaxies. This training set is the foundation stone for the \Euclid galaxy-galaxy strong lensing discovery engine.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1): The Strong Lensing Discovery Engine A -- System overview and lens catalogue
Authors:
Euclid Collaboration,
M. Walmsley,
P. Holloway,
N. E. P. Lines,
K. Rojas,
T. E. Collett,
A. Verma,
T. Li,
J. W. Nightingale,
G. Despali,
S. Schuldt,
R. Gavazzi,
A. Melo,
R. B. Metcalf,
I. T. Andika,
L. Leuzzi,
A. Manjón-García,
R. Pearce-Casey,
S. H. Vincken,
J. Wilde,
V. Busillo,
C. Tortora,
J. A. Acevedo Barroso,
H. Dole,
L. R. Ecker
, et al. (350 additional authors not shown)
Abstract:
We present a catalogue of 497 galaxy-galaxy strong lenses in the Euclid Quick Release 1 data (63 deg$^2$). In the initial 0.45\% of Euclid's surveys, we double the total number of known lens candidates with space-based imaging. Our catalogue includes 250 grade A candidates, the vast majority of which (243) were previously unpublished. Euclid's resolution reveals rare lens configurations of scienti…
▽ More
We present a catalogue of 497 galaxy-galaxy strong lenses in the Euclid Quick Release 1 data (63 deg$^2$). In the initial 0.45\% of Euclid's surveys, we double the total number of known lens candidates with space-based imaging. Our catalogue includes 250 grade A candidates, the vast majority of which (243) were previously unpublished. Euclid's resolution reveals rare lens configurations of scientific value including double-source-plane lenses, edge-on lenses, complete Einstein rings, and quadruply-imaged lenses. We resolve lenses with small Einstein radii ($θ_{\rm E} < 1''$) in large numbers for the first time. These lenses are found through an initial sweep by deep learning models, followed by Space Warps citizen scientist inspection, expert vetting, and system-by-system modelling. Our search approach scales straightforwardly to Euclid Data Release 1 and, without changes, would yield approximately 7000 high-confidence (grade A or B) lens candidates by late 2026. Further extrapolating to the complete Euclid Wide Survey implies a likely yield of over 100000 high-confidence candidates, transforming strong lensing science.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). Extending the quest for little red dots to z<4
Authors:
Euclid Collaboration,
L. Bisigello,
G. Rodighiero,
S. Fotopoulou,
F. Ricci,
K. Jahnke,
A. Feltre,
V. Allevato,
F. Shankar,
P. Cassata,
E. Dalla Bontà,
G. Gandolfi,
G. Girardi,
M. Giulietti,
A. Grazian,
C. C. Lovell,
R. Maiolino,
T. Matamoro Zatarain,
M. Mezcua,
I. Prandoni,
D. Roberts,
W. Roster,
M. Salvato,
M. Siudek,
F. Tarsitano
, et al. (326 additional authors not shown)
Abstract:
Recent James Webb Space Telescope (JWST) observations have revealed a population of sources with a compact morphology and a `v-shaped' continuum, namely blue at rest-frame $λ<4000$A and red at longer wavelengths. The nature of these sources, called `little red dots' (LRDs), is still debated, since it is unclear if they host active galactic nuclei (AGN) and their number seems to drastically drop at…
▽ More
Recent James Webb Space Telescope (JWST) observations have revealed a population of sources with a compact morphology and a `v-shaped' continuum, namely blue at rest-frame $λ<4000$A and red at longer wavelengths. The nature of these sources, called `little red dots' (LRDs), is still debated, since it is unclear if they host active galactic nuclei (AGN) and their number seems to drastically drop at z<4. We utilise the 63 $deg^2$ covered by the quick Euclid Quick Data Release (Q1) to extend the search for LRDs to brighter magnitudes and to lower z than what has been possible with JWST to have a broader view of the evolution of this peculiar galaxy population. The selection is done by fitting the available photometric data (Euclid, Spitzer/IRAC, and ground-based griz data) with two power laws, to retrieve the rest-frame optical and UV slopes consistently over a large redshift range (i.e, z<7.6). We exclude extended objects and possible line emitters, and perform a visual inspection to remove imaging artefacts. The final selection includes 3341 LRD candidates from z=0.33 to z=3.6, with 29 detected in IRAC. Their rest-frame UV luminosity function, in contrast with previous JWST studies, shows that the number density of LRD candidates increases from high-z down to z=1.5-2.5 and decreases at even lower z. Less evolution is apparent focusing on the subsample of more robust LRD candidates having IRAC detections, which is affected by low statistics and limited by the IRAC resolution. The comparison with previous quasar UV luminosity functions shows that LRDs are not the dominant AGN population at z<4. Follow-up studies of these LRD candidates are key to confirm their nature, probe their physical properties and check for their compatibility with JWST sources, since the different spatial resolution and wavelength coverage of Euclid and JWST could select different samples of compact sources.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). An investigation of optically faint, red objects in the Euclid Deep Fields
Authors:
Euclid Collaboration,
G. Girardi,
G. Rodighiero,
L. Bisigello,
A. Enia,
A. Grazian,
E. Dalla Bontà,
E. Daddi,
S. Serjeant,
G. Gandolfi,
C. C. Lovell,
K. I. Caputi,
A. Bianchetti,
A. Vietri,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
H. Aussel,
C. Baccigalupi,
M. Baldi,
A. Balestra,
S. Bardelli,
P. Battaglia
, et al. (304 additional authors not shown)
Abstract:
Our understanding of cosmic star-formation at $z>3$ used to largely rely on rest-frame UV observations. However, these observations overlook dusty and massive sources, resulting in an incomplete census of early star-forming galaxies. Recently, infrared data from Spitzer and the James Webb Space Telescope (JWST) have revealed a hidden population at $z\sim$3-6 with extreme red colours. Taking advant…
▽ More
Our understanding of cosmic star-formation at $z>3$ used to largely rely on rest-frame UV observations. However, these observations overlook dusty and massive sources, resulting in an incomplete census of early star-forming galaxies. Recently, infrared data from Spitzer and the James Webb Space Telescope (JWST) have revealed a hidden population at $z\sim$3-6 with extreme red colours. Taking advantage of the overlap between imaging in the Euclid Deep Fields (EDFs), covering $\sim$ 60 deg$^2$, and ancillary Spitzer observations, we identified 27000 extremely red objects with $H_E-{\rm IRAC}2>2.25$ (dubbed HIEROs) down to a $10σ$ completeness magnitude limit of IRAC2 $=$ 22.5 AB. After a visual inspection to discard artefacts and objects with troubling photometry, we ended up with a final sample of 3900 candidates. We retrieved the physical parameter estimates for these objects from the SED-fitting tool CIGALE. Our results confirm that HIERO galaxies may populate the high-mass end of the stellar mass function at $z>3$, with some reaching extreme stellar masses ($M_*>10^{11}M_\odot$) and exhibiting high dust attenuation ($A_V>3$). However, we consider stellar mass estimates unreliable for $z>3.5$, favouring a lower-z solution. The challenges faced by SED-fitting tools in characterising these objects highlight the need for further studies, incorporating shorter-wavelength and spectroscopic data. Euclid spectra will help resolve degeneracies and better constrain the physical properties of the brightest galaxies. Given the extreme nature of this population, characterising these sources is crucial for understanding galaxy evolution. This work demonstrates Euclid's potential to provide statistical samples of rare, massive, dust-obscured galaxies at $z>3$, which will be prime targets for JWST, ALMA, and ELT.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images
Authors:
Euclid Collaboration,
G. Stevens,
S. Fotopoulou,
M. N. Bremer,
T. Matamoro Zatarain,
K. Jahnke,
B. Margalef-Bentabol,
M. Huertas-Company,
M. J. Smith,
M. Walmsley,
M. Salvato,
M. Mezcua,
A. Paulino-Afonso,
M. Siudek,
M. Talia,
F. Ricci,
W. Roster,
N. Aghanim,
B. Altieri,
S. Andreon,
H. Aussel,
C. Baccigalupi,
M. Baldi,
S. Bardelli,
P. Battaglia
, et al. (249 additional authors not shown)
Abstract:
Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an acti…
▽ More
Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an active galactic nucleus (AGN) feature intense, concentrated emission from gas accretion around supermassive black holes, superimposed on regular galactic light, while quasi-stellar objects (QSO) are the extreme case of the AGN emission dominating the galaxy. The challenge of identifying AGN and QSO has been discussed many times in the literature, often requiring multi-wavelength observations. This paper introduces a novel approach to identify AGN and QSO from a single image. Diffusion models have been recently developed in the machine-learning literature to generate realistic-looking images of everyday objects. Utilising the spatial resolving power of the Euclid VIS images, we created a diffusion model trained on one million sources, without using any source pre-selection or labels. The model learns to reconstruct light distributions of normal galaxies, since the population is dominated by them. We condition the prediction of the central light distribution by masking the central few pixels of each source and reconstruct the light according to the diffusion model. We further use this prediction to identify sources that deviate from this profile by examining the reconstruction error of the few central pixels regenerated in each source's core. Our approach, solely using VIS imaging, features high completeness compared to traditional methods of AGN and QSO selection, including optical, near-infrared, mid-infrared, and X-rays. [abridged]
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1). The active galaxies of Euclid
Authors:
Euclid Collaboration,
T. Matamoro Zatarain,
S. Fotopoulou,
F. Ricci,
M. Bolzonella,
F. La Franca,
A. Viitanen,
G. Zamorani,
M. B. Taylor,
M. Mezcua,
B. Laloux,
A. Bongiorno,
K. Jahnke,
G. Stevens,
R. A. Shaw,
L. Bisigello,
W. Roster,
Y. Fu,
B. Margalef-Bentabol,
A. La Marca,
F. Tarsitano,
A. Feltre,
J. Calhau,
X. Lopez Lopez,
M. Scialpi
, et al. (333 additional authors not shown)
Abstract:
We present a catalogue of candidate active galactic nuclei (AGN) in the $Euclid$ Quick Release (Q1) fields. For each $Euclid$ source we collect multi-wavelength photometry and spectroscopy information from Galaxy Evolution Explorer (GALEX), $Gaia$, Dark Energy Survey (DES), Wise-field Infrared Survey Explorer (WISE), $Spitzer$, Dark Energy Survey (DESI), and Sloan Digital Sky Survey (SDSS), includ…
▽ More
We present a catalogue of candidate active galactic nuclei (AGN) in the $Euclid$ Quick Release (Q1) fields. For each $Euclid$ source we collect multi-wavelength photometry and spectroscopy information from Galaxy Evolution Explorer (GALEX), $Gaia$, Dark Energy Survey (DES), Wise-field Infrared Survey Explorer (WISE), $Spitzer$, Dark Energy Survey (DESI), and Sloan Digital Sky Survey (SDSS), including spectroscopic redshift from public compilations. We investigate the AGN contents of the Q1 fields by applying selection criteria using $Euclid$ colours and WISE-AllWISE cuts finding respectively 292,222 and 65,131 candidates. We also create a high-purity QSO catalogue based on $Gaia$ DR3 information containing 1971 candidates. Furthermore, we utilise the collected spectroscopic information from DESI to perform broad-line and narrow-line AGN selections, leading to a total of 4392 AGN candidates in the Q1 field. We investigate and refine the Q1 probabilistic random forest QSO population, selecting a total of 180,666 candidates. Additionally, we perform SED fitting on a subset of sources with available $z_{\text{spec}}$, and by utilizing the derived AGN fraction, we identify a total of 7766 AGN candidates. We discuss purity and completeness of the selections and define two new colour selection criteria ($JH$_$I_{\text{E}}Y$ and $I_{\text{E}}H$_$gz$) to improve on purity, finding 313,714 and 267,513 candidates respectively in the Q1 data. We find a total of 229,779 AGN candidates equivalent to an AGN surface density of 3641 deg$^{-2}$ for $18<I_{\text{E}}\leq 24.5$, and a subsample of 30,422 candidates corresponding to an AGN surface density of 482 deg$^{-2}$ when limiting the depth to $18<I_{\text{E}}\leq 22$. The surface density of AGN recovered from this work is in line with predictions based on the AGN X-ray luminosity functions.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.