-
AnomalyMatch: Discovering Rare Objects of Interest with Semi-supervised and Active Learning
Authors:
Pablo Gómez,
David O'Ryan
Abstract:
Anomaly detection in large datasets is essential in fields such as astronomy and computer vision; however, supervised methods typically require extensive anomaly labelling, which is often impractical. We present AnomalyMatch, an anomaly detection framework combining the semi-supervised FixMatch algorithm using EfficientNet classifiers with active learning. By treating anomaly detection as a semi-s…
▽ More
Anomaly detection in large datasets is essential in fields such as astronomy and computer vision; however, supervised methods typically require extensive anomaly labelling, which is often impractical. We present AnomalyMatch, an anomaly detection framework combining the semi-supervised FixMatch algorithm using EfficientNet classifiers with active learning. By treating anomaly detection as a semi-supervised binary classification problem, we efficiently utilise limited labelled and abundant unlabelled images. We allow iterative model refinement in a user interface for expert verification of high-confidence anomalies and correction of false positives. Built for astronomical data, AnomalyMatch generalises readily to other domains facing similar data challenges. Evaluations on the GalaxyMNIST astronomical dataset and the miniImageNet natural-image benchmark under severe class imbalance (1% anomalies for miniImageNet) display strong performance: starting from five to ten labelled anomalies and after three active learning cycles, we achieve an average AUROC of 0.95 (miniImageNet) and 0.86 (GalaxyMNIST), with respective AUPRC of 0.77 and 0.71. After active learning cycles, anomalies are ranked with 71% (miniImageNet) to 93% precision in the 1% of the highest-ranked images. AnomalyMatch is tailored for large-scale applications, efficiently processing predictions for 100 million images within three days on a single GPU. Integrated into ESAs Datalabs platform, AnomalyMatch facilitates targeted discovery of scientifically valuable anomalies in vast astronomical datasets. Our results underscore the exceptional utility and scalability of this approach for anomaly discovery, highlighting the value of specialised approaches for domains characterised by severe label scarcity.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
Identifying Astrophysical Anomalies in 99.6 Million Cutouts from the Hubble Legacy Archive Using AnomalyMatch
Authors:
David O'Ryan,
Pablo Gómez
Abstract:
Astronomical archives contain vast quantities of unexplored data that potentially harbour rare and scientifically valuable cosmic phenomena. We leverage new semi-supervised methods to extract such objects from the Hubble Legacy Archive. We have systematically searched approximately 100 million image cutouts from the entire Hubble Legacy Archive using the recently developed AnomalyMatch method, whi…
▽ More
Astronomical archives contain vast quantities of unexplored data that potentially harbour rare and scientifically valuable cosmic phenomena. We leverage new semi-supervised methods to extract such objects from the Hubble Legacy Archive. We have systematically searched approximately 100 million image cutouts from the entire Hubble Legacy Archive using the recently developed AnomalyMatch method, which combines semi-supervised and active learning techniques for the efficient detection of astrophysical anomalies. This comprehensive search rapidly uncovered a multitude of astrophysical anomalies presented here that significantly expand the inventory of known rare objects. Among our discoveries are 138 new candidate gravitational lenses, 18 jellyfish galaxies, and 417 mergers or interacting galaxies. The efficiency and accuracy of our iterative detection strategy allows us to trawl the complete archive within just 2-3 days, highlighting its potential for large-scale astronomical surveys. We present a detailed overview of these newly identified objects, discuss their astrophysical significance, and demonstrate the considerable potential of AnomalyMatch to efficiently explore extensive astronomical datasets, including, e.g., upcoming Euclid data releases.
△ Less
Submitted 14 May, 2025; v1 submitted 6 May, 2025;
originally announced May 2025.
-
Galaxy Zoo CEERS: Bar fractions up to z~4.0
Authors:
Tobias Géron,
R. J. Smethurst,
Hugh Dickinson,
L. F. Fortson,
Izzy L. Garland,
Sandor Kruk,
Chris Lintott,
Jason Shingirai Makechemu,
Kameswara Bharadwaj Mantha,
Karen L. Masters,
David O'Ryan,
Hayley Roberts,
B. D. Simmons,
Mike Walmsley,
Antonello Calabrò,
Rimpei Chiba,
Luca Costantin,
Maria R. Drout,
Francesca Fragkoudi,
Yuchen Guo,
B. W. Holwerda,
Shardha Jogee,
Anton M. Koekemoer,
Ray A. Lucas,
Fabio Pacucci
Abstract:
We study the evolution of the bar fraction in disc galaxies between $0.5 < z < 4.0$ using multi-band coloured images from JWST CEERS. These images were classified by citizen scientists in a new phase of the Galaxy Zoo project called GZ CEERS. Citizen scientists were asked whether a strong or weak bar was visible in the host galaxy. After considering multiple corrections for observational biases, w…
▽ More
We study the evolution of the bar fraction in disc galaxies between $0.5 < z < 4.0$ using multi-band coloured images from JWST CEERS. These images were classified by citizen scientists in a new phase of the Galaxy Zoo project called GZ CEERS. Citizen scientists were asked whether a strong or weak bar was visible in the host galaxy. After considering multiple corrections for observational biases, we find that the bar fraction decreases with redshift in our volume-limited sample (n = 398); from $25^{+6}_{-4}$% at $0.5 < z < 1.0$ to $3^{+6}_{-1}$% at $3.0 < z < 4.0$. However, we argue it is appropriate to interpret these fractions as lower limits. Disentangling real changes in the bar fraction from detection biases remains challenging. Nevertheless, we find a significant number of bars up to $z = 2.5$. This implies that discs are dynamically cool or baryon-dominated, enabling them to host bars. This also suggests that bar-driven secular evolution likely plays an important role at higher redshifts. When we distinguish between strong and weak bars, we find that the weak bar fraction decreases with increasing redshift. In contrast, the strong bar fraction is constant between $0.5 < z < 2.5$. This implies that the strong bars found in this work are robust long-lived structures, unless the rate of bar destruction is similar to the rate of bar formation. Finally, our results are consistent with disc instabilities being the dominant mode of bar formation at lower redshifts, while bar formation through interactions and mergers is more common at higher redshifts.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Timescales for the Effects of Interactions on Galaxy Properties and SMBH Growth
Authors:
D. O'Ryan,
B. D. Simmons,
A. L. Faisst,
I. L. Garland,
T. Géron,
G. Gozaliasl,
S. Gillman,
S. G. V. Pinto,
W. C. Keel,
A. M. Koekemoer,
S. Kruk,
K. L. Masters,
O. Montoya C.,
M. Redden,
M. R. Thorne,
E. R. Walls,
D. Weerasinghe,
J. R. Weaver
Abstract:
Galaxy interaction and merging have clear effects on the systems involved. We find an increase in the star formation rate (SFR), potential ignition of active galactic nuclei (AGN) and significant morphology changes. However, at what stage during interactions or mergers these changes begin to occur remains an open question. With a combination of machine learning and visual classification, we select…
▽ More
Galaxy interaction and merging have clear effects on the systems involved. We find an increase in the star formation rate (SFR), potential ignition of active galactic nuclei (AGN) and significant morphology changes. However, at what stage during interactions or mergers these changes begin to occur remains an open question. With a combination of machine learning and visual classification, we select a sample of 3,162 interacting and merging galaxies in the Cosmic Evolutionary Survey (COSMOS) field across a redshift range of 0.0 - 1.2. We divide this sample into four distinct stages of interaction based on their morphology, each stage representing a different phase of the dynamical timescale. We use the rich ancillary data available in COSMOS to probe the relation between interaction stage, stellar mass, SFR, and AGN fraction. We find that the distribution of SFRs rapidly change with stage for mass distributions consistent with being drawn from the same parent sample. This is driven by a decrease in the fraction of red sequence galaxies (from 17% as close pairs to 1.4% during merging) and an increase in the fraction of starburst galaxies (from 7% to 32%). We find the AGN fraction increases by a factor of 1.2 only at coalescence. We find the effects of interaction peak at the point of closest approach and coalescence of the two systems. We show that the point in time of the underlying dynamical timescale - and its related morphology - is as important to consider as its projected separation.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Galaxy Zoo JWST: Up to 75% of discs are featureless at $3<z<7$
Authors:
R. J. Smethurst,
B. D. Simmons,
T. Géron,
H. Dickinson,
L. Fortson,
I. L. Garland,
S. Kruk,
S. M. Jewell,
C. J. Lintott,
J. S. Makechemu,
K. B. Mantha,
K. L. Masters,
D. O'Ryan,
H. Roberts,
M. R. Thorne,
M. Walmsley,
M. Calabrò,
B. Holwerda,
J. S. Kartaltepe,
A. M. Koekemoer,
Y. Lyu,
R. Lucas,
F. Pacucci,
M. Tarrasse
Abstract:
We have not yet observed the epoch at which disc galaxies emerge in the Universe. While high-$z$ measurements of large-scale features such as bars and spiral arms trace the evolution of disc galaxies, such methods cannot directly quantify featureless discs in the early Universe. Here we identify a substantial population of apparently featureless disc galaxies in the Cosmic Evolution Early Release…
▽ More
We have not yet observed the epoch at which disc galaxies emerge in the Universe. While high-$z$ measurements of large-scale features such as bars and spiral arms trace the evolution of disc galaxies, such methods cannot directly quantify featureless discs in the early Universe. Here we identify a substantial population of apparently featureless disc galaxies in the Cosmic Evolution Early Release Science (CEERS) survey by combining quantitative visual morphologies of $\sim 7,000$ galaxies from the Galaxy Zoo JWST CEERS project with a public catalogue of expert visual and parametric morphologies. While the highest-redshift featured disc we identify is at $z_{\rm{phot}}=5.5$, the highest-redshift featureless disc we identify is at $z_{\rm{phot}}=7.4$. The distribution of Sérsic indices for these featureless systems suggests that they truly are dynamically cold: disc-dominated systems have existed since at least $z\sim 7.4$. We place upper limits on the featureless disc fraction as a function of redshift, and show that up to $75\%$ of discs are featureless at $3.0<z<7.4$. This is a conservative limit assuming all galaxies in the sample truly lack features. With further consideration of redshift effects and observational constraints, we find the featureless disc fraction in CEERS imaging at these redshifts is more likely $\sim29-38\%$. We hypothesise that the apparent lack of features in a third of high-redshift discs is due to a higher gas fraction in the early Universe, which allows the discs to be resistant to buckling and instabilities.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1) -- First Euclid statistical study of galaxy mergers and their connection to active galactic nuclei
Authors:
Euclid Collaboration,
A. La Marca,
L. Wang,
B. Margalef-Bentabol,
L. Gabarra,
Y. Toba,
M. Mezcua,
V. Rodriguez-Gomez,
F. Ricci,
S. Fotopoulou,
T. Matamoro Zatarain,
V. Allevato,
F. La Franca,
F. Shankar,
L. Bisigello,
G. Stevens,
M. Siudek,
W. Roster,
M. Salvato,
C. Tortora,
L. Spinoglio,
A. W. S. Man,
J. H. Knapen,
M. Baes,
D. O'Ryan
, et al. (312 additional authors not shown)
Abstract:
Galaxy major mergers are a key pathway to trigger AGN. We present the first detection of major mergers in the Euclid Deep Fields and analyse their connection with AGN. We constructed a stellar-mass-complete ($M_*>10^{9.8}\,M_{\odot}$) sample of galaxies from the first quick data release (Q1), in the redshift range z=0.5-2. We selected AGN using X-ray data, optical spectroscopy, mid-infrared colour…
▽ More
Galaxy major mergers are a key pathway to trigger AGN. We present the first detection of major mergers in the Euclid Deep Fields and analyse their connection with AGN. We constructed a stellar-mass-complete ($M_*>10^{9.8}\,M_{\odot}$) sample of galaxies from the first quick data release (Q1), in the redshift range z=0.5-2. We selected AGN using X-ray data, optical spectroscopy, mid-infrared colours, and processing \IE observations with an image decomposition algorithm. We used CNNs trained on cosmological simulations to classify galaxies as mergers and non-mergers. We found a larger fraction of AGN in mergers compared to the non-merger controls for all AGN selections, with AGN excess factors ranging from 2 to 6. Likewise, a generally larger merger fraction ($f_{merg}$) is seen in active galaxies than in the non-active controls. We analysed $f_{merg}$ as a function of the AGN bolometric luminosity ($L_{bol}$) and the contribution of the point-source to the total galaxy light in the \IE-band ($f_{PSF}$) as a proxy for the relative AGN contribution fraction. We uncovered a rising $f_{merg}$, with increasing $f_{PSF}$ up to $f_{PSF}=0.55$, after which we observed a decreasing trend. We then derived the point-source luminosity ($L_{PSF}$) and showed that $f_{merg}$ monotonically increases as a function of $L_{PSF}$ at z<0.9, with $f_{merg}>$50% for $L_{PSF}>2\,10^{43}$ erg/s. At z>0.9, $f_{merg}$ rises as a function of $L_{PSF}$, though mergers do not dominate until $L_{PSF}=10^{45}$ erg/s. For X-ray and spectroscopic AGN, we computed $L_{bol}$, which has a positive correlation with $f_{merg}$ for X-ray AGN, while shows a less pronounced trend for spectroscopic AGN due to the smaller sample size. At $L_{bol}>10^{45}$ erg/s, AGN mostly reside in mergers. We concluded that mergers are strongly linked to the most powerful, dust-obscured AGN, associated with rapid supermassive black hole growth.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1), A first look at the fraction of bars in massive galaxies at $z<1$
Authors:
Euclid Collaboration,
M. Huertas-Company,
M. Walmsley,
M. Siudek,
P. Iglesias-Navarro,
J. H. Knapen,
S. Serjeant,
H. J. Dickinson,
L. Fortson,
I. Garland,
T. Géron,
W. Keel,
S. Kruk,
C. J. Lintott,
K. Mantha,
K. Masters,
D. O'Ryan,
J. J. Popp,
H. Roberts,
C. Scarlata,
J. S. Makechemu,
B. Simmons,
R. J. Smethurst,
A. Spindler,
M. Baes
, et al. (314 additional authors not shown)
Abstract:
Stellar bars are key structures in disc galaxies, driving angular momentum redistribution and influencing processes such as bulge growth and star formation. Quantifying the bar fraction as a function of redshift and stellar mass is therefore important for constraining the physical processes that drive disc formation and evolution across the history of the Universe. Leveraging the unprecedented res…
▽ More
Stellar bars are key structures in disc galaxies, driving angular momentum redistribution and influencing processes such as bulge growth and star formation. Quantifying the bar fraction as a function of redshift and stellar mass is therefore important for constraining the physical processes that drive disc formation and evolution across the history of the Universe. Leveraging the unprecedented resolution and survey area of the Euclid Q1 data release combined with the Zoobot deep-learning model trained on citizen-science labels, we identify 7711 barred galaxies with $M_* \gtrsim 10^{10}M_\odot$ in a magnitude-selected sample $I_E < 20.5$ spanning $63.1 deg^2$. We measure a mean bar fraction of $0.2-0.4$, consistent with prior studies. At fixed redshift, massive galaxies exhibit higher bar fractions, while lower-mass systems show a steeper decline with redshift, suggesting earlier disc assembly in massive galaxies. Comparisons with cosmological simulations (e.g., TNG50, Auriga) reveal a broadly consistent bar fraction, but highlight overpredictions for high-mass systems, pointing to potential over-efficiency in central stellar mass build-up in simulations. These findings demonstrate Euclid's transformative potential for galaxy morphology studies and underscore the importance of refining theoretical models to better reproduce observed trends. Future work will explore finer mass bins, environmental correlations, and additional morphological indicators.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Euclid Quick Data Release (Q1): First visual morphology catalogue
Authors:
Euclid Collaboration,
M. Walmsley,
M. Huertas-Company,
L. Quilley,
K. L. Masters,
S. Kruk,
K. A. Remmelgas,
J. J. Popp,
E. Romelli,
D. O'Ryan,
H. J. Dickinson,
C. J. Lintott,
S. Serjeant,
R. J. Smethurst,
B. Simmons,
J. Shingirai Makechemu,
I. L. Garland,
H. Roberts,
K. Mantha,
L. F. Fortson,
T. Géron,
W. Keel,
E. M. Baeten,
C. Macmillan,
J. Bovy
, et al. (330 additional authors not shown)
Abstract:
We present a detailed visual morphology catalogue for Euclid's Quick Release 1 (Q1). Our catalogue includes galaxy features such as bars, spiral arms, and ongoing mergers, for the 378000 bright ($I_E < 20.5$) or extended (area $\geq 700\,$pixels) galaxies in Q1. The catalogue was created by finetuning the Zoobot galaxy foundation models on annotations from an intensive one month campaign by Galaxy…
▽ More
We present a detailed visual morphology catalogue for Euclid's Quick Release 1 (Q1). Our catalogue includes galaxy features such as bars, spiral arms, and ongoing mergers, for the 378000 bright ($I_E < 20.5$) or extended (area $\geq 700\,$pixels) galaxies in Q1. The catalogue was created by finetuning the Zoobot galaxy foundation models on annotations from an intensive one month campaign by Galaxy Zoo volunteers. Our measurements are fully automated and hence fully scaleable. This catalogue is the first 0.4% of the approximately 100 million galaxies where Euclid will ultimately resolve detailed morphology.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Galaxy Zoo DESI: large-scale bars as a secular mechanism for triggering AGN
Authors:
Izzy L. Garland,
Mike Walmsley,
Maddie S. Silcock,
Leah M. Potts,
Josh Smith,
Brooke D. Simmons,
Chris J. Lintott,
Rebecca J. Smethurst,
James M. Dawson,
William C. Keel,
Sandor Kruk,
Kameswara Bharadwaj Mantha,
Karen L. Masters,
David O'Ryan,
Jürgen J. Popp,
Matthew R. Thorne
Abstract:
Despite the evidence that supermassive black holes (SMBHs) co-evolve with their host galaxy, and that most of the growth of these SMBHs occurs via merger-free processes, the underlying mechanisms which drive this secular co-evolution are poorly understood. We investigate the role that both strong and weak large-scale galactic bars play in mediating this relationship. Using 72,940 disc galaxies in…
▽ More
Despite the evidence that supermassive black holes (SMBHs) co-evolve with their host galaxy, and that most of the growth of these SMBHs occurs via merger-free processes, the underlying mechanisms which drive this secular co-evolution are poorly understood. We investigate the role that both strong and weak large-scale galactic bars play in mediating this relationship. Using 72,940 disc galaxies in a volume-limited sample from Galaxy Zoo DESI, we analyse the active galactic nucleus (AGN) fraction in strongly barred, weakly barred, and unbarred galaxies up to z = 0.1 over a range of stellar masses and colours. After controlling for stellar mass and colour, we find that the optically selected AGN fraction is 31.6 +/- 0.9 per cent in strongly barred galaxies, 23.3 +/- 0.8 per cent in weakly barred galaxies, and 14.2 +/- 0.6 per cent in unbarred disc galaxies. These are highly statistically robust results, strengthening the tantalising results in earlier works. Strongly barred galaxies have a higher fraction of AGNs than weakly barred galaxies, which in turn have a higher fraction than unbarred galaxies. Thus, while bars are not required in order to grow a SMBH in a disc galaxy, large-scale galactic bars appear to facilitate AGN fuelling, and the presence of a strong bar makes a disc galaxy more than twice as likely to host an AGN than an unbarred galaxy at all galaxy stellar masses and colours.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
The effects of bar strength and kinematics on galaxy evolution: slow strong bars affect their hosts the most
Authors:
Tobias Géron,
R. J. Smethurst,
Chris Lintott,
Karen L. Masters,
I. L. Garland,
Petra Mengistu,
David O'Ryan,
B. D. Simmons
Abstract:
We study how bar strength and bar kinematics affect star formation in different regions of the bar by creating radial profiles of EW[H$α$] and D$_{\rm n}$4000 using data from SDSS-IV MaNGA. Bars in galaxies are classified as strong or weak using Galaxy Zoo DESI, and they are classified as fast and slow bars using the Tremaine-Weinberg method on stellar kinematic data from the MaNGA survey. In agre…
▽ More
We study how bar strength and bar kinematics affect star formation in different regions of the bar by creating radial profiles of EW[H$α$] and D$_{\rm n}$4000 using data from SDSS-IV MaNGA. Bars in galaxies are classified as strong or weak using Galaxy Zoo DESI, and they are classified as fast and slow bars using the Tremaine-Weinberg method on stellar kinematic data from the MaNGA survey. In agreement with previous studies, we find that strong bars in star forming galaxies have enhanced star formation in their centre and beyond the bar-end region, while star formation is suppressed in the arms of the bar. This is not found for weakly barred galaxies, which have very similar radial profiles to unbarred galaxies. In addition, we find that slow bars in star forming galaxies have significantly higher star formation along the bar than fast bars. However, the global star formation rate is not significantly different between galaxies with fast and slow bars. This suggests that the kinematics of the bar do not affect star formation globally, but changes where star formation occurs in the galaxy. Thus, we find that a bar will influence its host the most if it is both strong and slow.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Scaling Laws for Galaxy Images
Authors:
Mike Walmsley,
Micah Bowles,
Anna M. M. Scaife,
Jason Shingirai Makechemu,
Alexander J. Gordon,
Annette M. N. Ferguson,
Robert G. Mann,
James Pearson,
Jürgen J. Popp,
Jo Bovy,
Josh Speagle,
Hugh Dickinson,
Lucy Fortson,
Tobias Géron,
Sandor Kruk,
Chris J. Lintott,
Kameswara Mantha,
Devina Mohan,
David O'Ryan,
Inigo V. Slijepevic
Abstract:
We present the first systematic investigation of supervised scaling laws outside of an ImageNet-like context - on images of galaxies. We use 840k galaxy images and over 100M annotations by Galaxy Zoo volunteers, comparable in scale to Imagenet-1K. We find that adding annotated galaxy images provides a power law improvement in performance across all architectures and all tasks, while adding trainab…
▽ More
We present the first systematic investigation of supervised scaling laws outside of an ImageNet-like context - on images of galaxies. We use 840k galaxy images and over 100M annotations by Galaxy Zoo volunteers, comparable in scale to Imagenet-1K. We find that adding annotated galaxy images provides a power law improvement in performance across all architectures and all tasks, while adding trainable parameters is effective only for some (typically more subjectively challenging) tasks. We then compare the downstream performance of finetuned models pretrained on either ImageNet-12k alone vs. additionally pretrained on our galaxy images. We achieve an average relative error rate reduction of 31% across 5 downstream tasks of scientific interest. Our finetuned models are more label-efficient and, unlike their ImageNet-12k-pretrained equivalents, often achieve linear transfer performance equal to that of end-to-end finetuning. We find relatively modest additional downstream benefits from scaling model size, implying that scaling alone is not sufficient to address our domain gap, and suggest that practitioners with qualitatively different images might benefit more from in-domain adaption followed by targeted downstream labelling.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Galaxy merger challenge: A comparison study between machine learning-based detection methods
Authors:
B. Margalef-Bentabol,
L. Wang,
A. La Marca,
C. Blanco-Prieto,
D. Chudy,
H. Domínguez-Sánchez,
A. D. Goulding,
A. Guzmán-Ortega,
M. Huertas-Company,
G. Martin,
W. J. Pearson,
V. Rodriguez-Gomez,
M. Walmsley,
R. W. Bickley,
C. Bottrell,
C. Conselice,
D. O'Ryan
Abstract:
Various galaxy merger detection methods have been applied to diverse datasets. However, it is difficult to understand how they compare. We aim to benchmark the relative performance of machine learning (ML) merger detection methods. We explore six leading ML methods using three main datasets. The first one (the training data) consists of mock observations from the IllustrisTNG simulations and allow…
▽ More
Various galaxy merger detection methods have been applied to diverse datasets. However, it is difficult to understand how they compare. We aim to benchmark the relative performance of machine learning (ML) merger detection methods. We explore six leading ML methods using three main datasets. The first one (the training data) consists of mock observations from the IllustrisTNG simulations and allows us to quantify the performance metrics of the detection methods. The second one consists of mock observations from the Horizon-AGN simulations, introduced to evaluate the performance of classifiers trained on different, but comparable data. The third one consists of real observations from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP) survey. For the binary classification task (mergers vs. non-mergers), all methods perform reasonably well in the domain of the training data. At $0.1<z<0.3$, precision and recall range between $\sim$70\% and 80\%, both of which decrease with increasing $z$ as expected (by $\sim$5\% for precision and $\sim$10\% for recall at $0.76<z<1.0$). When transferred to a different domain, the precision of all classifiers is only slightly reduced, but the recall is significantly worse (by $\sim$20-40\% depending on the method). Zoobot offers the best overall performance in terms of precision and F1 score. When applied to real HSC observations, all methods agree well with visual labels of clear mergers but can differ by more than an order of magnitude in predicting the overall fraction of major mergers. For the multi-class classification task to distinguish pre-, post- and non-mergers, none of the methods offer a good performance, which could be partly due to limitations in resolution and depth of the data. With the advent of better quality data (e.g. JWST and Euclid), it is important to improve our ability to detect mergers and distinguish between merger stages.
△ Less
Submitted 15 April, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Galaxy Zoo DESI: Detailed Morphology Measurements for 8.7M Galaxies in the DESI Legacy Imaging Surveys
Authors:
Mike Walmsley,
Tobias Géron,
Sandor Kruk,
Anna M. M. Scaife,
Chris Lintott,
Karen L. Masters,
James M. Dawson,
Hugh Dickinson,
Lucy Fortson,
Izzy L. Garland,
Kameswara Mantha,
David O'Ryan,
Jürgen Popp,
Brooke Simmons,
Elisabeth M. Baeten,
Christine Macmillan
Abstract:
We present detailed morphology measurements for 8.67 million galaxies in the DESI Legacy Imaging Surveys (DECaLS, MzLS, and BASS, plus DES). These are automated measurements made by deep learning models trained on Galaxy Zoo volunteer votes. Our models typically predict the fraction of volunteers selecting each answer to within 5-10\% for every answer to every GZ question. The models are trained o…
▽ More
We present detailed morphology measurements for 8.67 million galaxies in the DESI Legacy Imaging Surveys (DECaLS, MzLS, and BASS, plus DES). These are automated measurements made by deep learning models trained on Galaxy Zoo volunteer votes. Our models typically predict the fraction of volunteers selecting each answer to within 5-10\% for every answer to every GZ question. The models are trained on newly-collected votes for DESI-LS DR8 images as well as historical votes from GZ DECaLS. We also release the newly-collected votes. Extending our morphology measurements outside of the previously-released DECaLS/SDSS intersection increases our sky coverage by a factor of 4 (5,000 to 19,000 deg$^2$) and allows for full overlap with complementary surveys including ALFALFA and MaNGA.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
The most luminous, merger-free AGN show only marginal correlation with bar presence
Authors:
Izzy L. Garland,
Matthew J. Fahey,
Brooke D. Simmons,
Rebecca J. Smethurst,
Chris J. Lintott,
Jesse Shanahan,
Maddie S. Silcock,
Joshua Smith,
William C. Keel,
Alison Coil,
Tobias Géron,
Sandor Kruk,
Karen L. Masters,
David O'Ryan,
Matthew R. Thorne,
Klaas Wiersema
Abstract:
The role of large-scale bars in the fuelling of active galactic nuclei (AGN) is still debated, even as evidence mounts that black hole growth in the absence of galaxy mergers cumulatively dominated and may substantially influence disc (i.e., merger-free) galaxy evolution. We investigate whether large-scale galactic bars are a good candidate for merger-free AGN fuelling. Specifically, we combine sl…
▽ More
The role of large-scale bars in the fuelling of active galactic nuclei (AGN) is still debated, even as evidence mounts that black hole growth in the absence of galaxy mergers cumulatively dominated and may substantially influence disc (i.e., merger-free) galaxy evolution. We investigate whether large-scale galactic bars are a good candidate for merger-free AGN fuelling. Specifically, we combine slit spectroscopy and Hubble Space Telescope imagery to characterise star formation rates (SFRs) and stellar masses of the unambiguously disc-dominated host galaxies of a sample of luminous, Type-1 AGN with 0.02 < z 0.024. After carefully correcting for AGN signal, we find no clear difference in SFR between AGN hosts and a stellar mass-matched sample of galaxies lacking an AGN (0.013 < z < 0.19), although this could be due to a small sample size (n_AGN = 34). We correct for SFR and stellar mass to minimise selection biases, and compare the bar fraction in the two samples. We find that AGN are marginally (1.7$σ$) more likely to host a bar than inactive galaxies, with AGN hosts having a bar fraction, fbar = 0.59^{+0.08}_{-0.09} and inactive galaxies having a bar fraction fbar = 0.44^{+0.08}_{-0.09}. However, we find no further differences between SFR- and mass-matched AGN and inactive samples. While bars could potentially trigger AGN activity, they appear to have no further, unique effect on a galaxy's stellar mass or SFR.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Harnessing the Hubble Space Telescope Archives: A Catalogue of 21,926 Interacting Galaxies
Authors:
David O'Ryan,
Bruno Merín,
Brooke D. Simmons,
Antónia Vojteková,
Anna Anku,
Mike Walmsley,
Izzy L. Garland,
Tobias Géron,
William Keel,
Sandor Kruk,
Chris J. Lintott,
Kameswara Bharadwaj Mantha,
Karen L. Masters,
Jan Reerink,
Rebecca J. Smethurst,
Matthew R. Thorne
Abstract:
Mergers play a complex role in galaxy formation and evolution. Continuing to improve our understanding of these systems require ever larger samples, which can be difficult (even impossible) to select from individual surveys. We use the new platform ESA Datalabs to assemble a catalogue of interacting galaxies from the Hubble Space Telescope science archives; this catalogue is larger than previously…
▽ More
Mergers play a complex role in galaxy formation and evolution. Continuing to improve our understanding of these systems require ever larger samples, which can be difficult (even impossible) to select from individual surveys. We use the new platform ESA Datalabs to assemble a catalogue of interacting galaxies from the Hubble Space Telescope science archives; this catalogue is larger than previously published catalogues by nearly an order of magnitude. In particular, we apply the Zoobot convolutional neural network directly to the entire public archive of HST $F814W$ images and make probabilistic interaction predictions for 126 million sources from the Hubble Source Catalogue. We employ a combination of automated visual representation and visual analysis to identify a clean sample of 21,926 interacting galaxy systems, mostly with $z < 1$. Sixty five percent of these systems have no previous references in either the NASA Extragalactic Database or Simbad. In the process of removing contamination, we also discover many other objects of interest, such as gravitational lenses, edge-on protoplanetary disks, and `backlit' overlapping galaxies. We briefly investigate the basic properties of this sample, and we make our catalogue publicly available for use by the community. In addition to providing a new catalogue of scientifically interesting objects imaged by HST, this work also demonstrates the power of the ESA Datalabs tool to facilitate substantial archival analysis without placing a high computational or storage burden on the end user.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Galaxy And Mass Assembly: Galaxy Morphology in the Green Valley, Prominent rings and looser Spiral Arms
Authors:
Dominic Smith,
Lutz Haberzettl,
L. E. Porter,
Ren Porter-Temple,
Christopher P. A. Henry,
Benne Holwerda,
A. R. Lopez-Sanchez,
Steven Phillipps,
Alister W. Graham,
Sarah Brough,
Kevin A. Pimbblet,
Jochen Liske,
Lee S. Kelvin,
Clayton D. Robertson,
Wade Roemer,
Michael Walmsley,
David O'Ryan,
Tobias Geron
Abstract:
Galaxies broadly fall into two categories: star-forming (blue) galaxies and quiescent (red) galaxies. In between, one finds the less populated ``green valley". Some of these galaxies are suspected to be in the process of ceasing their star-formation through a gradual exhaustion of gas supply or already dead and are experiencing a rejuvenation of star-formation through fuel injection. We use the Ga…
▽ More
Galaxies broadly fall into two categories: star-forming (blue) galaxies and quiescent (red) galaxies. In between, one finds the less populated ``green valley". Some of these galaxies are suspected to be in the process of ceasing their star-formation through a gradual exhaustion of gas supply or already dead and are experiencing a rejuvenation of star-formation through fuel injection. We use the Galaxy And Mass Assembly database and the Galaxy Zoo citizen science morphological estimates to compare the morphology of galaxies in the green valley against those in the red sequence and blue cloud.
Our goal is to examine the structural differences within galaxies that fall in the green valley, and what brings them there. Previous results found disc features such as rings and lenses are more prominently represented in the green valley population. We revisit this with a similar sized data set of galaxies with morphology labels provided by the Galaxy Zoo for the GAMA fields based on new KiDS images. Our aim is to compare qualitatively the results from expert classification to that of citizen science.
We observe that ring structures are indeed found more commonly in green valley galaxies compared to their red and blue counterparts. We suggest that ring structures are a consequence of disc galaxies in the green valley actively exhibiting characteristics of fading discs and evolving disc morphology of galaxies. We note that the progression from blue to red correlates with loosening spiral arm structure.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Preparing for low surface brightness science with the Vera C. Rubin Observatory: characterisation of tidal features from mock images
Authors:
G. Martin,
A. E. Bazkiaei,
M. Spavone,
E. Iodice,
J. C. Mihos,
M. Montes,
J. A. Benavides,
S. Brough,
J. L. Carlin,
C. A. Collins,
P. A. Duc,
F. A. Gómez,
G. Galaz,
H. M. Hernández-Toledo,
R. A. Jackson,
S. Kaviraj,
J. H. Knapen,
C. Martínez-Lombilla,
S. McGee,
D. O'Ryan,
D. J. Prole,
R. M. Rich,
J. Román,
E. A. Shah,
T. K. Starkenburg
, et al. (28 additional authors not shown)
Abstract:
Tidal features in the outskirts of galaxies yield unique information about their past interactions and are a key prediction of the hierarchical structure formation paradigm. The Vera C. Rubin Observatory is poised to deliver deep observations for potentially of millions of objects with visible tidal features, but the inference of galaxy interaction histories from such features is not straightforwa…
▽ More
Tidal features in the outskirts of galaxies yield unique information about their past interactions and are a key prediction of the hierarchical structure formation paradigm. The Vera C. Rubin Observatory is poised to deliver deep observations for potentially of millions of objects with visible tidal features, but the inference of galaxy interaction histories from such features is not straightforward. Utilising automated techniques and human visual classification in conjunction with realistic mock images produced using the NEWHORIZON cosmological simulation, we investigate the nature, frequency and visibility of tidal features and debris across a range of environments and stellar masses. In our simulated sample, around 80 per cent of the flux in the tidal features around Milky Way or greater mass galaxies is detected at the 10-year depth of the Legacy Survey of Space and Time (30-31 mag / sq. arcsec), falling to 60 per cent assuming a shallower final depth of 29.5 mag / sq. arcsec. The fraction of total flux found in tidal features increases towards higher masses, rising to 10 per cent for the most massive objects in our sample (M*~10^{11.5} Msun). When observed at sufficient depth, such objects frequently exhibit many distinct tidal features with complex shapes. The interpretation and characterisation of such features varies significantly with image depth and object orientation, introducing significant biases in their classification. Assuming the data reduction pipeline is properly optimised, we expect the Rubin Observatory to be capable of recovering much of the flux found in the outskirts of Milky Way mass galaxies, even at intermediate redshifts (z<0.2).
△ Less
Submitted 7 May, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Gems of the Galaxy Zoos -- a Wide-Ranging Hubble Space Telescope Gap-Filler Program
Authors:
William C. Keel,
Jean Tate,
O. Ivy Wong,
Julie K. Banfield,
Chris J. Lintott,
Karen L. Masters,
Brooke D. Simmons,
Claudia Scarlata,
Carolin Cardamone,
Rebecca Smethurst,
Lucy Fortson,
Jesse Shanahan,
Sandor Kruk,
Izzy L. Garland,
Colin Hancock,
David O'Ryan
Abstract:
We describe the Gems of the Galaxy Zoos (Zoo Gems) project, a gap-filler project using short windows in the Hubble Space Telescope's schedule. As with previous snapshot programs, targets are taken from a pool based on position; we combine objects selected by volunteers in both the Galaxy Zoo and Radio Galaxy Zoo citizen-science projects. Zoo Gems uses exposures with the Advanced Camera for Surveys…
▽ More
We describe the Gems of the Galaxy Zoos (Zoo Gems) project, a gap-filler project using short windows in the Hubble Space Telescope's schedule. As with previous snapshot programs, targets are taken from a pool based on position; we combine objects selected by volunteers in both the Galaxy Zoo and Radio Galaxy Zoo citizen-science projects. Zoo Gems uses exposures with the Advanced Camera for Surveys (ACS) to address a broad range of topics in galaxy morphology, interstellar-medium content, host galaxies of active galactic nuclei, and galaxy evolution. Science cases include studying galaxy interactions, backlit dust in galaxies, post-starburst systems, rings and peculiar spiral patterns, outliers from the usual color-morphology relation, Green Pea compact starburst systems, double radio sources with spiral host galaxies, and extended emission-line regions around active galactic nuclei. For many of these science categories, final selection of targets from a larger list used public input via a voting process. Highlights to date include the prevalence of tightly-wound spiral structure in blue, apparently early-type galaxies, a nearly complete Einstein ring from a group lens, redder components at lower surface brightness surrounding compact Green Pea starbursts, and high-probability examples of spiral galaxies hosting large double radio sources.
△ Less
Submitted 15 February, 2022; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Quantifying the Poor Purity and Completeness of Morphological Samples Selected by Galaxy Colour
Authors:
Rebecca J. Smethurst,
Karen L. Masters,
Brooke D. Simmons,
Izzy L. Garland,
Tobias Géron,
Boris Häußler,
Sandor Kruk,
Chris J. Lintott,
David O'Ryan,
Mike Walmsley
Abstract:
The galaxy population is strongly bimodal in both colour and morphology, and the two measures correlate strongly, with most blue galaxies being late-types (spirals) and most early-types, typically ellipticals, being red. This observation has led to the use of colour as a convenient selection criteria to make samples which are then labelled by morphology. Such use of colour as a proxy for morpholog…
▽ More
The galaxy population is strongly bimodal in both colour and morphology, and the two measures correlate strongly, with most blue galaxies being late-types (spirals) and most early-types, typically ellipticals, being red. This observation has led to the use of colour as a convenient selection criteria to make samples which are then labelled by morphology. Such use of colour as a proxy for morphology results in necessarily impure and incomplete samples. In this paper, we make use of the morphological labels produced by Galaxy Zoo to measure how incomplete and impure such samples are, considering optical (ugriz), NUV and NIR (JHK) bands. The best single colour optical selection is found using a threshold of g-r = 0.742, but this still results in a sample where only 56% of red galaxies are smooth and 56% of smooth galaxies are red. Use of the NUV gives some improvement over purely optical bands, particularly for late-types, but still results in low purity/completeness for early-types. No significant improvement is found by adding NIR bands. With any two bands, including NUV, a sample of early-types with greater than two-thirds purity cannot be constructed. Advances in quantitative galaxy morphologies have made colour-morphology proxy selections largely unnecessary going forward; where such assumptions are still required, we recommend studies carefully consider the implications of sample incompleteness/impurity.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
Origin of the Local Group satellite planes
Authors:
Indranil Banik,
David O'Ryan,
Hongsheng Zhao
Abstract:
We attempt to understand the planes of satellite galaxies orbiting the Milky Way (MW) and M31 in the context of Modified Newtonian Dynamics (MOND), which implies a close MW-M31 flyby occurred ${\approx 8}$ Gyr ago. Using the timing argument, we obtain MW-M31 trajectories consistent with cosmological initial conditions and present observations. We adjust the present M31 proper motion within its unc…
▽ More
We attempt to understand the planes of satellite galaxies orbiting the Milky Way (MW) and M31 in the context of Modified Newtonian Dynamics (MOND), which implies a close MW-M31 flyby occurred ${\approx 8}$ Gyr ago. Using the timing argument, we obtain MW-M31 trajectories consistent with cosmological initial conditions and present observations. We adjust the present M31 proper motion within its uncertainty in order to simulate a range of orbital geometries and closest approach distances. Treating the MW and M31 as point masses, we follow the trajectories of surrounding test particle disks, thereby mapping out the tidal debris distribution.
Around each galaxy, the resulting tidal debris tends to cluster around a particular orbital pole. We find some models in which these preferred spin vectors align fairly well with those of the corresponding observed satellite planes. The radial distributions of material in the simulated satellite planes are similar to what we observe. Around the MW, our best-fitting model yields a significant fraction (0.22) of counter-rotating material, perhaps explaining why Sculptor counter-rotates within the MW satellite plane. In contrast, our model yields no counter-rotating material around M31. This is testable with proper motions of M31 satellites.
In our best model, the MW disk is thickened by the flyby 7.65 Gyr ago to a root mean square height of 0.75 kpc. This is similar to the observed age and thickness of the Galactic thick disk. Thus, the MW thick disk may have formed together with the MW and M31 satellite planes during a past MW-M31 flyby.
△ Less
Submitted 8 April, 2018; v1 submitted 1 February, 2018;
originally announced February 2018.