-
Unveiling the main sequence of galaxies at $z \geq 5$ with the James Webb Space Telescope: predictions from simulations
Authors:
Jordan C. J. D'Silva,
Claudia D. P. Lagos,
Luke J. M. Davies,
Christopher C. Lovell,
Aswin P. Vijayan
Abstract:
We use two independent, galaxy formation simulations, FLARES, a cosmological hydrodynamical simulation, and SHARK, a semi-analytic model, to explore how well the James Webb Space Telescope (JWST) will be able to uncover the existence and parameters of the star-forming main sequence (SFS) at $z=5\to10$, i.e. shape, scatter, normalisation. Using two independent simulations allows us to isolate predi…
▽ More
We use two independent, galaxy formation simulations, FLARES, a cosmological hydrodynamical simulation, and SHARK, a semi-analytic model, to explore how well the James Webb Space Telescope (JWST) will be able to uncover the existence and parameters of the star-forming main sequence (SFS) at $z=5\to10$, i.e. shape, scatter, normalisation. Using two independent simulations allows us to isolate predictions (e.g., stellar mass, star formation rate, SFR, luminosity functions) that are robust to or highly dependent on the implementation of the physics of galaxy formation. Both simulations predict that JWST can observe $\ge 70-90\%$ (for SHARK and FLARES respectively) of galaxies up to $z\sim10$ (down to stellar masses of $\approx 10^{8.3}\,\rm M_{\odot}$ and SFRs of $\approx 10^{0.5}\,\rm M_{\odot}\, yr^{-1}$) in modest integration times and given current proposed survey areas (e.g. the Web COSMOS $0.6\,\rm deg^2$) to accurately constrain the parameters of the SFS. Although both simulations predict qualitatively similar distributions of stellar mass and SFR, there are important quantitative differences, such as the abundance of massive, star-forming galaxies, with FLARES predicting a higher abundance than SHARK; the early onset of quenching as a result of black hole growth in FLARES (at $z\approx 8$), not seen in SHARK until much lower redshifts; and the implementation of synthetic photometry, with FLARES predicting more JWST-detected galaxies ($\sim 90\%$) than SHARK ($\sim 70\%$) at $z=10$. JWST observations will distinguish between these models, leading to a significant improvement upon our understanding of the formation of the very first galaxies.
△ Less
Submitted 5 October, 2022; v1 submitted 12 August, 2022;
originally announced August 2022.
-
First Light And Reionisation Epoch Simulations (FLARES) VII: The Star Formation and Metal Enrichment Histories of Galaxies in the early Universe
Authors:
Stephen M. Wilkins,
Aswin P. Vijayan,
Christopher C. Lovell,
William J. Roper,
Erik Zackrisson,
Dimitrios Irodotou,
Louise T. C. Seeyave,
Jussi K. Kuusisto,
Peter A. Thomas,
Joseph Caruana,
Christopher J. Conselice
Abstract:
The star formation and metal enrichment histories of galaxies - at any epoch - constitute one of the key properties of galaxies, and their measurement is a core aim of observational extragalactic astronomy. The lack of deep rest-frame optical coverage at high-redshift has made robust constraints elusive, but this is now changing thanks to the \emph{James Webb Space Telescope (JWST)}. In preparatio…
▽ More
The star formation and metal enrichment histories of galaxies - at any epoch - constitute one of the key properties of galaxies, and their measurement is a core aim of observational extragalactic astronomy. The lack of deep rest-frame optical coverage at high-redshift has made robust constraints elusive, but this is now changing thanks to the \emph{James Webb Space Telescope (JWST)}. In preparation for the constraints provided by \emph{JWST} we explore the star formation and metal enrichment histories of galaxies at $z=5-13$ using the First Light And Reionisation Epoch Simulations (FLARES) suite. Built on the EAGLE model, the unique strategy of FLARES allows us to simulate a wide range of stellar masses (and luminosities) and environments. While we predict significant redshift evolution of average ages and specific star formation rates our core result is a mostly flat relationship of age and specific star formation rate with stellar mass. We also find that galaxies in this epoch predominantly have strongly rising star formation histories, albeit with the magnitude dropping with redshift and stellar mass. In terms of chemical enrichment we predict a strong stellar mass - metallicity relation present at $z=10$ and beyond alongside significant $α$-enhancement. Finally, we find no environmental dependence of the relationship between age, specific star formation rate, or metallicity with stellar mass.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Few-Shot Class-Incremental Learning from an Open-Set Perspective
Authors:
Can Peng,
Kun Zhao,
Tianren Wang,
Meng Li,
Brian C. Lovell
Abstract:
The continual appearance of new objects in the visual world poses considerable challenges for current deep learning methods in real-world deployments. The challenge of new task learning is often exacerbated by the scarcity of data for the new categories due to rarity or cost. Here we explore the important task of Few-Shot Class-Incremental Learning (FSCIL) and its extreme data scarcity condition o…
▽ More
The continual appearance of new objects in the visual world poses considerable challenges for current deep learning methods in real-world deployments. The challenge of new task learning is often exacerbated by the scarcity of data for the new categories due to rarity or cost. Here we explore the important task of Few-Shot Class-Incremental Learning (FSCIL) and its extreme data scarcity condition of one-shot. An ideal FSCIL model needs to perform well on all classes, regardless of their presentation order or paucity of data. It also needs to be robust to open-set real-world conditions and be easily adapted to the new tasks that always arise in the field. In this paper, we first reevaluate the current task setting and propose a more comprehensive and practical setting for the FSCIL task. Then, inspired by the similarity of the goals for FSCIL and modern face recognition systems, we propose our method -- Augmented Angular Loss Incremental Classification or ALICE. In ALICE, instead of the commonly used cross-entropy loss, we propose to use the angular penalty loss to obtain well-clustered features. As the obtained features not only need to be compactly clustered but also diverse enough to maintain generalization for future incremental classes, we further discuss how class augmentation, data augmentation, and data balancing affect classification performance. Experiments on benchmark datasets, including CIFAR100, miniImageNet, and CUB200, demonstrate the improved performance of ALICE over the state-of-the-art FSCIL methods.
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Seeing sharper and deeper: JWST's first glimpse of the photometric and spectroscopic properties of galaxies in the epoch of reionisation
Authors:
James A. A. Trussler,
Nathan J. Adams,
Christopher J. Conselice,
Leonardo Ferreira,
Duncan Austin,
Rachana Bhatawdekar,
Joseph Caruana,
Brenda L. Frye,
Tom Harvey,
Christopher C. Lovell,
Massimo Pascale,
William J. Roper,
Aprajita Verma,
Aswin P. Vijayan,
Stephen M. Wilkins
Abstract:
We analyse the photometric and spectroscopic properties of four galaxies in the epoch of reionisation (EoR) within the SMACS 0723 JWST Early Release Observations field. Given the known spectroscopic redshifts of these sources, we investigated the accuracy with which photometric redshifts can be derived using NIRCam photometry alone, finding that F115W imaging is essential to distinguish between z~…
▽ More
We analyse the photometric and spectroscopic properties of four galaxies in the epoch of reionisation (EoR) within the SMACS 0723 JWST Early Release Observations field. Given the known spectroscopic redshifts of these sources, we investigated the accuracy with which photometric redshifts can be derived using NIRCam photometry alone, finding that F115W imaging is essential to distinguish between z~8 galaxies with high equivalent width (EW) [O III] λ5007 emission and z~10 Balmer break galaxies. We find that all four sources exhibit strong (\geq 0.6 mag) F356W-F444W colours, which sit at the extreme end of theoretical predictions from numerical simulations. We find that these galaxies deviate (by roughly 0.5 dex) from the local correlation between [O III] λ5007/Hβand [Ne III] λ3869/[O II], which is consistent with the predictions from simulations of high-redshift galaxies having elevated line excitation ratios. We measure the [O III] λ5007 rest-frame equivalent widths both directly from the spectroscopy, and indirectly as inferred from the strong F356W-F444W colours, finding large [O III] λ5007 EWs of 225-1740 Å. The [O III] λ5007 and HβEWs are consistent with those seen in extreme, intensely star-forming dwarf galaxies in the local Universe. Our structural analysis indicates that these galaxies are resolved, exhibiting irregular shapes with bright clumps. In line with the predictions from the FLARES hydrodynamic simulations, such intense star formation and extreme nebular conditions are likely the norm, rather than the exception, in the EoR.
△ Less
Submitted 30 August, 2023; v1 submitted 28 July, 2022;
originally announced July 2022.
-
First Light And Reionisation Epoch Simulations (FLARES) VI: The colour evolution of galaxies $z=5-15$
Authors:
Stephen M. Wilkins,
Aswin P. Vijayan,
Christopher C. Lovell,
William J. Roper,
Dimitrios Irodotou,
Joseph Caruana,
Louise T. C. Seeyave,
Jussi K. Kuusisto,
Peter A. Thomas
Abstract:
With its exquisite sensitivity, wavelength coverage, and spatial and spectral resolution, the James Webb Space Telescope is poised to revolutionise our view of the distant, high-redshift ($z>5$) Universe. While Webb's spectroscopic observations will be transformative for the field, photometric observations play a key role in identifying distant objects and providing more comprehensive samples than…
▽ More
With its exquisite sensitivity, wavelength coverage, and spatial and spectral resolution, the James Webb Space Telescope is poised to revolutionise our view of the distant, high-redshift ($z>5$) Universe. While Webb's spectroscopic observations will be transformative for the field, photometric observations play a key role in identifying distant objects and providing more comprehensive samples than accessible to spectroscopy alone. In addition to identifying objects, photometric observations can also be used to infer physical properties and thus be used to constrain galaxy formation models. However, inferred physical properties from broadband photometric observations, particularly in the absence of spectroscopic redshifts, often have large uncertainties. With the development of new tools for forward modelling simulations it is now routinely possible to predict observational quantities, enabling a direct comparison with observations. With this in mind, in this work, we make predictions for the colour evolution of galaxies at $z=5-15$ using the FLARES: First Light And Reionisation Epoch Simulations cosmological hydrodynamical simulation suite. We predict a complex evolution, driven predominantly by strong nebular line emission passing through individual bands. These predictions are in good agreement with existing constraints from Hubble and Spitzer as well as some of the first results from Webb. We also contrast our predictions with other models in the literature: while the general trends are similar we find key differences, particularly in the strength of features associated with strong nebular line emission. This suggests photometric observations alone should provide useful discriminating power between different models.
△ Less
Submitted 6 September, 2022; v1 submitted 22 July, 2022;
originally announced July 2022.
-
First Light And Reionisation Epoch Simulations (FLARES) V: The redshift frontier
Authors:
Stephen M. Wilkins,
Aswin P. Vijayan,
Christopher C. Lovell,
William J. Roper,
Dimitrios Irodotou,
Joseph Caruana,
Louise T. C. Seeyave,
Jussi K. Kuusisto,
Peter A. Thomas,
Shedeur A. K. Parris
Abstract:
The James Webb Space Telescope (JWST) is set to transform many areas of astronomy, one of the most exciting is the expansion of the redshift frontier to $z>10$. In its first year alone JWST should discover hundreds of galaxies, dwarfing the handful currently known. To prepare for these powerful observational constraints, we use the First Light And Reionisation Epoch (FLARES) simulations to predict…
▽ More
The James Webb Space Telescope (JWST) is set to transform many areas of astronomy, one of the most exciting is the expansion of the redshift frontier to $z>10$. In its first year alone JWST should discover hundreds of galaxies, dwarfing the handful currently known. To prepare for these powerful observational constraints, we use the First Light And Reionisation Epoch (FLARES) simulations to predict the physical and observational properties of the $z>10$ population of galaxies accessible to JWST. This is the first time such predictions have been made using a hydrodynamical model validated at low redshift. Our predictions at $z=10$ are broadly in agreement with current observational constraints on the far-UV luminosity function and UV continuum slope $β$, though the observational uncertainties are large. We note tension with recent constraints $z\sim 13$ from Harikane et al. 2022 - compared to these constraints, FLARES predicts objects with the same space density should have an order of magnitude lower luminosity, though this is mitigated slightly if dust attenuation is negligible in these systems. Our predictions suggest that in JWST's first cycle alone, around $600$ galaxies should be identified at $z>10$, with the first small samples available at $z>13$.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Chaotic and Clumpy Galaxy Formation in an Extremely Massive Reionization-Era Halo
Authors:
Justin S. Spilker,
Christopher C. Hayward,
Daniel P. Marrone,
Manuel Aravena,
Matthieu Bethermin,
James Burgoyne,
Scott C. Chapman,
Thomas R. Greve,
Gayathri Gururajan,
Yashar D. Hezaveh,
Ryley Hill,
Katrina C. Litke,
Christopher C. Lovell,
Matthew A. Malkan,
Eric J. Murphy,
Desika Narayanan,
Kedar A. Phadke,
Cassie Reuter,
Antony A. Stark,
Nikolaus Sulzenauer,
Joaquin D. Vieira,
David Vizgan,
Axel Weiss
Abstract:
The SPT0311-58 system at z=6.900 is an extremely massive structure within the reionization epoch, and offers a chance to understand the formation of galaxies in an extreme peak in the primordial density field. We present 70mas Atacama Large Millimeter/submillimeter Array observations of the dust continuum and CII 158um emission in the central pair of galaxies and reach physical resolution ~100-350…
▽ More
The SPT0311-58 system at z=6.900 is an extremely massive structure within the reionization epoch, and offers a chance to understand the formation of galaxies in an extreme peak in the primordial density field. We present 70mas Atacama Large Millimeter/submillimeter Array observations of the dust continuum and CII 158um emission in the central pair of galaxies and reach physical resolution ~100-350pc, among the most detailed views of any reionization-era system to date. The observations resolve the source into at least a dozen kiloparsec-size clumps. The global kinematics and high turbulent velocity dispersion within the galaxies present a striking contrast to recent claims of dynamically cold thin-disk kinematics in some dusty galaxies just 800Myr later at z~4. We speculate that both gravitational interactions and fragmentation from massive parent disks have likely played a role in the overall dynamics and formation of clumps in the system. Each clump individually is comparable in mass to other 6<z<8 galaxies identified in rest-UV/optical deep field surveys, but with star formation rates elevated by ~3-5x. Internally, the clumps themselves bear close resemblance to greatly scaled-up versions of virialized cloud-scale structures identified in low-redshift galaxies. Our observations are qualitatively similar to the chaotic and clumpy assembly within massive halos seen in simulations of high-redshift galaxies.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
First Light And Reionisation Epoch Simulations (FLARES) IV: The size evolution of galaxies at $z\geq5$
Authors:
William J. Roper,
Christopher C. Lovell,
Aswin P. Vijayan,
Madeline A. Marshall,
Dimitrios Irodotou,
Jussi K. Kuusisto,
Peter A. Thomas,
Stephen M. Wilkins
Abstract:
We present the intrinsic and observed sizes of galaxies at $z\geq5$ in the First Light And Reionisation Epoch Simulations (FLARES). We employ the large effective volume of FLARES to produce a sizeable sample of high redshift galaxies with intrinsic and observed luminosities and half light radii in a range of rest frame UV and visual photometric bands. This sample contains a significant number of i…
▽ More
We present the intrinsic and observed sizes of galaxies at $z\geq5$ in the First Light And Reionisation Epoch Simulations (FLARES). We employ the large effective volume of FLARES to produce a sizeable sample of high redshift galaxies with intrinsic and observed luminosities and half light radii in a range of rest frame UV and visual photometric bands. This sample contains a significant number of intrinsically ultra-compact galaxies in the far-UV (1500 angstrom), leading to a negative intrinsic far-UV size-luminosity relation. However, after the inclusion of the effects of dust these same compact galaxies exhibit observed sizes that are as much as 50 times larger than those measured from the intrinsic emission, and broadly agree with a range of observational samples. This increase in size is driven by the concentration of dust in the core of galaxies, heavily attenuating the intrinsically brightest regions. At fixed luminosity we find a galaxy size redshift evolution with a slope of $m=1.21-1.87$ depending on the luminosity sample in question, and we demonstrate the wavelength dependence of the size-luminosity relation which will soon be probed by the Webb Space Telescope.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
The BPT Diagram in Cosmological Galaxy Formation Simulations: Understanding the Physics Driving Offsets at High-Redshift
Authors:
Prerak Garg,
Desika Narayanan,
Nell Byler,
Ryan L. Sanders,
Alice E. Shapley,
Allison L. Strom,
Romeel Davé,
Michaela Hirschmann,
Christopher C. Lovell,
Justin Otter,
Gergö Popping,
George C. Privon
Abstract:
The Baldwin, Philips, & Terlevich diagram of [O III]/H$β$ vs. [N II]/H$α$ (hereafter N2-BPT) has long been used as a tool for classifying galaxies based on the dominant source of ionizing radiation. Recent observations have demonstrated that galaxies at $z\sim2$ reside offset from local galaxies in the N2-BPT space. In this paper, we conduct a series of controlled numerical experiments to understa…
▽ More
The Baldwin, Philips, & Terlevich diagram of [O III]/H$β$ vs. [N II]/H$α$ (hereafter N2-BPT) has long been used as a tool for classifying galaxies based on the dominant source of ionizing radiation. Recent observations have demonstrated that galaxies at $z\sim2$ reside offset from local galaxies in the N2-BPT space. In this paper, we conduct a series of controlled numerical experiments to understand the potential physical processes driving this offset. We model nebular line emission in a large sample of galaxies, taken from the SIMBA cosmological hydrodynamic galaxy formation simulation, using the CLOUDY photoionization code to compute the nebular line luminosities from H II regions. We find that the observed shift toward higher [O III]/H$β$ and [N II]/H$α$ values at high redshift arises from sample selection: when we consider only the most massive galaxies $M_* \sim 10^{10-11} M_\odot$, the offset naturally appears, due to their high metallicities. We predict that deeper observations that probe lower-mass galaxies will reveal galaxies that lie on a locus comparable to $z\sim 0$ observations. Even when accounting for sample selection effects, we find that there is a subtle mismatch between simulations and observations. To resolve this discrepancy, we investigate the impact of varying ionization parameters, H II region densities, gas-phase abundance patterns, and increasing radiation field hardness on N2-BPT diagrams. We find that either decreasing the ionization parameter or increasing the N/O ratio of galaxies at fixed O/H can move galaxies along a self-similar arc in N2-BPT space that is occupied by high-redshift galaxies.
△ Less
Submitted 10 January, 2022;
originally announced January 2022.
-
DIODE: Dilatable Incremental Object Detection
Authors:
Can Peng,
Kun Zhao,
Sam Maksoud,
Tianren Wang,
Brian C. Lovell
Abstract:
To accommodate rapid changes in the real world, the cognition system of humans is capable of continually learning concepts. On the contrary, conventional deep learning models lack this capability of preserving previously learned knowledge. When a neural network is fine-tuned to learn new tasks, its performance on previously trained tasks will significantly deteriorate. Many recent works on increme…
▽ More
To accommodate rapid changes in the real world, the cognition system of humans is capable of continually learning concepts. On the contrary, conventional deep learning models lack this capability of preserving previously learned knowledge. When a neural network is fine-tuned to learn new tasks, its performance on previously trained tasks will significantly deteriorate. Many recent works on incremental object detection tackle this problem by introducing advanced regularization. Although these methods have shown promising results, the benefits are often short-lived after the first incremental step. Under multi-step incremental learning, the trade-off between old knowledge preserving and new task learning becomes progressively more severe. Thus, the performance of regularization-based incremental object detectors gradually decays for subsequent learning steps. In this paper, we aim to alleviate this performance decay on multi-step incremental detection tasks by proposing a dilatable incremental object detector (DIODE). For the task-shared parameters, our method adaptively penalizes the changes of important weights for previous tasks. At the same time, the structure of the model is dilated or expanded by a limited number of task-specific parameters to promote new task learning. Extensive experiments on PASCAL VOC and COCO datasets demonstrate substantial improvements over the state-of-the-art methods. Notably, compared with the state-of-the-art methods, our method achieves up to 6.0% performance improvement by increasing the number of parameters by just 1.2% for each newly learned task.
△ Less
Submitted 12 August, 2021;
originally announced August 2021.
-
First Light And Reionisation Epoch Simulations (FLARES) III: The properties of massive dusty galaxies at cosmic dawn
Authors:
Aswin P. Vijayan,
Stephen M. Wilkins,
Christopher C. Lovell,
Peter A. Thomas,
Peter Camps,
Maarten Baes,
James Trayford,
Jussi Kuusisto,
William J. Roper
Abstract:
Using the First Light And Reionisation Epoch Simulations (\textsc{Flares}) we explore the dust driven properties of massive high-redshift galaxies at $z\in[5,10]$. By post-processing the galaxy sample using the radiative transfer code \textsc{skirt} we obtain the full spectral energy distribution. We explore the resultant luminosity functions, IRX-$β$ relations as well as the luminosity-weighted d…
▽ More
Using the First Light And Reionisation Epoch Simulations (\textsc{Flares}) we explore the dust driven properties of massive high-redshift galaxies at $z\in[5,10]$. By post-processing the galaxy sample using the radiative transfer code \textsc{skirt} we obtain the full spectral energy distribution. We explore the resultant luminosity functions, IRX-$β$ relations as well as the luminosity-weighted dust temperatures in the Epoch of Reionisation (EoR). We find that most of our results are in agreement with the current set of observations, but under-predict the number densities of bright IR galaxies, which are extremely biased towards the most overdense regions. We see that the \textsc{Flares} IRX-$β$ relation (for $5\le z\le8$) predominantly follows the local starburst relation. The IRX shows an increase with stellar mass, plateauing at the high-mass end ($\sim10^{10}$M$_{\odot}$) and shows no evolution in the median normalisation with redshift. We also look at the dependence of the peak dust temperature ($T_{\mathrm{peak}}$) on various galaxy properties including the stellar mass, IR luminosity and sSFR, finding the correlation to be strongest with sSFR. The luminosity-weighted dust temperatures increase towards higher redshifts, with the slope of the $T_{\mathrm{peak}}$ - redshift relation showing a higher slope than the lower redshift relations obtained from previous observational and theoretical works. The results from \textsc{Flares}, which is able to provide a better statistical sample of high-redshift galaxies compared to other simulations, provides a distinct vantage point for the high-redshift Universe.
△ Less
Submitted 10 March, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
An Orientation Bias in Observations of Submillimetre Galaxies
Authors:
C. C. Lovell,
J. E. Geach,
R. Davé,
D. Narayanan,
K. E. K. Coppin,
Q. Li,
M. Franco,
G. C. Privon
Abstract:
Recent high-resolution interferometric images of submillimetre galaxies (SMGs) reveal fascinatingly complex morphologies. This raises a number of questions: how does the relative orientation of a galaxy affect its observed submillimetre emission, and does this result in an `orientation bias' in the selection and analysis of such galaxies in flux-limited cosmological surveys? We investigated these…
▽ More
Recent high-resolution interferometric images of submillimetre galaxies (SMGs) reveal fascinatingly complex morphologies. This raises a number of questions: how does the relative orientation of a galaxy affect its observed submillimetre emission, and does this result in an `orientation bias' in the selection and analysis of such galaxies in flux-limited cosmological surveys? We investigated these questions using the \textsc{Simba} cosmological simulation paired with the dust radiative transfer code \textsc{Powderday}. We selected eight simulated SMGs ($S_{850}\gtrsim2$ mJy) at $z = 2$, and measured the variance of their `observed' emission over 50 random orientations. Each galaxy exhibits significant scatter in its emission close to the peak of the thermal dust emission, with variation in flux density of up to a factor of 2.7. This results in an appreciable dispersion in the inferred dust temperatures and infrared luminosities ($16^{\mathrm{th}}-84^{\mathrm{th}}$ percentile ranges of 5\,K and 0.1\,dex, respectively) and therefore a fundamental uncertainty in derived parameters such as dust mass and star formation rate ($\sim$30% for the latter using simple calibrations). Using a Monte Carlo simulation we also assessed the impact of orientation on flux-limited surveys, finding a bias in the selection of SMGs towards those with face--on orientations, as well as those at lower redshifts. We predict that the orientation bias will affect flux-limited single-dish surveys, most significantly at THz frequencies, and this bias should be taken into account when placing the results of targeted follow--up studies in a statistical context.
△ Less
Submitted 30 August, 2022; v1 submitted 22 June, 2021;
originally announced June 2021.
-
A machine learning approach to mapping baryons onto dark matter haloes using the EAGLE and C-EAGLE simulations
Authors:
Christopher C. Lovell,
Stephen M. Wilkins,
Peter A. Thomas,
Matthieu Schaller,
Carlton M. Baugh,
Giulio Fabbian,
Yannick Bahé
Abstract:
High-resolution cosmological hydrodynamic simulations are currently limited to relatively small volumes due to their computational expense. However, much larger volumes are required to probe rare, overdense environments, and measure clustering statistics of the large scale structure. Typically, zoom simulations of individual regions are used to study rare environments, and semi-analytic models and…
▽ More
High-resolution cosmological hydrodynamic simulations are currently limited to relatively small volumes due to their computational expense. However, much larger volumes are required to probe rare, overdense environments, and measure clustering statistics of the large scale structure. Typically, zoom simulations of individual regions are used to study rare environments, and semi-analytic models and halo occupation models applied to dark matter only (DMO) simulations are used to study the Universe in the large-volume regime. We propose a new approach, using a machine learning framework to explore the halo-galaxy relationship in the periodic EAGLE simulations, and zoom C-EAGLE simulations of galaxy clusters. We train a tree based machine learning method to predict the baryonic properties of galaxies based on their host dark matter halo properties. The trained model successfully reproduces a number of key distribution functions for an infinitesimal fraction of the computational cost of a full hydrodynamic simulation. By training on both periodic simulations as well as zooms of overdense environments, we learn the bias of galaxy evolution in differing environments. This allows us to apply the trained model to a larger DMO volume than would be possible if we only trained on a periodic simulation. We demonstrate this application using the $(800 \; \mathrm{Mpc})^3$ P-Millennium simulation, and present predictions for key baryonic distribution functions and clustering statistics from the EAGLE model in this large volume.
△ Less
Submitted 2 May, 2023; v1 submitted 9 June, 2021;
originally announced June 2021.
-
Scalable Bayesian Deep Learning with Kernel Seed Networks
Authors:
Sam Maksoud,
Kun Zhao,
Can Peng,
Brian C. Lovell
Abstract:
This paper addresses the scalability problem of Bayesian deep neural networks. The performance of deep neural networks is undermined by the fact that these algorithms have poorly calibrated measures of uncertainty. This restricts their application in high risk domains such as computer aided diagnosis and autonomous vehicle navigation. Bayesian Deep Learning (BDL) offers a promising method for repr…
▽ More
This paper addresses the scalability problem of Bayesian deep neural networks. The performance of deep neural networks is undermined by the fact that these algorithms have poorly calibrated measures of uncertainty. This restricts their application in high risk domains such as computer aided diagnosis and autonomous vehicle navigation. Bayesian Deep Learning (BDL) offers a promising method for representing uncertainty in neural network. However, BDL requires a separate set of parameters to store the mean and standard deviation of model weights to learn a distribution. This results in a prohibitive 2-fold increase in the number of model parameters. To address this problem we present a method for performing BDL, namely Kernel Seed Networks (KSN), which does not require a 2-fold increase in the number of parameters. KSNs use 1x1 Convolution operations to learn a compressed latent space representation of the parameter distribution. In this paper we show how this allows KSNs to outperform conventional BDL methods while reducing the number of required parameters by up to a factor of 6.6.
△ Less
Submitted 18 April, 2021;
originally announced April 2021.
-
Cosmic evolution of the H2 mass density and the epoch of molecular gas
Authors:
T. K. Garratt,
K. E. K. Coppin,
J. E. Geach,
O. Almaini,
W. G. Hartley,
D. T. Maltby,
C. J. Simpson,
A. Wilkinson,
C. J. Conselice,
M. Franco,
R. J. Ivison,
M. P. Koprowski,
C. C. Lovell,
A. Pope,
D. Scott,
P. van der Werf
Abstract:
We present new empirical constraints on the evolution of $ρ_{\rm H_2}$, the cosmological mass density of molecular hydrogen, back to $z\approx2.5$. We employ a statistical approach measuring the average observed $850μ{\rm m}$ flux density of near-infrared selected galaxies as a function of redshift. The redshift range considered corresponds to a span where the $850μ{\rm m}$ band probes the Rayleig…
▽ More
We present new empirical constraints on the evolution of $ρ_{\rm H_2}$, the cosmological mass density of molecular hydrogen, back to $z\approx2.5$. We employ a statistical approach measuring the average observed $850μ{\rm m}$ flux density of near-infrared selected galaxies as a function of redshift. The redshift range considered corresponds to a span where the $850μ{\rm m}$ band probes the Rayleigh-Jeans tail of thermal dust emission in the rest-frame, and can therefore be used as an estimate of the mass of the interstellar medium (ISM). Our sample comprises of ${\approx}150,000$ galaxies in the UKIDSS-UDS field with near-infrared magnitudes $K_{\rm AB}\leq25$ mag and photometric redshifts with corresponding probability distribution functions derived from deep 12-band photometry. With a sample approximately 2 orders of magnitude larger than in previous works we significantly reduce statistical uncertainties on $ρ_{\rm H_2}$ to $z\approx2.5$. Our measurements are in broad agreement with recent direct estimates from blank field molecular gas surveys, finding that the epoch of molecular gas coincides with the peak epoch of star formation with $ρ_{\rm H_2}\approx2\times10^7\,{\rm M_\odot}\,{\rm Mpc^{-3}}$ at $z\approx2$. We demonstrate that $ρ_{\rm H_2}$ can be broadly modelled by inverting the star-formation rate density with a fixed or weakly evolving star-formation efficiency. This 'constant efficiency' model shows a similar evolution to our statistically derived $ρ_{\rm H_2}$, indicating that the dominant factor driving the peak star formation history at $z\approx2$ is a larger supply of molecular gas in galaxies rather than a significant evolution of the star-formation rate efficiency within individual galaxies.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
SID: Incremental Learning for Anchor-Free Object Detection via Selective and Inter-Related Distillation
Authors:
Can Peng,
Kun Zhao,
Sam Maksoud,
Meng Li,
Brian C. Lovell
Abstract:
Incremental learning requires a model to continually learn new tasks from streaming data. However, traditional fine-tuning of a well-trained deep neural network on a new task will dramatically degrade performance on the old task -- a problem known as catastrophic forgetting. In this paper, we address this issue in the context of anchor-free object detection, which is a new trend in computer vision…
▽ More
Incremental learning requires a model to continually learn new tasks from streaming data. However, traditional fine-tuning of a well-trained deep neural network on a new task will dramatically degrade performance on the old task -- a problem known as catastrophic forgetting. In this paper, we address this issue in the context of anchor-free object detection, which is a new trend in computer vision as it is simple, fast, and flexible. Simply adapting current incremental learning strategies fails on these anchor-free detectors due to lack of consideration of their specific model structures. To deal with the challenges of incremental learning on anchor-free object detectors, we propose a novel incremental learning paradigm called Selective and Inter-related Distillation (SID). In addition, a novel evaluation metric is proposed to better assess the performance of detectors under incremental learning conditions. By selective distilling at the proper locations and further transferring additional instance relation knowledge, our method demonstrates significant advantages on the benchmark datasets PASCAL VOC and COCO.
△ Less
Submitted 30 December, 2020;
originally announced December 2020.
-
Debunking Generalization Error or: How I Learned to Stop Worrying and Love My Training Set
Authors:
Viviana Acquaviva,
Chistopher Lovell,
Emille Ishida
Abstract:
We aim to determine some physical properties of distant galaxies (for example, stellar mass, star formation history, or chemical enrichment history) from their observed spectra, using supervised machine learning methods. We know that different astrophysical processes leave their imprint in various regions of the spectra with characteristic signatures. Unfortunately, identifying a training set for…
▽ More
We aim to determine some physical properties of distant galaxies (for example, stellar mass, star formation history, or chemical enrichment history) from their observed spectra, using supervised machine learning methods. We know that different astrophysical processes leave their imprint in various regions of the spectra with characteristic signatures. Unfortunately, identifying a training set for this problem is very hard, because labels are not readily available - we have no way of knowing the true history of how galaxies have formed. One possible approach to this problem is to train machine learning models on state-of-the-art cosmological simulations. However, when algorithms are trained on the simulations, it is unclear how well they will perform once applied to real data. In this paper, we attempt to model the generalization error as a function of an appropriate measure of distance between the source domain and the application domain. Our goal is to obtain a reliable estimate of how a model trained on simulations might behave on data.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
The emergence of passive galaxies in the early Universe
Authors:
P. Santini,
M. Castellano,
E. Merlin,
A. Fontana,
F. Fortuni,
D. Kodra,
B. Magnelli,
N. Menci,
A. Calabrò,
C. C. Lovell,
L. Pentericci,
V. Testa,
S. M. Wilkins
Abstract:
The emergence of passive galaxies in the early Universe results from the interplay among the processes responsible for their rapid assembly and for the abrupt shut-down of their SF. Investigating the individual properties and demographics of early passive galaxies will improve our understanding of these mechanisms. In this work we present a follow-up analysis of the z>3 passive galaxy candidates s…
▽ More
The emergence of passive galaxies in the early Universe results from the interplay among the processes responsible for their rapid assembly and for the abrupt shut-down of their SF. Investigating the individual properties and demographics of early passive galaxies will improve our understanding of these mechanisms. In this work we present a follow-up analysis of the z>3 passive galaxy candidates selected by Merlin et al. (2019) in the CANDELS fields. We begin by first confirming the accuracy of their passive classification by exploiting their sub-mm emission to demonstrate the lack of ongoing SF. Using archival ALMA observations we are able to confirm at least 61% of the observed candidates as passive. While the remainder lack sufficiently deep data for confirmation, we are able to validate the entire sample in a statistical sense. We then estimate the Stellar Mass Function (SMF) of all 101 passive candidates in three redshift bins from z=5 to z=3. We adopt a stepwise approach that has the advantage of taking into account photometric errors, observational incompleteness, and the Eddington bias without any a-posteriori correction. We observe a pronounced evolution in the SMF around z~4, indicating that we are witnessing the emergence of the passive population at this epoch. Massive (M>10^11Msun) passive galaxies, only accounting for a small (<10%) fraction of galaxies at z>4, become dominant at later epochs. Thanks to a combination of photometric quality, sample selection and methodology, we overall find a higher density of passive galaxies than previous works. The comparison with theoretical predictions, despite a qualitative agreement, denotes a still incomplete understanding of the physical processes responsible for the formation of these galaxies. Finally, we extrapolate our results to predict the number of early passive galaxies expected in surveys carried out with future facilities.
△ Less
Submitted 11 May, 2021; v1 submitted 20 November, 2020;
originally announced November 2020.
-
First Light And Reionisation Epoch Simulations (FLARES) II: The Photometric Properties of High-Redshift Galaxies
Authors:
Aswin P. Vijayan,
Christopher C. Lovell,
Stephen M. Wilkins,
Peter A. Thomas,
David J. Barnes,
Dimitrios Irodotou,
Jussi Kuusisto,
Will Roper
Abstract:
We present the photometric properties of galaxies in the First Light and Reionisation Epoch Simulations (FLARES). The simulations trace the evolution of galaxies in a range of overdensities through the Epoch of Reionistion (EoR). With a novel weighting scheme we combine these overdensities, extending significantly the dynamic range of observed composite distribution functions compared to periodic…
▽ More
We present the photometric properties of galaxies in the First Light and Reionisation Epoch Simulations (FLARES). The simulations trace the evolution of galaxies in a range of overdensities through the Epoch of Reionistion (EoR). With a novel weighting scheme we combine these overdensities, extending significantly the dynamic range of observed composite distribution functions compared to periodic simulation boxes. FLARES predicts a significantly larger number of intrinsically bright galaxies, which can be explained through a simple model linking dust-attenuation to the metal content of the interstellar medium, using a line-of-sight (LOS) extinction model. With this model we present the photometric properties of the FLARES galaxies for $z \in [5,10]$. We show that the ultraviolet (UV) luminosity function (LF) matches the observations at all redshifts. The function is fit by Schechter and double power-law forms, with the latter being favoured at these redshifts by the FLARES composite UV LF. We also present predictions for the UV continuum slope as well as the attenuation in the UV. The impact of environment on the UV LF is also explored, with the brightest galaxies forming in the densest environments. We then present the line luminosity and equivalent widths of some prominent nebular emission lines arising from the galaxies, finding rough agreement with available observations. We also look at the relative contribution of obscured and unobscured star formation, finding comparable contributions at these redshifts.
△ Less
Submitted 14 December, 2020; v1 submitted 13 August, 2020;
originally announced August 2020.
-
Reproducing sub-millimetre galaxy number counts with cosmological hydrodynamic simulations
Authors:
Christopher C. Lovell,
James E. Geach,
Romeel Davé,
Desika Narayanan,
Qi Li
Abstract:
Matching the number counts of high-$z$ sub-millimetre-selected galaxies (SMGs) has been a long standing problem for galaxy formation models. In this paper, we use 3D dust radiative transfer to model the sub-mm emission from galaxies in the SIMBA cosmological hydrodynamic simulations, and compare predictions to the latest single-dish observational constraints on the abundance of 850$\mathrm{μm}$-se…
▽ More
Matching the number counts of high-$z$ sub-millimetre-selected galaxies (SMGs) has been a long standing problem for galaxy formation models. In this paper, we use 3D dust radiative transfer to model the sub-mm emission from galaxies in the SIMBA cosmological hydrodynamic simulations, and compare predictions to the latest single-dish observational constraints on the abundance of 850$\mathrm{μm}$-selected sources. We find good agreement with the shape of the integrated 850$\mathrm{μm}$ luminosity function, and the normalisation is within 0.25 dex at $> 3 \; \mathrm{mJy}$, unprecedented for a fully cosmological hydrodynamic simulation, along with good agreement in the redshift distribution of bright SMGs. The agreement is driven primarily by SIMBA's good match to infrared measures of the star formation rate (SFR) function between $z = 2-4$ at high SFRs. Also important is the self-consistent on-the-fly dust model in SIMBA, which predicts, on average, higher dust masses (by up to a factor of 2.5) compared to using a fixed dust-to-metals ratio of 0.3. We construct a lightcone to investigate the effect of far-field blending, and find that 52% of sources are blends of multiple components, which makes a small contribution to the normalisation of the bright-end of the number counts. We provide new fits to the 850$\mathrm{μm}$ luminosity as a function of SFR and dust mass. Our results demonstrate that exotic solutions to the discrepancy between sub-mm counts in simulations and observations, such as a top-heavy IMF, are unnecessary, and that sub-millimetre-bright phases are a natural consequence of massive galaxy evolution.
△ Less
Submitted 11 January, 2021; v1 submitted 26 June, 2020;
originally announced June 2020.
-
Powderday: Dust Radiative Transfer for Galaxy Simulations
Authors:
Desika Narayanan,
Matthew J. Turk,
Thomas Robitaille,
Ashley J. Kelly,
B. Connor McClellan,
Ray S. Sharma,
Prerak Garg,
Matthew Abruzzo,
Ena Choi,
Charlie Conroy,
Benjamin D. Johnson,
Benjamin Kimock,
Qi Li,
Christopher C. Lovell,
Sidney Lower,
George C. Privon,
Jonathan Roberts,
Snigdaa Sethuram,
Gregory F. Snyder,
Robert Thompson,
John H. Wise
Abstract:
We present Powderday, a flexible, fast, open-source dust radiative transfer package designed to interface with galaxy formation simulations. Powderday builds on FSPS population synthesis models, Hyperion dust radiative transfer, and employs yt to interface between different software packages. We include our stellar population synthesis modeling on the fly, which allows for significant run-time fle…
▽ More
We present Powderday, a flexible, fast, open-source dust radiative transfer package designed to interface with galaxy formation simulations. Powderday builds on FSPS population synthesis models, Hyperion dust radiative transfer, and employs yt to interface between different software packages. We include our stellar population synthesis modeling on the fly, which allows for significant run-time flexibility in the assumed stellar physics. We include a model for nebular line emission that can employ either precomputed Cloudy lookup tables (for efficiency), or direct photoionization calculations for all young stars (for flexibility). The dust content follows either observationally-motivated prescriptions, direct modeling from galaxy formation simulations, or a novel approach that includes the dust content via learning-based algorithms from the SIMBA cosmological galaxy formation simulation. AGN can additionally be included via a range of prescriptions. The output of these models are broadband SEDs, as well as filter-convolved images. Powderday is designed to eliminate last-mile efforts by researchers that employ different hydrodynamic galaxy formation models, and seamlessly interfaces with GIZMO, AREPO, GASOLINE, CHANGA, and ENZO. We demonstrate the capabilities of the code via three applications: a model for the star formation rate (SFR) - infrared luminosity relation in galaxies (including the impact of AGN); the impact of circumstellar dust around AGB stars on the mid-infrared emission from galaxy SEDs; and the impact of galaxy inclination angle on dust attenuation laws.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
First Light And Reionisation Epoch Simulations (FLARES) I: Environmental Dependence of High-Redshift Galaxy Evolution
Authors:
Christopher C. Lovell,
Aswin P. Vijayan,
Peter A. Thomas,
Stephen M. Wilkins,
David J. Barnes,
Dimitrios Irodotou,
Will Roper
Abstract:
We introduce the First Light And Reionisation Epoch Simulations (FLARES), a suite of zoom simulations using the EAGLE model. We resimulate a range of overdensities during the Epoch of Reionisation (EoR) in order to build composite distribution functions, as well as explore the environmental dependence of galaxy formation and evolution during this critical period of galaxy assembly. The regions are…
▽ More
We introduce the First Light And Reionisation Epoch Simulations (FLARES), a suite of zoom simulations using the EAGLE model. We resimulate a range of overdensities during the Epoch of Reionisation (EoR) in order to build composite distribution functions, as well as explore the environmental dependence of galaxy formation and evolution during this critical period of galaxy assembly. The regions are selected from a large $(3.2 \;\mathrm{cGpc})^{3}$ parent volume, based on their overdensity within a sphere of radius $14\,h^{-1}\;\mathrm{cMpc}$. We then resimulate with full hydrodynamics, and employ a novel weighting scheme that allows the construction of composite distribution functions that are representative of the full parent volume. This significantly extends the dynamic range compared to smaller volume periodic simulations. We present an analysis of the galaxy stellar mass function (GSMF), the star formation rate distribution function (SFRF) and the star forming sequence (SFS) predicted by \flares, and compare to a number of observational and model constraints. We also analyse the environmental dependence over an unprecedented range of overdensity. Both the GSMF and the SFRF exhibit a clear double-Schechter form, up to the highest redshifts ($z = 10$). We also find no environmental dependence of the SFS normalisation. The increased dynamic range probed by FLARES will allow us to make predictions for a number of large area surveys that will probe the EoR in coming years, such as WFIRST and Euclid.
△ Less
Submitted 2 September, 2020; v1 submitted 15 April, 2020;
originally announced April 2020.
-
Faster ILOD: Incremental Learning for Object Detectors based on Faster RCNN
Authors:
Can Peng,
Kun Zhao,
Brian C. Lovell
Abstract:
The human vision and perception system is inherently incremental where new knowledge is continually learned over time whilst existing knowledge is retained. On the other hand, deep learning networks are ill-equipped for incremental learning. When a well-trained network is adapted to new categories, its performance on the old categories will dramatically degrade. To address this problem, incrementa…
▽ More
The human vision and perception system is inherently incremental where new knowledge is continually learned over time whilst existing knowledge is retained. On the other hand, deep learning networks are ill-equipped for incremental learning. When a well-trained network is adapted to new categories, its performance on the old categories will dramatically degrade. To address this problem, incremental learning methods have been explored which preserve the old knowledge of deep learning models. However, the state-of-the-art incremental object detector employs an external fixed region proposal method that increases overall computation time and reduces accuracy comparing to Region Proposal Network (RPN) based object detectors such as Faster RCNN. The purpose of this paper is to design an efficient end-to-end incremental object detector using knowledge distillation. We first evaluate and analyze the performance of the RPN-based detector with classic distillation on incremental detection tasks. Then, we introduce multi-network adaptive distillation that properly retains knowledge from the old categories when fine-tuning the model for new task. Experiments on the benchmark datasets, PASCAL VOC and COCO, demonstrate that the proposed incremental detector based on Faster RCNN is more accurate as well as being 13 times faster than the baseline detector.
△ Less
Submitted 6 October, 2020; v1 submitted 8 March, 2020;
originally announced March 2020.
-
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation
Authors:
Siqi Yang,
Lin Wu,
Arnold Wiliem,
Brian C. Lovell
Abstract:
We present a novel approach to perform the unsupervised domain adaptation for object detection through forward-backward cyclic (FBC) training. Recent adversarial training based domain adaptation methods have shown their effectiveness on minimizing domain discrepancy via marginal feature distributions alignment. However, aligning the marginal feature distributions does not guarantee the alignment o…
▽ More
We present a novel approach to perform the unsupervised domain adaptation for object detection through forward-backward cyclic (FBC) training. Recent adversarial training based domain adaptation methods have shown their effectiveness on minimizing domain discrepancy via marginal feature distributions alignment. However, aligning the marginal feature distributions does not guarantee the alignment of class conditional distributions. This limitation is more evident when adapting object detectors as the domain discrepancy is larger compared to the image classification task, e.g. various number of objects exist in one image and the majority of content in an image is the background. This motivates us to learn domain invariance for category level semantics via gradient alignment. Intuitively, if the gradients of two domains point in similar directions, then the learning of one domain can improve that of another domain. To achieve gradient alignment, we propose Forward-Backward Cyclic Adaptation, which iteratively computes adaptation from source to target via backward hopping and from target to source via forward passing. In addition, we align low-level features for adapting holistic color/texture via adversarial training. However, the detector performs well on both domains is not ideal for target domain. As such, in each cycle, domain diversity is enforced by maximum entropy regularization on the source domain to penalize confident source-specific learning and minimum entropy regularization on target domain to intrigue target-specific learning. Theoretical analysis of the training process is provided, and extensive experiments on challenging cross-domain object detection datasets have shown the superiority of our approach over the state-of-the-art.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
Sengi: a small, fast, interactive viewer for spectral outputs from stellar population synthesis models
Authors:
Christopher C. Lovell
Abstract:
We present Sengi, https://christopherlovell.github.io/sengi , an online tool for viewing the spectral outputs of stellar population synthesis (SPS) codes. Typical SPS codes require significant disk space or computing resources to produce spectra for simple stellar populations with arbitrary parameters. This makes it difficult to present their results in an interactive, web-friendly format. Sengi u…
▽ More
We present Sengi, https://christopherlovell.github.io/sengi , an online tool for viewing the spectral outputs of stellar population synthesis (SPS) codes. Typical SPS codes require significant disk space or computing resources to produce spectra for simple stellar populations with arbitrary parameters. This makes it difficult to present their results in an interactive, web-friendly format. Sengi uses Non-negative Matrix Factorisation (NMF) and bilinear interpolation to estimate output spectra for arbitrary values of stellar age and metallicity. The reduced disk requirements and computational expense allows the result to be served as a client-based Javascript application. In this paper we present the method for generating grids of spectra, fitting those grids with NMF, bilinear interpolation across the fitted coefficients, and finally provide estimates of the prediction and interpolation errors.
△ Less
Submitted 16 December, 2020; v1 submitted 28 November, 2019;
originally announced November 2019.
-
Recalibrating the Cosmic Star Formation History
Authors:
Stephen M. Wilkins,
Christopher C. Lovell,
Elizabeth R. Stanway
Abstract:
The calibrations linking observed luminosities to the star formation rate depend on the assumed stellar population synthesis model, initial mass function, star formation and metal enrichment history, and whether reprocessing by dust and gas is included. Consequently the shape and normalisation of the inferred cosmic star formation history is sensitive to these assumptions. Using v2.2.1 of the Bina…
▽ More
The calibrations linking observed luminosities to the star formation rate depend on the assumed stellar population synthesis model, initial mass function, star formation and metal enrichment history, and whether reprocessing by dust and gas is included. Consequently the shape and normalisation of the inferred cosmic star formation history is sensitive to these assumptions. Using v2.2.1 of the Binary Population and Spectral Synthesis (\bpass) model we determine a new set of calibration coefficients for the ultraviolet, thermal-infrared, and, hydrogen recombination lines. These ultraviolet and thermal infrared coefficients are 0.15-0.2 dex higher than those widely utilised in the literature while the H$α$ coefficient is $\sim 0.35$ dex larger. These differences arise in part due to the inclusion binary evolution pathways but predominantly reflect an extension in the IMF to 300 $M_{\odot}$ and a change in the choice of reference metallicity. We use these new coefficients to recalibrate the cosmic star formation history, and find improved agreement between the integrated cosmic star formation history and the in-situ measured stellar mass density as a function of redshift. However, these coefficients produce new tension between star formation rate densities inferred from the ultraviolet and thermal-infrared and those from H$α$.
△ Less
Submitted 11 October, 2019;
originally announced October 2019.
-
To What Extent Does Downsampling, Compression, and Data Scarcity Impact Renal Image Analysis?
Authors:
Can Peng,
Kun Zhao,
Arnold Wiliem,
Teng Zhang,
Peter Hobson,
Anthony Jennings,
Brian C. Lovell
Abstract:
The condition of the Glomeruli, or filter sacks, in renal Direct Immunofluorescence (DIF) specimens is a critical indicator for diagnosing kidney diseases. A digital pathology system which digitizes a glass histology slide into a Whole Slide Image (WSI) and then automatically detects and zooms in on the glomeruli with a higher magnification objective will be extremely helpful for pathologists. In…
▽ More
The condition of the Glomeruli, or filter sacks, in renal Direct Immunofluorescence (DIF) specimens is a critical indicator for diagnosing kidney diseases. A digital pathology system which digitizes a glass histology slide into a Whole Slide Image (WSI) and then automatically detects and zooms in on the glomeruli with a higher magnification objective will be extremely helpful for pathologists. In this paper, using glomerulus detection as the study case, we provide analysis and observations on several important issues to help with the development of Computer Aided Diagnostic (CAD) systems to process WSIs. Large image resolution, large file size, and data scarcity are always challenging to deal with. To this end, we first examine image downsampling rates in terms of their effect on detection accuracy. Second, we examine the impact of image compression. Third, we examine the relationship between the size of the training set and detection accuracy. To understand the above issues, experiments are performed on the state-of-the-art detectors: Faster R-CNN, R-FCN, Mask R-CNN and SSD. Critical findings are observed: (1) The best balance between detection accuracy, detection speed and file size is achieved at 8 times downsampling captured with a $40\times$ objective; (2) compression which reduces the file size dramatically, does not necessarily have an adverse effect on overall accuracy; (3) reducing the amount of training data to some extents causes a drop in precision but has a negligible impact on the recall; (4) in most cases, Faster R-CNN achieves the best accuracy in the glomerulus detection task. We show that the image file size of $40\times$ WSI images can be reduced by a factor of over 6000 with negligible loss of glomerulus detection accuracy.
△ Less
Submitted 22 September, 2019;
originally announced September 2019.
-
Deep Instance-Level Hard Negative Mining Model for Histopathology Images
Authors:
Meng Li,
Lin Wu,
Arnold Wiliem,
Kun Zhao,
Teng Zhang,
Brian C. Lovell
Abstract:
Histopathology image analysis can be considered as a Multiple instance learning (MIL) problem, where the whole slide histopathology image (WSI) is regarded as a bag of instances (i.e, patches) and the task is to predict a single class label to the WSI. However, in many real-life applications such as computational pathology, discovering the key instances that trigger the bag label is of great inter…
▽ More
Histopathology image analysis can be considered as a Multiple instance learning (MIL) problem, where the whole slide histopathology image (WSI) is regarded as a bag of instances (i.e, patches) and the task is to predict a single class label to the WSI. However, in many real-life applications such as computational pathology, discovering the key instances that trigger the bag label is of great interest because it provides reasons for the decision made by the system. In this paper, we propose a deep convolutional neural network (CNN) model that addresses the primary task of a bag classification on a WSI and also learns to identify the response of each instance to provide interpretable results to the final prediction. We incorporate the attention mechanism into the proposed model to operate the transformation of instances and learn attention weights to allow us to find key patches. To perform a balanced training, we introduce adaptive weighing in each training bag to explicitly adjust the weight distribution in order to concentrate more on the contribution of hard samples. Based on the learned attention weights, we further develop a solution to boost the classification performance by generating the bags with hard negative instances. We conduct extensive experiments on colon and breast cancer histopathology data and show that our framework achieves state-of-the-art performance.
△ Less
Submitted 26 June, 2019; v1 submitted 23 June, 2019;
originally announced June 2019.
-
CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels
Authors:
Sam Maksoud,
Arnold Wiliem,
Kun Zhao,
Teng Zhang,
Lin Wu,
Brian C. Lovell
Abstract:
This work tackles the problem of generating a medical report for multi-image panels. We apply our solution to the Renal Direct Immunofluorescence (RDIF) assay which requires a pathologist to generate a report based on observations across the eight different WSI in concert with existing clinical features. To this end, we propose a novel attention-based multi-modal generative recurrent neural networ…
▽ More
This work tackles the problem of generating a medical report for multi-image panels. We apply our solution to the Renal Direct Immunofluorescence (RDIF) assay which requires a pathologist to generate a report based on observations across the eight different WSI in concert with existing clinical features. To this end, we propose a novel attention-based multi-modal generative recurrent neural network (RNN) architecture capable of dynamically sampling image data concurrently across the RDIF panel. The proposed methodology incorporates text from the clinical notes of the requesting physician to regulate the output of the network to align with the overall clinical context. In addition, we found the importance of regularizing the attention weights for word generation processes. This is because the system can ignore the attention mechanism by assigning equal weights for all members. Thus, we propose two regularizations which force the system to utilize the attention mechanism. Experiments on our novel collection of RDIF WSIs provided by a large clinical laboratory demonstrate that our framework offers significant improvements over existing methods.
△ Less
Submitted 23 June, 2019;
originally announced June 2019.
-
Nebular Line Emission During the Epoch of Reionization
Authors:
Stephen M. Wilkins,
Christopher C. Lovell,
Ciaran Fairhurst,
Yu Feng,
Tiziana Di Matteo,
Rupert Croft,
Jussi Kuusisto,
Aswin P. Vijayan,
Peter Thomas
Abstract:
Nebular emission lines associated with galactic HII regions carry information about both physical properties of the ionised gas and the source of ionising photons as well as providing the opportunity of measuring accurate redshifts and thus distances once a cosmological model is assumed. While nebular line emission has been extensively studied at lower redshift there are currently only few constra…
▽ More
Nebular emission lines associated with galactic HII regions carry information about both physical properties of the ionised gas and the source of ionising photons as well as providing the opportunity of measuring accurate redshifts and thus distances once a cosmological model is assumed. While nebular line emission has been extensively studied at lower redshift there are currently only few constraints within the epoch of reionisation (EoR, $z>6$), chiefly due to the lack of sensitive near-IR spectrographs. However, this will soon change with the arrival of the Webb Telescope providing sensitive near-IR spectroscopy covering the rest-frame UV and optical emission of galaxies in the EoR. In anticipation of Webb we combine the large cosmological hydrodynamical simulation Bluetides with photoionisation modelling to predict the nebular emission line properties of galaxies at $z=8\to 13$. We find good agreement with the, albeit limited, existing direct and indirect observational constraints on equivalent widths though poorer agreement with luminosity function constraints.
△ Less
Submitted 26 March, 2020; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Learning the Relationship between Galaxies Spectra and their Star Formation Histories using Convolutional Neural Networks and Cosmological Simulations
Authors:
Christopher C. Lovell,
Viviana Acquaviva,
Peter A. Thomas,
Kartheik G. Iyer,
Eric Gawiser,
Stephen M. Wilkins
Abstract:
We present a new method for inferring galaxy star formation histories (SFH) using machine learning methods coupled with two cosmological hydrodynamic simulations. We train Convolutional Neural Networks to learn the relationship between synthetic galaxy spectra and high resolution SFHs from the EAGLE and Illustris models. To evaluate our SFH reconstruction we use Symmetric Mean Absolute Percentage…
▽ More
We present a new method for inferring galaxy star formation histories (SFH) using machine learning methods coupled with two cosmological hydrodynamic simulations. We train Convolutional Neural Networks to learn the relationship between synthetic galaxy spectra and high resolution SFHs from the EAGLE and Illustris models. To evaluate our SFH reconstruction we use Symmetric Mean Absolute Percentage Error (SMAPE), which acts as a true percentage error in the low-error regime. On dust-attenuated spectra we achieve high test accuracy (median SMAPE $= 10.5\%$). Including the effects of simulated observational noise increases the error ($12.5\%$), however this is alleviated by including multiple realisations of the noise, which increases the training set size and reduces overfitting ($10.9\%$). We also make estimates for the observational and modelling errors. To further evaluate the generalisation properties we apply models trained on one simulation to spectra from the other, which leads to only a small increase in the error (median SMAPE $\sim 15\%$). We apply each trained model to SDSS DR7 spectra, and find smoother histories than in the VESPA catalogue. This new approach complements the results of existing SED fitting techniques, providing star formation histories directly motivated by the results of the latest cosmological simulations.
△ Less
Submitted 9 October, 2019; v1 submitted 25 March, 2019;
originally announced March 2019.
-
Convex Class Model on Symmetric Positive Definite Manifolds
Authors:
Kun Zhao,
Arnold Wiliem,
Shaokang Chen,
Brian C. Lovell
Abstract:
The effectiveness of Symmetric Positive Definite (SPD) manifold features has been proven in various computer vision tasks. However, due to the non-Euclidean geometry of these features, existing Euclidean machineries cannot be directly used. In this paper, we tackle the classification tasks with limited training data on SPD manifolds. Our proposed framework, named Manifold Convex Class Model, repre…
▽ More
The effectiveness of Symmetric Positive Definite (SPD) manifold features has been proven in various computer vision tasks. However, due to the non-Euclidean geometry of these features, existing Euclidean machineries cannot be directly used. In this paper, we tackle the classification tasks with limited training data on SPD manifolds. Our proposed framework, named Manifold Convex Class Model, represents each class on SPD manifolds using a convex model, and classification can be performed by computing distances to the convex models. We provide three methods based on different metrics to address the optimization problem of the smallest distance of a point to the convex model on SPD manifold. The efficacy of our proposed framework is demonstrated both on synthetic data and several computer vision tasks including object recognition, texture classification, person re-identification and traffic scene classification.
△ Less
Submitted 29 May, 2019; v1 submitted 13 June, 2018;
originally announced June 2018.
-
SlideNet: Fast and Accurate Slide Quality Assessment Based on Deep Neural Networks
Authors:
Teng Zhang,
Johanna Carvajal,
Daniel F. Smith,
Kun Zhao,
Arnold Wiliem,
Peter Hobson,
Anthony Jennings,
Brian C. Lovell
Abstract:
This work tackles the automatic fine-grained slide quality assessment problem for digitized direct smears test using the Gram staining protocol. Automatic quality assessment can provide useful information for the pathologists and the whole digital pathology workflow. For instance, if the system found a slide to have a low staining quality, it could send a request to the automatic slide preparation…
▽ More
This work tackles the automatic fine-grained slide quality assessment problem for digitized direct smears test using the Gram staining protocol. Automatic quality assessment can provide useful information for the pathologists and the whole digital pathology workflow. For instance, if the system found a slide to have a low staining quality, it could send a request to the automatic slide preparation system to remake the slide. If the system detects severe damage in the slides, it could notify the experts that manual microscope reading may be required. In order to address the quality assessment problem, we propose a deep neural network based framework to automatically assess the slide quality in a semantic way. Specifically, the first step of our framework is to perform dense fine-grained region classification on the whole slide and calculate the region distribution histogram. Next, our framework will generate assessments of the slide quality from various perspectives: staining quality, information density, damage level and which regions are more valuable for subsequent high-magnification analysis. To make the information more accessible, we present our results in the form of a heat map and text summaries. Additionally, in order to stimulate research in this direction, we propose a novel dataset for slide quality assessment. Experiments show that the proposed framework outperforms recent related works.
△ Less
Submitted 19 March, 2018;
originally announced March 2018.
-
Using LIP to Gloss Over Faces in Single-Stage Face Detection Networks
Authors:
Siqi Yang,
Arnold Wiliem,
Shaokang Chen,
Brian C. Lovell
Abstract:
This work shows that it is possible to fool/attack recent state-of-the-art face detectors which are based on the single-stage networks. Successfully attacking face detectors could be a serious malware vulnerability when deploying a smart surveillance system utilizing face detectors. We show that existing adversarial perturbation methods are not effective to perform such an attack, especially when…
▽ More
This work shows that it is possible to fool/attack recent state-of-the-art face detectors which are based on the single-stage networks. Successfully attacking face detectors could be a serious malware vulnerability when deploying a smart surveillance system utilizing face detectors. We show that existing adversarial perturbation methods are not effective to perform such an attack, especially when there are multiple faces in the input image. This is because the adversarial perturbation specifically generated for one face may disrupt the adversarial perturbation for another face. In this paper, we call this problem the Instance Perturbation Interference (IPI) problem. This IPI problem is addressed by studying the relationship between the deep neural network receptive field and the adversarial perturbation. As such, we propose the Localized Instance Perturbation (LIP) that uses adversarial perturbation constrained to the Effective Receptive Field (ERF) of a target to perform the attack. Experiment results show the LIP method massively outperforms existing adversarial perturbation generation methods -- often by a factor of 2 to 10.
△ Less
Submitted 4 July, 2018; v1 submitted 21 December, 2017;
originally announced December 2017.
-
TV-GAN: Generative Adversarial Network Based Thermal to Visible Face Recognition
Authors:
Teng Zhang,
Arnold Wiliem,
Siqi Yang,
Brian C. Lovell
Abstract:
This work tackles the face recognition task on images captured using thermal camera sensors which can operate in the non-light environment. While it can greatly increase the scope and benefits of the current security surveillance systems, performing such a task using thermal images is a challenging problem compared to face recognition task in the Visible Light Domain (VLD). This is partly due to t…
▽ More
This work tackles the face recognition task on images captured using thermal camera sensors which can operate in the non-light environment. While it can greatly increase the scope and benefits of the current security surveillance systems, performing such a task using thermal images is a challenging problem compared to face recognition task in the Visible Light Domain (VLD). This is partly due to the much smaller amount number of thermal imagery data collected compared to the VLD data. Unfortunately, direct application of the existing very strong face recognition models trained using VLD data into the thermal imagery data will not produce a satisfactory performance. This is due to the existence of the domain gap between the thermal and VLD images. To this end, we propose a Thermal-to-Visible Generative Adversarial Network (TV-GAN) that is able to transform thermal face images into their corresponding VLD images whilst maintaining identity information which is sufficient enough for the existing VLD face recognition models to perform recognition. Some examples are presented in Figure 1. Unlike the previous methods, our proposed TV-GAN uses an explicit closed-set face recognition loss to regularize the discriminator network training. This information will then be conveyed into the generator network in the forms of gradient loss. In the experiment, we show that by using this additional explicit regularization for the discriminator network, the TV-GAN is able to preserve more identity information when translating a thermal image of a person which is not seen before by the TV-GAN.
△ Less
Submitted 7 December, 2017;
originally announced December 2017.
-
Characterising and Identifying Galaxy Protoclusters
Authors:
Christopher C. Lovell,
Peter A. Thomas,
Stephen M. Wilkins
Abstract:
We study the characteristics of galaxy protoclusters using the latest L-galaxies semi-analytic model. Searching for protoclusters on a scale of $\sim 10 \, \mathrm{cMpc}$ gives an excellent compromise between the completeness and purity of their galaxy populations, leads to high distinction from the field in overdensity space, and allows accurate determination of the descendant cluster mass. This…
▽ More
We study the characteristics of galaxy protoclusters using the latest L-galaxies semi-analytic model. Searching for protoclusters on a scale of $\sim 10 \, \mathrm{cMpc}$ gives an excellent compromise between the completeness and purity of their galaxy populations, leads to high distinction from the field in overdensity space, and allows accurate determination of the descendant cluster mass. This scale is valid over a range of redshifts and selection criteria. We present a procedure for estimating, given a measured galaxy overdensity, the protocluster probability and its descendant cluster mass for a range of modelling assumptions, particularly taking into account the shape of the measurement aperture. This procedure produces lower protocluster probabilities compared to previous estimates using fixed size apertures. The relationship between AGN and protoclusters is also investigated, and shows significant evolution with redshift; at $z \sim 2$ the fraction of protoclusters traced by AGN is high, but the fraction of all AGN in protoclusters is low, whereas at $z \geqslant 5$ the fraction of protoclusters containing AGN is low, but most AGN are in protoclusters. We also find indirect evidence for the emergence of a passive sequence in protoclusters at $z \sim 2$, and note that a significant fraction of all galaxies reside in protoclusters at $z \geqslant 2$, particularly the most massive.
△ Less
Submitted 1 December, 2017; v1 submitted 5 October, 2017;
originally announced October 2017.
-
Dust Obscured Star Forming Galaxies in the Early Universe
Authors:
Stephen M. Wilkins,
Yu Feng,
Tiziana Di Matteo,
Rupert Croft,
Christopher C. Lovell,
Peter Thomas
Abstract:
Motivated by recent observational constraints on dust reprocessed emission in star forming galaxies at $z\sim 6$ and above we use the very-large cosmological hydrodynamical simulation \bluetides\ to explore predictions for the amount of dust obscured star formation in the early Universe ($z>8$). \bluetides\ matches current observational constraints on both the UV luminosity function and galaxy ste…
▽ More
Motivated by recent observational constraints on dust reprocessed emission in star forming galaxies at $z\sim 6$ and above we use the very-large cosmological hydrodynamical simulation \bluetides\ to explore predictions for the amount of dust obscured star formation in the early Universe ($z>8$). \bluetides\ matches current observational constraints on both the UV luminosity function and galaxy stellar mass function and predicts that approximately $90\%$ of the star formation in high-mass ($M_{*}>10^{10}\,{\rm M_{\odot}}$) galaxies at $z=8$ is already obscured by dust. The relationship between dust attenuation and stellar mass predicted by \bluetides\ is consistent with that observed at lower redshift. However, observations of several individual objects at $z>6$ are discrepant with the predictions, though it is possible their uncertainties may have been underestimated. We find that the predicted surface density of $z\ge 8$ sub-mm sources is below that accessible to current {\em Herschel}, SCUBA-2, and ALMA sub-mm surveys. However, as ALMA continues to accrue additional surface area the population of $z>8$ dust-obscured galaxies may become accessible in the near future.
△ Less
Submitted 5 October, 2017;
originally announced October 2017.
-
The properties of the first galaxies in the BLUETIDES simulation
Authors:
Stephen M. Wilkins,
Yu Feng,
Tiziana Di-Matteo,
Rupert Croft,
Christopher C. Lovell,
Dacen Waters
Abstract:
We employ the very large cosmological hydrodynamical simulation BLUETIDES to investigate the predicted properties of the galaxy population during the epoch of reionisation ($z>8$). BLUETIDES has a resolution and volume ($(400/h\approx 577)^{3}\,{\rm cMpc^3}$) providing a population of galaxies which is well matched to depth and area of current observational surveys targeting the high-redshift Univ…
▽ More
We employ the very large cosmological hydrodynamical simulation BLUETIDES to investigate the predicted properties of the galaxy population during the epoch of reionisation ($z>8$). BLUETIDES has a resolution and volume ($(400/h\approx 577)^{3}\,{\rm cMpc^3}$) providing a population of galaxies which is well matched to depth and area of current observational surveys targeting the high-redshift Universe. At $z=8$ BLUETIDES includes almost 160,000 galaxies with stellar masses $>10^{8}\,{\rm M_{\odot}}$. The population of galaxies predicted by BLUETIDES closely matches observational constraints on both the galaxy stellar mass function and far-UV ($150\,{\rm nm}$) luminosity function. Galaxies in BLUETIDES are characterised by rapidly increasing star formation histories. Specific star formation rates decrease with redshift though remain largely insensitive to stellar mass. As a result of the enhanced surface density of metals more massive galaxies are predicted to have higher dust attenuation resulting in a significant steepening of the observed far-UV luminosity function at high luminosities. The contribution of active SMBHs to the UV luminosities of galaxies with stellar masses $10^{9-10}\,{\rm M_{\odot}}$ is around $3\%$ on average. Approximately $25\%$ of galaxies with $M_{*}\approx 10^{10}\,{\rm M_{\odot}}$ are predicted to have active SMBH which contribute $>10\%$ of the total UV luminosity.
△ Less
Submitted 4 April, 2017;
originally announced April 2017.
-
What is the Best Way for Extracting Meaningful Attributes from Pictures?
Authors:
Liangchen Liu,
Arnold Wiliem,
Shaokang Chen,
Brian C. Lovell
Abstract:
Automatic attribute discovery methods have gained in popularity to extract sets of visual attributes from images or videos for various tasks. Despite their good performance in some classification tasks, it is difficult to evaluate whether the attributes discovered by these methods are meaningful and which methods are the most appropriate to discover attributes for visual descriptions. In its simpl…
▽ More
Automatic attribute discovery methods have gained in popularity to extract sets of visual attributes from images or videos for various tasks. Despite their good performance in some classification tasks, it is difficult to evaluate whether the attributes discovered by these methods are meaningful and which methods are the most appropriate to discover attributes for visual descriptions. In its simplest form, such an evaluation can be performed by manually verifying whether there is any consistent identifiable visual concept distinguishing between positive and negative exemplars labelled by an attribute. This manual checking is tedious, expensive and labour intensive. In addition, comparisons between different methods could also be problematic as it is not clear how one could quantitatively decide which attribute is more meaningful than the others. In this paper, we propose a novel attribute meaningfulness metric to address this challenging problem. With this metric, automatic quantitative evaluation can be performed on the attribute sets; thus, reducing the enormous effort to perform manual evaluation. The proposed metric is applied to some recent automatic attribute discovery and hashing methods on four attribute-labelled datasets. To further validate the efficacy of the proposed method, we conducted a user study. In addition, we also compared our metric with a semi-supervised attribute discover method using the mixture of probabilistic PCA. In our evaluation, we gleaned several insights that could be beneficial in developing new automatic attribute discovery methods.
△ Less
Submitted 16 October, 2016;
originally announced October 2016.
-
The Photometric Properties of Galaxies in the Early Universe
Authors:
Stephen M. Wilkins,
Yu Feng,
Tiziana Di-Matteo,
Rupert Croft,
Elizabeth R. Stanway,
Andrew Bunker,
Dacen Waters,
Christopher Lovell
Abstract:
We use the large cosmological hydro-dynamic simulation BlueTides to predict the photometric properties of galaxies during the epoch of reionisation ($z=8-15$). These properties include the rest-frame UV to near-IR broadband spectral energy distributions, the Lyman continuum photon production, the UV star formation rate calibration, and intrinsic UV continuum slope. In particular we focus on explor…
▽ More
We use the large cosmological hydro-dynamic simulation BlueTides to predict the photometric properties of galaxies during the epoch of reionisation ($z=8-15$). These properties include the rest-frame UV to near-IR broadband spectral energy distributions, the Lyman continuum photon production, the UV star formation rate calibration, and intrinsic UV continuum slope. In particular we focus on exploring the effect of various modelling assumptions, including the assumed choice of stellar population synthesis model, initial mass function, and the escape fraction of Lyman continuum photons, upon these quantities. We find that these modelling assumptions can have a dramatic effect on photometric properties leading to consequences for the accurate determination of physical properties from observations. For example, at $z=8$ we predict that nebular emission can account for up-to $50\%$ of the rest-frame $R$-band luminosity, while the choice of stellar population synthesis model can change the Lyman continuum production rate up to a factor of $\times 2$.
△ Less
Submitted 17 May, 2016;
originally announced May 2016.
-
Determining the best attributes for surveillance video keywords generation
Authors:
Liangchen Liu,
Arnold Wiliem,
Shaokang Chen,
Kun Zhao,
Brian C. Lovell
Abstract:
Automatic video keyword generation is one of the key ingredients in reducing the burden of security officers in analyzing surveillance videos. Keywords or attributes are generally chosen manually based on expert knowledge of surveillance. Most existing works primarily aim at either supervised learning approaches relying on extensive manual labelling or hierarchical probabilistic models that assume…
▽ More
Automatic video keyword generation is one of the key ingredients in reducing the burden of security officers in analyzing surveillance videos. Keywords or attributes are generally chosen manually based on expert knowledge of surveillance. Most existing works primarily aim at either supervised learning approaches relying on extensive manual labelling or hierarchical probabilistic models that assume the features are extracted using the bag-of-words approach; thus limiting the utilization of the other features. To address this, we turn our attention to automatic attribute discovery approaches. However, it is not clear which automatic discovery approach can discover the most meaningful attributes. Furthermore, little research has been done on how to compare and choose the best automatic attribute discovery methods. In this paper, we propose a novel approach, based on the shared structure exhibited amongst meaningful attributes, that enables us to compare between different automatic attribute discovery approaches.We then validate our approach by comparing various attribute discovery methods such as PiCoDeS on two attribute datasets. The evaluation shows that our approach is able to select the automatic discovery approach that discovers the most meaningful attributes. We then employ the best discovery approach to generate keywords for videos recorded from a surveillance system. This work shows it is possible to massively reduce the amount of manual work in generating video keywords without limiting ourselves to a particular video feature descriptor.
△ Less
Submitted 21 February, 2016;
originally announced February 2016.
-
Automatic and Quantitative evaluation of attribute discovery methods
Authors:
Liangchen Liu,
Arnold Wiliem,
Shaokang Chen,
Brian C. Lovell
Abstract:
Many automatic attribute discovery methods have been developed to extract a set of visual attributes from images for various tasks. However, despite good performance in some image classification tasks, it is difficult to evaluate whether these methods discover meaningful attributes and which one is the best to find the attributes for image descriptions. An intuitive way to evaluate this is to manu…
▽ More
Many automatic attribute discovery methods have been developed to extract a set of visual attributes from images for various tasks. However, despite good performance in some image classification tasks, it is difficult to evaluate whether these methods discover meaningful attributes and which one is the best to find the attributes for image descriptions. An intuitive way to evaluate this is to manually verify whether consistent identifiable visual concepts exist to distinguish between positive and negative images of an attribute. This manual checking is tedious, labor intensive and expensive and it is very hard to get quantitative comparisons between different methods. In this work, we tackle this problem by proposing an attribute meaningfulness metric, that can perform automatic evaluation on the meaningfulness of attribute sets as well as achieving quantitative comparisons. We apply our proposed metric to recent automatic attribute discovery methods and popular hashing methods on three attribute datasets. A user study is also conducted to validate the effectiveness of the metric. In our evaluation, we gleaned some insights that could be beneficial in developing automatic attribute discovery methods to generate meaningful attributes. To the best of our knowledge, this is the first work to quantitatively measure the semantic content of automatically discovered attributes.
△ Less
Submitted 5 February, 2016;
originally announced February 2016.
-
Efficient Clustering on Riemannian Manifolds: A Kernelised Random Projection Approach
Authors:
Kun Zhao,
Azadeh Alavi,
Arnold Wiliem,
Brian C. Lovell
Abstract:
Reformulating computer vision problems over Riemannian manifolds has demonstrated superior performance in various computer vision applications. This is because visual data often forms a special structure lying on a lower dimensional space embedded in a higher dimensional space. However, since these manifolds belong to non-Euclidean topological spaces, exploiting their structures is computationally…
▽ More
Reformulating computer vision problems over Riemannian manifolds has demonstrated superior performance in various computer vision applications. This is because visual data often forms a special structure lying on a lower dimensional space embedded in a higher dimensional space. However, since these manifolds belong to non-Euclidean topological spaces, exploiting their structures is computationally expensive, especially when one considers the clustering analysis of massive amounts of data. To this end, we propose an efficient framework to address the clustering problem on Riemannian manifolds. This framework implements random projections for manifold points via kernel space, which can preserve the geometric structure of the original space, but is computationally efficient. Here, we introduce three methods that follow our framework. We then validate our framework on several computer vision applications by comparing against popular clustering methods on Riemannian manifolds. Experimental results demonstrate that our framework maintains the performance of the clustering whilst massively reducing computational complexity by over two orders of magnitude in some cases.
△ Less
Submitted 18 September, 2015;
originally announced September 2015.
-
Multi-Action Recognition via Stochastic Modelling of Optical Flow and Gradients
Authors:
Johanna Carvajal,
Conrad Sanderson,
Chris McCool,
Brian C. Lovell
Abstract:
In this paper we propose a novel approach to multi-action recognition that performs joint segmentation and classification. This approach models each action using a Gaussian mixture using robust low-dimensional action features. Segmentation is achieved by performing classification on overlapping temporal windows, which are then merged to produce the final result. This approach is considerably less…
▽ More
In this paper we propose a novel approach to multi-action recognition that performs joint segmentation and classification. This approach models each action using a Gaussian mixture using robust low-dimensional action features. Segmentation is achieved by performing classification on overlapping temporal windows, which are then merged to produce the final result. This approach is considerably less complicated than previous methods which use dynamic programming or computationally expensive hidden Markov models (HMMs). Initial experiments on a stitched version of the KTH dataset show that the proposed approach achieves an accuracy of 78.3%, outperforming a recent HMM-based approach which obtained 71.2%.
△ Less
Submitted 5 February, 2015;
originally announced February 2015.
-
Discovering Discriminative Cell Attributes for HEp-2 Specimen Image Classification
Authors:
Arnold Wiliem,
Peter Hobson,
Brian C. Lovell
Abstract:
Recently, there has been a growing interest in developing Computer Aided Diagnostic (CAD) systems for improving the reliability and consistency of pathology test results. This paper describes a novel CAD system for the Anti-Nuclear Antibody (ANA) test via Indirect Immunofluorescence protocol on Human Epithelial Type 2 (HEp-2) cells. While prior works have primarily focused on classifying cell imag…
▽ More
Recently, there has been a growing interest in developing Computer Aided Diagnostic (CAD) systems for improving the reliability and consistency of pathology test results. This paper describes a novel CAD system for the Anti-Nuclear Antibody (ANA) test via Indirect Immunofluorescence protocol on Human Epithelial Type 2 (HEp-2) cells. While prior works have primarily focused on classifying cell images extracted from ANA specimen images, this work takes a further step by focussing on the specimen image classification problem itself. Our system is able to efficiently classify specimen images as well as producing meaningful descriptions of ANA pattern class which helps physicians to understand the differences between various ANA patterns. We achieve this goal by designing a specimen-level image descriptor that: (1) is highly discriminative; (2) has small descriptor length and (3) is semantically meaningful at the cell level. In our work, a specimen image descriptor is represented by its overall cell attribute descriptors. As such, we propose two max-margin based learning schemes to discover cell attributes whilst still maintaining the discrimination of the specimen image descriptor. Our learning schemes differ from the existing discriminative attribute learning approaches as they primarily focus on discovering image-level attributes. Comparative evaluations were undertaken to contrast the proposed approach to various state-of-the-art approaches on a novel HEp-2 cell dataset which was specifically proposed for the specimen-level classification. Finally, we showcase the ability of the proposed approach to provide textual descriptions to explain ANA patterns.
△ Less
Submitted 28 July, 2014;
originally announced July 2014.
-
MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos
Authors:
Vikas Reddy,
Conrad Sanderson,
Andres Sanin,
Brian C. Lovell
Abstract:
Robust foreground object segmentation via background modelling is a difficult problem in cluttered environments, where obtaining a clear view of the background to model is almost impossible. In this paper, we propose a method capable of robustly estimating the background and detecting regions of interest in such environments. In particular, we propose to extend the background initialisation compon…
▽ More
Robust foreground object segmentation via background modelling is a difficult problem in cluttered environments, where obtaining a clear view of the background to model is almost impossible. In this paper, we propose a method capable of robustly estimating the background and detecting regions of interest in such environments. In particular, we propose to extend the background initialisation component of a recent patch-based foreground detection algorithm with an elaborate technique based on Markov Random Fields, where the optimal labelling solution is computed using iterated conditional modes. Rather than relying purely on local temporal statistics, the proposed technique takes into account the spatial continuity of the entire background. Experiments with several tracking algorithms on the CAVIAR dataset indicate that the proposed method leads to considerable improvements in object tracking accuracy, when compared to methods based on Gaussian mixture models and feature histograms.
△ Less
Submitted 19 June, 2014;
originally announced June 2014.
-
Automatic Classification of Human Epithelial Type 2 Cell Indirect Immunofluorescence Images using Cell Pyramid Matching
Authors:
Arnold Wiliem,
Conrad Sanderson,
Yongkang Wong,
Peter Hobson,
Rodney F. Minchin,
Brian C. Lovell
Abstract:
This paper describes a novel system for automatic classification of images obtained from Anti-Nuclear Antibody (ANA) pathology tests on Human Epithelial type 2 (HEp-2) cells using the Indirect Immunofluorescence (IIF) protocol. The IIF protocol on HEp-2 cells has been the hallmark method to identify the presence of ANAs, due to its high sensitivity and the large range of antigens that can be detec…
▽ More
This paper describes a novel system for automatic classification of images obtained from Anti-Nuclear Antibody (ANA) pathology tests on Human Epithelial type 2 (HEp-2) cells using the Indirect Immunofluorescence (IIF) protocol. The IIF protocol on HEp-2 cells has been the hallmark method to identify the presence of ANAs, due to its high sensitivity and the large range of antigens that can be detected. However, it suffers from numerous shortcomings, such as being subjective as well as time and labour intensive. Computer Aided Diagnostic (CAD) systems have been developed to address these problems, which automatically classify a HEp-2 cell image into one of its known patterns (eg. speckled, homogeneous). Most of the existing CAD systems use handpicked features to represent a HEp-2 cell image, which may only work in limited scenarios. We propose a novel automatic cell image classification method termed Cell Pyramid Matching (CPM), which is comprised of regional histograms of visual words coupled with the Multiple Kernel Learning framework. We present a study of several variations of generating histograms and show the efficacy of the system on two publicly available datasets: the ICPR HEp-2 cell classification contest dataset and the SNPHEp-2 dataset.
△ Less
Submitted 15 March, 2014;
originally announced March 2014.
-
K-Tangent Spaces on Riemannian Manifolds for Improved Pedestrian Detection
Authors:
Andres Sanin,
Conrad Sanderson,
Mehrtash T. Harandi,
Brian C. Lovell
Abstract:
For covariance-based image descriptors, taking into account the curvature of the corresponding feature space has been shown to improve discrimination performance. This is often done through representing the descriptors as points on Riemannian manifolds, with the discrimination accomplished on a tangent space. However, such treatment is restrictive as distances between arbitrary points on the tange…
▽ More
For covariance-based image descriptors, taking into account the curvature of the corresponding feature space has been shown to improve discrimination performance. This is often done through representing the descriptors as points on Riemannian manifolds, with the discrimination accomplished on a tangent space. However, such treatment is restrictive as distances between arbitrary points on the tangent space do not represent true geodesic distances, and hence do not represent the manifold structure accurately. In this paper we propose a general discriminative model based on the combination of several tangent spaces, in order to preserve more details of the structure. The model can be used as a weak learner in a boosting-based pedestrian detection framework. Experiments on the challenging INRIA and DaimlerChrysler datasets show that the proposed model leads to considerably higher performance than methods based on histograms of oriented gradients as well as previous Riemannian-based techniques.
△ Less
Submitted 5 March, 2014;
originally announced March 2014.
-
Random Projections on Manifolds of Symmetric Positive Definite Matrices for Image Classification
Authors:
Azadeh Alavi,
Arnold Wiliem,
Kun Zhao,
Brian C. Lovell,
Conrad Sanderson
Abstract:
Recent advances suggest that encoding images through Symmetric Positive Definite (SPD) matrices and then interpreting such matrices as points on Riemannian manifolds can lead to increased classification performance. Taking into account manifold geometry is typically done via (1) embedding the manifolds in tangent spaces, or (2) embedding into Reproducing Kernel Hilbert Spaces (RKHS). While embeddi…
▽ More
Recent advances suggest that encoding images through Symmetric Positive Definite (SPD) matrices and then interpreting such matrices as points on Riemannian manifolds can lead to increased classification performance. Taking into account manifold geometry is typically done via (1) embedding the manifolds in tangent spaces, or (2) embedding into Reproducing Kernel Hilbert Spaces (RKHS). While embedding into tangent spaces allows the use of existing Euclidean-based learning algorithms, manifold shape is only approximated which can cause loss of discriminatory information. The RKHS approach retains more of the manifold structure, but may require non-trivial effort to kernelise Euclidean-based learning algorithms. In contrast to the above approaches, in this paper we offer a novel solution that allows SPD matrices to be used with unmodified Euclidean-based learning algorithms, with the true manifold shape well-preserved. Specifically, we propose to project SPD matrices using a set of random projection hyperplanes over RKHS into a random projection space, which leads to representing each matrix as a vector of projection coefficients. Experiments on face recognition, person re-identification and texture classification show that the proposed approach outperforms several recent methods, such as Tensor Sparse Coding, Histogram Plus Epitome, Riemannian Locality Preserving Projection and Relational Divergence Classification.
△ Less
Submitted 4 March, 2014;
originally announced March 2014.
-
Matching Image Sets via Adaptive Multi Convex Hull
Authors:
Shaokang Chen,
Arnold Wiliem,
Conrad Sanderson,
Brian C. Lovell
Abstract:
Traditional nearest points methods use all the samples in an image set to construct a single convex or affine hull model for classification. However, strong artificial features and noisy data may be generated from combinations of training samples when significant intra-class variations and/or noise occur in the image set. Existing multi-model approaches extract local models by clustering each imag…
▽ More
Traditional nearest points methods use all the samples in an image set to construct a single convex or affine hull model for classification. However, strong artificial features and noisy data may be generated from combinations of training samples when significant intra-class variations and/or noise occur in the image set. Existing multi-model approaches extract local models by clustering each image set individually only once, with fixed clusters used for matching with various image sets. This may not be optimal for discrimination, as undesirable environmental conditions (eg. illumination and pose variations) may result in the two closest clusters representing different characteristics of an object (eg. frontal face being compared to non-frontal face). To address the above problem, we propose a novel approach to enhance nearest points based methods by integrating affine/convex hull classification with an adapted multi-model approach. We first extract multiple local convex hulls from a query image set via maximum margin clustering to diminish the artificial variations and constrain the noise in local convex hulls. We then propose adaptive reference clustering (ARC) to constrain the clustering of each gallery image set by forcing the clusters to have resemblance to the clusters in the query image set. By applying ARC, noisy clusters in the query set can be discarded. Experiments on Honda, MoBo and ETH-80 datasets show that the proposed method outperforms single model approaches and other recent techniques, such as Sparse Approximated Nearest Points, Mutual Subspace Method and Manifold Discriminant Analysis.
△ Less
Submitted 3 March, 2014;
originally announced March 2014.