-
The kangaroo's first hop: the early fast cooling phase of EP250108a/SN 2025kg
Authors:
Rob A. J. Eyles-Ferris,
Peter G. Jonker,
Andrew J. Levan,
Daniele Bjørn Malesani,
Nikhil Sarin,
Christopher L. Fryer,
Jillian C. Rastinejad,
Eric Burns,
Nial R. Tanvir,
Paul T. O'Brien,
Wen-fai Fong,
Ilya Mandel,
Benjamin P. Gompertz,
Charles D. Kilpatrick,
Steven Bloemen,
Joe S. Bright,
Francesco Carotenuto,
Gregory Corcoran,
Laura Cotter,
Paul J. Groot,
Luca Izzo,
Tanmoy Laskar,
Antonio Martin-Carrillo,
Jesse Palmerio,
Maria E. Ravasio
, et al. (30 additional authors not shown)
Abstract:
Fast X-ray transients (FXTs) are a rare and poorly understood population of events. Previously difficult to detect in real time, the launch of the Einstein Probe with its wide field X-ray telescope has led to a rapid expansion in the sample and allowed the exploration of these transients across the electromagnetic spectrum. EP250108a is a recently detected example linked to an optical counterpart,…
▽ More
Fast X-ray transients (FXTs) are a rare and poorly understood population of events. Previously difficult to detect in real time, the launch of the Einstein Probe with its wide field X-ray telescope has led to a rapid expansion in the sample and allowed the exploration of these transients across the electromagnetic spectrum. EP250108a is a recently detected example linked to an optical counterpart, SN 2025kg, or 'the kangaroo'. Together with a companion paper (Rastinejad et al. 2025), we present our observing campaign and analysis of this event. In this letter, we focus on the early evolution of the optical counterpart over the first six days, including our measurement of the redshift of $z=0.17641$. We find that the source is well-modelled by a rapidly expanding cooling blackbody. We show the observed X-ray and radio properties are consistent with a collapsar-powered jet that is low energy ($\lesssim10^{51}$ erg) and/or fails to break out of the dense material surrounding it. The optical emission therefore likely arises from a shocked cocoon resulting from the trapped jet; however, we also examine the possibility that it emerges from the shock produced as the supernova ejecta expand into a dense shell of circumstellar material. We compare to other supernovae and fast transients showing similar features, finding significant similarities with SN 2006aj and SN 2020bvc. This suggests trapped jets could be more common than previously thought and SN 2025kg may herald a larger sample of similar transients.
△ Less
Submitted 7 May, 2025; v1 submitted 11 April, 2025;
originally announced April 2025.
-
Velocity evolution of broad-line Ic supernovae with and without gamma-ray bursts
Authors:
Gabriel Finneran,
Laura Cotter,
Antonio Martin-Carrillo
Abstract:
There are more than 60 broad-line Ic (Ic-BL) supernovae (SNe) which are associated with a long Gamma-ray Burst (GRB). A large population of `ordinary' Ic-BLs for which no GRB component is detected also exists. On average, the expansion velocities of GRB-associated Ic-BLs exceed those of ordinary Ic-BLs. This work presents the largest spectroscopic sample of Ic-BL SNe with and without GRBs to date.…
▽ More
There are more than 60 broad-line Ic (Ic-BL) supernovae (SNe) which are associated with a long Gamma-ray Burst (GRB). A large population of `ordinary' Ic-BLs for which no GRB component is detected also exists. On average, the expansion velocities of GRB-associated Ic-BLs exceed those of ordinary Ic-BLs. This work presents the largest spectroscopic sample of Ic-BL SNe with and without GRBs to date. The goal of this work is to investigate how the expansion velocities evolve in cases where an ultra-relativistic jet has been launched (GRB-SN cases) and compare these to Ic-BL SNe without a GRB detection. We measured the expansion velocities of the Fe II, Si II and Ca II lines observed in the spectra of Ic-BL SNe using a spline fitting method. We fit the expansion velocity evolution with single and broken power-laws. The expansion velocities of the Fe II and Si II features reveal considerable overlap between the two populations. It is not clear that GRB-associated supernovae expand more rapidly. Broken power-law evolution appears to be more common for the Si II feature, which always follows a shallow-steep decay, while the broken power-law Fe II decays are predominantly steep-shallow. The power-law indices for both samples were compared for both Fe II and Si II, and suggest that GRB-SNe decline at a similar rate to non-GRB Ic-BL supernovae. Neither the velocities nor their evolution can be used to distinguish between Ic-BLs with and without GRBs. Expansion velocities consistent with broken power-law evolution may indicate the presence of two velocity components, which may be evidence for a jet in some of these explosions. However, it is not possible to rule in or out the presence of a jet in any Ic-BL supernova purely based on the velocities. These results suggest that GRB-SNe and Ic-BLs are drawn from the same underlying population of events.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
The GRBSN webtool: An open-source repository for gamma-ray burst-supernova associations
Authors:
Gabriel Finneran,
Laura Cotter,
Antonio Martin-Carrillo
Abstract:
This paper presents the GRBSN webtool, an open-source data repository coupled to a web interface that hosts the most complete dataset of GRB-SN associations to date. In contrast to repositories of supernova (SN) or gamma-ray burst (GRB) data, this tool provides a multi-wavelength view of each GRB-SN association. GRBSN allows users to view and interact with plots of the data; search and filter the…
▽ More
This paper presents the GRBSN webtool, an open-source data repository coupled to a web interface that hosts the most complete dataset of GRB-SN associations to date. In contrast to repositories of supernova (SN) or gamma-ray burst (GRB) data, this tool provides a multi-wavelength view of each GRB-SN association. GRBSN allows users to view and interact with plots of the data; search and filter the whole database; and download radio, X-ray, optical/NIR photometric and spectroscopic data related to a GRB-SN association. The web interface code and GRB-SN data are hosted on a public GitHub repository, allowing users to upload their own data, flag missing data and suggest improvements. The GRBSN webtool will be maintained by the Space Science group at University College Dublin, Ireland. As the number of confirmed GRB-SN associations increases in the coming years, the GRBSN webtool will provide a robust framework in which to catalogue these associations and their associated data. The web interface is available at: https://grbsn.watchertelescope.ie.
△ Less
Submitted 27 April, 2025; v1 submitted 13 November, 2024;
originally announced November 2024.
-
A randomized multi-index sequential Monte Carlo method
Authors:
Xinzhu Liang,
Shangda Yang,
Simon L. Cotter,
Kody J. H. Law
Abstract:
We consider the problem of estimating expectations with respect to a target distribution with an unknown normalizing constant, and where even the unnormalized target needs to be approximated at finite resolution. Under such an assumption, this work builds upon a recently introduced multi-index Sequential Monte Carlo (SMC) ratio estimator, which provably enjoys the complexity improvements of multi-…
▽ More
We consider the problem of estimating expectations with respect to a target distribution with an unknown normalizing constant, and where even the unnormalized target needs to be approximated at finite resolution. Under such an assumption, this work builds upon a recently introduced multi-index Sequential Monte Carlo (SMC) ratio estimator, which provably enjoys the complexity improvements of multi-index Monte Carlo (MIMC) and the efficiency of SMC for inference. The present work leverages a randomization strategy to remove bias entirely, which simplifies estimation substantially, particularly in the MIMC context, where the choice of index set is otherwise important. Under reasonable assumptions, the proposed method provably achieves the same canonical complexity of MSE$^{-1}$ as the original method (where MSE is mean squared error), but without discretization bias. It is illustrated on examples of Bayesian inverse and spatial statistics problems.
△ Less
Submitted 28 June, 2023; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Batch Bayesian Optimization via Particle Gradient Flows
Authors:
Enrico Crovini,
Simon L. Cotter,
Konstantinos Zygalakis,
Andrew B. Duncan
Abstract:
Bayesian Optimisation (BO) methods seek to find global optima of objective functions which are only available as a black-box or are expensive to evaluate. Such methods construct a surrogate model for the objective function, quantifying the uncertainty in that surrogate through Bayesian inference. Objective evaluations are sequentially determined by maximising an acquisition function at each step.…
▽ More
Bayesian Optimisation (BO) methods seek to find global optima of objective functions which are only available as a black-box or are expensive to evaluate. Such methods construct a surrogate model for the objective function, quantifying the uncertainty in that surrogate through Bayesian inference. Objective evaluations are sequentially determined by maximising an acquisition function at each step. However, this ancilliary optimisation problem can be highly non-trivial to solve, due to the non-convexity of the acquisition function, particularly in the case of batch Bayesian optimisation, where multiple points are selected in every step. In this work we reformulate batch BO as an optimisation problem over the space of probability measures. We construct a new acquisition function based on multipoint expected improvement which is convex over the space of probability measures. Practical schemes for solving this `inner' optimisation problem arise naturally as gradient flows of this objective function. We demonstrate the efficacy of this new method on different benchmark functions and compare with state-of-the-art batch BO methods.
△ Less
Submitted 9 January, 2023; v1 submitted 10 September, 2022;
originally announced September 2022.
-
Hierarchical Bayesian data selection
Authors:
Simon L. Cotter
Abstract:
There are many issues that can cause problems when attempting to infer model parameters from data. Data and models are both imperfect, and as such there are multiple scenarios in which standard methods of inference will lead to misleading conclusions; corrupted data, models which are only representative of subsets of the data, or multiple regions in which the model is best fit using different para…
▽ More
There are many issues that can cause problems when attempting to infer model parameters from data. Data and models are both imperfect, and as such there are multiple scenarios in which standard methods of inference will lead to misleading conclusions; corrupted data, models which are only representative of subsets of the data, or multiple regions in which the model is best fit using different parameters. Methods exist for the exclusion of some anomalous types of data, but in practice, data cleaning is often undertaken by hand before attempting to fit models to data. In this work, we will employ hierarchical Bayesian data selection; the simultaneous inference of both model parameters, and parameters which represent our belief that each observation within the data should be included in the inference. The aim, within a Bayesian setting, is to find the regions of observation space for which the model can well-represent the data, and to find the corresponding model parameters for those regions. A number of approaches will be explored, and applied to test problems in linear regression, and to the problem of fitting an ODE model, approximated by a finite difference method. The approaches are simple to implement, can aid mixing of Markov chains designed to sample from the arising densities, and are broadly applicable to many inferential problems.
△ Less
Submitted 30 April, 2024; v1 submitted 5 August, 2022;
originally announced August 2022.
-
Unlabelled landmark matching via Bayesian data selection, and application to cell matching across imaging modalities
Authors:
Jessica E. Forsyth,
Ali H. Al-Anbaki,
Berenika Plusa,
Simon L. Cotter
Abstract:
We consider the problem of landmark matching between two unlabelled point sets, in particular where the number of points in each cloud may differ, and where points in each cloud may not have a corresponding match. We invoke a Bayesian framework to identify the transformation of coordinates that maps one cloud to the other, alongside correspondence of the points. This problem necessitates a novel m…
▽ More
We consider the problem of landmark matching between two unlabelled point sets, in particular where the number of points in each cloud may differ, and where points in each cloud may not have a corresponding match. We invoke a Bayesian framework to identify the transformation of coordinates that maps one cloud to the other, alongside correspondence of the points. This problem necessitates a novel methodology for Bayesian data selection; simultaneous inference of model parameters, and selection of the data which leads to the best fit of the model to the majority of the data. We apply this to a problem in developmental biology where the landmarks correspond to segmented cell centres, where potential death or division of cells can lead to discrepancies between the point-sets from each image. We validate the efficacy of our approach using in silico tests and a microinjected fluorescent marker experiment. Subsequently we apply our approach to the matching of cells between real time imaging and immunostaining experiments, facilitating the combination of single-cell data between imaging modalities. Furthermore our approach to Bayesian data selection is broadly applicable across data science, and has the potential to change the way we think about fitting models to data.
△ Less
Submitted 31 May, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Bayesian inference on a microstructural, hyperelastic model of tendon deformation
Authors:
James Haughton,
Simon L. Cotter,
William J. Parnell,
Tom Shearer
Abstract:
Microstructural models of soft tissue deformation are important in applications including artificial tissue design and surgical planning. The basis of these models, and their advantage over their phenomenological counterparts, is that they incorporate parameters that are directly linked to the tissue's microscale structure and constitutive behaviour and can therefore be used to predict the effects…
▽ More
Microstructural models of soft tissue deformation are important in applications including artificial tissue design and surgical planning. The basis of these models, and their advantage over their phenomenological counterparts, is that they incorporate parameters that are directly linked to the tissue's microscale structure and constitutive behaviour and can therefore be used to predict the effects of structural changes to the tissue. Although studies have attempted to determine such parameters using diverse, state-of-the-art, experimental techniques, values ranging over several orders of magnitude have been reported, leading to uncertainty in the true parameter values and creating a need for models that can handle such uncertainty. We derive a microstructural, hyperelastic model for transversely isotropic soft tissues and use it to model the mechanical behaviour of tendons. To account for parameter uncertainty, we employ a Bayesian approach and apply an adaptive Markov chain Monte Carlo algorithm to determine posterior probability distributions for the model parameters. The obtained posterior distributions are consistent with parameter measurements previously reported and enable us to quantify the uncertainty in their values for each tendon sample that was modelled. This approach could serve as a prototype for quantifying parameter uncertainty in other soft tissues.
△ Less
Submitted 5 May, 2022; v1 submitted 13 January, 2022;
originally announced January 2022.
-
Transport map accelerated adaptive importance sampling, and application to inverse problems arising from multiscale stochastic reaction networks
Authors:
Simon L. Cotter,
Ioannis G. Kevrekidis,
Paul Russell
Abstract:
In many applications, Bayesian inverse problems can give rise to probability distributions which contain complexities due to the Hessian varying greatly across parameter space. This complexity often manifests itself as lower dimensional manifolds on which the likelihood function is invariant, or varies very little. This can be due to trying to infer unobservable parameters, or due to sloppiness in…
▽ More
In many applications, Bayesian inverse problems can give rise to probability distributions which contain complexities due to the Hessian varying greatly across parameter space. This complexity often manifests itself as lower dimensional manifolds on which the likelihood function is invariant, or varies very little. This can be due to trying to infer unobservable parameters, or due to sloppiness in the model which is being used to describe the data. In such a situation, standard sampling methods for characterising the posterior distribution, which do not incorporate information about this structure, will be highly inefficient.
In this paper, we seek to develop an approach to tackle this problem when using adaptive importance sampling methods, by using optimal transport maps to simplify posterior distributions which are concentrated on lower dimensional manifolds. This approach is applicable to a whole range of problems for which Monte Carlo Markov chain (MCMC) methods mix slowly.
We demonstrate the approach by considering inverse problems arising from partially observed stochastic reaction networks. In particular, we consider systems which exhibit multiscale behaviour, but for which only the slow variables in the system are observable. We demonstrate that certain multiscale approximations lead to more consistent approximations of the posterior than others. The use of optimal transport maps stabilises the ensemble transform adaptive importance sampling (ETAIS) method, and allows for efficient sampling with smaller ensemble sizes. This approach allows us to take advantage of the large increases of efficiency when using adaptive importance sampling methods for previously intractable Bayesian inverse problems with complex posterior structure.
△ Less
Submitted 27 July, 2020; v1 submitted 31 January, 2019;
originally announced January 2019.
-
Product-form stationary distributions for deficiency zero networks with non-mass action kinetics
Authors:
David F. Anderson,
Simon L. Cotter
Abstract:
In many applications, for example when computing statistics of fast subsystems in a multiscale setting, we wish to find the stationary distributions of systems of continuous time Markov chains. Here we present a class of models that appears naturally in certain averaging approaches whose stationary distributions can be computed explicitly. In particular, we study continuous time Markov chain model…
▽ More
In many applications, for example when computing statistics of fast subsystems in a multiscale setting, we wish to find the stationary distributions of systems of continuous time Markov chains. Here we present a class of models that appears naturally in certain averaging approaches whose stationary distributions can be computed explicitly. In particular, we study continuous time Markov chain models for biochemical interaction systems with non-mass action kinetics whose network satisfies a certain constraint. Analogous with previous related results, the distributions can be written in product form.
△ Less
Submitted 17 September, 2016; v1 submitted 23 May, 2016;
originally announced May 2016.
-
Bayesian data assimilation in shape registration
Authors:
C. J. Cotter,
S. L. Cotter,
F. -X. Vialard
Abstract:
In this paper we apply a Bayesian framework to the problem of geodesic curve matching. Given a template curve, the geodesic equations provide a mapping from initial conditions for the conjugate momentum onto topologically equivalent shapes. Here, we aim to recover the well-defined posterior distribution on the initial momentum which gives rise to observed points on the target curve; this is achiev…
▽ More
In this paper we apply a Bayesian framework to the problem of geodesic curve matching. Given a template curve, the geodesic equations provide a mapping from initial conditions for the conjugate momentum onto topologically equivalent shapes. Here, we aim to recover the well-defined posterior distribution on the initial momentum which gives rise to observed points on the target curve; this is achieved by explicitly including a reparameterisation in the formulation. Appropriate priors are chosen for the functions which together determine this field and the positions of the observation points, the initial momentum $p_0$ and the reparameterisation vector field $ν$, informed by regularity results about the forward model. Having done this, we illustrate how Maximum Likelihood Estimators (MLEs) can be used to find regions of high posterior density, but also how we can apply recently developed \SLC{Markov chain Monte Carlo (MCMC)} methods on function spaces to characterise the whole of the posterior density. These illustrative examples also include scenarios where the posterior distribution is multimodal and irregular, leading us to the conclusion that knowledge of a state of global maximal posterior density does not always give us the whole picture, and full posterior sampling can give better quantification of likely states and the overall uncertainty inherent in the problem.
△ Less
Submitted 20 December, 2012;
originally announced December 2012.
-
MCMC Methods for Functions: Modifying Old Algorithms to Make Them Faster
Authors:
S. L. Cotter,
G. O. Roberts,
A. M. Stuart,
D. White
Abstract:
Many problems arising in applications result in the need to probe a probability distribution for functions. Examples include Bayesian nonparametric statistics and conditioned diffusion processes. Standard MCMC algorithms typically become arbitrarily slow under the mesh refinement dictated by nonparametric description of the unknown function. We describe an approach to modifying a whole range of MC…
▽ More
Many problems arising in applications result in the need to probe a probability distribution for functions. Examples include Bayesian nonparametric statistics and conditioned diffusion processes. Standard MCMC algorithms typically become arbitrarily slow under the mesh refinement dictated by nonparametric description of the unknown function. We describe an approach to modifying a whole range of MCMC methods, applicable whenever the target measure has density with respect to a Gaussian process or Gaussian random field reference measure, which ensures that their speed of convergence is robust under mesh refinement. Gaussian processes or random fields are fields whose marginal distributions, when evaluated at any finite set of $N$ points, are $\mathbb{R}^N$-valued Gaussians. The algorithmic approach that we describe is applicable not only when the desired probability measure has density with respect to a Gaussian process or Gaussian random field reference measure, but also to some useful non-Gaussian reference measures constructed through random truncation. In the applications of interest the data is often sparse and the prior specification is an essential part of the overall modelling strategy. These Gaussian-based reference measures are a very flexible modelling tool, finding wide-ranging application. Examples are shown in density estimation, data assimilation in fluid mechanics, subsurface geophysics and image registration. The key design principle is to formulate the MCMC method so that it is, in principle, applicable for functions; this may be achieved by use of proposals based on carefully chosen time-discretizations of stochastic dynamical systems which exactly preserve the Gaussian reference measure. Taking this approach leads to many new algorithms which can be implemented via minor modification of existing algorithms, yet which show enormous speed-up on a wide range of applied problems.
△ Less
Submitted 10 October, 2013; v1 submitted 3 February, 2012;
originally announced February 2012.
-
Adaptive Finite Element Method Assisted by Stochastic Simulation of Chemical Systems
Authors:
Simon L. Cotter,
Tomas Vejchodsky,
Radek Erban
Abstract:
Stochastic models of chemical systems are often analysed by solving the corresponding Fokker-Planck equation which is a drift-diffusion partial differential equation for the probability distribution function. Efficient numerical solution of the Fokker-Planck equation requires adaptive mesh refinements. In this paper, we present a mesh refinement approach which makes use of a stochastic simulation…
▽ More
Stochastic models of chemical systems are often analysed by solving the corresponding Fokker-Planck equation which is a drift-diffusion partial differential equation for the probability distribution function. Efficient numerical solution of the Fokker-Planck equation requires adaptive mesh refinements. In this paper, we present a mesh refinement approach which makes use of a stochastic simulation of the underlying chemical system. By observing the stochastic trajectory for a relatively short amount of time, the areas of the state space with non-negligible probability density are identified. By refining the finite element mesh in these areas, and coarsening elsewhere, a suitable mesh is constructed and used for the computation of the probability density.
△ Less
Submitted 9 November, 2011;
originally announced November 2011.
-
Approximation of Bayesian Inverse Problems for PDEs
Authors:
S. L. Cotter,
M. Dashti,
A. M. Stuart
Abstract:
Inverse problems are often ill-posed, with solutions that depend sensitively on data. In any numerical approach to the solution of such problems, regularization of some form is needed to counteract the resulting instability. This paper is based on an approach to regularization, employing a Bayesian formulation of the problem, which leads to a notion of well-posedness for inverse problems, at the…
▽ More
Inverse problems are often ill-posed, with solutions that depend sensitively on data. In any numerical approach to the solution of such problems, regularization of some form is needed to counteract the resulting instability. This paper is based on an approach to regularization, employing a Bayesian formulation of the problem, which leads to a notion of well-posedness for inverse problems, at the level of probability measures.
The stability which results from this well-posedness may be used as the basis for quantifying the approximation, in finite dimensional spaces, of inverse problems for functions. This paper contains a theory which utilizes the stability to estimate the distance between the true and approximate posterior distributions, in the Hellinger metric, in terms of error estimates for approximation of the underlying forward problem. This is potentially useful as it allows for the transfer of estimates from the numerical analysis of forward problems into estimates for the solution of the related inverse problem. In particular controlling differences in the Hellinger metric leads to control on the differences between expected values of polynomially bounded functions and operators, including the mean and covariance operator.
The ideas are illustrated with the classical inverse problem for the heat equation, and then applied to some more complicated non-Gaussian inverse problems arising in data assimilation, involving determination of the initial condition for the Stokes or Navier-Stokes equation from Lagrangian and Eulerian observations respectively.
△ Less
Submitted 11 September, 2009;
originally announced September 2009.