Search | arXiv e-print repository

Structure based SAT dataset for analysing GNN generalisation

Authors: Yi Fu, Anthony Tompkins, Yang Song, Maurice Pagnucco

Abstract: Satisfiability (SAT) solvers based on techniques such as conflict driven clause learning (CDCL) have produced excellent performance on both synthetic and real world industrial problems. While these CDCL solvers only operate on a per-problem basis, graph neural network (GNN) based solvers bring new benefits to the field by allowing practitioners to exploit knowledge gained from solved problems to e… ▽ More Satisfiability (SAT) solvers based on techniques such as conflict driven clause learning (CDCL) have produced excellent performance on both synthetic and real world industrial problems. While these CDCL solvers only operate on a per-problem basis, graph neural network (GNN) based solvers bring new benefits to the field by allowing practitioners to exploit knowledge gained from solved problems to expedite solving of new SAT problems. However, one specific area that is often studied in the context of CDCL solvers, but largely overlooked in GNN solvers, is the relationship between graph theoretic measure of structure in SAT problems and the generalisation ability of GNN solvers. To bridge the gap between structural graph properties (e.g., modularity, self-similarity) and the generalisability (or lack thereof) of GNN based SAT solvers, we present StructureSAT: a curated dataset, along with code to further generate novel examples, containing a diverse set of SAT problems from well known problem domains. Furthermore, we utilise a novel splitting method that focuses on deconstructing the families into more detailed hierarchies based on their structural properties. With the new dataset, we aim to help explain problematic generalisation in existing GNN SAT solvers by exploiting knowledge of structural graph properties. We conclude with multiple future directions that can help researchers in GNN based SAT solving develop more effective and generalisable SAT solvers. △ Less

Submitted 16 February, 2025; originally announced February 2025.

Comments: to be published in 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

arXiv:2405.05586 [pdf, other]

Modelling the galaxy radio continuum from star formation and active galactic nuclei in the Shark semi-analytic model

Authors: Samuel P. Hansen, Claudia D. P. Lagos, Matteo Bonato, Robin H. W. Cook, Luke J. M. Davies, Ivan Delvecchio, Scott A. Tompkins

Abstract: We present a model of radio continuum emission associated with star formation (SF) and active galactic nuclei (AGN) implemented in the Shark semi-analytic model of galaxy formation. SF emission includes free-free and synchrotron emission, which depend on the free-electron density and the rate of core-collapse supernovae with a minor contribution from supernova remnants, respectively. AGN emission… ▽ More We present a model of radio continuum emission associated with star formation (SF) and active galactic nuclei (AGN) implemented in the Shark semi-analytic model of galaxy formation. SF emission includes free-free and synchrotron emission, which depend on the free-electron density and the rate of core-collapse supernovae with a minor contribution from supernova remnants, respectively. AGN emission is modelled based on the jet production rate, which depends on the black hole mass, accretion rate and spin, and includes synchrotron self-absorption. Shark reproduces radio luminosity functions (RLFs) at 1.4 GHz and 150 MHz for 0 $\leq$ z $\leq$ 4, and scaling relations between radio luminosity, star formation rate and infrared luminosity of galaxies in the local and distant universe in good agreement with observations. The model also reproduces observed number counts of radio sources from 150 MHz to 8.4 GHz to within a factor of two on average, though larger discrepancies are seen at the very bright fluxes at higher frequencies. We use this model to understand how the radio continuum emission from radio-quiet AGNs can affect the measured RLFs of galaxies. We find current methods to exclude AGNs from observational samples result in large fractions of radio-quiet AGNs contaminating the "star-forming galaxies" selection and a brighter end to the resulting RLFs. We investigate how this effects the infrared-radio correlation (IRRC) and show that AGN contamination can lead to evolution of the IRRC with redshift. Without this contamination our model predicts a redshift- and stellar mass-independent IRRC, except at the dwarf-galaxy regime. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: Accepted for publication in MNRAS. 17 pages, 7 figures

arXiv:2312.05123 [pdf, other]

Auto-tuning capabilities of the ACTS track reconstruction suite

Authors: Corentin Allaire, Rocky Bala Garg, Hadrien Benjamin Grasland, Elyssa Frances Hofgard, David Rousseau, Rama Salahat, Andreas Salzburger, Lauren Alexandra Tompkins

Abstract: The reconstruction of charged particle trajectories is a crucial challenge of particle physics experiments as it directly impacts particle reconstruction and physics performances. To reconstruct these trajectories, different reconstruction algorithms are used sequentially. Each of these algorithms uses many configuration parameters that must be fine-tuned to properly account for the detector/exper… ▽ More The reconstruction of charged particle trajectories is a crucial challenge of particle physics experiments as it directly impacts particle reconstruction and physics performances. To reconstruct these trajectories, different reconstruction algorithms are used sequentially. Each of these algorithms uses many configuration parameters that must be fine-tuned to properly account for the detector/experimental setup, the available CPU budget and the desired physics performance. Examples of such parameters are cut values limiting the algorithm's search space, approximations accounting for complex phenomenons, or parameters controlling algorithm performance. Until now, these parameters had to be optimised by human experts, which is inefficient and raises issues for the long-term maintainability of such algorithms. Previous experience using machine learning for particle reconstruction (such as the TrackML challenge) has shown that they can be easily adapted to different experiments by learning directly from the data. We propose to bring the same approach to the classic track reconstruction algorithms by connecting them to an agent-driven optimiser, allowing us to find the best input parameters using an iterative tuning approach. We have so far demonstrated this method on different track reconstruction algorithms within A Common Tracking Software (ACTS) framework using the Open Data Detector (ODD). These algorithms include the trajectory seed reconstruction and selection, the particle vertex reconstruction and the generation of simplified material maps used for trajectory reconstruction. △ Less

Submitted 8 December, 2023; originally announced December 2023.

Comments: 6 pages, 2 figures, Talk presented at the 21st International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2022)

arXiv:2301.03699 [pdf, other]

doi 10.1093/mnras/stad116

The cosmic radio background from 150 MHz--8.4 GHz, and its division into AGN and star-forming galaxy flux

Authors: Scott A. Tompkins, Simon P. Driver, Aaron S. G. Robotham, Rogier A. Windhorst, Claudia del P. Lagos, T. Vernstrom, Andrew M. Hopkins

Abstract: We present a revised measurement of the extra-galactic background light (EBL) at radio frequencies based on a near complete compendium of radio source counts. We present the radio-EBL at 150 MHz, 325 MHz, 610 MHz, 1.4 GHz, 3 GHz, 5 GHz, and 8.4 GHz. In all cases the contribution to the radio-EBL, per decade of flux, exhibits a two-humped distribution well matched to the AGN and star-forming galaxy… ▽ More We present a revised measurement of the extra-galactic background light (EBL) at radio frequencies based on a near complete compendium of radio source counts. We present the radio-EBL at 150 MHz, 325 MHz, 610 MHz, 1.4 GHz, 3 GHz, 5 GHz, and 8.4 GHz. In all cases the contribution to the radio-EBL, per decade of flux, exhibits a two-humped distribution well matched to the AGN and star-forming galaxy (SFG) populations, and with each population contributing roughly equal energy. Only at 3 GHz are the source count contributions to the EBL fully convergent, and hence we report empirical lower limits to the radio-EBL in the remaining bands. Adopting predictions from the SHARK semi-analytic model for the form of the SFG population, we can fit the fainter source counts providing measurements of the total contribution to the radio-EBL for the SFG and the AGN populations separately. This constitutes an empirically constrained model-dependent measurement for the SFG contribution, but a fully empirical measurement of the AGN contribution. Using the {\sc ProSpect} spectral energy distribution code we can model the UV-optical-infrared-mm-radio SFG EBL at all frequencies from the cosmic star-formation history and the adoption of a Chabrier initial mass function. However, significant discrepancy remains ($5\times$) between our source-count estimates of the radio-EBL and the direct measurements reported from the ARCADE-2 experiment. We can rule out a significant missing discrete source radio population and suggest that the cause of the high ARCADE-2 radio-EBL values may need to be sought either in the foreground subtraction or as a yet unknown diffuse component in the radio sky. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2010.04315 [pdf, other]

Sparse Spectrum Warped Input Measures for Nonstationary Kernel Learning

Authors: Anthony Tompkins, Rafael Oliveira, Fabio Ramos

Abstract: We establish a general form of explicit, input-dependent, measure-valued warpings for learning nonstationary kernels. While stationary kernels are ubiquitous and simple to use, they struggle to adapt to functions that vary in smoothness with respect to the input. The proposed learning algorithm warps inputs as conditional Gaussian measures that control the smoothness of a standard stationary kerne… ▽ More We establish a general form of explicit, input-dependent, measure-valued warpings for learning nonstationary kernels. While stationary kernels are ubiquitous and simple to use, they struggle to adapt to functions that vary in smoothness with respect to the input. The proposed learning algorithm warps inputs as conditional Gaussian measures that control the smoothness of a standard stationary kernel. This construction allows us to capture non-stationary patterns in the data and provides intuitive inductive bias. The resulting method is based on sparse spectrum Gaussian processes, enabling closed-form solutions, and is extensible to a stacked construction to capture more complex patterns. The method is extensively validated alongside related algorithms on synthetic and real world datasets. We demonstrate a remarkable efficiency in the number of parameters of the warping functions in learning problems with both small and large data regimes. △ Less

Submitted 8 October, 2020; originally announced October 2020.

Comments: Accepted version for NeurIPS 2020

arXiv:2007.00164 [pdf, other]

Online Domain Adaptation for Occupancy Mapping

Authors: Anthony Tompkins, Ransalu Senanayake, Fabio Ramos

Abstract: Creating accurate spatial representations that take into account uncertainty is critical for autonomous robots to safely navigate in unstructured environments. Although recent LIDAR based mapping techniques can produce robust occupancy maps, learning the parameters of such models demand considerable computational time, discouraging them from being used in real-time and large-scale applications suc… ▽ More Creating accurate spatial representations that take into account uncertainty is critical for autonomous robots to safely navigate in unstructured environments. Although recent LIDAR based mapping techniques can produce robust occupancy maps, learning the parameters of such models demand considerable computational time, discouraging them from being used in real-time and large-scale applications such as autonomous driving. Recognizing the fact that real-world structures exhibit similar geometric features across a variety of urban environments, in this paper, we argue that it is redundant to learn all geometry dependent parameters from scratch. Instead, we propose a theoretical framework building upon the theory of optimal transport to adapt model parameters to account for changes in the environment, significantly amortizing the training cost. Further, with the use of high-fidelity driving simulators and real-world datasets, we demonstrate how parameters of 2D and 3D occupancy maps can be automatically adapted to accord with local spatial changes. We validate various domain adaptation paradigms through a series of experiments, ranging from inter-domain feature transfer to simulation-to-real-world feature transfer. Experiments verified the possibility of estimating parameters with a negligible computational and memory cost, enabling large-scale probabilistic mapping in urban environments. △ Less

Submitted 30 June, 2020; originally announced July 2020.

Comments: Robotics: Science and Systems (RSS) 2020 conference

MSC Class: 90C27 ACM Class: G.3

arXiv:1805.04982 [pdf, other]

Index Set Fourier Series Features for Approximating Multi-dimensional Periodic Kernels

Authors: Anthony Tompkins, Fabio Ramos

Abstract: Periodicity is often studied in timeseries modelling with autoregressive methods but is less popular in the kernel literature, particularly for higher dimensional problems such as in textures, crystallography, and quantum mechanics. Large datasets often make modelling periodicity untenable for otherwise powerful non-parametric methods like Gaussian Processes (GPs) which typically incur an… ▽ More Periodicity is often studied in timeseries modelling with autoregressive methods but is less popular in the kernel literature, particularly for higher dimensional problems such as in textures, crystallography, and quantum mechanics. Large datasets often make modelling periodicity untenable for otherwise powerful non-parametric methods like Gaussian Processes (GPs) which typically incur an $\mathcal{O}(N^3)$ computational burden and, consequently, are unable to scale to larger datasets. To this end we introduce a method termed \emph{Index Set Fourier Series Features} to tractably exploit multivariate Fourier series and efficiently decompose periodic kernels on higher-dimensional data into a series of basis functions. We show that our approximation produces significantly less predictive error than alternative approaches such as those based on random Fourier features and achieves better generalisation on regression problems with periodic data. △ Less

Submitted 13 May, 2018; originally announced May 2018.

Showing 1–7 of 7 results for author: Tompkins, A