Search | arXiv e-print repository

Physical Symbolic Optimization

Authors: Wassim Tenachi, Rodrigo Ibata, Foivos I. Diakogiannis

Abstract: We present a framework for constraining the automatic sequential generation of equations to obey the rules of dimensional analysis by construction. Combining this approach with reinforcement learning, we built $Φ$-SO, a Physical Symbolic Optimization method for recovering analytical functions from physical data leveraging units constraints. Our symbolic regression algorithm achieves state-of-the-a… ▽ More We present a framework for constraining the automatic sequential generation of equations to obey the rules of dimensional analysis by construction. Combining this approach with reinforcement learning, we built $Φ$-SO, a Physical Symbolic Optimization method for recovering analytical functions from physical data leveraging units constraints. Our symbolic regression algorithm achieves state-of-the-art results in contexts in which variables and constants have known physical units, outperforming all other methods on SRBench's Feynman benchmark in the presence of noise (exceeding 0.1%) and showing resilience even in the presence of significant (10%) levels of noise. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 6 pages, 2 figures, 1 table. Accepted to NeurIPS 2023, Machine Learning for Physical Sciences workshop

arXiv:2312.01816 [pdf, other]

Class Symbolic Regression: Gotta Fit 'Em All

Authors: Wassim Tenachi, Rodrigo Ibata, Thibaut L. François, Foivos I. Diakogiannis

Abstract: We introduce 'Class Symbolic Regression' (Class SR) a first framework for automatically finding a single analytical functional form that accurately fits multiple datasets - each realization being governed by its own (possibly) unique set of fitting parameters. This hierarchical framework leverages the common constraint that all the members of a single class of physical phenomena follow a common go… ▽ More We introduce 'Class Symbolic Regression' (Class SR) a first framework for automatically finding a single analytical functional form that accurately fits multiple datasets - each realization being governed by its own (possibly) unique set of fitting parameters. This hierarchical framework leverages the common constraint that all the members of a single class of physical phenomena follow a common governing law. Our approach extends the capabilities of our earlier Physical Symbolic Optimization ($Φ$-SO) framework for Symbolic Regression, which integrates dimensional analysis constraints and deep reinforcement learning for unsupervised symbolic analytical function discovery from data. Additionally, we introduce the first Class SR benchmark, comprising a series of synthetic physical challenges specifically designed to evaluate such algorithms. We demonstrate the efficacy of our novel approach by applying it to these benchmark challenges and showcase its practical utility for astrophysics by successfully extracting an analytic galaxy potential from a set of simulated orbits approximating stellar streams. △ Less

Submitted 17 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: 10 pages, 4 figures, 1 table. Accepted for publication at ApJL

arXiv:2305.16845 [pdf, other]

An end-to-end strategy for recovering a free-form potential from a snapshot of stellar coordinates

Authors: Wassim Tenachi, Rodrigo Ibata, Foivos I. Diakogiannis

Abstract: New large observational surveys such as Gaia are leading us into an era of data abundance, offering unprecedented opportunities to discover new physical laws through the power of machine learning. Here we present an end-to-end strategy for recovering a free-form analytical potential from a mere snapshot of stellar positions and velocities. First we show how auto-differentiation can be used to capt… ▽ More New large observational surveys such as Gaia are leading us into an era of data abundance, offering unprecedented opportunities to discover new physical laws through the power of machine learning. Here we present an end-to-end strategy for recovering a free-form analytical potential from a mere snapshot of stellar positions and velocities. First we show how auto-differentiation can be used to capture an agnostic map of the gravitational potential and its underlying dark matter distribution in the form of a neural network. However, in the context of physics, neural networks are both a plague and a blessing as they are extremely flexible for modeling physical systems but largely consist in non-interpretable black boxes. Therefore, in addition, we show how a complementary symbolic regression approach can be used to open up this neural network into a physically meaningful expression. We demonstrate our strategy by recovering the potential of a toy isochrone system. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: 4 pages, 2 figures. Accepted for publication in the International Astronomical Union Proceedings Series

arXiv:2303.03192 [pdf, other]

doi 10.3847/1538-4357/ad014c

Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical laws

Authors: Wassim Tenachi, Rodrigo Ibata, Foivos I. Diakogiannis

Abstract: Symbolic Regression is the study of algorithms that automate the search for analytic expressions that fit data. While recent advances in deep learning have generated renewed interest in such approaches, the development of symbolic regression methods has not been focused on physics, where we have important additional constraints due to the units associated with our data. Here we present $Φ$-SO, a P… ▽ More Symbolic Regression is the study of algorithms that automate the search for analytic expressions that fit data. While recent advances in deep learning have generated renewed interest in such approaches, the development of symbolic regression methods has not been focused on physics, where we have important additional constraints due to the units associated with our data. Here we present $Φ$-SO, a Physical Symbolic Optimization framework for recovering analytical symbolic expressions from physics data using deep reinforcement learning techniques by learning units constraints. Our system is built, from the ground up, to propose solutions where the physical units are consistent by construction. This is useful not only in eliminating physically impossible solutions, but because the "grammatical" rules of dimensional analysis restrict enormously the freedom of the equation generator, thus vastly improving performance. The algorithm can be used to fit noiseless data, which can be useful for instance when attempting to derive an analytical property of a physical model, and it can also be used to obtain analytical approximations to noisy data. We test our machinery on a standard benchmark of equations from the Feynman Lectures on Physics and other physics textbooks, achieving state-of-the-art performance in the presence of noise (exceeding 0.1%) and show that it is robust even in the presence of substantial (10%) noise. We showcase its abilities on a panel of examples from astrophysics. △ Less

Submitted 9 October, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

Comments: 29 pages, 9 figures, 11 tables. Accepted for publication at ApJ

Journal ref: ApJ 959 99 (2023)

arXiv:2204.01958 [pdf, other]

Radio Transient Detection with Closure Products and Machine Learning

Authors: Xia Zhang, Foivos I. Diakogiannis, Richard Dodson, Andreas Wicenec

Abstract: For transient sources with timescales of 1-100 seconds, standardized imaging for all observations at each time step become impossible as large modern interferometers produce significantly large data volumes in this observation time frame. Here we propose a method based on machine learning and using interferometric closure products as input features to detect transient source candidates directly fr… ▽ More For transient sources with timescales of 1-100 seconds, standardized imaging for all observations at each time step become impossible as large modern interferometers produce significantly large data volumes in this observation time frame. Here we propose a method based on machine learning and using interferometric closure products as input features to detect transient source candidates directly from the spatial frequency domain without imaging. We train a simple neural network classifier on a synthetic dataset of Noise/Transient/RFI events, which we construct to tackle the lack of labelled data. We also use the hyper-parameter dropout rate of the model to allow the model to approximate Bayesian inference, and select the optimal dropout rate to match the posterior prediction to the actual underlying probability distribution of the detected events. The overall F1-score of the classifier on the simulated dataset is greater than 85\%, with the signal-to-noise at 7$σ$. The performance of the trained neural network with Monte Carlo dropout is evaluated on semi-real data, which includes a simulated transient source and real noise. This classifier accurately identifies the presence of transient signals in the detectable signal-to-noise levels (above 4$σ$) with the optimal variance. Our findings suggest that a feasible radio transient classifier can be built up with only simulated data for applying to the prediction of real observation, even in the absence of annotated real samples for the purpose of training. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 16 pages, 8 figures, to appear in AJ

arXiv:1810.11375 [pdf, other]

doi 10.1093/mnras/sty2931

Reliable mass calculation in spherical gravitating systems

Authors: Foivos I. Diakogiannis, Geraint F. Lewis, Rodrigo A. Ibata, Magda Guglielmo, Mark I. Wilkinson, Chris Power

Abstract: We present an innovative approach to the methodology of dynamical modelling, allowing practical reconstruction of the underlying dark matter mass without assuming both the density and anisotropy functions. With this, the mass-anisotropy degeneracy is reduced to simple model inference, incorporating the uncertainties inherent with observational data, statistically circumventing the mass-anisotropy… ▽ More We present an innovative approach to the methodology of dynamical modelling, allowing practical reconstruction of the underlying dark matter mass without assuming both the density and anisotropy functions. With this, the mass-anisotropy degeneracy is reduced to simple model inference, incorporating the uncertainties inherent with observational data, statistically circumventing the mass-anisotropy degeneracy in spherical collisionless systems. We also tackle the inadequacy that the Jeans method of moments has on small datasets, with the aid of Generative Adversarial Networks: we leverage the power of artificial intelligence to reconstruct non-parametrically the projected line-of-sight velocity distribution. We show with realistic numerical simulations of dwarf spheroidal galaxies that we can distinguish between competing dark matter distributions and recover the anisotropy and mass profile of the system. △ Less

Submitted 26 October, 2018; originally announced October 2018.

Comments: 19 pages, 13 figures, accepted for publication in MNRAS

arXiv:1805.12008 [pdf, other]

doi 10.1093/mnras/sty2646

Radio Galaxy Zoo: ClaRAN - A Deep Learning Classifier for Radio Morphologies

Authors: Chen Wu, O. Ivy Wong, Lawrence Rudnick, Stanislav S. Shabala, Matthew J. Alger, Julie K. Banfield, Cheng Soon Ong, Sarah V. White, Avery F. Garon, Ray P. Norris, Heinz Andernach, Jean Tate, Vesna Lukic, Hongming Tang, Kevin Schawinski, Foivos I. Diakogiannis

Abstract: The upcoming next-generation large area radio continuum surveys can expect tens of millions of radio sources, rendering the traditional method for radio morphology classification through visual inspection unfeasible. We present ClaRAN - Classifying Radio sources Automatically with Neural networks - a proof-of-concept radio source morphology classifier based upon the Faster Region-based Convolution… ▽ More The upcoming next-generation large area radio continuum surveys can expect tens of millions of radio sources, rendering the traditional method for radio morphology classification through visual inspection unfeasible. We present ClaRAN - Classifying Radio sources Automatically with Neural networks - a proof-of-concept radio source morphology classifier based upon the Faster Region-based Convolutional Neutral Networks (Faster R-CNN) method. Specifically, we train and test ClaRAN on the FIRST and WISE images from the Radio Galaxy Zoo Data Release 1 catalogue. ClaRAN provides end users with automated identification of radio source morphology classifications from a simple input of a radio image and a counterpart infrared image of the same region. ClaRAN is the first open-source, end-to-end radio source morphology classifier that is capable of locating and associating discrete and extended components of radio sources in a fast (< 200 milliseconds per image) and accurate (>= 90 %) fashion. Future work will improve ClaRAN's relatively lower success rates in dealing with multi-source fields and will enable ClaRAN to identify sources on much larger fields without loss in classification accuracy. △ Less

Submitted 29 October, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

Comments: 22 pages, 16 figures, Accepted in Monthly Notices of the Royal Astronomical Society

arXiv:1705.05724 [pdf, other]

doi 10.1093/mnras/stx1219

A novel JEAnS analysis of the Fornax dwarf using evolutionary algorithms: mass follows light with signs of an off-centre merger

Authors: Foivos I. Diakogiannis, Geraint F. Lewis, Rodrigo A. Ibata, Magda Guglielmo, Prajwal R. Kafle, Mark I. Wilkinson, Chris Power

Abstract: Dwarf galaxies, among the most dark matter dominated structures of our universe, are excellent test-beds for dark matter theories. Unfortunately, mass modelling of these systems suffers from the well documented mass-velocity anisotropy degeneracy. For the case of spherically symmetric systems, we describe a method for non-parametric modelling of the radial and tangential velocity moments. The meth… ▽ More Dwarf galaxies, among the most dark matter dominated structures of our universe, are excellent test-beds for dark matter theories. Unfortunately, mass modelling of these systems suffers from the well documented mass-velocity anisotropy degeneracy. For the case of spherically symmetric systems, we describe a method for non-parametric modelling of the radial and tangential velocity moments. The method is a numerical velocity anisotropy "inversion", with parametric mass models, where the radial velocity dispersion profile, $σ_{\mathrm{rr}}^2$ is modeled as a B-spline, and the optimization is a three step process that consists of: (i) an Evolutionary modelling to determine the mass model form and the best B-spline basis to represent $σ_{\mathrm{rr}}^2$; (ii) an optimization of the smoothing parameters; (iii) a Markov chain Monte Carlo analysis to determine the physical parameters. The mass-anisotropy degeneracy is reduced into mass model inference, irrespective of kinematics. We test our method using synthetic data. Our algorithm constructs the best kinematic profile and discriminates between competing dark matter models. We apply our method to the Fornax dwarf spheroidal galaxy. Using a King brightness profile and testing various dark matter mass models, our model inference favours a simple mass-follows-light system. We find that the anisotropy profile of Fornax is tangential ($β(r) < 0$) and we estimate a total mass of $M_{\text{tot}} = 1.613 ^{+0.050}_{-0.075} \times 10^8 \, \text{M}_{\odot}$, and a mass-to-light ratio of $Υ_V = 8.93 ^{+0.32}_{-0.47} \, (\text{M}_{\odot}/\text{L}_{\odot})$. The algorithm we present is a robust and computationally inexpensive method for non-parametric modelling of spherical clusters independent of the mass-anisotropy degeneracy. △ Less

Submitted 16 May, 2017; originally announced May 2017.

Comments: 22 pages, 17 figures, accepted for publication in MNRAS

arXiv:1505.07926 [pdf, other]

doi 10.1088/0004-637X/805/2/189

Selecting Sagittarius: Identification and Chemical Characterization of the Sagittarius Stream

Authors: E. A. Hyde, S. Keller, D. B. Zucker, R. Ibata, A. Siebert, G. F. Lewis, J. Penarrubia, M. Irwin, G. Gilmore, R. R. Lane, A. Koch, A. R. Conn, F. I. Diakogiannis, S. Martell

Abstract: Wrapping around the Milky Way, the Sagittarius stream is the dominant substructure in the halo. Our statistical selection method has allowed us to identify 106 highly likely members of the Sagittarius stream. Spectroscopic analysis of metallicity and kinematics of all members provides us with a new mapping of the Sagittarius stream. We find correspondence between the velocity distribution of strea… ▽ More Wrapping around the Milky Way, the Sagittarius stream is the dominant substructure in the halo. Our statistical selection method has allowed us to identify 106 highly likely members of the Sagittarius stream. Spectroscopic analysis of metallicity and kinematics of all members provides us with a new mapping of the Sagittarius stream. We find correspondence between the velocity distribution of stream stars and those computed for a triaxial model of the Milky Way dark matter halo. The Sagittarius trailing arm exhibits a metallicity gradient, ranging from $-0.59$ dex to $-0.97$ dex over 142$^{\circ}$. This is consistent with the scenario of tidal disruption from a progenitor dwarf galaxy that possessed an internal metallicity gradient. We note high metallicity dispersion in the leading arm, causing a lack of detectable gradient and possibly indicating orbital phase mixing. We additionally report on a potential detection of the Sextans dwarf spheroidal in our data. △ Less

Submitted 29 May, 2015; originally announced May 2015.

Journal ref: E.A. Hyde (2015) ApJ, 805, 189

arXiv:1406.2546 [pdf, other]

doi 10.1093/mnras/stu1154

Resolving the mass--anisotropy degeneracy of the spherically symmetric Jeans equation II: optimum smoothing and model validation

Authors: Foivos I. Diakogiannis, Geraint F. Lewis, Rodrigo A. Ibata

Abstract: The spherical Jeans equation is widely used to estimate the mass content of a stellar systems with apparent spherical symmetry. However, this method suffers from a degeneracy between the assumed mass density and the kinematic anisotropy profile, $β(r)$. In a previous work, we laid the theoretical foundations for an algorithm that combines smoothing B-splines with equations from dynamics to remove… ▽ More The spherical Jeans equation is widely used to estimate the mass content of a stellar systems with apparent spherical symmetry. However, this method suffers from a degeneracy between the assumed mass density and the kinematic anisotropy profile, $β(r)$. In a previous work, we laid the theoretical foundations for an algorithm that combines smoothing B-splines with equations from dynamics to remove this degeneracy. Specifically, our method reconstructs a unique kinematic profile of $σ_{rr}^2$ and $σ_{tt}^2$ for an assumed free functional form of the potential and mass density $(Φ,ρ)$ and given a set of observed line-of-sight velocity dispersion measurements, $σ_{los}^2$. In Paper I (submitted to MNRAS: MN-14-0101-MJ) we demonstrated the efficiency of our algorithm with a very simple example and we commented on the need for optimum smoothing of the B-spline representation; this is in order to avoid unphysical variational behaviour when we have large uncertainty in our data. In the current contribution we present a process of finding the optimum smoothing for a given data set by using information of the behaviour from known ideal theoretical models. Markov Chain Monte Carlo methods are used to explore the degeneracy in the dynamical modelling process. We validate our model through applications to synthetic data for systems with constant or variable mass-to-light ratio $Υ$. In all cases we recover excellent fits of theoretical functions to observables and unique solutions. Our algorithm is a robust method for the removal of the mass-anisotropy degeneracy of the spherically symmetric Jeans equation for an assumed functional form of the mass density. △ Less

Submitted 10 June, 2014; originally announced June 2014.

Comments: 15 pages, 10 figures, Accepted for publication in MNRAS

arXiv:1406.2542 [pdf, other]

doi 10.1093/mnras/stu1153

Resolving the mass--anisotropy degeneracy of the spherically symmetric Jeans equation I: theoretical foundation

Authors: Foivos I. Diakogiannis, Geraint F. Lewis, Rodrigo A. Ibata

Abstract: A widely employed method for estimating the mass of stellar systems with apparent spherical symmetry is dynamical modelling using the spherically symmetric Jeans equation. Unfortunately this approach suffers from a degeneracy between the assumed mass density and the second order velocity moments. This degeneracy can lead to significantly different predictions for the mass content of the system und… ▽ More A widely employed method for estimating the mass of stellar systems with apparent spherical symmetry is dynamical modelling using the spherically symmetric Jeans equation. Unfortunately this approach suffers from a degeneracy between the assumed mass density and the second order velocity moments. This degeneracy can lead to significantly different predictions for the mass content of the system under investigation, and thus poses a barrier for accurate estimates of the dark matter content of astrophysical systems. In a series of papers we describe an algorithm that removes this degeneracy and allows for unbiased mass estimates of systems of constant or variable mass-to-light ratio. The present contribution sets the theoretical foundation of the method that reconstructs a unique kinematic profile for some assumed free functional form of the mass density. The essence of our method lies in using flexible B-spline functions for the representation of the radial velocity dispersion in the spherically symmetric Jeans equation. We demonstrate our algorithm through an application to synthetic data for the case of an isotropic King model with fixed mass-to-light ratio, recovering excellent fits of theoretical functions to observables and a unique solution. The mass-anisotropy degeneracy is removed to the extent that, for an assumed functional form of the potential and mass density pair $(Φ,ρ)$, and a given set of line-of-sight velocity dispersion $σ_{los}^2$ observables, we recover a unique profile for $σ_{rr}^2$ and $σ_{tt}^2$. Our algorithm is simple, easy to apply and provides an efficient means to reconstruct the kinematic profile. △ Less

Submitted 10 June, 2014; originally announced June 2014.

Comments: 14 pages, 6 figures, Accepted for publication in MNRAS

arXiv:1310.8096 [pdf, ps, other]

doi 10.1093/mnras/stt2093

Dynamical Modeling of NGC 6809: Selecting the best model using Bayesian Inference

Authors: Foivos I. Diakogiannis, Geraint F. Lewis, Rodrigo A. Ibata

Abstract: The precise cosmological origin of globular clusters remains uncertain, a situation hampered by the struggle of observational approaches in conclusively identifying the presence, or not, of dark matter in these systems. In this paper, we address this question through an analysis of the particular case of NGC 6809. While previous studies have performed dynamical modeling of this globular cluster us… ▽ More The precise cosmological origin of globular clusters remains uncertain, a situation hampered by the struggle of observational approaches in conclusively identifying the presence, or not, of dark matter in these systems. In this paper, we address this question through an analysis of the particular case of NGC 6809. While previous studies have performed dynamical modeling of this globular cluster using a small number of available kinematic data, they did not perform appropriate statistical inference tests for the choice of best model description; such statistical inference for model selection is important since, in general, different models can result in significantly different inferred quantities. With the latest kinematic data, we use Bayesian inference tests for model selection and thus obtain the best fitting models, as well as mass and dynamic mass-to-light ratio estimates. For this, we introduce a new likelihood function that provides more constrained distributions for the defining parameters of dynamical models. Initially we consider models with a known distribution function, and then model the cluster using solutions of the spherically symmetric Jeans equation; this latter approach depends upon the mass density profile and anisotropy $β$ parameter. In order to find the best description for the cluster we compare these models by calculating their Bayesian evidence. We find smaller mass and dynamic mass-to-light ratio values than previous studies, with the best fitting Michie model for a constant mass-to-light ratio of $Υ= 0.90^{+0.14}_{-0.14}$ and $M_{\text{dyn}}=6.10^{+0.51}_{-0.88} \times 10^4 M_{\odot}$. We exclude the significant presence of dark matter throughout the cluster, showing that no physically motivated distribution of dark matter can be present away from the cluster core. △ Less

Submitted 31 October, 2013; v1 submitted 30 October, 2013; originally announced October 2013.

Comments: 12 pages, 10 figures, accepted for publication in MNRAS

Report number: GFL-001

Showing 1–12 of 12 results for author: Diakogiannis, F I