Search | arXiv e-print repository

Savage-Dickey density ratio estimation with normalizing flows for Bayesian model comparison

Authors: Kiyam Lin, Alicja Polanska, Davide Piras, Alessio Spurio Mancini, Jason D. McEwen

Abstract: A core motivation of science is to evaluate which scientific model best explains observed data. Bayesian model comparison provides a principled statistical approach to comparing scientific models and has found widespread application within cosmology and astrophysics. Calculating the Bayesian evidence is computationally challenging, especially as we continue to explore increasingly more complex mod… ▽ More A core motivation of science is to evaluate which scientific model best explains observed data. Bayesian model comparison provides a principled statistical approach to comparing scientific models and has found widespread application within cosmology and astrophysics. Calculating the Bayesian evidence is computationally challenging, especially as we continue to explore increasingly more complex models. The Savage-Dickey density ratio (SDDR) provides a method to calculate the Bayes factor (evidence ratio) between two nested models using only posterior samples from the super model. The SDDR requires the calculation of a normalised marginal distribution over the extra parameters of the super model, which has typically been performed using classical density estimators, such as histograms. Classical density estimators, however, can struggle to scale to high-dimensional settings. We introduce a neural SDDR approach using normalizing flows that can scale to settings where the super model contains a large number of extra parameters. We demonstrate the effectiveness of this neural SDDR methodology applied to both toy and realistic cosmological examples. For a field-level inference setting, we show that Bayes factors computed for a Bayesian hierarchical model (BHM) and simulation-based inference (SBI) approach are consistent, providing further validation that SBI extracts as much cosmological information from the field as the BHM approach. The SDDR estimator with normalizing flows is implemented in the open-source harmonic Python package. △ Less

Submitted 4 June, 2025; originally announced June 2025.

Comments: 9 pages, 1 figure. Submitted to the Open Journal of Astrophysics. Codes available at https://github.com/astro-informatics/harmonic

arXiv:2505.21215 [pdf, ps, other]

Transfer learning for multifidelity simulation-based inference in cosmology

Authors: Alex A. Saoulis, Davide Piras, Niall Jeffrey, Alessio Spurio Mancini, Ana M. G. Ferreira, Benjamin Joachimi

Abstract: Simulation-based inference (SBI) enables cosmological parameter estimation when closed-form likelihoods or models are unavailable. However, SBI relies on machine learning for neural compression and density estimation. This requires large training datasets which are prohibitively expensive for high-quality simulations. We overcome this limitation with multifidelity transfer learning, combining less… ▽ More Simulation-based inference (SBI) enables cosmological parameter estimation when closed-form likelihoods or models are unavailable. However, SBI relies on machine learning for neural compression and density estimation. This requires large training datasets which are prohibitively expensive for high-quality simulations. We overcome this limitation with multifidelity transfer learning, combining less expensive, lower-fidelity simulations with a limited number of high-fidelity simulations. We demonstrate our methodology on dark matter density maps from two separate simulation suites in the hydrodynamical CAMELS Multifield Dataset. Pre-training on dark-matter-only $N$-body simulations reduces the required number of high-fidelity hydrodynamical simulations by a factor between $8$ and $15$, depending on the model complexity, posterior dimensionality, and performance metrics used. By leveraging cheaper simulations, our approach enables performant and accurate inference on high-fidelity models while substantially reducing computational costs. △ Less

Submitted 27 May, 2025; originally announced May 2025.

Comments: 9+4 pages, 8+5 figures

arXiv:2504.10453 [pdf, other]

Anchors no more: Using peculiar velocities to constrain $H_0$ and the primordial Universe without calibrators

Authors: Davide Piras, Francesco Sorrenti, Ruth Durrer, Martin Kunz

Abstract: We develop a novel approach to constrain the Hubble parameter $H_0$ and the primordial power spectrum amplitude $A_\mathrm{s}$ using supernovae type Ia (SNIa) data. By considering SNIa as tracers of the peculiar velocity field, we can model their distance and their covariance as a function of cosmological parameters without the need of calibrators like Cepheids; this yields a new independent probe… ▽ More We develop a novel approach to constrain the Hubble parameter $H_0$ and the primordial power spectrum amplitude $A_\mathrm{s}$ using supernovae type Ia (SNIa) data. By considering SNIa as tracers of the peculiar velocity field, we can model their distance and their covariance as a function of cosmological parameters without the need of calibrators like Cepheids; this yields a new independent probe of the large-scale structure based on SNIa data without distance anchors. Crucially, we implement a differentiable pipeline in JAX, including efficient emulators and affine sampling, reducing inference time from years to hours on a single GPU. We first validate our method on mock datasets, demonstrating that we can constrain $H_0$ and $\log 10^{10}A_\mathrm{s}$ within $\sim10\%$ using $\sim10^3$ SNIa. We then test our pipeline with SNIa from an $N$-body simulation, obtaining $7\%$-level unbiased constraints on $H_0$ with a moderate noise level. We finally apply our method to Pantheon+ data, constraining $H_0$ at the $10\%$ level without Cepheids when fixing $A_\mathrm{s}$ to its $\it{Planck}$ value. On the other hand, we obtain $15\%$-level constraints on $\log 10^{10}A_\mathrm{s}$ in agreement with $\it{Planck}$ when including Cepheids in the analysis. In light of upcoming observations of low redshift SNIa from the Zwicky Transient Facility and the Vera Rubin Legacy Survey of Space and Time, surveys for which our method will develop its full potential, we make our code publicly available. △ Less

Submitted 14 April, 2025; originally announced April 2025.

Comments: 22 pages, 5 figures, comments welcome. Code available at https://github.com/dpiras/veloce

arXiv:2503.00108 [pdf, other]

Constraining the primordial power spectrum using a differentiable likelihood

Authors: Subarna Chaki, Andrina Nicola, Alessio Spurio Mancini, Davide Piras, Robert Reischke

Abstract: The simplest inflationary models predict the primordial power spectrum (PPS) of curvature perturbations to be nearly scale-invariant. However, various other models of inflation predict deviations from this behaviour, motivating a data-driven approach to reconstruct the PPS and constrain its shape. In this work, we present a novel method that employs a fully differentiable pipeline to reconstruct t… ▽ More The simplest inflationary models predict the primordial power spectrum (PPS) of curvature perturbations to be nearly scale-invariant. However, various other models of inflation predict deviations from this behaviour, motivating a data-driven approach to reconstruct the PPS and constrain its shape. In this work, we present a novel method that employs a fully differentiable pipeline to reconstruct the PPS using Gaussian Processes and uses neural network emulators for fast and differentiable theoretical predictions. By leveraging gradient-based sampling techniques, such as Hamiltonian Monte Carlo, our approach efficiently samples the high-dimensional parameter space of cosmological parameters and the free-form PPS, enabling joint constraints on both. Applying this framework to Planck 2018 Cosmic Microwave Background (CMB) temperature anisotropy data we find our reconstructed PPS to be consistent with near scale-invariance on small scales, while exhibiting large uncertainties at large scales, driven mostly by cosmic variance. Our results show an overestimation of the PPS amplitude compared to $Λ$CDM predictions from the Planck 2018 analysis, which we attribute to our choice of a conservative prior on the optical depth $τ$ based on Planck 2015 measurements. Adopting a prior consistent with Planck 2018 measurements brings our results into full agreement with previous work. To ensure robustness of our results, we validate our differentiable pipeline against a non-differentiable framework, and also demonstrate that our results are insensitive to the choice of Gaussian process hyperparameters. These promising results and the flexibility of our pipeline make it ideally suited for application to additional data sets such as CMB polarisation as well as Large-Scale Structure probes, thus moving towards multi-probe primordial power spectrum reconstruction. △ Less

Submitted 28 February, 2025; originally announced March 2025.

Comments: 29 pages, 15 figures, 2 tables, to be submitted to JCAP

arXiv:2502.09810 [pdf, other]

doi 10.1103/PhysRevD.111.083537

$Λ$CDM and early dark energy in latent space: a data-driven parametrization of the CMB temperature power spectrum

Authors: Davide Piras, Laura Herold, Luisa Lucie-Smith, Eiichiro Komatsu

Abstract: Finding the best parametrization for cosmological models in the absence of first-principle theories is an open question. We propose a data-driven parametrization of cosmological models given by the disentangled 'latent' representation of a variational autoencoder (VAE) trained to compress cosmic microwave background (CMB) temperature power spectra. We consider a broad range of $Λ$CDM and beyond-… ▽ More Finding the best parametrization for cosmological models in the absence of first-principle theories is an open question. We propose a data-driven parametrization of cosmological models given by the disentangled 'latent' representation of a variational autoencoder (VAE) trained to compress cosmic microwave background (CMB) temperature power spectra. We consider a broad range of $Λ$CDM and beyond-$Λ$CDM cosmologies with an additional early dark energy (EDE) component. We show that these spectra can be compressed into 5 ($Λ$CDM) or 8 (EDE) independent latent parameters, as expected when using temperature power spectra alone, and which reconstruct spectra at an accuracy well within the Planck errors. These latent parameters have a physical interpretation in terms of well-known features of the CMB temperature spectrum: these include the position, height and even-odd modulation of the acoustic peaks, as well as the gravitational lensing effect. The VAE also discovers one latent parameter which entirely isolates the EDE effects from those related to $Λ$CDM parameters, thus revealing a previously unknown degree of freedom in the CMB temperature power spectrum. We further showcase how to place constraints on the latent parameters using Planck data as typically done for cosmological parameters, obtaining latent values consistent with previous $Λ$CDM and EDE cosmological constraints. Our work demonstrates the potential of a data-driven reformulation of current beyond-$Λ$CDM phenomenological models into the independent degrees of freedom to which the data observables are sensitive. △ Less

Submitted 28 March, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

Comments: 18 pages, 12 figures. Minor changes to match version published in PRD

arXiv:2410.23238 [pdf, other]

doi 10.1093/gji/ggaf112

Full-waveform earthquake source inversion using simulation-based inference

Authors: A. A. Saoulis, D. Piras, A. Spurio Mancini, B. Joachimi, A. M. G. Ferreira

Abstract: This paper presents a novel framework for full-waveform seismic source inversion using simulation-based inference (SBI). Traditional probabilistic approaches often rely on simplifying assumptions about data errors, which we show can lead to inaccurate uncertainty quantification. SBI addresses this limitation by building an empirical probabilistic model of the data errors using machine learning mod… ▽ More This paper presents a novel framework for full-waveform seismic source inversion using simulation-based inference (SBI). Traditional probabilistic approaches often rely on simplifying assumptions about data errors, which we show can lead to inaccurate uncertainty quantification. SBI addresses this limitation by building an empirical probabilistic model of the data errors using machine learning models, known as neural density estimators, which can then be integrated into the Bayesian inference framework. We apply the SBI framework to point-source moment tensor inversions as well as joint moment tensor and time-location inversions. We construct a range of synthetic examples to explore the quality of the SBI solutions, as well as to compare the SBI results with standard Gaussian likelihood-based Bayesian inversions. We then demonstrate that under real seismic noise, common Gaussian likelihood assumptions for treating full-waveform data yield overconfident posterior distributions that underestimate the moment tensor component uncertainties by up to a factor of 3. We contrast this with SBI, which produces well-calibrated posteriors that generally agree with the true seismic source parameters, and offers an order-of-magnitude reduction in the number of simulations required to perform inference compared to standard Monte Carlo techniques. Finally, we apply our methodology to a pair of moderate magnitude earthquakes in the North Atlantic. We utilise seismic waveforms recorded by the recent UPFLOW ocean bottom seismometer array as well as by regional land stations in the Azores, comparing full moment tensor and source-time location posteriors between SBI and a Gaussian likelihood approach. We find that our adaptation of SBI can be directly applied to real earthquake sources to efficiently produce high quality posterior distributions that significantly improve upon Gaussian likelihood approaches. △ Less

Submitted 14 May, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

Comments: 22 + 11 pages, 11 + 11 figures. Now published in GJI

Journal ref: Geophysical Journal International 241.3 (2025): 1740-1761

arXiv:2410.10603 [pdf, other]

Testing interacting dark energy with Stage IV cosmic shear surveys through differentiable neural emulators

Authors: Karim Carrion, Alessio Spurio Mancini, Davide Piras, Juan Carlos Hidalgo

Abstract: We employ a novel framework for accelerated cosmological inference, based on neural emulators and gradient-based sampling methods, to forecast constraints on dark energy models from Stage IV cosmic shear surveys. We focus on dark scattering (DS), an interacting dark energy model with pure momentum exchange in the dark sector, and train COSMOPOWER emulators to accurately and efficiently model the D… ▽ More We employ a novel framework for accelerated cosmological inference, based on neural emulators and gradient-based sampling methods, to forecast constraints on dark energy models from Stage IV cosmic shear surveys. We focus on dark scattering (DS), an interacting dark energy model with pure momentum exchange in the dark sector, and train COSMOPOWER emulators to accurately and efficiently model the DS non-linear matter power spectrum produced by the halo model reaction framework, including the effects of baryon feedback and massive neutrinos. We embed the emulators within a fully-differentiable pipeline for gradient-based cosmological inference for which the batch likelihood call is up to $O(10^5)$ times faster than with traditional approaches, producing parameter constraints from simulated Stage IV cosmic shear data running on a single graphics processing unit (GPU). We also perform model comparison on the output chains from the inference process, employing the learnt harmonic mean estimator implemented in the software HARMONIC. We investigate degeneracies between dark energy and systematics parameters and assess the impact of scale cuts on the final constraints. Assuming a DS model for the mock data vector, we find that a Stage IV survey cosmic shear analysis can constrain the DS amplitude parameter $A_{\mathrm{ds}}$ with an uncertainty roughly an order of magnitude smaller than current constraints from Stage III surveys, even after marginalising over baryonic feedback, intrinsic alignments and redshift distribution uncertainties. These results show great promise for constraining DS with Stage IV data; furthermore, our methodology can be straightforwardly extended to a wide range of dark energy and modified gravity models. △ Less

Submitted 22 April, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

Comments: 9 pages, 6 figures. COSMOPOWER available at https://github.com/alessiospuriomancini/cosmopower, COSMOPOWER-JAX available at https://github.com/dpiras/cosmopower-jax, emulators for Dark Scattering available at https://github.com/karimpsi22/DS-emulators. V2: Minor corrections. Version accepted for publication at MNRAS

arXiv:2410.07349 [pdf, other]

doi 10.1093/mnras/stae2810

Psi-GAN: A power-spectrum-informed generative adversarial network for the emulation of large-scale structure maps across cosmologies and redshifts

Authors: Prabh Bhambra, Benjamin Joachimi, Ofer Lahav, Davide Piras

Abstract: Simulations of the dark matter distribution throughout the Universe are essential in order to analyse data from cosmological surveys. $N$-body simulations are computationally expensive, and many cheaper alternatives (such as lognormal random fields) fail to reproduce accurate statistics of the smaller, non-linear scales. In this work, we present \textsc{Psi-GAN} (\textbf{P}ower-\textbf{s}pectrum-\… ▽ More Simulations of the dark matter distribution throughout the Universe are essential in order to analyse data from cosmological surveys. $N$-body simulations are computationally expensive, and many cheaper alternatives (such as lognormal random fields) fail to reproduce accurate statistics of the smaller, non-linear scales. In this work, we present \textsc{Psi-GAN} (\textbf{P}ower-\textbf{s}pectrum-\textbf{i}nformed \textbf{G}enerative \textbf{A}dversarial \textbf{N}etwork), a machine learning model which takes a two-dimensional lognormal dark matter density field and transforms it into a more realistic field. We construct \textsc{Psi-GAN} so that it is continuously conditional, and can therefore generate realistic realisations of the dark matter density field across a range of cosmologies and redshifts in $z \in [0, 3]$. We train \textsc{Psi-GAN} as a generative adversarial network on $2\,000$ simulation boxes from the Quijote simulation suite. We use a novel critic architecture that utilises the power spectrum as the basis for discrimination between real and generated samples. \textsc{Psi-GAN} shows agreement with $N$-body simulations over a range of redshifts and cosmologies, consistently outperforming the lognormal approximation on all tests of non-linear structure, such as being able to reproduce both the power spectrum up to wavenumbers of $1~h~\mathrm{Mpc}^{-1}$, and the bispectra of target $N$-body simulations to within ${\sim}5$ per cent. Our improved ability to model non-linear structure should allow more robust constraints on cosmological parameters when used in techniques such as simulation-based inference. △ Less

Submitted 6 January, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

Comments: 20 pages, 11 figures, 3 tables, 1 appendix. Accepted for publication by Monthly Notices of the Royal Astronomical Society

arXiv:2409.11175 [pdf, other]

Bridging the Gap: Examining Vision Foundation Models for Optical and Radio Astronomy Applications

Authors: E. Lastufka, O. Bait, M. Drozdova, V. Kinakh, D. Piras, M. Audard, M. Dessauges-Zavadsky, T. Holotyak, D. Schaerer, S. Voloshynovskiy

Abstract: Vision foundation models, which have demonstrated significant potential in many multimedia applications, are often underutilized in the natural sciences. This is primarily due to mismatches between the nature of domain-specific scientific data and the typical training data used for foundation models, leading to distribution shifts. Scientific data often differ substantially in structure and charac… ▽ More Vision foundation models, which have demonstrated significant potential in many multimedia applications, are often underutilized in the natural sciences. This is primarily due to mismatches between the nature of domain-specific scientific data and the typical training data used for foundation models, leading to distribution shifts. Scientific data often differ substantially in structure and characteristics, and researchers frequently face the challenge of optimizing model performance with limited labeled data of only a few hundred or thousand images. This work evaluates the performance of vision foundation models in astrophysics, with a focus on identifying the best practices for adapting them to domain-specific datasets. We aim to establish a framework for selecting, fine-tuning, and optimizing these models for common tasks in optical and radio astronomy. We compared multiple foundation models, including self-supervised, weakly supervised, and distillation-based architectures, across two representative optical and radio datasets. Experiments involved different fine-tuning strategies, projector heads, and data preprocessing techniques, with performance evaluated on classification and detection metrics. Features extracted by specific foundation models improved classification accuracy for optical galaxy images compared to conventional supervised training. Similarly, these models achieved equivalent or superior performance in object detection tasks with radio images. However, classification performance for radio galaxy images was generally poor, often falling short of supervised approaches. These findings demonstrate that vision foundation models can be effectively adapted to astrophysical applications, provided practitioners iterate on model selection, training strategies, and data handling. △ Less

Submitted 9 January, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

Comments: 12 pages, 5 figures, submitted to Astronomy and Astrophysics. A previous version of this work was accepted to the Foundation Models for Science Workshop at NeurIPS 2024

arXiv:2408.06147 [pdf, other]

doi 10.1051/0004-6361/202449964

Self-Supervised Learning on MeerKAT Wide-Field Continuum Images

Authors: Erica Lastufka, Omkar Bait, Olga Taran, Mariia Drozdova, Vitaliy Kinakh, Davide Piras, Marc Audard, Miroslava Dessauges-Zavadsky, Taras Holotyak, Daniel Schaerer, Svyatoslav Voloshynovskiy

Abstract: Self-supervised learning (SSL) applied to natural images has demonstrated a remarkable ability to learn meaningful, low-dimension representations without labels, resulting in models that are adaptable to many different tasks. Until now, applications of SSL to astronomical images have been limited to Galaxy Zoo datasets, which require a significant amount of pre-processing to prepare sparse images… ▽ More Self-supervised learning (SSL) applied to natural images has demonstrated a remarkable ability to learn meaningful, low-dimension representations without labels, resulting in models that are adaptable to many different tasks. Until now, applications of SSL to astronomical images have been limited to Galaxy Zoo datasets, which require a significant amount of pre-processing to prepare sparse images centered on a single galaxy. With wide-field survey instruments at the forefront of the Square Kilometer Array (SKA) era, this approach to gathering training data is impractical. We demonstrate that continuum images from surveys like the MeerKAT Galactic Cluster Legacy Survey (MGCLS) can be successfully used with SSL, without extracting single-galaxy cutouts. Using the SSL framework DINO, we experiment with various preprocessing steps, augmentations, and architectures to determine the optimal approach for this data. We train both ResNet50 and Vision Transformer (ViT) backbones. Our models match state-of-the-art results (trained on Radio Galaxy Zoo) for FRI/FRII morphology classification. Furthermore, they predict the number of compact sources via linear regression with much higher accuracy. However, fine-tuning results in similar performance between our models, the state-of-the-art, and open-source models on multi-class morphology classification. Using source-rich crops from wide-field images to train multi-purpose models is an easily scalable approach that significantly reduces data preparation time. For the tasks evaluated in this work, twenty thousand crops is sufficient training data for models that produce results similar to state-of-the-art. In the future, complex tasks like source detection and characterization, together with domain-specific tasks, ought to demonstrate the true advantages of training models with radio astronomy data over natural-image foundation models. △ Less

Submitted 12 August, 2024; originally announced August 2024.

Comments: 16 pages, 11 figures. A version of this paper has been accepted for publication in Astronomy and Astrophysics. All datasets used in this work are public, and the code and pre-trained checkpoints are available at https://github.com/elastufka/mgcls_dino

Journal ref: A&A 690, A310 (2024)

arXiv:2405.15850 [pdf, other]

doi 10.1093/mnras/stae1696

Deep learning insights into non-universality in the halo mass function

Authors: Ningyuan Guo, Luisa Lucie-Smith, Hiranya V. Peiris, Andrew Pontzen, Davide Piras

Abstract: The abundance of dark matter haloes is a key cosmological probe in forthcoming galaxy surveys. The theoretical understanding of the halo mass function (HMF) is limited by our incomplete knowledge of the origin of non-universality and its cosmological parameter dependence. We present a deep learning model which compresses the linear matter power spectrum into three independent factors which are nec… ▽ More The abundance of dark matter haloes is a key cosmological probe in forthcoming galaxy surveys. The theoretical understanding of the halo mass function (HMF) is limited by our incomplete knowledge of the origin of non-universality and its cosmological parameter dependence. We present a deep learning model which compresses the linear matter power spectrum into three independent factors which are necessary and sufficient to describe the $z=0$ HMF from the state-of-the-art AEMULUS emulator to sub-per cent accuracy in a $w$CDM$+N_\mathrm{eff}$ parameter space. Additional information about growth history does not improve the accuracy of HMF predictions if the matter power spectrum is already provided as input, because required aspects of the former can be inferred from the latter. The three factors carry information about the universal and non-universal aspects of the HMF, which we interrogate via the information-theoretic measure of mutual information. We find that non-universality is captured by recent growth history after matter-dark-energy equality and $N_\mathrm{eff}$ for $M\sim 10^{13} \, \mathrm{M_\odot}\, h^{-1}$ haloes, and by $Ω_{\rm m}$ for $M\sim 10^{15} \, \mathrm{M_\odot}\, h^{-1}$. The compact representation learnt by our model can inform the design of emulator training sets to achieve high emulator accuracy with fewer simulations. △ Less

Submitted 9 July, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: 17 pages, 13 figures. Minor changes to match version accepted for publication in MNRAS

arXiv:2405.12965 [pdf, other]

doi 10.33232/001c.123368

The future of cosmological likelihood-based inference: accelerated high-dimensional parameter estimation and model comparison

Authors: Davide Piras, Alicja Polanska, Alessio Spurio Mancini, Matthew A. Price, Jason D. McEwen

Abstract: We advocate for a new paradigm of cosmological likelihood-based inference, leveraging recent developments in machine learning and its underlying technology, to accelerate Bayesian inference in high-dimensional settings. Specifically, we combine (i) emulation, where a machine learning model is trained to mimic cosmological observables, e.g. CosmoPower-JAX; (ii) differentiable and probabilistic prog… ▽ More We advocate for a new paradigm of cosmological likelihood-based inference, leveraging recent developments in machine learning and its underlying technology, to accelerate Bayesian inference in high-dimensional settings. Specifically, we combine (i) emulation, where a machine learning model is trained to mimic cosmological observables, e.g. CosmoPower-JAX; (ii) differentiable and probabilistic programming, e.g. JAX and NumPyro, respectively; (iii) scalable Markov chain Monte Carlo (MCMC) sampling techniques that exploit gradients, e.g. Hamiltonian Monte Carlo; and (iv) decoupled and scalable Bayesian model selection techniques that compute the Bayesian evidence purely from posterior samples, e.g. the learned harmonic mean implemented in harmonic. This paradigm allows us to carry out a complete Bayesian analysis, including both parameter estimation and model selection, in a fraction of the time of traditional approaches. First, we demonstrate the application of this paradigm on a simulated cosmic shear analysis for a Stage IV survey in 37- and 39-dimensional parameter spaces, comparing $Λ$CDM and a dynamical dark energy model ($w_0w_a$CDM). We recover posterior contours and evidence estimates that are in excellent agreement with those computed by the traditional nested sampling approach while reducing the computational cost from 8 months on 48 CPU cores to 2 days on 12 GPUs. Second, we consider a joint analysis between three simulated next-generation surveys, each performing a 3x2pt analysis, resulting in 157- and 159-dimensional parameter spaces. Standard nested sampling techniques are simply unlikely to be feasible in this high-dimensional setting, requiring a projected 12 years of compute time on 48 CPU cores; on the other hand, the proposed approach only requires 8 days of compute time on 24 GPUs. All packages used in our analyses are publicly available. △ Less

Submitted 4 September, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: 14 pages, 6 figures. Accepted for publication in the Open Journal of Astrophysics. Codes available at https://github.com/alessiospuriomancini/cosmopower, https://github.com/dpiras/cosmopower-jax, https://github.com/astro-informatics/harmonic/

Journal ref: Open Journal of Astrophysics, Vol. 7, September 5th 2024

arXiv:2405.05969 [pdf, ps, other]

Learned harmonic mean estimation of the Bayesian evidence with normalizing flows

Authors: Alicja Polanska, Matthew A. Price, Davide Piras, Alessio Spurio Mancini, Jason D. McEwen

Abstract: We present the learned harmonic mean estimator with normalizing flows - a robust, scalable and flexible estimator of the Bayesian evidence for model comparison. Since the estimator is agnostic to sampling strategy and simply requires posterior samples, it can be applied to compute the evidence using any Markov chain Monte Carlo (MCMC) sampling technique, including saved down MCMC chains, or any va… ▽ More We present the learned harmonic mean estimator with normalizing flows - a robust, scalable and flexible estimator of the Bayesian evidence for model comparison. Since the estimator is agnostic to sampling strategy and simply requires posterior samples, it can be applied to compute the evidence using any Markov chain Monte Carlo (MCMC) sampling technique, including saved down MCMC chains, or any variational inference approach. The learned harmonic mean estimator was recently introduced, where machine learning techniques were developed to learn a suitable internal importance sampling target distribution to solve the issue of exploding variance of the original harmonic mean estimator. In this article we present the use of normalizing flows as the internal machine learning technique within the learned harmonic mean estimator. Normalizing flows can be elegantly coupled with the learned harmonic mean to provide an approach that is more robust, flexible and scalable than the machine learning models considered previously. We perform a series of numerical experiments, applying our method to benchmark problems and to a cosmological example in up to 21 dimensions. We find the learned harmonic mean estimator is in agreement with ground truth values and nested sampling estimates. The open-source harmonic Python package implementing the learned harmonic mean, now with normalizing flows included, is publicly available. △ Less

Submitted 5 June, 2025; v1 submitted 9 May, 2024; originally announced May 2024.

Comments: 14 pages, 8 figures, harmonic code available at https://github.com/astro-informatics/harmonic

arXiv:2310.10717 [pdf, other]

doi 10.1103/PhysRevD.110.023514

A representation learning approach to probe for dynamical dark energy in matter power spectra

Authors: Davide Piras, Lucas Lombriser

Abstract: We present DE-VAE, a variational autoencoder (VAE) architecture to search for a compressed representation of dynamical dark energy (DE) models in observational studies of the cosmic large-scale structure. DE-VAE is trained on matter power spectra boosts generated at wavenumbers $k\in(0.01-2.5) \ h/\rm{Mpc}$ and at four redshift values $z\in(0.1,0.48,0.78,1.5)$ for the most typical dynamical DE par… ▽ More We present DE-VAE, a variational autoencoder (VAE) architecture to search for a compressed representation of dynamical dark energy (DE) models in observational studies of the cosmic large-scale structure. DE-VAE is trained on matter power spectra boosts generated at wavenumbers $k\in(0.01-2.5) \ h/\rm{Mpc}$ and at four redshift values $z\in(0.1,0.48,0.78,1.5)$ for the most typical dynamical DE parametrization with two extra parameters describing an evolving DE equation of state. The boosts are compressed to a lower-dimensional representation, which is concatenated with standard cold dark matter (CDM) parameters and then mapped back to reconstructed boosts; both the compression and the reconstruction components are parametrized as neural networks. Remarkably, we find that a single latent parameter is sufficient to predict 95% (99%) of DE power spectra generated over a broad range of cosmological parameters within $1σ$ ($2σ$) of a Gaussian error which includes cosmic variance, shot noise and systematic effects for a Stage IV-like survey. This single parameter shows a high mutual information with the two DE parameters, and these three variables can be linked together with an explicit equation through symbolic regression. Considering a model with two latent variables only marginally improves the accuracy of the predictions, and adding a third latent variable has no significant impact on the model's performance. We discuss how the DE-VAE architecture can be extended from a proof of concept to a general framework to be employed in the search for a common lower-dimensional parametrization of a wide range of beyond-$Λ$CDM models and for different cosmological datasets. Such a framework could then both inform the development of cosmological surveys by targeting optimal probes, and provide theoretical insight into the common phenomenological aspects of beyond-$Λ$CDM models. △ Less

Submitted 9 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: 11 pages, 5 figures. Minor changes to match version published in PRD

Journal ref: Phys. Rev. D 110, 023514, July 2024

arXiv:2305.06347 [pdf, other]

doi 10.21105/astro.2305.06347

CosmoPower-JAX: high-dimensional Bayesian inference with differentiable cosmological emulators

Authors: D. Piras, A. Spurio Mancini

Abstract: We present CosmoPower-JAX, a JAX-based implementation of the CosmoPower framework, which accelerates cosmological inference by building neural emulators of cosmological power spectra. We show how, using the automatic differentiation, batch evaluation and just-in-time compilation features of JAX, and running the inference pipeline on graphics processing units (GPUs), parameter estimation can be acc… ▽ More We present CosmoPower-JAX, a JAX-based implementation of the CosmoPower framework, which accelerates cosmological inference by building neural emulators of cosmological power spectra. We show how, using the automatic differentiation, batch evaluation and just-in-time compilation features of JAX, and running the inference pipeline on graphics processing units (GPUs), parameter estimation can be accelerated by orders of magnitude with advanced gradient-based sampling techniques. These can be used to efficiently explore high-dimensional parameter spaces, such as those needed for the analysis of next-generation cosmological surveys. We showcase the accuracy and computational efficiency of CosmoPower-JAX on two simulated Stage IV configurations. We first consider a single survey performing a cosmic shear analysis totalling 37 model parameters. We validate the contours derived with CosmoPower-JAX and a Hamiltonian Monte Carlo sampler against those derived with a nested sampler and without emulators, obtaining a speed-up factor of $\mathcal{O}(10^3)$. We then consider a combination of three Stage IV surveys, each performing a joint cosmic shear and galaxy clustering (3x2pt) analysis, for a total of 157 model parameters. Even with such a high-dimensional parameter space, CosmoPower-JAX provides converged posterior contours in 3 days, as opposed to the estimated 6 years required by standard methods. CosmoPower-JAX is fully written in Python, and we make it publicly available to help the cosmological community meet the accuracy requirements set by next-generation surveys. △ Less

Submitted 22 June, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

Comments: 12 pages, 5 figures. Accepted for publication in The Open Journal of Astrophysics. CosmoPower-JAX is available at https://github.com/dpiras/cosmopower-jax

Journal ref: Volume 6 (2023)

arXiv:2211.00024 [pdf, other]

doi 10.1088/2632-2153/acc444

A robust estimator of mutual information for deep learning interpretability

Authors: Davide Piras, Hiranya V. Peiris, Andrew Pontzen, Luisa Lucie-Smith, Ningyuan Guo, Brian Nord

Abstract: We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI (pronounced $``$Jimmie$"$), an algorithm based on Gaussian mixture models that can be applied to both discrete and continuous settings. GMM-MI is computationally efficien… ▽ More We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI (pronounced $``$Jimmie$"$), an algorithm based on Gaussian mixture models that can be applied to both discrete and continuous settings. GMM-MI is computationally efficient, robust to the choice of hyperparameters and provides the uncertainty on the MI estimate due to the finite sample size. We extensively validate GMM-MI on toy data for which the ground truth MI is known, comparing its performance against established mutual information estimators. We then demonstrate the use of our MI estimator in the context of representation learning, working with synthetic data and physical datasets describing highly non-linear processes. We train deep learning models to encode high-dimensional data within a meaningful compressed (latent) representation, and use GMM-MI to quantify both the level of disentanglement between the latent variables, and their association with relevant physical quantities, thus unlocking the interpretability of the latent representation. We make GMM-MI publicly available. △ Less

Submitted 23 March, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

Comments: 30 pages, 8 figures. Minor changes to match version accepted for publication in Machine Learning: Science and Technology. GMM-MI available at https://github.com/dpiras/GMM-MI

Journal ref: Machine Learning: Science and Technology, Volume 4, Number 2, 025006, April 2023

arXiv:2205.07898 [pdf, other]

doi 10.1093/mnras/stad052

Fast and realistic large-scale structure from machine-learning-augmented random field simulations

Authors: Davide Piras, Benjamin Joachimi, Francisco Villaescusa-Navarro

Abstract: Producing thousands of simulations of the dark matter distribution in the Universe with increasing precision is a challenging but critical task to facilitate the exploitation of current and forthcoming cosmological surveys. Many inexpensive substitutes to full $N$-body simulations have been proposed, even though they often fail to reproduce the statistics of the smaller, non-linear scales. Among t… ▽ More Producing thousands of simulations of the dark matter distribution in the Universe with increasing precision is a challenging but critical task to facilitate the exploitation of current and forthcoming cosmological surveys. Many inexpensive substitutes to full $N$-body simulations have been proposed, even though they often fail to reproduce the statistics of the smaller, non-linear scales. Among these alternatives, a common approximation is represented by the lognormal distribution, which comes with its own limitations as well, while being extremely fast to compute even for high-resolution density fields. In this work, we train a generative deep learning model, mainly made of convolutional layers, to transform projected lognormal dark matter density fields to more realistic dark matter maps, as obtained from full $N$-body simulations. We detail the procedure that we follow to generate highly correlated pairs of lognormal and simulated maps, which we use as our training data, exploiting the information of the Fourier phases. We demonstrate the performance of our model comparing various statistical tests with different field resolutions, redshifts and cosmological parameters, proving its robustness and explaining its current limitations. When evaluated on 100 test maps, the augmented lognormal random fields reproduce the power spectrum up to wavenumbers of $1 \ h \ \rm{Mpc}^{-1}$, and the bispectrum within 10%, and always within the error bars, of the fiducial target simulations. Finally, we describe how we plan to integrate our proposed model with existing tools to yield more accurate spherical random fields for weak lensing analysis. △ Less

Submitted 1 February, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

Comments: 16 pages, 11 figures. Matches MNRAS published version, which includes more tests with e.g. varying cosmological parameters

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 520, Issue 1, March 2023, Pages 668-683

arXiv:2203.08827 [pdf, other]

doi 10.1103/PhysRevD.105.103533

Discovering the building blocks of dark matter halo density profiles with neural networks

Authors: Luisa Lucie-Smith, Hiranya V. Peiris, Andrew Pontzen, Brian Nord, Jeyan Thiyagalingam, Davide Piras

Abstract: The density profiles of dark matter halos are typically modeled using empirical formulae fitted to the density profiles of relaxed halo populations. We present a neural network model that is trained to learn the mapping from the raw density field containing each halo to the dark matter density profile. We show that the model recovers the widely-used Navarro-Frenk-White (NFW) profile out to the vir… ▽ More The density profiles of dark matter halos are typically modeled using empirical formulae fitted to the density profiles of relaxed halo populations. We present a neural network model that is trained to learn the mapping from the raw density field containing each halo to the dark matter density profile. We show that the model recovers the widely-used Navarro-Frenk-White (NFW) profile out to the virial radius, and can additionally describe the variability in the outer profile of the halos. The neural network architecture consists of a supervised encoder-decoder framework, which first compresses the density inputs into a low-dimensional latent representation, and then outputs $ρ(r)$ for any desired value of radius $r$. The latent representation contains all the information used by the model to predict the density profiles. This allows us to interpret the latent representation by quantifying the mutual information between the representation and the halos' ground-truth density profiles. A two-dimensional representation is sufficient to accurately model the density profiles up to the virial radius; however, a three-dimensional representation is required to describe the outer profiles beyond the virial radius. The additional dimension in the representation contains information about the infalling material in the outer profiles of dark matter halos, thus discovering the splashback boundary of halos without prior knowledge of the halos' dynamical history. △ Less

Submitted 13 May, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 12 pages, 6 figures. Minor changes to match version accepted for publication in PRD

arXiv:2107.00665 [pdf, other]

Towards Machine Learning-Based Meta-Studies: Applications to Cosmological Parameters

Authors: Tom Crossland, Pontus Stenetorp, Daisuke Kawata, Sebastian Riedel, Thomas D. Kitching, Anurag Deshpande, Tom Kimpson, Choong Ling Liew-Cain, Christian Pedersen, Davide Piras, Monu Sharma

Abstract: We develop a new model for automatic extraction of reported measurement values from the astrophysical literature, utilising modern Natural Language Processing techniques. We use this model to extract measurements present in the abstracts of the approximately 248,000 astrophysics articles from the arXiv repository, yielding a database containing over 231,000 astrophysical numerical measurements. Fu… ▽ More We develop a new model for automatic extraction of reported measurement values from the astrophysical literature, utilising modern Natural Language Processing techniques. We use this model to extract measurements present in the abstracts of the approximately 248,000 astrophysics articles from the arXiv repository, yielding a database containing over 231,000 astrophysical numerical measurements. Furthermore, we present an online interface (Numerical Atlas) to allow users to query and explore this database, based on parameter names and symbolic representations, and download the resulting datasets for their own research uses. To illustrate potential use cases we then collect values for nine different cosmological parameters using this tool. From these results we can clearly observe the historical trends in the reported values of these quantities over the past two decades, and see the impacts of landmark publications on our understanding of cosmology. △ Less

Submitted 1 July, 2021; originally announced July 2021.

Comments: 23 pages, 14 figures. Submitted to Monthly Notices of the Royal Astronomical Society. Astronomical measurement database available at http://numericalatlas.cs.ucl.ac.uk/

arXiv:2106.03846 [pdf, other]

doi 10.1093/mnras/stac064

COSMOPOWER: emulating cosmological power spectra for accelerated Bayesian inference from next-generation surveys

Authors: A. Spurio Mancini, D. Piras, J. Alsing, B. Joachimi, M. P. Hobson

Abstract: We present $\it{CosmoPower}$, a suite of neural cosmological power spectrum emulators providing orders-of-magnitude acceleration for parameter estimation from two-point statistics analyses of Large-Scale Structure (LSS) and Cosmic Microwave Background (CMB) surveys. The emulators replace the computation of matter and CMB power spectra from Boltzmann codes; thus, they do not need to be re-trained f… ▽ More We present $\it{CosmoPower}$, a suite of neural cosmological power spectrum emulators providing orders-of-magnitude acceleration for parameter estimation from two-point statistics analyses of Large-Scale Structure (LSS) and Cosmic Microwave Background (CMB) surveys. The emulators replace the computation of matter and CMB power spectra from Boltzmann codes; thus, they do not need to be re-trained for different choices of astrophysical nuisance parameters or redshift distributions. The matter power spectrum emulation error is less than $0.4\%$ in the wavenumber range $k \in [10^{-5}, 10] \, \mathrm{Mpc}^{-1}$, for redshift $z \in [0, 5]$. $\it{CosmoPower}$ emulates CMB temperature, polarisation and lensing potential power spectra in the $5σ$ region of parameter space around the $\it{Planck}$ best fit values with an error $\lesssim 10\%$ of the expected shot noise for the forthcoming Simons Observatory. $\it{CosmoPower}$ is showcased on a joint cosmic shear and galaxy clustering analysis from the Kilo-Degree Survey, as well as on a Stage IV $\it{Euclid}$-like simulated cosmic shear analysis. For the CMB case, $\it{CosmoPower}$ is tested on a $\it{Planck}$ 2018 CMB temperature and polarisation analysis. The emulators always recover the fiducial cosmological constraints with differences in the posteriors smaller than sampling noise, while providing a speed-up factor up to $O(10^4)$ to the complete inference pipeline. This acceleration allows posterior distributions to be recovered in just a few seconds, as we demonstrate in the $\it{Planck}$ likelihood case. $\it{CosmoPower}$ is written entirely in Python, can be interfaced with all commonly used cosmological samplers and is publicly available at https://github.com/alessiospuriomancini/cosmopower . △ Less

Submitted 31 January, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: 13+6 pages, 6+3 figures. Matches MNRAS published version. COSMOPOWER available at https://github.com/alessiospuriomancini/cosmopower

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 511, Issue 2, April 2022, Pages 1771-1788

arXiv:2101.04724 [pdf, other]

doi 10.1093/gji/ggac385

Towards fast machine-learning-assisted Bayesian posterior inference of microseismic event location and source mechanism

Authors: Davide Piras, Alessio Spurio Mancini, Ana M. G. Ferreira, Benjamin Joachimi, Michael P. Hobson

Abstract: Bayesian inference applied to microseismic activity monitoring allows the accurate location of microseismic events from recorded seismograms and the estimation of the associated uncertainties. However, the forward modelling of these microseismic events, which is necessary to perform Bayesian source inversion, can be prohibitively expensive in terms of computational resources. A viable solution is… ▽ More Bayesian inference applied to microseismic activity monitoring allows the accurate location of microseismic events from recorded seismograms and the estimation of the associated uncertainties. However, the forward modelling of these microseismic events, which is necessary to perform Bayesian source inversion, can be prohibitively expensive in terms of computational resources. A viable solution is to train a surrogate model based on machine learning techniques, to emulate the forward model and thus accelerate Bayesian inference. In this paper, we substantially enhance previous work, which considered only sources with isotropic moment tensors. We train a machine learning algorithm on the power spectrum of the recorded pressure wave and show that the trained emulator allows complete and fast event locations for $\textit{any}$ source mechanism. Moreover, we show that our approach is computationally inexpensive, as it can be run in less than 1 hour on a commercial laptop, while yielding accurate results using less than $10^4$ training seismograms. We additionally demonstrate how the trained emulators can be used to identify the source mechanism through the estimation of the Bayesian evidence. Finally, we demonstrate that our approach is robust to real noise as measured in field data. This work lays the foundations for efficient, accurate future joint determinations of event location and moment tensor, and associated uncertainties, which are ultimately key for accurately characterising human-induced and natural earthquakes, and for enhanced quantitative seismic hazard assessments. △ Less

Submitted 28 October, 2022; v1 submitted 12 January, 2021; originally announced January 2021.

Comments: 17+4 pages, 13+3 figures, 2 tables. Matches version published in GJI, including extra tests with realistic noise and network configuration. Code available at https://github.com/alessiospuriomancini/seismoML/tree/main/Piras_2022

Journal ref: Geophysical Journal International, Volume 232, Issue 2, February 2023, Pages 1219-1235

arXiv:2010.12464 [pdf, other]

Representation Learning for High-Dimensional Data Collection under Local Differential Privacy

Authors: Alex Mansbridge, Gregory Barbour, Davide Piras, Michael Murray, Christopher Frye, Ilya Feige, David Barber

Abstract: The collection of individuals' data has become commonplace in many industries. Local differential privacy (LDP) offers a rigorous approach to preserving privacy whereby the individual privatises their data locally, allowing only their perturbed datum to leave their possession. LDP thus provides a provable privacy guarantee to the individual against both adversaries and database administrators. Exi… ▽ More The collection of individuals' data has become commonplace in many industries. Local differential privacy (LDP) offers a rigorous approach to preserving privacy whereby the individual privatises their data locally, allowing only their perturbed datum to leave their possession. LDP thus provides a provable privacy guarantee to the individual against both adversaries and database administrators. Existing LDP mechanisms have successfully been applied to low-dimensional data, but in high dimensions the privacy-inducing noise largely destroys the utility of the data. In this work, our contributions are two-fold: first, by adapting state-of-the-art techniques from representation learning, we introduce a novel approach to learning LDP mechanisms. These mechanisms add noise to powerful representations on the low-dimensional manifold underlying the data, thereby overcoming the prohibitive noise requirements of LDP in high dimensions. Second, we introduce a novel denoising approach for downstream model learning. The training of performant machine learning models using collected LDP data is a common goal for data collectors, and downstream model performance forms a proxy for the LDP data utility. Our approach significantly outperforms current state-of-the-art LDP mechanisms. △ Less

Submitted 14 May, 2022; v1 submitted 23 October, 2020; originally announced October 2020.

arXiv:2009.06758 [pdf, other]

doi 10.5194/se-12-1683-2021

Accelerating Bayesian microseismic event location with deep learning

Authors: A. Spurio Mancini, D. Piras, A. M. G. Ferreira, M. P. Hobson, B. Joachimi

Abstract: We present a series of new open source deep learning algorithms to accelerate Bayesian full waveform point source inversion of microseismic events. Inferring the joint posterior probability distribution of moment tensor components and source location is key for rigorous uncertainty quantification. However, the inference process requires forward modelling of microseismic traces for each set of para… ▽ More We present a series of new open source deep learning algorithms to accelerate Bayesian full waveform point source inversion of microseismic events. Inferring the joint posterior probability distribution of moment tensor components and source location is key for rigorous uncertainty quantification. However, the inference process requires forward modelling of microseismic traces for each set of parameters explored by the sampling algorithm, which makes the inference very computationally intensive. In this paper we focus on accelerating this process by training deep learning models to learn the mapping between source location and seismic traces, for a given 3D heterogeneous velocity model, and a fixed isotropic moment tensor for the sources. These trained emulators replace the expensive solution of the elastic wave equation in the inference process. We compare our results with a previous study that used emulators based on Gaussian Processes to invert microseismic events. We show that all of our models provide more accurate predictions and $\sim 100$ times faster predictions than the method based on Gaussian Processes, and a $\mathcal{O}(10^5)$ speed-up factor over a pseudo-spectral method for waveform generation. For example, a 2-s long synthetic trace can be generated in $\sim 10$ ms on a common laptop processor, instead of $\sim$ 1 hr using a pseudo-spectral method on a high-profile Graphics Processing Units card. We also show that our inference results are in excellent agreement with those obtained from traditional location methods based on travel time estimates. The speed, accuracy and scalability of our open source deep learning models pave the way for extensions of these emulators to generic source mechanisms and application to joint Bayesian inversion of moment tensor components and source location using full waveforms. △ Less

Submitted 2 August, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

Comments: matches version published in Solid Earth. Code available at http://github.com/alessiospuriomancini/seismoML

Journal ref: Solid Earth 12 (2021) 1683-1705

arXiv:2007.01662 [pdf, other]

Scattering contrast in GHz frequency ultrasound subsurface atomic force microscopy for detection of deeply buried features

Authors: Maarten H. van Es, Benoit A. J. Quesson, Abbas Mohtashami, Daniele Piras, Kodai Hatakeyama, Laurent Fillinger, Paul L. M. J. van Neer

Abstract: While Atomic Force Microscopy is mostly used to investigate surface properties, people have almost since its invention sought to apply its high resolution capability to image also structures buried within samples. One of the earliest techniques for this was based on using ultrasound excitations to visualize local differences in effective tip-sample stiffness caused by the presence of buried struct… ▽ More While Atomic Force Microscopy is mostly used to investigate surface properties, people have almost since its invention sought to apply its high resolution capability to image also structures buried within samples. One of the earliest techniques for this was based on using ultrasound excitations to visualize local differences in effective tip-sample stiffness caused by the presence of buried structures with different visco-elasticity from their surroundings. While the use of ultrasound has often triggered discussions on the contribution of diffraction or scattering of acoustic waves in visualizing buried structures, no conclusive papers on this topic have been published. Here we demonstrate and discuss how such acoustical effects can be unambiguously recognized and can be used with Atomic Force Microscopy to visualize deeply buried structures. △ Less

Submitted 3 July, 2020; originally announced July 2020.

Comments: 18 pages, 5 figures

arXiv:1707.06559 [pdf, ps, other]

doi 10.1093/mnras/stx2846

The mass dependence of dark matter halo alignments with large-scale structure

Authors: Davide Piras, Benjamin Joachimi, Björn Malte Schäfer, Mario Bonamigo, Stefan Hilbert, Edo van Uitert

Abstract: Tidal gravitational forces can modify the shape of galaxies and clusters of galaxies, thus correlating their orientation with the surrounding matter density field. We study the dependence of this phenomenon, known as intrinsic alignment (IA), on the mass of the dark matter haloes that host these bright structures, analysing the Millennium and Millennium-XXL $N$-body simulations. We closely follow… ▽ More Tidal gravitational forces can modify the shape of galaxies and clusters of galaxies, thus correlating their orientation with the surrounding matter density field. We study the dependence of this phenomenon, known as intrinsic alignment (IA), on the mass of the dark matter haloes that host these bright structures, analysing the Millennium and Millennium-XXL $N$-body simulations. We closely follow the observational approach, measuring the halo position-halo shape alignment and subsequently dividing out the dependence on halo bias. We derive a theoretical scaling of the IA amplitude with mass in a dark matter universe, and predict a power-law with slope $β_{\mathrm{M}}$ in the range $1/3$ to $1/2$, depending on mass scale. We find that the simulation data agree with each other and with the theoretical prediction remarkably well over three orders of magnitude in mass, with the joint analysis yielding an estimate of $β_{\mathrm{M}} = 0.36^{+0.01}_{-0.01}$. This result does not depend on redshift or on the details of the halo shape measurement. The analysis is repeated on observational data, obtaining a significantly higher value, $β_{\mathrm{M}} = 0.56^{+0.05}_{-0.05}$. There are also small but significant deviations from our simple model in the simulation signals at both the high- and low-mass end. We discuss possible reasons for these discrepancies, and argue that they can be attributed to physical processes not captured in the model or in the dark matter-only simulations. △ Less

Submitted 31 October, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

Comments: 12 pages, 6 figures; accepted for publication in MNRAS

arXiv:1701.07357 [pdf, ps, other]

doi 10.1088/1361-6463/aa7024

Analysis of contact stiffness in Ultrasound Atomic Force Microscopy: Three-dimensional time-dependent ultrasound modeling

Authors: Daniele Piras, Hamed Sadeghian

Abstract: Ultrasound Atomic Force Microscopy (US-AFM) has been used for subsurface imaging of nanostructures. The contact stiffness variations have been suggested as the origin of the image contrast. Therefore, to analyze the image contrast, the local changes in the contact stiffness due to the presence of subsurface features should be calculated. So far, only static simulations have been conducted to analy… ▽ More Ultrasound Atomic Force Microscopy (US-AFM) has been used for subsurface imaging of nanostructures. The contact stiffness variations have been suggested as the origin of the image contrast. Therefore, to analyze the image contrast, the local changes in the contact stiffness due to the presence of subsurface features should be calculated. So far, only static simulations have been conducted to analyze the local changes in the contact stiffness and, consequently, the contrast in US-AFM. Such a static approach does not fully represent the real US-AFM experiment, where an ultrasound wave is launched either into the sample or at the tip, which modulates the contact stiffness. This is a time-dependent nonlinear dynamic problem rather than a static and stationary one. This letter presents dynamic 3D ultrasound analysis of contact stiffness in US-AFM (in contrast to static analysis) to realistically predict the changes in contact stiffness and thus the changes in the subsurface image contrast. The modulation frequency also influences the contact stiffness variations and, thus, the image contrast. The three-dimensional time-dependent ultrasound analysis will greatly aid in the contrast optimization of subsurface nanoimaging with US-AFM. △ Less

Submitted 25 January, 2017; originally announced January 2017.

Comments: 11 pages, 5 figures

arXiv:1610.01836 [pdf, ps, other]

doi 10.1142/S201032631750006X

Spectrum of large random Markov chains: heavy-tailed weights on the oriented complete graph

Authors: Charles Bordenave, Pietro Caputo, Djalil Chafaï, Daniele Piras

Abstract: We consider the random Markov matrix obtained by assigning i.i.d. non-negative weights to each edge of the complete oriented graph. In this study, the weights have unbounded first moment and belong to the domain of attraction of an alpha-stable law. We prove that as the dimension tends to infinity, the empirical measure of the singular values tends to a probability measure which depends only on al… ▽ More We consider the random Markov matrix obtained by assigning i.i.d. non-negative weights to each edge of the complete oriented graph. In this study, the weights have unbounded first moment and belong to the domain of attraction of an alpha-stable law. We prove that as the dimension tends to infinity, the empirical measure of the singular values tends to a probability measure which depends only on alpha, characterized as the expected value of the spectral measure at the root of a weighted random tree. The latter is a generalized two-stage version of the Poisson weighted infinite tree (PWIT) introduced by David Aldous. Under an additional smoothness assumption, we show that the empirical measure of the eigenvalues tends to a non-degenerate isotropic probability measure depending only on alpha and supported on the unit disc of the complex plane. We conjecture that the limiting support is actually formed by a strictly smaller disc. △ Less

Submitted 7 June, 2017; v1 submitted 6 October, 2016; originally announced October 2016.

Comments: Minor corrections

Journal ref: Random Matrices: Theory and Applications, World Scientific, 6 (2), pp.1750006 (2017)

arXiv:1305.4062 [pdf, ps, other]

A new acoustic lens material for large area detectors in photoacoustic breast tomography

Authors: Wenfeng Xia, Daniele Piras, Johan C. G. van Hespen, Wiendelt Steenbergen, Srirang Manohar

Abstract: Acoustic lenses made of acrylic plastic (PMMA) have been used to enlarge the acceptance angle of sensitive large surface area detectors and improve lateral resolution. However, PMMA lenses introduce image artifacts due to ultrasound internal reflections within the lenses. In this work we investigated this issue proposing a new lens material Stycast 1090SI. We characterized the acoustic properties… ▽ More Acoustic lenses made of acrylic plastic (PMMA) have been used to enlarge the acceptance angle of sensitive large surface area detectors and improve lateral resolution. However, PMMA lenses introduce image artifacts due to ultrasound internal reflections within the lenses. In this work we investigated this issue proposing a new lens material Stycast 1090SI. We characterized the acoustic properties of the proposed material in comparison with PMMA. Detector performance using negative lenses with the two materials, was tested using finite element simulation and experiment. Further the image quality of a photoacoustic tomography system was studied using k-Wave simulation and experiment. Our acoustic characterization showed that Stycast 1090SI has tissue-like acoustic impedance, high speed of sound and low acoustic attenuation. Both acoustic lenses show significant enlargement of detector acceptance angle and lateral resolution improvement. However, image artifacts induced by acoustic lenses are reduced using the proposed lens compared to PMMA lens. △ Less

Submitted 17 May, 2013; originally announced May 2013.

Comments: Accepted by Photoacoustics, Elsevier

arXiv:1212.2642 [pdf, ps, other]

doi 10.1118/1.4792462

An optimized ultrasound detector for photoacoustic breast tomography

Authors: Wenfeng Xia, Daniele Piras, Johan Van Hespen, Spiridon Van Veldhoven, Christian Prins, Ton Van Leeuwen, Wiendelt Steenbergen, Srirang Manohar

Abstract: Photoacoustic imaging has proven to be able to detect vascularization-driven optical absorption contrast associated with tumors. In order to detect breast tumors located a few centimeter deep in tissue, a sensitive ultrasound detector is of crucial importance for photoacoustic mammography. Further, because the expected photoacoustic frequency bandwidth (a few MHz to tens of kHz) is inversely propo… ▽ More Photoacoustic imaging has proven to be able to detect vascularization-driven optical absorption contrast associated with tumors. In order to detect breast tumors located a few centimeter deep in tissue, a sensitive ultrasound detector is of crucial importance for photoacoustic mammography. Further, because the expected photoacoustic frequency bandwidth (a few MHz to tens of kHz) is inversely proportional to the dimensions of light absorbing structures (0.5 to 10+ mm), proper choices of materials and their geometries, and proper considerations in design have to be made for optimal photoacoustic detectors. In this study, we design and evaluate a specialized ultrasound detector for photoacoustic mammography. Based on the required detector sensitivity and its frequency response, a selection of active material and matching layers and their geometries is made leading to a functional detector models. By iteration between simulation of detector performances, fabrication and experimental characterization of functional models an optimized implementation is made and evaluated. The experimental results of the designed first and second functional detectors matched with the simulations. In subsequent bare piezoelectric samples the effect of lateral resonances was addressed and their influence minimized by sub-dicing the samples. Consequently, using simulations, the final optimized detector could be designed, with a center frequency of 1 MHz and a -6 dB bandwidth of ~80%. The minimum detectable pressure was measured to be 0.5 Pa, which will facilitate deeper imaging compared to the currrent systems. The detector should be capable of detecting vascularized tumors with resolution of 1-2 mm. Further improvements by proper electrical grounding and shielding and implementation of this design into an arrayed detector will pave the way for clinical applications of photoacoustic mammography. △ Less

Submitted 13 February, 2013; v1 submitted 11 December, 2012; originally announced December 2012.

Comments: Accepted for publication in Medical Physics (American Association of Physicists in Medicine)

Journal ref: Med. Phys. 40, 032901 (2013)

Showing 1–29 of 29 results for author: Piras, D