-
On Soft Clustering For Correlation Estimators: Model Uncertainty, Differentiability, and Surrogates
Authors:
Edward Berman,
Sneh Pandya,
Jacqueline McCleary,
Marko Shuntov,
Caitlin Casey,
Nicole Drakos,
Andreas Faisst,
Steven Gillman,
Ghassem Gozaliasl,
Natalie Hogg,
Jeyhan Kartaltepe,
Anton Koekemoer,
Wilfried Mercier,
Diana Scognamiglio,
COSMOS-Web,
:,
The JWST Cosmic Origins Survey
Abstract:
Properly estimating correlations between objects at different spatial scales necessitates $\mathcal{O}(n^2)$ distance calculations. For this reason, most widely adopted packages for estimating correlations use clustering algorithms to approximate local trends. However, methods for quantifying the error introduced by this clustering have been understudied. In response, we present an algorithm for e…
▽ More
Properly estimating correlations between objects at different spatial scales necessitates $\mathcal{O}(n^2)$ distance calculations. For this reason, most widely adopted packages for estimating correlations use clustering algorithms to approximate local trends. However, methods for quantifying the error introduced by this clustering have been understudied. In response, we present an algorithm for estimating correlations that is probabilistic in the way that it clusters objects, enabling us to quantify the uncertainty caused by clustering simply through model inference. These soft clustering assignments enable correlation estimators that are theoretically differentiable with respect to their input catalogs. Thus, we also build a theoretical framework for differentiable correlation functions and describe their utility in comparison to existing surrogate models. Notably, we find that repeated normalization and distance function calls slow gradient calculations and that sparse Jacobians destabilize precision, pointing towards either approximate or surrogate methods as a necessary solution to exact gradients from correlation functions. To that end, we close with a discussion of surrogate models as proxies for correlation functions. We provide an example that demonstrates the efficacy of surrogate models to enable gradient-based optimization of astrophysical model parameters, successfully minimizing a correlation function output. Our numerical experiments cover science cases across cosmology, from point spread function (PSF) modeling efforts to gravitational simulations to galaxy intrinsic alignment (IA).
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
IAEmu: Learning Galaxy Intrinsic Alignment Correlations
Authors:
Sneh Pandya,
Yuanyuan Yang,
Nicholas Van Alfen,
Jonathan Blazek,
Robin Walters
Abstract:
The intrinsic alignments (IA) of galaxies, a key contaminant in weak lensing analyses, arise from correlations in galaxy shapes driven by tidal interactions and galaxy formation processes. Accurate IA modeling is essential for robust cosmological inference, but current approaches rely on perturbative methods that break down on nonlinear scales or on expensive simulations. We introduce IAEmu, a neu…
▽ More
The intrinsic alignments (IA) of galaxies, a key contaminant in weak lensing analyses, arise from correlations in galaxy shapes driven by tidal interactions and galaxy formation processes. Accurate IA modeling is essential for robust cosmological inference, but current approaches rely on perturbative methods that break down on nonlinear scales or on expensive simulations. We introduce IAEmu, a neural network-based emulator that predicts the galaxy position-position ($ξ$), position-orientation ($ω$), and orientation-orientation ($η$) correlation functions and their uncertainties using mock catalogs based on the halo occupation distribution (HOD) framework. Compared to simulations, IAEmu achieves ~3% average error for $ξ$ and ~5% for $ω$, while capturing the stochasticity of $η$ without overfitting. The emulator provides both aleatoric and epistemic uncertainties, helping identify regions where predictions may be less reliable. We also demonstrate generalization to non-HOD alignment signals by fitting to IllustrisTNG hydrodynamical simulation data. As a fully differentiable neural network, IAEmu enables $\sim$10,000$\times$ speed-ups in mapping HOD parameters to correlation functions on GPUs, compared to CPU-based simulations. This acceleration facilitates inverse modeling via gradient-based sampling, making IAEmu a powerful surrogate model for galaxy bias and IA studies with direct applications to Stage IV weak lensing surveys.
△ Less
Submitted 14 May, 2025; v1 submitted 7 April, 2025;
originally announced April 2025.
-
Building Machine Learning Challenges for Anomaly Detection in Science
Authors:
Elizabeth G. Campolongo,
Yuan-Tang Chou,
Ekaterina Govorkova,
Wahid Bhimji,
Wei-Lun Chao,
Chris Harris,
Shih-Chieh Hsu,
Hilmar Lapp,
Mark S. Neubauer,
Josephine Namayanja,
Aneesh Subramanian,
Philip Harris,
Advaith Anand,
David E. Carlyn,
Subhankar Ghosh,
Christopher Lawrence,
Eric Moreno,
Ryan Raikman,
Jiaman Wu,
Ziheng Zhang,
Bayu Adhi,
Mohammad Ahmadi Gharehtoragh,
Saúl Alonso Monsalve,
Marta Babicz,
Furqan Baig
, et al. (125 additional authors not shown)
Abstract:
Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c…
▽ More
Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be confounding since it requires codifying a complete knowledge of the known scientific behaviors and then projecting these known behaviors on the data to look for deviations. When utilizing machine learning, this presents a particular challenge since we require that the model not only understands scientific data perfectly but also recognizes when the data is inconsistent and out of the scope of its trained behavior. In this paper, we present three datasets aimed at developing machine learning-based anomaly detection for disparate scientific domains covering astrophysics, genomics, and polar science. We present the different datasets along with a scheme to make machine learning challenges around the three datasets findable, accessible, interoperable, and reusable (FAIR). Furthermore, we present an approach that generalizes to future machine learning challenges, enabling the possibility of large, more compute-intensive challenges that can ultimately lead to scientific discovery.
△ Less
Submitted 29 March, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
SIDDA: SInkhorn Dynamic Domain Adaptation for Image Classification with Equivariant Neural Networks
Authors:
Sneh Pandya,
Purvik Patel,
Brian D. Nord,
Mike Walmsley,
Aleksandra Ćiprijanović
Abstract:
Modern neural networks (NNs) often do not generalize well in the presence of a "covariate shift"; that is, in situations where the training and test data distributions differ, but the conditional distribution of classification labels remains unchanged. In such cases, NN generalization can be reduced to a problem of learning more domain-invariant features. Domain adaptation (DA) methods include a r…
▽ More
Modern neural networks (NNs) often do not generalize well in the presence of a "covariate shift"; that is, in situations where the training and test data distributions differ, but the conditional distribution of classification labels remains unchanged. In such cases, NN generalization can be reduced to a problem of learning more domain-invariant features. Domain adaptation (DA) methods include a range of techniques aimed at achieving this; however, these methods have struggled with the need for extensive hyperparameter tuning, which then incurs significant computational costs. In this work, we introduce SIDDA, an out-of-the-box DA training algorithm built upon the Sinkhorn divergence, that can achieve effective domain alignment with minimal hyperparameter tuning and computational overhead. We demonstrate the efficacy of our method on multiple simulated and real datasets of varying complexity, including simple shapes, handwritten digits, and real astronomical observations. SIDDA is compatible with a variety of NN architectures, and it works particularly well in improving classification accuracy and model calibration when paired with equivariant neural networks (ENNs). We find that SIDDA enhances the generalization capabilities of NNs, achieving up to a $\approx40\%$ improvement in classification accuracy on unlabeled target data. We also study the efficacy of DA on ENNs with respect to the varying group orders of the dihedral group $D_N$, and find that the model performance improves as the degree of equivariance increases. Finally, we find that SIDDA enhances model calibration on both source and target data--achieving over an order of magnitude improvement in the ECE and Brier score. SIDDA's versatility, combined with its automated approach to domain alignment, has the potential to advance multi-dataset studies by enabling the development of highly generalizable models.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
On the Generality and Persistence of Cosmological Stasis
Authors:
James Halverson,
Sneh Pandya
Abstract:
Hierarchical decays of $N$ matter species to radiation may balance against Hubble expansion to yield stasis, a new phase of cosmological evolution with constant matter and radiation abundances. We analyze stasis with various machine learning techniques on the full $2N$-dimensional space of decay rates and abundances, which serve as inputs to the system of Boltzmann equations that governs the dynam…
▽ More
Hierarchical decays of $N$ matter species to radiation may balance against Hubble expansion to yield stasis, a new phase of cosmological evolution with constant matter and radiation abundances. We analyze stasis with various machine learning techniques on the full $2N$-dimensional space of decay rates and abundances, which serve as inputs to the system of Boltzmann equations that governs the dynamics. We construct a differentiable Boltzmann solver to maximize the number of stasis $e$-folds $\mathcal{N}$. High-stasis configurations obtained by gradient ascent motivate log-uniform distributions on rates and abundances to accompany power-law distributions of previous works. We demonstrate that random configurations drawn from these families of distributions regularly exhibit many $e$-folds of stasis. We additionally use them as priors in a Bayesian analysis conditioned on stasis, using stochastic variational inference with normalizing flows to model the posterior. All three numerical analyses demonstrate the generality of stasis and point to a new model in which the rates and abundances are exponential in the species index. We show that the exponential model solves the exact stasis equations, is an attractor, and satisfies $\mathcal{N}\propto N$, exhibiting inflation-level $e$-folding with a relatively low number of species. This is contrasted with the $\mathcal{N}\propto \log(N)$ scaling of power-law models. Finally, we discuss implications for the emergent string conjecture and string axiverse.
△ Less
Submitted 4 January, 2025; v1 submitted 1 August, 2024;
originally announced August 2024.
-
Learning Galaxy Intrinsic Alignment Correlations
Authors:
Sneh Pandya,
Yuanyuan Yang,
Nicholas Van Alfen,
Jonathan Blazek,
Robin Walters
Abstract:
The intrinsic alignments (IA) of galaxies, regarded as a contaminant in weak lensing analyses, represents the correlation of galaxy shapes due to gravitational tidal interactions and galaxy formation processes. As such, understanding IA is paramount for accurate cosmological inferences from weak lensing surveys; however, one limitation to our understanding and mitigation of IA is expensive simulat…
▽ More
The intrinsic alignments (IA) of galaxies, regarded as a contaminant in weak lensing analyses, represents the correlation of galaxy shapes due to gravitational tidal interactions and galaxy formation processes. As such, understanding IA is paramount for accurate cosmological inferences from weak lensing surveys; however, one limitation to our understanding and mitigation of IA is expensive simulation-based modeling. In this work, we present a deep learning approach to emulate galaxy position-position ($ξ$), position-orientation ($ω$), and orientation-orientation ($η$) correlation function measurements and uncertainties from halo occupation distribution-based mock galaxy catalogs. We find strong Pearson correlation values with the model across all three correlation functions and further predict aleatoric uncertainties through a mean-variance estimation training procedure. $ξ(r)$ predictions are generally accurate to $\leq10\%$. Our model also successfully captures the underlying signal of the noisier correlations $ω(r)$ and $η(r)$, although with a lower average accuracy. We find that the model performance is inhibited by the stochasticity of the data, and will benefit from correlations averaged over multiple data realizations. Our code will be made open source upon journal publication.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
E(2) Equivariant Neural Networks for Robust Galaxy Morphology Classification
Authors:
Sneh Pandya,
Purvik Patel,
Franc O,
Jonathan Blazek
Abstract:
We propose the use of group convolutional neural network architectures (GCNNs) equivariant to the 2D Euclidean group, $E(2)$, for the task of galaxy morphology classification by utilizing symmetries of the data present in galaxy images as an inductive bias in the architecture. We conduct robustness studies by introducing artificial perturbations via Poisson noise insertion and one-pixel adversaria…
▽ More
We propose the use of group convolutional neural network architectures (GCNNs) equivariant to the 2D Euclidean group, $E(2)$, for the task of galaxy morphology classification by utilizing symmetries of the data present in galaxy images as an inductive bias in the architecture. We conduct robustness studies by introducing artificial perturbations via Poisson noise insertion and one-pixel adversarial attacks to simulate the effects of limited observational capabilities. We train, validate, and test GCNNs equivariant to discrete subgroups of $E(2)$ - the cyclic and dihedral groups of order $N$ - on the Galaxy10 DECals dataset and find that GCNNs achieve higher classification accuracy and are consistently more robust than their non-equivariant counterparts, with an architecture equivariant to the group $D_{16}$ achieving a $95.52 \pm 0.18\%$ test-set accuracy. We also find that the model loses $<6\%$ accuracy on a $50\%$-noise dataset and all GCNNs are less susceptible to one-pixel perturbations than an identically constructed CNN. Our code is publicly available at https://github.com/snehjp2/GCNNMorphology.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
AGNet: Weighing Black Holes with Deep Learning
Authors:
Joshua Yao-Yu Lin,
Sneh Pandya,
Devanshi Pratap,
Xin Liu,
Matias Carrasco Kind,
Volodymyr Kindratenko
Abstract:
Supermassive black holes (SMBHs) are ubiquitously found at the centers of most massive galaxies. Measuring SMBH mass is important for understanding the origin and evolution of SMBHs. However, traditional methods require spectroscopic data which is expensive to gather. We present an algorithm that weighs SMBHs using quasar light time series, circumventing the need for expensive spectra. We train, v…
▽ More
Supermassive black holes (SMBHs) are ubiquitously found at the centers of most massive galaxies. Measuring SMBH mass is important for understanding the origin and evolution of SMBHs. However, traditional methods require spectroscopic data which is expensive to gather. We present an algorithm that weighs SMBHs using quasar light time series, circumventing the need for expensive spectra. We train, validate, and test neural networks that directly learn from the Sloan Digital Sky Survey (SDSS) Stripe 82 light curves for a sample of $38,939$ spectroscopically confirmed quasars to map out the nonlinear encoding between SMBH mass and multi-color optical light curves. We find a 1$σ$ scatter of 0.37 dex between the predicted SMBH mass and the fiducial virial mass estimate based on SDSS single-epoch spectra, which is comparable to the systematic uncertainty in the virial mass estimate. Our results have direct implications for more efficient applications with future observations from the Vera C. Rubin Observatory. Our code, \textsf{AGNet}, is publicly available at \url{https://github.com/snehjp2/AGNet}.
△ Less
Submitted 21 November, 2022; v1 submitted 17 August, 2021;
originally announced August 2021.
-
AGNet: Weighing Black Holes with Machine Learning
Authors:
Joshua Yao-Yu Lin,
Sneh Pandya,
Devanshi Pratap,
Xin Liu,
Matias Carrasco Kind
Abstract:
Supermassive black holes (SMBHs) are ubiquitously found at the centers of most galaxies. Measuring SMBH mass is important for understanding the origin and evolution of SMBHs. However, traditional methods require spectral data which is expensive to gather. To solve this problem, we present an algorithm that weighs SMBHs using quasar light time series, circumventing the need for expensive spectra. W…
▽ More
Supermassive black holes (SMBHs) are ubiquitously found at the centers of most galaxies. Measuring SMBH mass is important for understanding the origin and evolution of SMBHs. However, traditional methods require spectral data which is expensive to gather. To solve this problem, we present an algorithm that weighs SMBHs using quasar light time series, circumventing the need for expensive spectra. We train, validate, and test neural networks that directly learn from the Sloan Digital Sky Survey (SDSS) Stripe 82 data for a sample of $9,038$ spectroscopically confirmed quasars to map out the nonlinear encoding between black hole mass and multi-color optical light curves. We find a 1$σ$ scatter of 0.35 dex between the predicted mass and the fiducial virial mass based on SDSS single-epoch spectra. Our results have direct implications for efficient applications with future observations from the Vera Rubin Observatory.
△ Less
Submitted 1 December, 2020; v1 submitted 30 November, 2020;
originally announced November 2020.
-
Large Area X-ray Proportional Counter (LAXPC) Instrument on AstroSat and Some Preliminary Results from its performance in the orbit
Authors:
P. C. Agrawal,
J. S. Yadav,
H. M. Antia,
Dhiraj Dedhia,
P. Shah,
Jai Verdhan Chauhan,
R. K. Manchanda,
V. R. Chitnis,
V. M. Gujar,
Tilak Katoch,
V. N. Kurhade,
P. Madhwani,
T. K. Manojkumar,
V. A. Nikam,
A. S. Pandya,
J. V. Parmar,
D. M. Pawar,
Jayashree Roy,
B. Paul,
Mayukh Pahari,
Ranjeev Misra,
M. H. Ravichandran,
K. Anilkumar,
C. C. Joseph,
K. H. Navalgund
, et al. (3 additional authors not shown)
Abstract:
Large Area X-ray Propositional Counter (LAXPC) instrument on AstroSat is aimed at providing high time resolution X-ray observations in 3 to 80 keV energy band with moderate energy resolution. To achieve large collecting area, a cluster of three co-aligned identical LAXPC detectors, is used to realize an effective area in access of about 6000 cm2 at 15 keV. The large detection volume of the LAXPC d…
▽ More
Large Area X-ray Propositional Counter (LAXPC) instrument on AstroSat is aimed at providing high time resolution X-ray observations in 3 to 80 keV energy band with moderate energy resolution. To achieve large collecting area, a cluster of three co-aligned identical LAXPC detectors, is used to realize an effective area in access of about 6000 cm2 at 15 keV. The large detection volume of the LAXPC detectors, filled with xenon gas at about 2 atmosphere pressure, results in detection efficiency greater than 50%, above 30 keV. In this article, we present salient features of the LAXPC detectors, their testing and characterization in the laboratory prior to launch and calibration in the orbit. Some preliminary results on timing and spectral characteristics of a few X-ray binaries and other type of sources, are briefly discussed to demonstrate that the LAXPC instrument is performing as planned in the orbit.
△ Less
Submitted 25 May, 2017; v1 submitted 18 May, 2017;
originally announced May 2017.
-
Calibration of the Large Area X-ray Proportional Counter (LAXPC) instrument on-board AstroSat
Authors:
H. M. Antia,
J. S. Yadav,
P. C. Agrawal,
Jai Verdhan Chauhan,
R. K. Manchanda,
Varsha Chitnis,
Biswajit Paul,
Dhiraj Dedhia,
Parag Shah,
V. M. Gujar,
Tilak Katoch,
V. N. Kurhade,
Pankaj Madhwani,
T. K. Manojkumar,
V. A. Nikam,
A. S. Pandya,
J. V. Parmar,
D. M. Pawar,
Mayukh Pahari,
Ranjeev Misra,
K. H. Navalgund,
R. Pandiyan,
K. S. Sharma,
K. Subbarao
Abstract:
We present the calibration and background model for the Large Area X-ray Proportional Counter (LAXPC) detectors on-board AstroSat. LAXPC instrument has three nominally identical detectors to achieve large collecting area. These detectors are independent of each other and in the event analysis mode, they record the arrival time and energy of each photon that is detected. The detectors have a time-r…
▽ More
We present the calibration and background model for the Large Area X-ray Proportional Counter (LAXPC) detectors on-board AstroSat. LAXPC instrument has three nominally identical detectors to achieve large collecting area. These detectors are independent of each other and in the event analysis mode, they record the arrival time and energy of each photon that is detected. The detectors have a time-resolution of 10 $μ$s and a dead-time of about 42 $μ$s. This makes LAXPC ideal for timing studies. The energy resolution and peak channel to energy mapping were obtained from calibration on ground using radioactive sources coupled with GEANT4 simulations of the detectors. The response matrix was further refined from observations of the Crab X-ray source after launch. At around 20 keV the energy resolution of detector is about 10--15\%, while the combined effective area of the 3 detectors is about 6000 cm$^2$.
△ Less
Submitted 18 June, 2017; v1 submitted 27 February, 2017;
originally announced February 2017.