-
A Search for "New Physics'' "Beyond the Standard Model'' in Open Data with Machine Learning
Authors:
Rikab Gambhir
Abstract:
In this new era of large data, it is important to make sure we do not miss any signs of new physics. Using the publicly-available open data collected by the arXiv.org experiment in the \texttt{hep-ph} channel, corresponding to a raw total integrated $\mathcal{L}$iterature of 65,276 papers, we perform a search for ``New Physics'' and related signals. In the worst-case, we are able to detect ``New P…
▽ More
In this new era of large data, it is important to make sure we do not miss any signs of new physics. Using the publicly-available open data collected by the arXiv.org experiment in the \texttt{hep-ph} channel, corresponding to a raw total integrated $\mathcal{L}$iterature of 65,276 papers, we perform a search for ``New Physics'' and related signals. In the worst-case, we are able to detect ``New Physics'' with ``the LHC'' at a significance level of at least $6.5σ$. This ``New Physics'' signature is primarily ``Dark'' in nature, and is potentially axion(-like) dark matter. We also show the potential for further improvement in the future, and that ``New Physics'' can be found with ``a Future Collider'' at at least $8.9σ$, as well as the potential to find ``New Physics'' without any collider at all. This search is performed using code that was $80\%$ written by Machine Learning methods.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Isolating Unisolated Upsilons with Anomaly Detection in CMS Open Data
Authors:
Rikab Gambhir,
Radha Mastandrea,
Benjamin Nachman,
Jesse Thaler
Abstract:
We present the first study of anti-isolated Upsilon decays to two muons ($Υ\to μ^+ μ^-$) in proton-proton collisions at the Large Hadron Collider. Using a machine learning (ML)-based anomaly detection strategy, we "rediscover" the $Υ$ in 13 TeV CMS Open Data from 2016, despite overwhelming anti-isolated backgrounds. We elevate the signal significance to $6.4 σ$ using these methods, starting from…
▽ More
We present the first study of anti-isolated Upsilon decays to two muons ($Υ\to μ^+ μ^-$) in proton-proton collisions at the Large Hadron Collider. Using a machine learning (ML)-based anomaly detection strategy, we "rediscover" the $Υ$ in 13 TeV CMS Open Data from 2016, despite overwhelming anti-isolated backgrounds. We elevate the signal significance to $6.4 σ$ using these methods, starting from $1.6 σ$ using the dimuon mass spectrum alone. Moreover, we demonstrate improved sensitivity from using an ML-based estimate of the multi-feature likelihood compared to traditional "cut-and-count" methods. Our work demonstrates that it is possible and practical to find real signals in experimental collider data using ML-based anomaly detection, and we distill a readily-accessible benchmark dataset from the CMS Open Data to facilitate future anomaly detection developments.
△ Less
Submitted 27 February, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
SPECTER: Efficient Evaluation of the Spectral EMD
Authors:
Rikab Gambhir,
Andrew J. Larkoski,
Jesse Thaler
Abstract:
The Energy Mover's Distance (EMD) has seen use in collider physics as a metric between events and as a geometric method of defining infrared and collinear safe observables. Recently, the Spectral Energy Mover's Distance (SEMD) has been proposed as a more analytically tractable alternative to the EMD. In this work, we obtain a closed-form expression for the Riemannian-like p = 2 SEMD metric between…
▽ More
The Energy Mover's Distance (EMD) has seen use in collider physics as a metric between events and as a geometric method of defining infrared and collinear safe observables. Recently, the Spectral Energy Mover's Distance (SEMD) has been proposed as a more analytically tractable alternative to the EMD. In this work, we obtain a closed-form expression for the Riemannian-like p = 2 SEMD metric between events, eliminating the need to numerically solve an optimal transport problem. Additionally, we show how the SEMD can be used to define event and jet shape observables by minimizing the distance between events and parameterized energy flows (similar to the EMD), and we obtain closed-form expressions for several of these observables. We also present the SPECTER framework, an efficient and highly parallelized implementation of the SEMD metric and SEMD-derived shape observables as an analogue of the previously-introduced SHAPER for EMD-based computations. We demonstrate that computing the SEMD with SPECTER can be up to a thousand times faster than computing the EMD with standard optimal transport libraries.
△ Less
Submitted 10 January, 2025; v1 submitted 7 October, 2024;
originally announced October 2024.
-
Moments of Clarity: Streamlining Latent Spaces in Machine Learning using Moment Pooling
Authors:
Rikab Gambhir,
Athis Osathapan,
Jesse Thaler
Abstract:
Many machine learning applications involve learning a latent representation of data, which is often high-dimensional and difficult to directly interpret. In this work, we propose "Moment Pooling", a natural extension of Deep Sets networks which drastically decrease latent space dimensionality of these networks while maintaining or even improving performance. Moment Pooling generalizes the summatio…
▽ More
Many machine learning applications involve learning a latent representation of data, which is often high-dimensional and difficult to directly interpret. In this work, we propose "Moment Pooling", a natural extension of Deep Sets networks which drastically decrease latent space dimensionality of these networks while maintaining or even improving performance. Moment Pooling generalizes the summation in Deep Sets to arbitrary multivariate moments, which enables the model to achieve a much higher effective latent dimensionality for a fixed latent dimension. We demonstrate Moment Pooling on the collider physics task of quark/gluon jet classification by extending Energy Flow Networks (EFNs) to Moment EFNs. We find that Moment EFNs with latent dimensions as small as 1 perform similarly to ordinary EFNs with higher latent dimension. This small latent dimension allows for the internal representation to be directly visualized and interpreted, which in turn enables the learned internal jet representation to be extracted in closed form.
△ Less
Submitted 17 October, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Seeing Double: Calibrating Two Jets at Once
Authors:
Rikab Gambhir,
Benjamin Nachman
Abstract:
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$…
▽ More
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$ asymmetry of dijet events in simulation, while remaining agnostic to the $p_T$ spectra themselves, we are able to obtain correlation-improved maximum likelihood estimates. This approach is demonstrated with simulated jets from the CMS Detector, yielding a $3$-$5\%$ relative improvement in the jet energy resolution, corresponding to a quadrature improvement of approximately 35\%.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
The New Physics Case for Beam-Dump Experiments with Accelerated Muon Beams
Authors:
Cari Cesarotti,
Rikab Gambhir
Abstract:
As the field examines a future muon collider as a possible successor to the LHC, we must consider how to fully utilize not only the high-energy particle collisions, but also any lower-energy staging facilities necessary in the R&D process. An economical and efficient possibility is to use the accelerated muon beam from either the full experiment or from cooling and acceleration tests in beam-dump…
▽ More
As the field examines a future muon collider as a possible successor to the LHC, we must consider how to fully utilize not only the high-energy particle collisions, but also any lower-energy staging facilities necessary in the R&D process. An economical and efficient possibility is to use the accelerated muon beam from either the full experiment or from cooling and acceleration tests in beam-dump experiments.Beam-dump experiments are complementary to the main collider as they achieve sensitivity to very small couplings with minimal instrumentation. We demonstrate the utility of muon beam-dump experiments for new physics searches at energies from 10 GeV to 5 TeV. We find that, even at low energies like those accessible at staging or demonstrator facilities, it is possible to probe new regions of parameter space for a variety of generic BSM models, including muonphilic, leptophilic, $L_μ- L_τ$, and dark photon scenarios. Such experiments could therefore provide opportunities for discovery of new physics well before the completion of the full multi-TeV collider.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
SHAPER: Can You Hear the Shape of a Jet?
Authors:
Demba Ba,
Akshunna S. Dogra,
Rikab Gambhir,
Abiy Tasissa,
Jesse Thaler
Abstract:
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing colli…
▽ More
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing collider events, which accounts for its recent success in understanding event and jet substructure. We then present a Shape Hunting Algorithm using Parameterized Energy Reconstruction (SHAPER), which is a general framework for defining and computing shape-based observables. SHAPER generalizes N-jettiness from point clusters to any extended, parametrizable shape. This is accomplished by efficiently minimizing the EMD between events and parameterized manifolds of energy flows representing idealized shapes, implemented using the dual-potential Sinkhorn approximation of the Wasserstein metric. We show how the geometric language of observables as manifolds can be used to define novel observables with built-in infrared-and-collinear safety. We demonstrate the efficacy of the SHAPER framework by performing empirical jet substructure studies using several examples of new shape-based observables.
△ Less
Submitted 20 July, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Bias and Priors in Machine Learning Calibrations for High Energy Physics
Authors:
Rikab Gambhir,
Benjamin Nachman,
Jesse Thaler
Abstract:
Machine learning offers an exciting opportunity to improve the calibration of nearly all reconstructed objects in high-energy physics detectors. However, machine learning approaches often depend on the spectra of examples used during training, an issue known as prior dependence. This is an undesirable property of a calibration, which needs to be applicable in a variety of environments. The purpose…
▽ More
Machine learning offers an exciting opportunity to improve the calibration of nearly all reconstructed objects in high-energy physics detectors. However, machine learning approaches often depend on the spectra of examples used during training, an issue known as prior dependence. This is an undesirable property of a calibration, which needs to be applicable in a variety of environments. The purpose of this paper is to explicitly highlight the prior dependence of some machine learning-based calibration strategies. We demonstrate how some recent proposals for both simulation-based and data-based calibrations inherit properties of the sample used for training, which can result in biases for downstream analyses. In the case of simulation-based calibration, we argue that our recently proposed Gaussian Ansatz approach can avoid some of the pitfalls of prior dependence, whereas prior-independent data-based calibration remains an open problem.
△ Less
Submitted 31 August, 2022; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Learning Uncertainties the Frequentist Way: Calibration and Correlation in High Energy Physics
Authors:
Rikab Gambhir,
Benjamin Nachman,
Jesse Thaler
Abstract:
Calibration is a common experimental physics problem, whose goal is to infer the value and uncertainty of an unobservable quantity Z given a measured quantity X. Additionally, one would like to quantify the extent to which X and Z are correlated. In this paper, we present a machine learning framework for performing frequentist maximum likelihood inference with Gaussian uncertainty estimation, whic…
▽ More
Calibration is a common experimental physics problem, whose goal is to infer the value and uncertainty of an unobservable quantity Z given a measured quantity X. Additionally, one would like to quantify the extent to which X and Z are correlated. In this paper, we present a machine learning framework for performing frequentist maximum likelihood inference with Gaussian uncertainty estimation, which also quantifies the mutual information between the unobservable and measured quantities. This framework uses the Donsker-Varadhan representation of the Kullback-Leibler divergence -- parametrized with a novel Gaussian Ansatz -- to enable a simultaneous extraction of the maximum likelihood values, uncertainties, and mutual information in a single training. We demonstrate our framework by extracting jet energy corrections and resolution factors from a simulation of the CMS detector at the Large Hadron Collider. By leveraging the high-dimensional feature space inside jets, we improve upon the nominal CMS jet resolution by upwards of 15%.
△ Less
Submitted 24 September, 2023; v1 submitted 6 May, 2022;
originally announced May 2022.