Skip to main content

Showing 1–26 of 26 results for author: Raine, J A

.
  1. arXiv:2410.22074  [pdf, other

    hep-ph cs.LG

    Variational inference for pile-up removal at hadron colliders with diffusion models

    Authors: Malte Algren, Christopher Pollard, John Andrew Raine, Tobias Golling

    Abstract: In this paper, we present a novel method for pile-up removal of pp interactions using variational inference with diffusion models, called Vipr. Instead of using classification methods to identify which particles are from the primary collision, a generative model is trained to predict the constituents of the hard-scatter particle jets with pile-up removed. This results in an estimate of the full po… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: 19 pages, 13 figures

  2. arXiv:2410.21611  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph

    CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

    Authors: Claudius Krause, Michele Faucci Giannelli, Gregor Kasieczka, Benjamin Nachman, Dalila Salamani, David Shih, Anna Zaborowska, Oz Amram, Kerstin Borras, Matthew R. Buckley, Erik Buhmann, Thorsten Buss, Renato Paulo Da Costa Cardoso, Anthony L. Caterini, Nadezda Chernyavskaya, Federico A. G. Corchia, Jesse C. Cresswell, Sascha Diefenbacher, Etienne Dreyer, Vijay Ekambaram, Engin Eren, Florian Ernst, Luigi Favaro, Matteo Franchini, Frank Gaede , et al. (44 additional authors not shown)

    Abstract: We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoder… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 204 pages, 100+ figures, 30+ tables

    Report number: HEPHY-ML-24-05, FERMILAB-PUB-24-0728-CMS, TTK-24-43

  3. arXiv:2408.11616  [pdf, other

    hep-ph

    RODEM Jet Datasets

    Authors: Knut Zoch, John Andrew Raine, Debajyoti Sengupta, Tobias Golling

    Abstract: We present the RODEM Jet Datasets, a comprehensive collection of simulated large-radius jets designed to support the development and evaluation of machine-learning algorithms in particle physics. These datasets encompass a diverse range of jet sources, including quark/gluon jets, jets from the decay of W bosons, top quarks, and heavy new-physics particles. The datasets provide detailed substructur… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: The datasets are available on Zenodo at https://doi.org/10.5281/zenodo.12793616

  4. arXiv:2406.13074  [pdf, other

    hep-ph cs.LG hep-ex stat.ML

    PIPPIN: Generating variable length full events from partons

    Authors: Guillaume Quétant, John Andrew Raine, Matthew Leigh, Debajyoti Sengupta, Tobias Golling

    Abstract: This paper presents a novel approach for directly generating full events at detector-level from parton-level information, leveraging cutting-edge machine learning techniques. To address the challenge of multiplicity variations between parton and reconstructed object spaces, we employ transformers, score-based models and normalizing flows. Our method tackles the inherent complexities of the stochas… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: Phys. Rev. D 110, 076023 (Published 21 October 2024)

  5. arXiv:2405.12131  [pdf, other

    astro-ph.GA hep-ph physics.data-an

    SkyCURTAINs: Model agnostic search for Stellar Streams with Gaia data

    Authors: Debajyoti Sengupta, Stephen Mulligan, David Shih, John Andrew Raine, Tobias Golling

    Abstract: We present SkyCURTAINs, a data driven and model agnostic method to search for stellar streams in the Milky Way galaxy using data from the Gaia telescope. SkyCURTAINs is a weakly supervised machine learning algorithm that builds a background enriched template in the signal region by leveraging the correlation of the source's characterising features with their proper motion in the sky. This allows f… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2404.07258  [pdf, other

    hep-ph hep-ex physics.data-an

    Complete Optimal Non-Resonant Anomaly Detection

    Authors: Gregor Kasieczka, John Andrew Raine, David Shih, Aman Upadhyay

    Abstract: We propose the first-ever complete, model-agnostic search strategy based on the optimal anomaly score, for new physics on the tails of distributions. Signal sensitivity is achieved via a classifier trained on auxiliary features in a weakly-supervised fashion, and backgrounds are predicted using the ABCD method in the classifier output and the primary tail feature. The independence between the clas… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures

  7. arXiv:2402.17714  [pdf, other

    hep-ph hep-ex physics.data-an

    Cluster Scanning: a novel approach to resonance searches

    Authors: Ivan Oleksiyuk, John Andrew Raine, Michael Krämer, Svyatoslav Voloshynovskiy, Tobias Golling

    Abstract: We propose a new model-independent method for new physics searches called Cluster Scanning. It uses the k-means algorithm to perform clustering in the space of low-level event or jet observables, and separates potentially anomalous clusters to construct a signal-enriched region. The spectra of a selected observable (e.g. invariant mass) in these two regions are then used to determine whether a res… ▽ More

    Submitted 21 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 33 pages, 11 figures

  8. arXiv:2401.13537  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models

    Authors: Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy, John Andrew Raine

    Abstract: We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards bui… ▽ More

    Submitted 11 July, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  9. arXiv:2312.10130  [pdf, other

    physics.data-an cs.LG hep-ex hep-ph

    Improving new physics searches with diffusion models for event observables and jet constituents

    Authors: Debajyoti Sengupta, Matthew Leigh, John Andrew Raine, Samuel Klein, Tobias Golling

    Abstract: We introduce a new technique called Drapes to enhance the sensitivity in searches for new physics at the LHC. By training diffusion models on side-band data, we show how background templates for the signal region can be generated either directly from noise, or by partially applying the diffusion process to existing data. In the partial diffusion case, data can be drawn from side-band regions, with… ▽ More

    Submitted 19 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 34 pages, 19 figures

  10. arXiv:2310.00049  [pdf, other

    hep-ph cs.LG

    EPiC-ly Fast Particle Cloud Generation with Flow-Matching and Diffusion

    Authors: Erik Buhmann, Cedric Ewen, Darius A. Faroughy, Tobias Golling, Gregor Kasieczka, Matthew Leigh, Guillaume Quétant, John Andrew Raine, Debajyoti Sengupta, David Shih

    Abstract: Jets at the LHC, typically consisting of a large number of highly correlated particles, are a fascinating laboratory for deep generative modeling. In this paper, we present two novel methods that generate LHC jets as point clouds efficiently and accurately. We introduce \epcjedi, which combines score-matching diffusion models with the Equivariant Point Cloud (EPiC) architecture based on the deep s… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 21 pages, 8 figures

  11. arXiv:2309.06472  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

    Authors: Tobias Golling, Samuel Klein, Radha Mastandrea, Benjamin Nachman, John Andrew Raine

    Abstract: Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for m… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 15 pages, 17 figures. This work is a merger of arXiv:2211.02487 and arXiv:2212.06155

  12. arXiv:2308.11700  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    Calorimeter shower superresolution

    Authors: Ian Pang, John Andrew Raine, David Shih

    Abstract: Calorimeter shower simulation is a major bottleneck in the Large Hadron Collider computational pipeline. There have been recent efforts to employ deep-generative surrogate models to overcome this challenge. However, many of best performing models have training and generation times that do not scale well to high-dimensional calorimeter showers. In this work, we introduce SuperCalo, a flow-based sup… ▽ More

    Submitted 15 May, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 16 pages, 13 figures, v3: title changed, matches published version

    Journal ref: Phys. Rev. D 109, 092009 (2024)

  13. arXiv:2307.11157  [pdf, other

    hep-ph hep-ex physics.data-an

    The Interplay of Machine Learning--based Resonant Anomaly Detection Methods

    Authors: Tobias Golling, Gregor Kasieczka, Claudius Krause, Radha Mastandrea, Benjamin Nachman, John Andrew Raine, Debajyoti Sengupta, David Shih, Manuel Sommerhalder

    Abstract: Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 27 pages, 21 figures. Updated with revisions for journal acceptance

  14. arXiv:2307.06836  [pdf, other

    hep-ex cs.LG hep-ph

    PC-Droid: Faster diffusion and improved quality for particle cloud generation

    Authors: Matthew Leigh, Debajyoti Sengupta, John Andrew Raine, Guillaume Quétant, Tobias Golling

    Abstract: Building on the success of PC-JeDi we introduce PC-Droid, a substantially improved diffusion model for the generation of jet particle clouds. By leveraging a new diffusion formulation, studying more recent integration solvers, and training on all jet types simultaneously, we are able to achieve state-of-the-art performance for all types of jets across all evaluation metrics. We study the trade-off… ▽ More

    Submitted 18 August, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: 21 pages, 8 tables, 13 figures

  15. Decorrelation using Optimal Transport

    Authors: Malte Algren, John Andrew Raine, Tobias Golling

    Abstract: Being able to decorrelate a feature space from protected attributes is an area of active research and study in ethics, fairness, and also natural sciences. We introduce a novel decorrelation method using Convex Neural Optimal Transport Solvers (Cnots) that is able to decorrelate a continuous feature space against protected attributes with optimal transport. We demonstrate how well it performs in t… ▽ More

    Submitted 14 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Journal ref: Eur. Phys. J. C 84, 579 (2024)

  16. arXiv:2307.02405  [pdf, other

    hep-ph cs.LG hep-ex

    $ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

    Authors: John Andrew Raine, Matthew Leigh, Knut Zoch, Tobias Golling

    Abstract: In this work we introduce $ν^2$-Flows, an extension of the $ν$-Flows method to final states containing multiple neutrinos. The architecture can natively scale for all combinations of object types and multiplicities in the final state for any desired neutrino multiplicities. In $t\bar{t}$ dilepton events, the momenta of both neutrinos and correlations between them are reconstructed more accurately… ▽ More

    Submitted 15 December, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 24 pages, 19 figures, 6 tables

  17. arXiv:2305.04646  [pdf, other

    hep-ph cs.LG hep-ex

    CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

    Authors: Debajyoti Sengupta, Samuel Klein, John Andrew Raine, Tobias Golling

    Abstract: Model independent techniques for constructing background data templates using generative models have shown great promise for use in searches for new physics processes at the LHC. We introduce a major improvement to the CURTAINs method by training the conditional normalizing flow between two side-band regions using maximum likelihood estimation instead of an optimal transport loss. The new training… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 19 pages, 10 figures, 4 tables

  18. arXiv:2304.14963  [pdf, other

    hep-ph cs.LG

    Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

    Authors: Malte Algren, Tobias Golling, Manuel Guth, Chris Pollard, John Andrew Raine

    Abstract: We present an alternative to reweighting techniques for modifying distributions to account for a desired change in an underlying conditional distribution, as is often needed to correct for mis-modelling in a simulated sample. We employ conditional normalizing flows to learn the full conditional probability distribution from which we sample new events for conditional values drawn from the target di… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 21 pages, 9 figures

  19. arXiv:2303.14134  [pdf, other

    hep-ph

    The Mass-ive Issue: Anomaly Detection in Jet Physics

    Authors: Tobias Golling, Takuya Nobe, Dimitrios Proios, John Andrew Raine, Debajyoti Sengupta, Slava Voloshynovskiy, Jean-Francois Arguin, Julien Leissner Martin, Jacinthe Pilette, Debottam Bakshi Gupta, Amir Farbin

    Abstract: In the hunt for new and unobserved phenomena in particle physics, attention has turned in recent years to using advanced machine learning techniques for model independent searches. In this paper we highlight the main challenge of applying anomaly detection to jet physics, where preserving an unbiased estimator of the jet mass remains a critical piece of any model independent search. Using Variatio… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 6 pages, 5 figures. Accepted at Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020)

  20. arXiv:2303.13937  [pdf, other

    hep-ph cs.LG hep-ex

    Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

    Authors: Lukas Ehrke, John Andrew Raine, Knut Zoch, Manuel Guth, Tobias Golling

    Abstract: We present a new approach, the Topograph, which reconstructs underlying physics processes, including the intermediary particles, by leveraging underlying priors from the nature of particle physics decays and the flexibility of message passing graph neural networks. The Topograph not only solves the combinatoric assignment of observed final state objects, associating them to their original mother p… ▽ More

    Submitted 13 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 25 pages, 24 figures, 8 tables

    Journal ref: Phys. Rev. D 107 (2023) 116019

  21. PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

    Authors: Matthew Leigh, Debajyoti Sengupta, Guillaume Quétant, John Andrew Raine, Knut Zoch, Tobias Golling

    Abstract: In this paper, we present a new method to efficiently generate jets in High Energy Physics called PC-JeDi. This method utilises score-based diffusion models in conjunction with transformers which are well suited to the task of generating jets as particle clouds due to their permutation equivariance. PC-JeDi achieves competitive performance with current state-of-the-art methods across several metri… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 30 pages, 25 figures, 5 tables

    Journal ref: SciPost Phys. 16, 018 (2024)

  22. arXiv:2211.02487  [pdf, other

    cs.LG

    Flows for Flows: Training Normalizing Flows Between Arbitrary Distributions with Maximum Likelihood Estimation

    Authors: Samuel Klein, John Andrew Raine, Tobias Golling

    Abstract: Normalizing flows are constructed from a base distribution with a known density and a diffeomorphism with a tractable Jacobian. The base density of a normalizing flow can be parameterised by a different normalizing flow, thus allowing maps to be found between arbitrary distributions. We demonstrate and explore the utility of this approach and show it is particularly interesting in the case of cond… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  23. ν-Flows: Conditional Neutrino Regression

    Authors: Matthew Leigh, John Andrew Raine, Knut Zoch, Tobias Golling

    Abstract: We present $ν$-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks. This method allows the recovery of the full neutrino momentum which is usually left as a free parameter and permits one to sample neutrino values under a learned conditional likelihood given eve… ▽ More

    Submitted 22 June, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: 26 pages, 15 figures

    Journal ref: SciPost Phys. 14 (2023) 159

  24. CURTAINs for your Sliding Window: Constructing Unobserved Regions by Transforming Adjacent Intervals

    Authors: John Andrew Raine, Samuel Klein, Debajyoti Sengupta, Tobias Golling

    Abstract: We propose a new model independent technique for constructing background data templates for use in searches for new physics processes at the LHC. This method, called CURTAINs, uses invertible neural networks to parametrise the distribution of side band data as a function of the resonant observable. The network learns a transformation to map any data point from its value of the resonant observable… ▽ More

    Submitted 10 February, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 31 pages, 18 figures, 2 tables

  25. arXiv:2202.05012  [pdf, other

    physics.data-an astro-ph.IM cs.LG hep-ex physics.acc-ph

    SUPA: A Lightweight Diagnostic Simulator for Machine Learning in Particle Physics

    Authors: Atul Kumar Sinha, Daniele Paliotta, Bálint Máté, Sebastian Pina-Otey, John A. Raine, Tobias Golling, François Fleuret

    Abstract: Deep learning methods have gained popularity in high energy physics for fast modeling of particle showers in detectors. Detailed simulation frameworks such as the gold standard Geant4 are computationally intensive, and current deep generative architectures work on discretized, lower resolution versions of the detailed simulation. The development of models that work at higher spatial resolutions is… ▽ More

    Submitted 21 October, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  26. arXiv:2112.08069  [pdf, other

    cs.LG stat.ML

    Funnels: Exact maximum likelihood with dimensionality reduction

    Authors: Samuel Klein, John A. Raine, Sebastian Pina-Otey, Slava Voloshynovskiy, Tobias Golling

    Abstract: Normalizing flows are diffeomorphic, typically dimension-preserving, models trained using the likelihood of the model. We use the SurVAE framework to construct dimension reducing surjective flows via a new layer, known as the funnel. We demonstrate its efficacy on a variety of datasets, and show it improves upon or matches the performance of existing flows while having a reduced latent space size.… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 16 pages, 5 figures, 8 tables