-
Transforming Simulation to Data Without Pairing
Authors:
Eli Gendreau-Distler,
Luc Le Pottier,
Haichen Wang
Abstract:
We explore a generative machine learning-based approach for estimating multi-dimensional probability density functions (PDFs) in a target sample using a statistically independent but related control sample - a common challenge in particle physics data analysis. The generative model must accurately reproduce individual observable distributions while preserving the correlations between them, based o…
▽ More
We explore a generative machine learning-based approach for estimating multi-dimensional probability density functions (PDFs) in a target sample using a statistically independent but related control sample - a common challenge in particle physics data analysis. The generative model must accurately reproduce individual observable distributions while preserving the correlations between them, based on the input multidimensional distribution from the control sample. Here we present a conditional normalizing flow model (CNF) based on a chain of bijectors which learns to transform unpaired simulation events to data events. We assess the performance of the CNF model in the context of LHC Higgs to diphoton analysis, where we use the CNF model to convert a Monte Carlo diphoton sample to one that models data. We show that the CNF model can accurately model complex data distributions and correlations. We also leverage the recently popularized Modified Differential Multiplier Method (MDMM) to improve the convergence of our model and assign physical meaning to usually arbitrary loss-function parameters.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
United States Muon Collider Community White Paper for the European Strategy for Particle Physics Update
Authors:
A. Abdelhamid,
D. Acosta,
P. Affleck,
G. Agarwal,
K. Agashe,
P. Agrawal,
R. Alharthy,
B. Allmond,
D. Ally,
G. Ambrosio,
O. Amram,
A. Apresyan,
A. Apyan,
C. Aruta,
C. Arzate,
P. Asadi,
J. Ashley,
A. Avasthi,
J. Backus,
R. Bartek,
A. Batz,
L. Bauerdick,
C. Bell,
S. Belomestnykh,
J. S. Berg
, et al. (280 additional authors not shown)
Abstract:
This document is being submitted to the 2024-2026 European Strategy for Particle Physics Update (ESPPU) process on behalf of the US Muon Collider community, with its preparation coordinated by the interim US Muon Collider Coordination Group. The US Muon Collider Community comprises a few hundred American scientists. The purpose of the document is to inform ESPPU about the US plans for Muon Collide…
▽ More
This document is being submitted to the 2024-2026 European Strategy for Particle Physics Update (ESPPU) process on behalf of the US Muon Collider community, with its preparation coordinated by the interim US Muon Collider Coordination Group. The US Muon Collider Community comprises a few hundred American scientists. The purpose of the document is to inform ESPPU about the US plans for Muon Collider research and development (R&D), explain how these efforts align with the broader international R&D initiatives, and present the US community vision for the future realization of this transformative project.
△ Less
Submitted 15 April, 2025; v1 submitted 30 March, 2025;
originally announced March 2025.
-
Autoencoders for Semivisible Jet Detection
Authors:
Florencia Canelli,
Annapaola de Cosa,
Luc Le Pottier,
Jeremi Niedziela,
Kevin Pedro,
Maurizio Pierini
Abstract:
The production of dark matter particles from confining dark sectors may lead to many novel experimental signatures. Depending on the details of the theory, dark quark production in proton-proton collisions could result in semivisible jets of particles: collimated sprays of dark hadrons of which only some are detectable by particle collider experiments. The experimental signature is characterised b…
▽ More
The production of dark matter particles from confining dark sectors may lead to many novel experimental signatures. Depending on the details of the theory, dark quark production in proton-proton collisions could result in semivisible jets of particles: collimated sprays of dark hadrons of which only some are detectable by particle collider experiments. The experimental signature is characterised by the presence of reconstructed missing momentum collinear with the visible components of the jets. This complex topology is sensitive to detector inefficiencies and mis-reconstruction that generate artificial missing momentum. With this work, we propose a signal-agnostic strategy to reject ordinary jets and identify semivisible jets via anomaly detection techniques. A deep neural autoencoder network with jet substructure variables as input proves highly useful for analyzing anomalous jets. The study focuses on the semivisible jet signature; however, the technique can apply to any new physics model that predicts signatures with anomalous jets from non-SM particles.
△ Less
Submitted 18 January, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
The LHC Olympics 2020: A Community Challenge for Anomaly Detection in High Energy Physics
Authors:
Gregor Kasieczka,
Benjamin Nachman,
David Shih,
Oz Amram,
Anders Andreassen,
Kees Benkendorfer,
Blaz Bortolato,
Gustaaf Brooijmans,
Florencia Canelli,
Jack H. Collins,
Biwei Dai,
Felipe F. De Freitas,
Barry M. Dillon,
Ioan-Mihail Dinu,
Zhongtian Dong,
Julien Donini,
Javier Duarte,
D. A. Faroughy,
Julia Gonski,
Philip Harris,
Alan Kahn,
Jernej F. Kamenik,
Charanjit K. Khosa,
Patrick Komiske,
Luc Le Pottier
, et al. (22 additional authors not shown)
Abstract:
A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a…
▽ More
A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a set of simulated collider events. Participants in these Olympics have developed their methods using an R&D dataset and then tested them on black boxes: datasets with an unknown anomaly (or not). This paper will review the LHC Olympics 2020 challenge, including an overview of the competition, a description of methods deployed in the competition, lessons learned from the experience, and implications for data analyses with future datasets as well as future colliders.
△ Less
Submitted 20 January, 2021;
originally announced January 2021.
-
Simulation-Assisted Decorrelation for Resonant Anomaly Detection
Authors:
Kees Benkendorfer,
Luc Le Pottier,
Benjamin Nachman
Abstract:
A growing number of weak- and unsupervised machine learning approaches to anomaly detection are being proposed to significantly extend the search program at the Large Hadron Collider and elsewhere. One of the prototypical examples for these methods is the search for resonant new physics, where a bump hunt can be performed in an invariant mass spectrum. A significant challenge to methods that rely…
▽ More
A growing number of weak- and unsupervised machine learning approaches to anomaly detection are being proposed to significantly extend the search program at the Large Hadron Collider and elsewhere. One of the prototypical examples for these methods is the search for resonant new physics, where a bump hunt can be performed in an invariant mass spectrum. A significant challenge to methods that rely entirely on data is that they are susceptible to sculpting artificial bumps from the dependence of the machine learning classifier on the invariant mass. We explore two solutions to this challenge by minimally incorporating simulation into the learning. In particular, we study the robustness of Simulation Assisted Likelihood-free Anomaly Detection (SALAD) to correlations between the classifier and the invariant mass. Next, we propose a new approach that only uses the simulation for decorrelation but the Classification without Labels (CWoLa) approach for achieving signal sensitivity. Both methods are compared using a full background fit analysis on simulated data from the LHC Olympics and are robust to correlations in the data.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.