Skip to main content

Showing 1–50 of 68 results for author: Kasieczka, G

Searching in archive hep-ph. Search in all archives.
.
  1. arXiv:2506.21720  [pdf, ps, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloHadronic: a diffusion model for the generation of hadronic showers

    Authors: Thorsten Buss, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger, Peter McKeown, Martina Mozzanica

    Abstract: Simulating showers of particles in highly-granular calorimeters is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models can enable them to augment traditional simulations and alleviate a major computing constraint. Recent developments have shown how diffusion based generative shower simulation approache… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  2. arXiv:2506.11192  [pdf, ps, other

    hep-ph

    Quirk SUEP

    Authors: David Curtin, Sascha Dreyer, Max Fusté Costa, Sarah Heim, Gregor Kasieczka, Louis Moureaux, David Rousso, David Shih, Manuel Sommerhalder

    Abstract: We propose searching for physics beyond the Standard Model in the low-transverse-momentum tracks accompanying hard-scatter events at the LHC. TeV-scale resonances connected to a dark QCD sector could be enhanced by selecting events with anomalies in the track distributions. As a benchmark, a quirk model with microscopic string lengths is developed, including a setup for event simulation. For this… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Report number: DESY-25-081-0

  3. arXiv:2501.05534  [pdf, ps, other

    hep-ph cs.LG hep-ex physics.ins-det

    OmniJet-$α_C$: Learning point cloud calorimeter simulations using generative transformers

    Authors: Joschka Birk, Frank Gaede, Anna Hallin, Gregor Kasieczka, Martina Mozzanica, Henning Rose

    Abstract: We show the first use of generative transformers for generating calorimeter showers as point clouds in a high-granularity calorimeter. Using the tokenizer and generative part of the OmniJet-$α$ model, we represent the hits in the detector as sequences of integers. This model allows variable-length sequences, which means that it supports realistic shower development and does not need to be conditio… ▽ More

    Submitted 11 June, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

  4. arXiv:2501.05382  [pdf, other

    physics.data-an cs.AI hep-ph physics.comp-ph physics.hist-ph

    Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models

    Authors: Kristian G. Barman, Sascha Caron, Emily Sullivan, Henk W. de Regt, Roberto Ruiz de Austri, Mieke Boon, Michael Färber, Stefan Fröse, Faegheh Hasibi, Andreas Ipp, Rukshak Kapoor, Gregor Kasieczka, Daniel Kostić, Michael Krämer, Tobias Golling, Luis G. Lopez, Jesus Marco, Sydney Otten, Pawel Pawlowski, Pietro Vischia, Erik Weber, Christoph Weniger

    Abstract: This paper explores ideas and provides a potential roadmap for the development and evaluation of physics-specific large-scale AI models, which we call Large Physics Models (LPMs). These models, based on foundation models such as Large Language Models (LLMs) - trained on broad data - are tailored to address the demands of physics research. LPMs can function independently or as part of an integrated… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  5. arXiv:2412.18773  [pdf

    hep-ph cs.LG hep-ex

    Learning Broken Symmetries with Approximate Invariance

    Authors: Seth Nabat, Aishik Ghosh, Edmund Witkowski, Gregor Kasieczka, Daniel Whiteson

    Abstract: Recognizing symmetries in data allows for significant boosts in neural network training, which is especially important where training data are limited. In many cases, however, the exact underlying symmetry is present only in an idealized dataset, and is broken in actual data, due to asymmetries in the detector, or varying response resolution as a function of particle momentum. Standard approaches,… ▽ More

    Submitted 3 April, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    Comments: 7 pages, 8 figures

    Journal ref: Phys. Rev. D 111 (2025) 072002

  6. arXiv:2412.10504  [pdf, other

    hep-ph cs.LG hep-ex stat.ML

    Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics

    Authors: Oz Amram, Luca Anzalone, Joschka Birk, Darius A. Faroughy, Anna Hallin, Gregor Kasieczka, Michael Krämer, Ian Pang, Humberto Reyes-Gonzalez, David Shih

    Abstract: Foundation models are deep learning models pre-trained on large amounts of data which are capable of generalizing to multiple datasets and/or downstream tasks. This work demonstrates how data collected by the CMS experiment at the Large Hadron Collider can be useful in pre-training foundation models for HEP. Specifically, we introduce the AspenOpenJets dataset, consisting of approximately 180M hig… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 11 pages, 4 figures, the AspenOpenJets dataset can be found at http://doi.org/10.25592/uhhfdm.16505

  7. arXiv:2411.00085  [pdf, other

    hep-ph hep-ex physics.data-an

    Accurate and robust methods for direct background estimation in resonant anomaly detection

    Authors: Ranit Das, Thorben Finke, Marie Hein, Gregor Kasieczka, Michael Krämer, Alexander Mück, David Shih

    Abstract: Resonant anomaly detection methods have great potential for enhancing the sensitivity of traditional bump hunt searches. A key component of these methods is a high quality background template used to produce an anomaly score. Using the LHC Olympics R&D dataset, we demonstrate that this background template can also be repurposed to directly estimate the background expectation in a simple cut and co… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    Comments: 26 pages, 9 figures, 2 tables

    Report number: P3H-24-077, TTK-24-45

  8. arXiv:2410.21611  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph

    CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

    Authors: Claudius Krause, Michele Faucci Giannelli, Gregor Kasieczka, Benjamin Nachman, Dalila Salamani, David Shih, Anna Zaborowska, Oz Amram, Kerstin Borras, Matthew R. Buckley, Erik Buhmann, Thorsten Buss, Renato Paulo Da Costa Cardoso, Anthony L. Caterini, Nadezda Chernyavskaya, Federico A. G. Corchia, Jesse C. Cresswell, Sascha Diefenbacher, Etienne Dreyer, Vijay Ekambaram, Engin Eren, Florian Ernst, Luigi Favaro, Matteo Franchini, Frank Gaede , et al. (44 additional authors not shown)

    Abstract: We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoder… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 204 pages, 100+ figures, 30+ tables

    Report number: HEPHY-ML-24-05, FERMILAB-PUB-24-0728-CMS, TTK-24-43

  9. arXiv:2408.00838  [pdf, other

    cs.LG cs.AI hep-ph

    Calibrating Bayesian Generative Machine Learning for Bayesiamplification

    Authors: Sebastian Bieringer, Sascha Diefenbacher, Gregor Kasieczka, Mathias Trabs

    Abstract: Recently, combinations of generative and Bayesian machine learning have been introduced in particle physics for both fast detector simulation and inference tasks. These neural networks aim to quantify the uncertainty on the generated distribution originating from limited training statistics. The interpretation of a distribution-wide uncertainty however remains ill-defined. We show a clear scheme f… ▽ More

    Submitted 13 November, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

    Comments: 15 pages, 6 figures, updated references, fixed typo

    Journal ref: 2024 Mach. Learn.: Sci. Technol. 5 045044

  10. arXiv:2407.20315  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Universal New Physics Latent Space

    Authors: Anna Hallin, Gregor Kasieczka, Sabine Kraml, André Lessa, Louis Moureaux, Tore von Schwartz, David Shih

    Abstract: We develop a machine learning method for mapping data originating from both Standard Model processes and various theories beyond the Standard Model into a unified representation (latent) space while conserving information about the relationship between the underlying theories. We apply our method to three examples of new physics at the LHC of increasing complexity, showing that models can be clust… ▽ More

    Submitted 22 January, 2025; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: 25 pages, 17 figures

    Journal ref: Phys. Rev. D 111, 016006 (2025)

  11. arXiv:2405.20407  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    Convolutional L2LFlows: Generating Accurate Showers in Highly Granular Calorimeters Using Convolutional Normalizing Flows

    Authors: Thorsten Buss, Frank Gaede, Gregor Kasieczka, Claudius Krause, David Shih

    Abstract: In the quest to build generative surrogate models as computationally efficient alternatives to rule-based simulations, the quality of the generated samples remains a crucial frontier. So far, normalizing flows have been among the models with the best fidelity. However, as the latent space in such models is required to have the same dimensionality as the data space, scaling up normalizing flows to… ▽ More

    Submitted 4 September, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Report number: HEPHY-ML-24-02

    Journal ref: 2024 JINST 19 P09003

  12. arXiv:2405.12972  [pdf, other

    hep-ph hep-ex physics.data-an

    Accelerating Resonance Searches via Signature-Oriented Pre-training

    Authors: Congqiao Li, Antonios Agapitos, Jovin Drews, Javier Duarte, Dawei Fu, Leyun Gao, Raghav Kansal, Gregor Kasieczka, Louis Moureaux, Huilin Qu, Cristina Mantilla Suarez, Qiang Li

    Abstract: The search for heavy resonances beyond the Standard Model (BSM) is a key objective at the LHC. While the recent use of advanced deep neural networks for boosted-jet tagging significantly enhances the sensitivity of dedicated searches, it is limited to specific final states, leaving vast potential BSM phase space underexplored. We introduce a novel experimental method, Signature-Oriented Pre-traini… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures

  13. arXiv:2404.07258  [pdf, other

    hep-ph hep-ex physics.data-an

    Complete Optimal Non-Resonant Anomaly Detection

    Authors: Gregor Kasieczka, John Andrew Raine, David Shih, Aman Upadhyay

    Abstract: We propose the first-ever complete, model-agnostic search strategy based on the optimal anomaly score, for new physics on the tails of distributions. Signal sensitivity is achieved via a classifier trained on auxiliary features in a weakly-supervised fashion, and backgrounds are predicted using the ABCD method in the classifier output and the primary tail feature. The independence between the clas… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures

  14. arXiv:2403.05618  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    OmniJet-$α$: The first cross-task foundation model for particle physics

    Authors: Joschka Birk, Anna Hallin, Gregor Kasieczka

    Abstract: Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a large variety of downstream applications. The successful development of such general-purpose models for physics data would be a major breakthrough as they could improve the achievable physics performance while at the same time drastically reduce the required amount of training… ▽ More

    Submitted 7 September, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Journal ref: Mach. Learn.: Sci. Technol. 5 035031 (2024)

  15. arXiv:2402.15558  [pdf, other

    hep-ph hep-ex physics.data-an

    Classifier Surrogates: Sharing AI-based Searches with the World

    Authors: Sebastian Bieringer, Gregor Kasieczka, Jan Kieseler, Mathias Trabs

    Abstract: In recent years, neural network-based classification has been used to improve data analysis at collider experiments. While this strategy proves to be hugely successful, the underlying models are not commonly shared with the public and rely on experiment-internal data as well as full detector simulations. We show a concrete implementation of a newly proposed strategy, so-called Classifier Surrogate… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 Figures, 1 Table

    Journal ref: Eur. Phys. J. C 84, 972 (2024)

  16. Les Houches guide to reusable ML models in LHC analyses

    Authors: Jack Y. Araz, Andy Buckley, Gregor Kasieczka, Jan Kieseler, Sabine Kraml, Anders Kvellestad, Andre Lessa, Tomasz Procter, Are Raklev, Humberto Reyes-Gonzalez, Krzysztof Rolbiecki, Sezen Sekmen, Gokhan Unel

    Abstract: With the increasing usage of machine-learning in high-energy physics analyses, the publication of the trained models in a reusable form has become a crucial question for analysis preservation and reuse. The complexity of these models creates practical issues for both reporting them accurately and for ensuring the stability of their behaviours in different environments and over extended timescales.… ▽ More

    Submitted 11 September, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 12 pages; v2: added funding acknowledgement; v3 update in response to referee comments

    Journal ref: SciPost Phys. Comm. Rep. 3 (2024)

  17. arXiv:2312.14027  [pdf, other

    stat.ML cs.LG hep-ph stat.CO

    AdamMCMC: Combining Metropolis Adjusted Langevin with Momentum-based Optimization

    Authors: Sebastian Bieringer, Gregor Kasieczka, Maximilian F. Steffen, Mathias Trabs

    Abstract: Uncertainty estimation is a key issue when considering the application of deep neural network methods in science and engineering. In this work, we introduce a novel algorithm that quantifies epistemic uncertainty via Monte Carlo sampling from a tempered posterior distribution. It combines the well established Metropolis Adjusted Langevin Algorithm (MALA) with momentum-based optimization using Adam… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 16 pages, 5 figures; adapted Theorem 2

  18. arXiv:2312.11629  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Residual ANODE

    Authors: Ranit Das, Gregor Kasieczka, David Shih

    Abstract: We present R-ANODE, a new method for data-driven, model-agnostic resonant anomaly detection that raises the bar for both performance and interpretability. The key to R-ANODE is to enhance the inductive bias of the anomaly detection task by fitting a normalizing flow directly to the small and unknown signal component, while holding fixed a background model (also a normalizing flow) learned from sid… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  19. arXiv:2312.00123  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information

    Authors: Joschka Birk, Erik Buhmann, Cedric Ewen, Gregor Kasieczka, David Shih

    Abstract: We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a g… ▽ More

    Submitted 26 March, 2025; v1 submitted 30 November, 2023; originally announced December 2023.

    Journal ref: Phys. Rev. D 111, 052008 (2025)

  20. arXiv:2310.06897  [pdf, other

    hep-ph hep-ex physics.data-an

    Full Phase Space Resonant Anomaly Detection

    Authors: Erik Buhmann, Cedric Ewen, Gregor Kasieczka, Vinicius Mikuni, Benjamin Nachman, David Shih

    Abstract: Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model backgrou… ▽ More

    Submitted 9 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 10 pages, 7 figures

    Journal ref: Phys. Rev. D 109, 055015 (2024)

  21. arXiv:2310.00049  [pdf, other

    hep-ph cs.LG

    EPiC-ly Fast Particle Cloud Generation with Flow-Matching and Diffusion

    Authors: Erik Buhmann, Cedric Ewen, Darius A. Faroughy, Tobias Golling, Gregor Kasieczka, Matthew Leigh, Guillaume Quétant, John Andrew Raine, Debajyoti Sengupta, David Shih

    Abstract: Jets at the LHC, typically consisting of a large number of highly correlated particles, are a fascinating laboratory for deep generative modeling. In this paper, we present two novel methods that generate LHC jets as point clouds efficiently and accurately. We introduce \epcjedi, which combines score-matching diffusion models with the Equivariant Point Cloud (EPiC) architecture based on the deep s… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 21 pages, 8 figures

  22. arXiv:2309.13111  [pdf, other

    hep-ph hep-ex physics.data-an

    Back To The Roots: Tree-Based Algorithms for Weakly Supervised Anomaly Detection

    Authors: Thorben Finke, Marie Hein, Gregor Kasieczka, Michael Krämer, Alexander Mück, Parada Prangchaikul, Tobias Quadfasel, David Shih, Manuel Sommerhalder

    Abstract: Weakly supervised methods have emerged as a powerful tool for model-agnostic anomaly detection at the Large Hadron Collider (LHC). While these methods have shown remarkable performance on specific signatures such as di-jet resonances, their application in a more model-agnostic manner requires dealing with a larger number of potentially noisy input features. In this paper, we show that using booste… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 11 pages, 9 figures

    Report number: TTK-23-26

  23. Combining Resonant and Tail-based Anomaly Detection

    Authors: Gerrit Bickendorf, Manuel Drees, Gregor Kasieczka, Claudius Krause, David Shih

    Abstract: In many well-motivated models of the electroweak scale, cascade decays of new particles can result in highly boosted hadronic resonances (e.g. $Z/W/h$). This can make these models rich and promising targets for recently developed resonant anomaly detection methods powered by modern machine learning. We demonstrate this using the state-of-the-art CATHODE method applied to supersymmetry scenarios wi… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 13 pages, 15 figures

  24. arXiv:2309.05704  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloClouds II: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation

    Authors: Erik Buhmann, Frank Gaede, Gregor Kasieczka, Anatolii Korol, William Korcari, Katja Krüger, Peter McKeown

    Abstract: Fast simulation of the energy depositions in high-granular detectors is needed for future collider experiments with ever-increasing luminosities. Generative machine learning (ML) models have been shown to speed up and augment the traditional simulation chain in physics analysis. However, the majority of previous efforts were limited to models relying on fixed, regular detector readout geometries.… ▽ More

    Submitted 26 February, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 30 pages, 7 figures, 3 tables, code available at https://github.com/FLC-QU-hep/CaloClouds-2

    Report number: DESY-23-130

  25. arXiv:2307.11157  [pdf, other

    hep-ph hep-ex physics.data-an

    The Interplay of Machine Learning--based Resonant Anomaly Detection Methods

    Authors: Tobias Golling, Gregor Kasieczka, Claudius Krause, Radha Mastandrea, Benjamin Nachman, John Andrew Raine, Debajyoti Sengupta, David Shih, Manuel Sommerhalder

    Abstract: Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 27 pages, 21 figures. Updated with revisions for journal acceptance

  26. arXiv:2305.04847  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, William Korcari, Katja Krüger, Peter McKeown

    Abstract: Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models would enable them to augment traditional simulations and alleviate a major computing constraint. This work achieves a major breakthrough in this task by, for the first time, directly gene… ▽ More

    Submitted 26 February, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: 25 pages, 11 figures

    Report number: DESY-23-061

    Journal ref: JINST 18 (2023) 11, P11025

  27. arXiv:2303.18150  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    New Angles on Fast Calorimeter Shower Simulation

    Authors: Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger, Peter McKeown, Lennart Rustige

    Abstract: The demands placed on computational resources by the simulation requirements of high energy physics experiments motivate the development of novel simulation tools. Machine learning based generative models offer a solution that is both fast and accurate. In this work we extend the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture, designed for the simulation of particle showers in hi… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: 26 pages, 19 figures

    Report number: DESY-23-039

  28. arXiv:2302.11594  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    L2LFlows: Generating High-Fidelity 3D Calorimeter Images

    Authors: Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Claudius Krause, Imahn Shekhzadeh, David Shih

    Abstract: We explore the use of normalizing flows to emulate Monte Carlo detector simulations of photon showers in a high-granularity electromagnetic calorimeter prototype for the International Large Detector (ILD). Our proposed method -- which we refer to as "Layer-to-Layer-Flows" (L$2$LFlows) -- is an evolution of the CaloFlow architecture adapted to a higher-dimensional setting (30 layers of… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: v2: 28 pages, 13 figures; matches version accepted for publication in JINST. Neither SISSA Medialab Srl nor IOP Publishing Ltd is responsible for any errors or omissions in this version of the manuscript or any version derived from it. Published version available via DOI

    Journal ref: 2023 JINST 18 P10017

  29. arXiv:2301.08128  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    EPiC-GAN: Equivariant Point Cloud Generation for Particle Jets

    Authors: Erik Buhmann, Gregor Kasieczka, Jesse Thaler

    Abstract: With the vast data-collecting capabilities of current and future high-energy collider experiments, there is an increasing demand for computationally efficient simulations. Generative machine learning models enable fast event generation, yet so far these approaches are largely constrained to fixed data structures and rigid detector geometries. In this paper, we introduce EPiC-GAN - equivariant poin… ▽ More

    Submitted 12 July, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 18 pages, 8 figures, 3 tables, code available at: https://github.com/uhh-pd-ml/EPiC-GAN

    Report number: MIT-CTP 5519

    Journal ref: SciPost Phys. 15, 130 (2023)

  30. arXiv:2212.00046  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Feature Selection with Distance Correlation

    Authors: Ranit Das, Gregor Kasieczka, David Shih

    Abstract: Choosing which properties of the data to use as input to multivariate decision algorithms -- a.k.a. feature selection -- is an important step in solving any problem with machine learning. While there is a clear trend towards training sophisticated deep networks on large numbers of relatively unprocessed inputs (so-called automated feature engineering), for many tasks in physics, sets of theoretica… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 14 pages, 8 figures, 3 tables

  31. arXiv:2210.14924  [pdf, other

    hep-ph hep-ex physics.data-an

    Resonant anomaly detection without background sculpting

    Authors: Anna Hallin, Gregor Kasieczka, Tobias Quadfasel, David Shih, Manuel Sommerhalder

    Abstract: We introduce a new technique named Latent CATHODE (LaCATHODE) for performing "enhanced bump hunts", a type of resonant anomaly search that combines conventional one-dimensional bump hunts with a model-agnostic anomaly score in an auxiliary feature space where potential signals could also be localized. The main advantage of LaCATHODE over existing methods is that it provides an anomaly score that i… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 11 pages, 8 figures; v2 (published version): referencing code and minor style updates

    Journal ref: Phys. Rev. D 107, 114012 (2023)

  32. arXiv:2209.06225  [pdf, other

    hep-ph hep-ex physics.data-an

    Anomaly Detection under Coordinate Transformations

    Authors: Gregor Kasieczka, Radha Mastandrea, Vinicius Mikuni, Benjamin Nachman, Mariel Pettee, David Shih

    Abstract: There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures

  33. arXiv:2203.08806  [pdf, other

    hep-ph cs.LG hep-ex physics.comp-ph physics.ins-det

    New directions for surrogate models and differentiable programming for High Energy Physics detector simulation

    Authors: Andreas Adelmann, Walter Hopkins, Evangelos Kourlitis, Michael Kagan, Gregor Kasieczka, Claudius Krause, David Shih, Vinicius Mikuni, Benjamin Nachman, Kevin Pedro, Daniel Winklehner

    Abstract: The computational cost for high energy physics detector simulation in future experimental facilities is going to exceed the current available resources. To overcome this challenge, new ideas on surrogate models using machine learning methods are being explored to replace computationally expensive components. Additionally, differentiable programming has been proposed as a complementary approach, pr… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: contribution to Snowmass 2021

    Report number: FERMILAB-CONF-22-199-SCD

  34. Machine Learning and LHC Event Generation

    Authors: Anja Butter, Tilman Plehn, Steffen Schumann, Simon Badger, Sascha Caron, Kyle Cranmer, Francesco Armando Di Bello, Etienne Dreyer, Stefano Forte, Sanmay Ganguly, Dorival Gonçalves, Eilam Gross, Theo Heimel, Gudrun Heinrich, Lukas Heinrich, Alexander Held, Stefan Höche, Jessica N. Howard, Philip Ilten, Joshua Isaacson, Timo Janßen, Stephen Jones, Marumi Kado, Michael Kagan, Gregor Kasieczka , et al. (26 additional authors not shown)

    Abstract: First-principle simulations are at the heart of the high-energy physics research program. They link the vast data output of multi-purpose detectors with fundamental theory predictions and interpretation. This review illustrates a wide range of applications of modern machine learning to event generation and simulation-based inference, including conceptional developments driven by the specific requi… ▽ More

    Submitted 28 December, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Review article based on a Snowmass 2021 contribution

    Journal ref: SciPost Phys. 14, 079 (2023)

  35. arXiv:2202.09375  [pdf, other

    hep-ph hep-ex physics.data-an

    Ephemeral Learning -- Augmenting Triggers with Online-Trained Normalizing Flows

    Authors: Anja Butter, Sascha Diefenbacher, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn, David Shih, Ramon Winterhalder

    Abstract: The large data rates at the LHC require an online trigger system to select relevant collisions. Rather than compressing individual events, we propose to compress an entire data set at once. We use a normalizing flow as a deep generative model to learn the probability density of the data online. The events are then represented by the generative neural network and can be inspected offline for anomal… ▽ More

    Submitted 28 June, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: 17 pages, 9 figures, minor changes to text, addressed referee comments

    Report number: CP3-22-10

    Journal ref: SciPost Phys. 13, 087 (2022)

  36. Calomplification -- The Power of Generative Calorimeter Models

    Authors: Sebastian Bieringer, Anja Butter, Sascha Diefenbacher, Engin Eren, Frank Gaede, Daniel Hundhausen, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn, Mathias Trabs

    Abstract: Motivated by the high computational costs of classical simulations, machine-learned generative models can be extremely useful in particle physics and elsewhere. They become especially attractive when surrogate models can efficiently learn the underlying distribution, such that a generated sample outperforms a training sample of limited size. This kind of GANplification has been observed for simple… ▽ More

    Submitted 25 January, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: 17 pages, 10 figures

    Report number: DESY-22-031

    Journal ref: JINST 17 P09028 (2022)

  37. arXiv:2112.09709  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    Hadrons, Better, Faster, Stronger

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Daniel Hundhausen, Gregor Kasieczka, William Korcari, Katja Krüger, Peter McKeown, Lennart Rustige

    Abstract: Motivated by the computational limitations of simulating interactions of particles in highly-granular detectors, there exists a concerted effort to build fast and exact machine-learning-based shower simulators. This work reports progress on two important fronts. First, the previously investigated WGAN and BIB-AE generative models are improved and successful learning of hadronic showers initiated b… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 20 pages, 8 figures

  38. arXiv:2112.03769  [pdf, other

    hep-ph hep-ex physics.data-an stat.ML

    Machine Learning in the Search for New Fundamental Physics

    Authors: Georgia Karagiorgi, Gregor Kasieczka, Scott Kravitz, Benjamin Nachman, David Shih

    Abstract: Machine learning plays a crucial role in enhancing and accelerating the search for new fundamental physics. We review the state of machine learning methods and applications for new physics searches in the context of terrestrial high energy physics experiments, including the Large Hadron Collider, rare event searches, and neutrino experiments. While machine learning has a long history in these fiel… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Preprint of article submitted to Nature Reviews Physics, 19 pages, 1 figure

  39. arXiv:2109.00546  [pdf, other

    hep-ph hep-ex physics.data-an

    Classifying Anomalies THrough Outer Density Estimation (CATHODE)

    Authors: Anna Hallin, Joshua Isaacson, Gregor Kasieczka, Claudius Krause, Benjamin Nachman, Tobias Quadfasel, Matthias Schlaffer, David Shih, Manuel Sommerhalder

    Abstract: We propose a new model-agnostic search strategy for physics beyond the standard model (BSM) at the LHC, based on a novel application of neural density estimation to anomaly detection. Our approach, which we call Classifying Anomalies THrough Outer Density Estimation (CATHODE), assumes the BSM signal is localized in a signal region (defined e.g. using invariant mass). By training a conditional dens… ▽ More

    Submitted 11 September, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 17 pages, 12 figures; v2: minor updates; v3 (published version): added study of background sculpting and minor fixes

    Report number: EFI-20-5, FERMILAB-PUB-21-389-T

    Journal ref: Phys. Rev. D 106, 055006 (2022)

  40. Symmetries, Safety, and Self-Supervision

    Authors: Barry M. Dillon, Gregor Kasieczka, Hans Olischlager, Tilman Plehn, Peter Sorrenson, Lorenz Vogel

    Abstract: Collider searches face the challenge of defining a representation of high-dimensional data such that physical symmetries are manifest, the discriminating features are retained, and the choice of representation is new-physics agnostic. We introduce JetCLR to solve the mapping from low-level data to optimized observables though self-supervised contrastive learning. As an example, we construct a data… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Journal ref: SciPost Phys. 12, 188 (2022)

  41. Unsupervised Hadronic SUEP at the LHC

    Authors: Jared Barron, David Curtin, Gregor Kasieczka, Tilman Plehn, Aris Spourdalakis

    Abstract: Confining dark sectors with pseudo-conformal dynamics produce SUEP, or Soft Unclustered Energy Patterns, at colliders: isotropic dark hadrons with soft and democratic energies. We target the experimental nightmare scenario, SUEPs in exotic Higgs decays, where all dark hadrons decay promptly to SM hadrons. First, we identify three promising observables, the charged particle multiplicity, the event… ▽ More

    Submitted 4 November, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 10 pages, 7 figures + references and appendix v2: Added graph to appendix and fixed typos

  42. arXiv:2107.02821  [pdf, other

    stat.ML cs.LG hep-ex hep-ph

    New Methods and Datasets for Group Anomaly Detection From Fundamental Physics

    Authors: Gregor Kasieczka, Benjamin Nachman, David Shih

    Abstract: The identification of anomalous overdensities in data - group or collective anomaly detection - is a rich problem with a large number of real world applications. However, it has received relatively little attention in the broader ML community, as compared to point anomalies or other types of single instance outliers. One reason for this is the lack of powerful benchmark datasets. In this paper, we… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted for ANDEA (Anomaly and Novelty Detection, Explanation and Accommodation) Workshop at KDD 2021

  43. arXiv:2107.00656  [pdf, other

    cs.LG astro-ph.IM hep-ph nucl-th physics.data-an stat.ML

    Shared Data and Algorithms for Deep Learning in Fundamental Physics

    Authors: Lisa Benato, Erik Buhmann, Martin Erdmann, Peter Fackeldey, Jonas Glombitza, Nikolai Hartmann, Gregor Kasieczka, William Korcari, Thomas Kuhr, Jan Steinheimer, Horst Stöcker, Tilman Plehn, Kai Zhou

    Abstract: We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level historie… ▽ More

    Submitted 24 March, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: 14 pages, 3 figures, 5 tables - Version accepted by Computing and Software for Big Science

    Journal ref: Comput Softw Big Sci 6, 9 (2022)

  44. arXiv:2102.12491  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    Decoding Photons: Physics in the Latent Space of a BIB-AE Generative Network

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger

    Abstract: Given the increasing data collection capabilities and limited computing resources of future collider experiments, interest in using generative neural networks for the fast simulation of collider events is growing. In our previous study, the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture for generating photon showers in a high-granularity calorimeter showed a high accuracy modelin… ▽ More

    Submitted 29 June, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: 13 pages, 9 figures, 2 tables, accepted by vCHEP 2021

    Report number: DESY 21-029

    Journal ref: EPJ Web of Conferences 251, 03003 (2021)

  45. arXiv:2101.08320  [pdf, other

    hep-ph hep-ex physics.data-an

    The LHC Olympics 2020: A Community Challenge for Anomaly Detection in High Energy Physics

    Authors: Gregor Kasieczka, Benjamin Nachman, David Shih, Oz Amram, Anders Andreassen, Kees Benkendorfer, Blaz Bortolato, Gustaaf Brooijmans, Florencia Canelli, Jack H. Collins, Biwei Dai, Felipe F. De Freitas, Barry M. Dillon, Ioan-Mihail Dinu, Zhongtian Dong, Julien Donini, Javier Duarte, D. A. Faroughy, Julia Gonski, Philip Harris, Alan Kahn, Jernej F. Kamenik, Charanjit K. Khosa, Patrick Komiske, Luc Le Pottier , et al. (22 additional authors not shown)

    Abstract: A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: 108 pages, 53 figures, 3 tables

  46. arXiv:2012.11944  [pdf, other

    hep-ph

    How to GAN Higher Jet Resolution

    Authors: Pierre Baldi, Lukas Blecher, Anja Butter, Julian Collado, Jessica N. Howard, Fabian Keilbach, Tilman Plehn, Gregor Kasieczka, Daniel Whiteson

    Abstract: QCD-jets at the LHC are described by simple physics principles. We show how super-resolution generative networks can learn the underlying structures and use them to improve the resolution of jet images. We test this approach on massless QCD-jets and on fat top-jets and find that the network reproduces their main features even without training on pure samples. In addition, we show how a slim networ… ▽ More

    Submitted 2 December, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 25 pages, 11 figures; implemented SciPost reviewer comments, clarified definitions and expanded discussion in high-level observable benchmarking subsection (section 3.3 and Fig. 7)

  47. arXiv:2009.03796  [pdf, other

    hep-ph hep-ex physics.data-an physics.ins-det stat.ML

    DCTRGAN: Improving the Precision of Generative Models with Reweighting

    Authors: Sascha Diefenbacher, Engin Eren, Gregor Kasieczka, Anatolii Korol, Benjamin Nachman, David Shih

    Abstract: Significant advances in deep learning have led to more widely used and precise neural network-based generative models such as Generative Adversarial Networks (GANs). We introduce a post-hoc correction to deep generative models to further improve their fidelity, based on the Deep neural networks using the Classification for Tuning and Reweighting (DCTR) protocol. The correction takes the form of a… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: 14 pages, 8 figures

  48. arXiv:2008.06545  [pdf, other

    hep-ph hep-ex physics.data-an stat.ML

    GANplifying Event Samples

    Authors: Anja Butter, Sascha Diefenbacher, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn

    Abstract: A critical question concerning generative networks applied to event generation in particle physics is if the generated events add statistical precision beyond the training sample. We show for a simple example with increasing dimensionality how generative networks indeed amplify the training statistics. We quantify their impact through an amplification factor or equivalent numbers of sampled events… ▽ More

    Submitted 25 March, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: 15 pages, 7 figures, fixed two equations, extended acknowledgments, addressed referee comments, improved figure readability

    Journal ref: SciPost Phys. 10, 139 (2021)

  49. arXiv:2007.14400  [pdf, other

    hep-ph hep-ex physics.data-an

    ABCDisCo: Automating the ABCD Method with Machine Learning

    Authors: Gregor Kasieczka, Benjamin Nachman, Matthew D. Schwartz, David Shih

    Abstract: The ABCD method is one of the most widely used data-driven background estimation techniques in high energy physics. Cuts on two statistically-independent classifiers separate signal and background into four regions, so that background in the signal region can be estimated simply using the other three control regions. Typically, the independent classifiers are chosen "by hand" to be intuitive and p… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 37 pages, 12 figures

    Journal ref: Phys. Rev. D 103, 035021 (2021)

  50. Towards Machine Learning Analytics for Jet Substructure

    Authors: Gregor Kasieczka, Simone Marzani, Gregory Soyez, Giovanni Stagnitto

    Abstract: The past few years have seen a rapid development of machine-learning algorithms. While surely augmenting performance, these complex tools are often treated as black-boxes and may impair our understanding of the physical processes under study. The aim of this paper is to move a first step into the direction of applying expert-knowledge in particle physics to calculate the optimal decision function… ▽ More

    Submitted 22 September, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 32 pages, 9 figures; v2 brings extra clarifications, as accepted by JHEP

    Report number: TIF-UNIMI-2020-12

    Journal ref: JHEP 09 (2020) 195