Skip to main content

Showing 1–7 of 7 results for author: Pang, I

Searching in archive hep-ph. Search in all archives.
.
  1. arXiv:2412.10504  [pdf, other

    hep-ph cs.LG hep-ex stat.ML

    Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics

    Authors: Oz Amram, Luca Anzalone, Joschka Birk, Darius A. Faroughy, Anna Hallin, Gregor Kasieczka, Michael Krämer, Ian Pang, Humberto Reyes-Gonzalez, David Shih

    Abstract: Foundation models are deep learning models pre-trained on large amounts of data which are capable of generalizing to multiple datasets and/or downstream tasks. This work demonstrates how data collected by the CMS experiment at the Large Hadron Collider can be useful in pre-training foundation models for HEP. Specifically, we introduce the AspenOpenJets dataset, consisting of approximately 180M hig… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 11 pages, 4 figures, the AspenOpenJets dataset can be found at http://doi.org/10.25592/uhhfdm.16505

  2. arXiv:2410.21611  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph

    CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

    Authors: Claudius Krause, Michele Faucci Giannelli, Gregor Kasieczka, Benjamin Nachman, Dalila Salamani, David Shih, Anna Zaborowska, Oz Amram, Kerstin Borras, Matthew R. Buckley, Erik Buhmann, Thorsten Buss, Renato Paulo Da Costa Cardoso, Anthony L. Caterini, Nadezda Chernyavskaya, Federico A. G. Corchia, Jesse C. Cresswell, Sascha Diefenbacher, Etienne Dreyer, Vijay Ekambaram, Engin Eren, Florian Ernst, Luigi Favaro, Matteo Franchini, Frank Gaede , et al. (44 additional authors not shown)

    Abstract: We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoder… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 204 pages, 100+ figures, 30+ tables

    Report number: HEPHY-ML-24-05, FERMILAB-PUB-24-0728-CMS, TTK-24-43

  3. arXiv:2404.18992  [pdf, other

    hep-ph hep-ex physics.data-an physics.ins-det stat.ML

    Unifying Simulation and Inference with Normalizing Flows

    Authors: Haoxing Du, Claudius Krause, Vinicius Mikuni, Benjamin Nachman, Ian Pang, David Shih

    Abstract: There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative models as automated fast detector simulators. We show that these two tasks can be unified by using maximum likelihood estimation (MLE) from conditional generative models for energy regression. Unlike direct regression techniques, the MLE approach is prior-… ▽ More

    Submitted 11 April, 2025; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures; v3: matches published version

    Report number: HEPHY-ML-24-01

    Journal ref: Phys. Rev. D 111, 076004 (2025)

  4. arXiv:2312.11618  [pdf, other

    hep-ph astro-ph.IM hep-ex physics.data-an physics.ins-det

    Anomaly detection with flow-based fast calorimeter simulators

    Authors: Claudius Krause, Benjamin Nachman, Ian Pang, David Shih, Yunhao Zhu

    Abstract: Recently, several normalizing flow-based deep generative models have been proposed to accelerate the simulation of calorimeter showers. Using CaloFlow as an example, we show that these models can simultaneously perform unsupervised anomaly detection with no additional training cost. As a demonstration, we consider electromagnetic showers initiated by one (background) or multiple (signal) photons.… ▽ More

    Submitted 29 August, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 14 pages, 8 figures

    Report number: HEPHY-ML-23-03

  5. arXiv:2308.11700  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    Calorimeter shower superresolution

    Authors: Ian Pang, John Andrew Raine, David Shih

    Abstract: Calorimeter shower simulation is a major bottleneck in the Large Hadron Collider computational pipeline. There have been recent efforts to employ deep-generative surrogate models to overcome this challenge. However, many of best performing models have training and generation times that do not scale well to high-dimensional calorimeter showers. In this work, we introduce SuperCalo, a flow-based sup… ▽ More

    Submitted 15 May, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 16 pages, 13 figures, v3: title changed, matches published version

    Journal ref: Phys. Rev. D 109, 092009 (2024)

  6. arXiv:2305.11934  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    Inductive Simulation of Calorimeter Showers with Normalizing Flows

    Authors: Matthew R. Buckley, Claudius Krause, Ian Pang, David Shih

    Abstract: Simulating particle detector response is the single most expensive step in the Large Hadron Collider computational pipeline. Recently it was shown that normalizing flows can accelerate this process while achieving unprecedented levels of accuracy, but scaling this approach up to higher resolutions relevant for future detector upgrades leads to prohibitive memory constraints. To overcome this probl… ▽ More

    Submitted 13 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 19 pages, 15 figures; v2: title changed, matches published version

    Journal ref: Phys. Rev. D 109, 033006 (2024)

  7. arXiv:2210.14245  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloFlow for CaloChallenge Dataset 1

    Authors: Claudius Krause, Ian Pang, David Shih

    Abstract: CaloFlow is a new and promising approach to fast calorimeter simulation based on normalizing flows. Applying CaloFlow to the photon and charged pion Geant4 showers of Dataset 1 of the Fast Calorimeter Simulation Challenge 2022, we show how it can produce high-fidelity samples with a sampling time that is several orders of magnitude faster than Geant4. We demonstrate the fidelity of the samples usi… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 36 pages, 21 figures, v3: match published version

    Journal ref: SciPost Phys. 16, 126 (2024)