Skip to main content

Showing 1–5 of 5 results for author: Birk, J

Searching in archive hep-ph. Search in all archives.
.
  1. arXiv:2503.19165  [pdf, other

    hep-ex hep-ph

    Reconstructing hadronically decaying tau leptons with a jet foundation model

    Authors: Laurits Tani, Joosep Pata, Joschka Birk

    Abstract: The limited availability and accuracy of simulated data has motivated the use of foundation models in high energy physics, with the idea to first train a task-agnostic model on large and potentially unlabeled datasets. This enables the subsequent fine-tuning of the learned representation for specific downstream tasks, potentially requiring much smaller dataset sizes to reach the performance of mod… ▽ More

    Submitted 23 May, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Submission to SciPost

  2. arXiv:2501.05534  [pdf, other

    hep-ph cs.LG hep-ex physics.ins-det

    OmniJet-${α_{ C}}$: Learning point cloud calorimeter simulations using generative transformers

    Authors: Joschka Birk, Frank Gaede, Anna Hallin, Gregor Kasieczka, Martina Mozzanica, Henning Rose

    Abstract: We show the first use of generative transformers for generating calorimeter showers as point clouds in a high-granularity calorimeter. Using the tokenizer and generative part of the OmniJet-$α$ model, we represent the hits in the detector as sequences of integers. This model allows variable-length sequences, which means that it supports realistic shower development and does not need to be conditio… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  3. arXiv:2412.10504  [pdf, other

    hep-ph cs.LG hep-ex stat.ML

    Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics

    Authors: Oz Amram, Luca Anzalone, Joschka Birk, Darius A. Faroughy, Anna Hallin, Gregor Kasieczka, Michael Krämer, Ian Pang, Humberto Reyes-Gonzalez, David Shih

    Abstract: Foundation models are deep learning models pre-trained on large amounts of data which are capable of generalizing to multiple datasets and/or downstream tasks. This work demonstrates how data collected by the CMS experiment at the Large Hadron Collider can be useful in pre-training foundation models for HEP. Specifically, we introduce the AspenOpenJets dataset, consisting of approximately 180M hig… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 11 pages, 4 figures, the AspenOpenJets dataset can be found at http://doi.org/10.25592/uhhfdm.16505

  4. arXiv:2403.05618  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    OmniJet-$α$: The first cross-task foundation model for particle physics

    Authors: Joschka Birk, Anna Hallin, Gregor Kasieczka

    Abstract: Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a large variety of downstream applications. The successful development of such general-purpose models for physics data would be a major breakthrough as they could improve the achievable physics performance while at the same time drastically reduce the required amount of training… ▽ More

    Submitted 7 September, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Journal ref: Mach. Learn.: Sci. Technol. 5 035031 (2024)

  5. arXiv:2312.00123  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information

    Authors: Joschka Birk, Erik Buhmann, Cedric Ewen, Gregor Kasieczka, David Shih

    Abstract: We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a g… ▽ More

    Submitted 26 March, 2025; v1 submitted 30 November, 2023; originally announced December 2023.

    Journal ref: Phys. Rev. D 111, 052008 (2025)