-
CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation
Authors:
Claudius Krause,
Michele Faucci Giannelli,
Gregor Kasieczka,
Benjamin Nachman,
Dalila Salamani,
David Shih,
Anna Zaborowska,
Oz Amram,
Kerstin Borras,
Matthew R. Buckley,
Erik Buhmann,
Thorsten Buss,
Renato Paulo Da Costa Cardoso,
Anthony L. Caterini,
Nadezda Chernyavskaya,
Federico A. G. Corchia,
Jesse C. Cresswell,
Sascha Diefenbacher,
Etienne Dreyer,
Vijay Ekambaram,
Engin Eren,
Florian Ernst,
Luigi Favaro,
Matteo Franchini,
Frank Gaede
, et al. (44 additional authors not shown)
Abstract:
We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoder…
▽ More
We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoders (VAEs), Generative Adversarial Networks (GANs), Normalizing Flows, Diffusion models, and models based on Conditional Flow Matching. We compare all submissions in terms of quality of generated calorimeter showers, as well as shower generation time and model size. To assess the quality we use a broad range of different metrics including differences in 1-dimensional histograms of observables, KPD/FPD scores, AUCs of binary classifiers, and the log-posterior of a multiclass classifier. The results of the CaloChallenge provide the most complete and comprehensive survey of cutting-edge approaches to calorimeter fast simulation to date. In addition, our work provides a uniquely detailed perspective on the important problem of how to evaluate generative models. As such, the results presented here should be applicable for other domains that use generative AI and require fast and faithful generation of samples in a large phase space.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
Multiple testing for signal-agnostic searches of new physics with machine learning
Authors:
Gaia Grosso,
Marco Letizia
Abstract:
In this work, we address the question of how to enhance signal-agnostic searches by leveraging multiple testing strategies. Specifically, we consider hypothesis tests relying on machine learning, where model selection can introduce a bias towards specific families of new physics signals. We show that it is beneficial to combine different tests, characterised by distinct choices of hyperparameters,…
▽ More
In this work, we address the question of how to enhance signal-agnostic searches by leveraging multiple testing strategies. Specifically, we consider hypothesis tests relying on machine learning, where model selection can introduce a bias towards specific families of new physics signals. We show that it is beneficial to combine different tests, characterised by distinct choices of hyperparameters, and that performances comparable to the best available test are generally achieved while providing a more uniform response to various types of anomalies. Focusing on the New Physics Learning Machine, a methodology to perform a signal-agnostic likelihood-ratio test, we explore a number of approaches to multiple testing, such as combining p-values and aggregating test statistics.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds
Authors:
Jesse C. Cresswell,
Brendan Leigh Ross,
Gabriel Loaiza-Ganem,
Humberto Reyes-Gonzalez,
Marco Letizia,
Anthony L. Caterini
Abstract:
Precision measurements and new physics searches at the Large Hadron Collider require efficient simulations of particle propagation and interactions within the detectors. The most computationally expensive simulations involve calorimeter showers. Advances in deep generative modelling - particularly in the realm of high-dimensional data - have opened the possibility of generating realistic calorimet…
▽ More
Precision measurements and new physics searches at the Large Hadron Collider require efficient simulations of particle propagation and interactions within the detectors. The most computationally expensive simulations involve calorimeter showers. Advances in deep generative modelling - particularly in the realm of high-dimensional data - have opened the possibility of generating realistic calorimeter showers orders of magnitude more quickly than physics-based simulation. However, the high-dimensional representation of showers belies the relative simplicity and structure of the underlying physical laws. This phenomenon is yet another example of the manifold hypothesis from machine learning, which states that high-dimensional data is supported on low-dimensional manifolds. We thus propose modelling calorimeter showers first by learning their manifold structure, and then estimating the density of data across this manifold. Learning manifold structure reduces the dimensionality of the data, which enables fast training and generation when compared with competing methods.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
3D printed microchannels for sub-nL NMR spectroscopy
Authors:
E. Montinaro,
M. Grisi,
M. C. Letizia,
L. Pethö,
M. A. M. Gijs,
R. Guidetti,
J. Michler,
J. Brugger,
G. Boero
Abstract:
Nuclear magnetic resonance (NMR) experiments on subnanoliter (sub-nL) volumes are hindered by the limited sensitivity of the detector and the difficulties in positioning and holding such small samples in proximity of the detector. Here, we report on NMR experiments on liquid and biological entities immersed in liquids having volumes down to 100 pL. These measurements are enabled by the fabrication…
▽ More
Nuclear magnetic resonance (NMR) experiments on subnanoliter (sub-nL) volumes are hindered by the limited sensitivity of the detector and the difficulties in positioning and holding such small samples in proximity of the detector. Here, we report on NMR experiments on liquid and biological entities immersed in liquids having volumes down to 100 pL. These measurements are enabled by the fabrication of high spatial resolution 3D printed microfluidic structures, specifically conceived to guide and confine sub-nL samples in the sub-nL most sensitive volume of a single-chip integrated NMR probe. The microfluidic structures are fabricated using two-photon polymerization 3D printing. This technique has a resolution better than 1 $μ$m$^3$ and allows to rapidly fabricate complex microfluidic structures tailored to position, hold, and feed biological samples, with a design that maximizes the NMR signals amplitude and minimizes the static magnetic field inhomogeneities. The NMR probe consists of an electronic transceiver and a 150 $μ$m diameter excitation/detection microcoil, co-integrated on a single silicon chip of about 1 mm$^2$. To demonstrate the potential of this approach, we report NMR experiments on sub-nL intact biological entities in liquid media, specifically ova of the tardigrade Richtersius coronifer and sections of Caenorhabditis elegans nematodes. We show a sensitivity of 2.5x10$^{13}$ spins/Hz$^{1/2}$ on 1H nuclei at 7 T, sufficient to detect highly concentrated endogenous compounds in active volumes down to 100 pL in a measurement time of 3 hours. Spectral resolutions of 0.01 ppm in liquids and of 0.1 ppm in the investigated biological entities are demonstrated. The obtained results indicate a promising route for NMR studies at the single unit level of important sub-nL biological entities, such as living microscopic organisms and eggs of several mammalians, humans included.
△ Less
Submitted 18 July, 2017;
originally announced July 2017.