-
Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning
Authors:
Cheng Chen,
Olmo Cerri,
Thong Q. Nguyen,
Jean-Roch Vlimant,
Maurizio Pierini
Abstract:
We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation le…
▽ More
We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation level, i.e., in absence of detector effects. Based on this model, we propose a novel fast-simulation workflow that starts from a large amount of generator-level events to deliver large analysis-specific samples. The adoption of this approach would result in about an order-of-magnitude reduction in computing and storage requirements for the collision simulation workflow. This strategy could help the high energy physics community to face the computing challenges of the future High-Luminosity LHC.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark
Authors:
Oliver Knapp,
Guenther Dissertori,
Olmo Cerri,
Thong Q. Nguyen,
Jean-Roch Vlimant,
Maurizio Pierini
Abstract:
We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driv…
▽ More
We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the t-tbar experimental signature at the LHC.
△ Less
Submitted 3 October, 2020; v1 submitted 4 May, 2020;
originally announced May 2020.
-
Interaction networks for the identification of boosted $H\to b\overline{b}$ decays
Authors:
Eric A. Moreno,
Thong Q. Nguyen,
Jean-Roch Vlimant,
Olmo Cerri,
Harvey B. Newman,
Avikar Periwal,
Maria Spiropulu,
Javier M. Duarte,
Maurizio Pierini
Abstract:
We develop an algorithm based on an interaction network to identify high-transverse-momentum Higgs bosons decaying to bottom quark-antiquark pairs and distinguish them from ordinary jets that reflect the configurations of quarks and gluons at short distances. The algorithm's inputs are features of the reconstructed charged particles in a jet and the secondary vertices associated with them. Describ…
▽ More
We develop an algorithm based on an interaction network to identify high-transverse-momentum Higgs bosons decaying to bottom quark-antiquark pairs and distinguish them from ordinary jets that reflect the configurations of quarks and gluons at short distances. The algorithm's inputs are features of the reconstructed charged particles in a jet and the secondary vertices associated with them. Describing the jet shower as a combination of particle-to-particle and particle-to-vertex interactions, the model is trained to learn a jet representation on which the classification problem is optimized. The algorithm is trained on simulated samples of realistic LHC collisions, released by the CMS Collaboration on the CERN Open Data Portal. The interaction network achieves a drastic improvement in the identification performance with respect to state-of-the-art algorithms.
△ Less
Submitted 28 July, 2020; v1 submitted 26 September, 2019;
originally announced September 2019.
-
JEDI-net: a jet identification algorithm based on interaction networks
Authors:
Eric A. Moreno,
Olmo Cerri,
Javier M. Duarte,
Harvey B. Newman,
Thong Q. Nguyen,
Avikar Periwal,
Maurizio Pierini,
Aidana Serikova,
Maria Spiropulu,
Jean-Roch Vlimant
Abstract:
We investigate the performance of a jet identification algorithm based on interaction networks (JEDI-net) to identify all-hadronic decays of high-momentum heavy particles produced at the LHC and distinguish them from ordinary jets originating from the hadronization of quarks and gluons. The jet dynamics are described as a set of one-to-one interactions between the jet constituents. Based on a repr…
▽ More
We investigate the performance of a jet identification algorithm based on interaction networks (JEDI-net) to identify all-hadronic decays of high-momentum heavy particles produced at the LHC and distinguish them from ordinary jets originating from the hadronization of quarks and gluons. The jet dynamics are described as a set of one-to-one interactions between the jet constituents. Based on a representation learned from these interactions, the jet is associated to one of the considered categories. Unlike other architectures, the JEDI-net models achieve their performance without special handling of the sparse input jet representation, extensive pre-processing, particle ordering, or specific assumptions regarding the underlying detector geometry. The presented models give better results with less model parameters, offering interesting prospects for LHC applications.
△ Less
Submitted 27 January, 2020; v1 submitted 14 August, 2019;
originally announced August 2019.
-
Variational Autoencoders for New Physics Mining at the Large Hadron Collider
Authors:
Olmo Cerri,
Thong Q. Nguyen,
Maurizio Pierini,
Maria Spiropulu,
Jean-Roch Vlimant
Abstract:
Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn't make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementar…
▽ More
Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn't make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementary to classic LHC searches, typically based on model-dependent hypothesis testing. Such an algorithm would deliver a list of anomalous events, that the experimental collaborations could further scrutinize and even release as a catalog, similarly to what is typically done in other scientific domains. Event topologies repeating in this dataset could inspire new-physics model building and new experimental searches. Running in the trigger system of the LHC experiments, such an application could identify anomalous events that would be otherwise lost, extending the scientific reach of the LHC.
△ Less
Submitted 13 June, 2019; v1 submitted 26 November, 2018;
originally announced November 2018.
-
Pileup mitigation at the Large Hadron Collider with Graph Neural Networks
Authors:
Jesus Arjona Martinez,
Olmo Cerri,
Maurizio Pierini,
Maria Spiropulu,
Jean-Roch Vlimant
Abstract:
At the Large Hadron Collider, the high transverse-momentum events studied by experimental collaborations occur in coincidence with parasitic low transverse-momentum collisions, usually referred to as pileup. Pileup mitigation is a key ingredient of the online and offline event reconstruction as pileup affects the reconstruction accuracy of many physics observables. We present a classifier based on…
▽ More
At the Large Hadron Collider, the high transverse-momentum events studied by experimental collaborations occur in coincidence with parasitic low transverse-momentum collisions, usually referred to as pileup. Pileup mitigation is a key ingredient of the online and offline event reconstruction as pileup affects the reconstruction accuracy of many physics observables. We present a classifier based on Graph Neural Networks, trained to retain particles coming from high-transverse-momentum collisions, while rejecting those coming from pileup collisions. This model is designed as a refinement of the PUPPI algorithm, employed in many LHC data analyses since 2015. Thanks to an extended basis of input information and the learning capabilities of the considered network architecture, we show an improvement in pileup-rejection performances with respect to state-of-the-art solutions.
△ Less
Submitted 13 June, 2019; v1 submitted 18 October, 2018;
originally announced October 2018.
-
Identification of Long-lived Charged Particles using Time-Of-Flight Systems at the Upgraded LHC detectors
Authors:
O. Cerri,
S. Xie,
Cristian Peña,
Maria Spiropulu
Abstract:
We study the impact of picosecond precision timing detection systems on the LHC experiments' long-lived particle search program during the HL-LHC era. We develop algorithms that allow us to reconstruct the mass of such charged particles and perform particle identification using the time-of-flight measurement. We investigate the reach for benchmark scenarios as a function of the timing resolution,…
▽ More
We study the impact of picosecond precision timing detection systems on the LHC experiments' long-lived particle search program during the HL-LHC era. We develop algorithms that allow us to reconstruct the mass of such charged particles and perform particle identification using the time-of-flight measurement. We investigate the reach for benchmark scenarios as a function of the timing resolution, and find sensitivity improvement of up to a factor of ten, depending on the new heavy particle mass.
△ Less
Submitted 10 January, 2019; v1 submitted 14 July, 2018;
originally announced July 2018.
-
Topology classification with deep learning to improve real-time event selection at the LHC
Authors:
Thong Q. Nguyen,
Daniel Weitekamp III,
Dustin Anderson,
Roberto Castello,
Olmo Cerri,
Maurizio Pierini,
Maria Spiropulu,
Jean-Roch Vlimant
Abstract:
We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's scor…
▽ More
We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's score can be trained to retain ~99% of the interesting events and reduce the false-positive rate by as much as one order of magnitude for certain background processes. By operating such a filter as part of the online event selection infrastructure of the LHC experiments, one could benefit from a more flexible and inclusive selection strategy while reducing the amount of downstream resources wasted in processing false positives. The saved resources could be translated into a reduction of the detector operation cost or into an effective increase of storage and processing capabilities, which could be reinvested to extend the physics reach of the LHC experiments.
△ Less
Submitted 2 September, 2019; v1 submitted 29 June, 2018;
originally announced July 2018.
-
About the rapidity and helicity distributions of the W bosons produced at LHC
Authors:
Elisabetta Manca,
Olmo Cerri,
Nicolo Foppiani,
Gigi Rolandi
Abstract:
$W$ bosons are produced at LHC from a forward-backward symmetric initial state. Their decay to a charged lepton and a neutrino has a strong spin analysing power. The combination of these effects results in characteristic distributions of the pseudorapidity of the leptons decaying from $W^+$ and $W^-$ of different helicity. This observation may open the possibility to measure precisely the $W^+…
▽ More
$W$ bosons are produced at LHC from a forward-backward symmetric initial state. Their decay to a charged lepton and a neutrino has a strong spin analysing power. The combination of these effects results in characteristic distributions of the pseudorapidity of the leptons decaying from $W^+$ and $W^-$ of different helicity. This observation may open the possibility to measure precisely the $W^+$ and $W^-$ rapidity distributions for the two transverse polarisation states of $W$ bosons produced at small transverse momentum.
△ Less
Submitted 14 December, 2017; v1 submitted 28 July, 2017;
originally announced July 2017.
-
Study the effect of beam energy spread and detector resolution on the search for Higgs boson decays to invisible particles at a future e$^+$e$^-$ circular collider
Authors:
Olmo Cerri,
Michele de Gruttola,
Maurizio Pierini,
Alessandro Podo,
Gigi Rolandi
Abstract:
We study the expected sensitivity to measure the branching ratio of Higgs boson decays to invisible particles at a future circular \epem collider (FCC-ee) in the process $e^+e^-\to HZ$ with $Z\to \ell^+\ell^-$ ($\ell=e$ or $μ$) using an integrated luminosity of 3.5 ab$^{-1}$ at a center-of-mass energy $\sqrt{s}=240$ GeV. The impact of the energy spread of the FCC-ee beam and of the resolution in t…
▽ More
We study the expected sensitivity to measure the branching ratio of Higgs boson decays to invisible particles at a future circular \epem collider (FCC-ee) in the process $e^+e^-\to HZ$ with $Z\to \ell^+\ell^-$ ($\ell=e$ or $μ$) using an integrated luminosity of 3.5 ab$^{-1}$ at a center-of-mass energy $\sqrt{s}=240$ GeV. The impact of the energy spread of the FCC-ee beam and of the resolution in the reconstruction of the leptons is discussed. %Two different detector concepts are considered: a detector corresponding to the CMS reconstruction performances and the expected design of the ILC detector. The minimum branching ratio for a $5σ$ observation after 3.5ab$^{-1}$ of data taking is $1.7\pm 0.1\%(stat+syst) $. The branching ratio exclusion limit at 95\% CL is $0.63 \pm 0.22\%((stat+syst))$.
△ Less
Submitted 6 February, 2017; v1 submitted 30 April, 2016;
originally announced May 2016.