-
A Precise Determination of $α_s$ from the Heavy Jet Mass Distribution
Authors:
Miguel A. Benitez,
Arindam Bhattacharya,
Andre H. Hoang,
Vicent Mateu,
Matthew D. Schwartz,
Iain W. Stewart,
Xiaoyuan Zhang
Abstract:
A global fit for $α_s(m_Z)$ is performed on available $e^+e^-$ data for the heavy jet mass distribution. The state-of-the-art theory prediction includes $\mathcal{O}(α_s^3)$ fixed-order results, N$^3$LL$^\prime$ dijet resummation, N$^2$LL Sudakov shoulder resummation, and a first-principles treatment of power corrections in the dijet region. Theoretical correlations are incorporated through a flat…
▽ More
A global fit for $α_s(m_Z)$ is performed on available $e^+e^-$ data for the heavy jet mass distribution. The state-of-the-art theory prediction includes $\mathcal{O}(α_s^3)$ fixed-order results, N$^3$LL$^\prime$ dijet resummation, N$^2$LL Sudakov shoulder resummation, and a first-principles treatment of power corrections in the dijet region. Theoretical correlations are incorporated through a flat random-scan covariance matrix. The global fit results in $0.1145^{+0.0021}_{-0.0019}$, compatible with similar determinations from thrust and $C$-parameter. Dijet resummation is essential for a robust fit, as it engenders insensitivity to the fit-range lower cutoff; without resummation the fit-range sensitivity is overwhelming. In addition, we find evidence for a negative power correction in the trijet region if and only if Sudakov shoulder resummation is included.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Challenges for Unsupervised Anomaly Detection in Particle Physics
Authors:
Katherine Fraser,
Samuel Homiller,
Rashmish K. Mishra,
Bryan Ostdiek,
Matthew D. Schwartz
Abstract:
Anomaly detection relies on designing a score to determine whether a particular event is uncharacteristic of a given background distribution. One way to define a score is to use autoencoders, which rely on the ability to reconstruct certain types of data (background) but not others (signals). In this paper, we study some challenges associated with variational autoencoders, such as the dependence o…
▽ More
Anomaly detection relies on designing a score to determine whether a particular event is uncharacteristic of a given background distribution. One way to define a score is to use autoencoders, which rely on the ability to reconstruct certain types of data (background) but not others (signals). In this paper, we study some challenges associated with variational autoencoders, such as the dependence on hyperparameters and the metric used, in the context of anomalous signal (top and $W$) jets in a QCD background. We find that the hyperparameter choices strongly affect the network performance and that the optimal parameters for one signal are non-optimal for another. In exploring the networks, we uncover a connection between the latent space of a variational autoencoder trained using mean-squared-error and the optimal transport distances within the dataset. We then show that optimal transport distances to representative events in the background dataset can be used directly for anomaly detection, with performance comparable to the autoencoders. Whether using autoencoders or optimal transport distances for anomaly detection, we find that the choices that best represent the background are not necessarily best for signal identification. These challenges with unsupervised anomaly detection bolster the case for additional exploration of semi-supervised or alternative approaches.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
ABCDisCo: Automating the ABCD Method with Machine Learning
Authors:
Gregor Kasieczka,
Benjamin Nachman,
Matthew D. Schwartz,
David Shih
Abstract:
The ABCD method is one of the most widely used data-driven background estimation techniques in high energy physics. Cuts on two statistically-independent classifiers separate signal and background into four regions, so that background in the signal region can be estimated simply using the other three control regions. Typically, the independent classifiers are chosen "by hand" to be intuitive and p…
▽ More
The ABCD method is one of the most widely used data-driven background estimation techniques in high energy physics. Cuts on two statistically-independent classifiers separate signal and background into four regions, so that background in the signal region can be estimated simply using the other three control regions. Typically, the independent classifiers are chosen "by hand" to be intuitive and physically motivated variables. Here, we explore the possibility of automating the design of one or both of these classifiers using machine learning. We show how to use state-of-the-art decorrelation methods to construct powerful yet independent discriminators. Along the way, we uncover a previously unappreciated aspect of the ABCD method: its accuracy hinges on having low signal contamination in control regions not just overall, but relative to the signal fraction in the signal region. We demonstrate the method with three examples: a simple model consisting of three-dimensional Gaussians; boosted hadronic top jet tagging; and a recasted search for paired dijet resonances. In all cases, automating the ABCD method with machine learning significantly improves performance in terms of ABCD closure, background rejection and signal contamination.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Learning to Classify from Impure Samples with High-Dimensional Data
Authors:
Patrick T. Komiske,
Eric M. Metodiev,
Benjamin Nachman,
Matthew D. Schwartz
Abstract:
A persistent challenge in practical classification tasks is that labeled training sets are not always available. In particle physics, this challenge is surmounted by the use of simulations. These simulations accurately reproduce most features of data, but cannot be trusted to capture all of the complex correlations exploitable by modern machine learning methods. Recent work in weakly supervised le…
▽ More
A persistent challenge in practical classification tasks is that labeled training sets are not always available. In particle physics, this challenge is surmounted by the use of simulations. These simulations accurately reproduce most features of data, but cannot be trusted to capture all of the complex correlations exploitable by modern machine learning methods. Recent work in weakly supervised learning has shown that simple, low-dimensional classifiers can be trained using only the impure mixtures present in data. Here, we demonstrate that complex, high-dimensional classifiers can also be trained on impure mixtures using weak supervision techniques, with performance comparable to what could be achieved with pure samples. Using weak supervision will therefore allow us to avoid relying exclusively on simulations for high-dimensional classification. This work opens the door to a new regime whereby complex models are trained directly on data, providing direct access to probe the underlying physics.
△ Less
Submitted 24 July, 2018; v1 submitted 30 January, 2018;
originally announced January 2018.
-
Pileup Mitigation with Machine Learning (PUMML)
Authors:
Patrick T. Komiske,
Eric M. Metodiev,
Benjamin Nachman,
Matthew D. Schwartz
Abstract:
Pileup involves the contamination of the energy distribution arising from the primary collision of interest (leading vertex) by radiation from soft collisions (pileup). We develop a new technique for removing this contamination using machine learning and convolutional neural networks. The network takes as input the energy distribution of charged leading vertex particles, charged pileup particles,…
▽ More
Pileup involves the contamination of the energy distribution arising from the primary collision of interest (leading vertex) by radiation from soft collisions (pileup). We develop a new technique for removing this contamination using machine learning and convolutional neural networks. The network takes as input the energy distribution of charged leading vertex particles, charged pileup particles, and all neutral particles and outputs the energy distribution of particles coming from leading vertex alone. The PUMML algorithm performs remarkably well at eliminating pileup distortion on a wide range of simple and complex jet observables. We test the robustness of the algorithm in a number of ways and discuss how the network can be trained directly on data.
△ Less
Submitted 8 January, 2018; v1 submitted 26 July, 2017;
originally announced July 2017.
-
Precision physics with pile-up insensitive observables
Authors:
Christopher Frye,
Andrew J. Larkoski,
Matthew D. Schwartz,
Kai Yan
Abstract:
To deepen the search for beyond the Standard Model physics, the Large Hadron Collider is pushing to higher and higher luminosity. At high luminosity, precision physics becomes increasingly difficult due to contamination from additional proton collisions per bunch crossing called pile-up. In recent years, many methods have been developed to cull this excess mostly low-energy radiation away from imp…
▽ More
To deepen the search for beyond the Standard Model physics, the Large Hadron Collider is pushing to higher and higher luminosity. At high luminosity, precision physics becomes increasingly difficult due to contamination from additional proton collisions per bunch crossing called pile-up. In recent years, many methods have been developed to cull this excess mostly low-energy radiation away from important signal regions, but it has been unclear if these methods were amenable to systematically-improvable theoretical understanding. In this paper, it is shown that one such method, soft drop jet grooming, has excellent theoretical properties: it is ultra-local, depending on only radiation within a jet, and it is free of non-global logarithms. Calculations of the soft drop jet mass and related observables are presented at next-to-next-to-leading logarithmic accuracy matched to next-to-next-to-leading fixed-order in perturbative Quantum Chromodynamics. Once measured at the Large Hadron Collider, precision comparisons between theory and data can be made, essentially independent of the amount of pile-up contamination.
△ Less
Submitted 22 March, 2016; v1 submitted 21 March, 2016;
originally announced March 2016.
-
Towards an Understanding of the Correlations in Jet Substructure
Authors:
D. Adams,
A. Arce,
L. Asquith,
M. Backovic,
T. Barillari,
P. Berta,
D. Bertolini,
A. Buckley,
J. Butterworth,
R. C. Camacho Toro,
J. Caudron,
Y. -T. Chien,
J. Cogan,
B. Cooper,
D. Curtin,
C. Debenedetti,
J. Dolen,
M. Eklund,
S. El Hedri,
S. D. Ellis,
T. Embry,
D. Ferencek,
J. Ferrando,
S. Fleischmann,
M. Freytsis
, et al. (61 additional authors not shown)
Abstract:
Over the past decade, a large number of jet substructure observables have been proposed in the literature, and explored at the LHC experiments. Such observables attempt to utilize the internal structure of jets in order to distinguish those initiated by quarks, gluons, or by boosted heavy objects, such as top quarks and W bosons. This report, originating from and motivated by the BOOST2013 worksho…
▽ More
Over the past decade, a large number of jet substructure observables have been proposed in the literature, and explored at the LHC experiments. Such observables attempt to utilize the internal structure of jets in order to distinguish those initiated by quarks, gluons, or by boosted heavy objects, such as top quarks and W bosons. This report, originating from and motivated by the BOOST2013 workshop, presents original particle-level studies that aim to improve our understanding of the relationships between jet substructure observables, their complementarity, and their dependence on the underlying jet properties, particularly the jet radius and jet transverse momentum. This is explored in the context of quark/gluon discrimination, boosted W boson tagging and boosted top quark tagging.
△ Less
Submitted 18 August, 2015; v1 submitted 2 April, 2015;
originally announced April 2015.
-
Boosted objects and jet substructure at the LHC
Authors:
BOOST2012 participants- A. Altheimer,
A. Arce,
L. Asquith,
J. Backus Mayes,
E. Bergeaas Kuutmann,
J. Berger,
D. Bjergaard,
L. Bryngemark,
A. Buckley,
J. Butterworth,
M. Cacciari,
M. Campanelli,
T. Carli,
M. Chala,
B. Chapleau,
C. Chen,
J. P. Chou,
Th. Cornelissen,
D. Curtin,
M. Dasgupta,
A. Davison,
F. de Almeida Dias,
A. de Cosa,
A. de Roeck,
C. Debenedetti
, et al. (62 additional authors not shown)
Abstract:
This report of the BOOST2012 workshop presents the results of four working groups that studied key aspects of jet substructure. We discuss the potential of the description of jet substructure in first-principle QCD calculations and study the accuracy of state-of-the-art Monte Carlo tools. Experimental limitations of the ability to resolve substructure are evaluated, with a focus on the impact of a…
▽ More
This report of the BOOST2012 workshop presents the results of four working groups that studied key aspects of jet substructure. We discuss the potential of the description of jet substructure in first-principle QCD calculations and study the accuracy of state-of-the-art Monte Carlo tools. Experimental limitations of the ability to resolve substructure are evaluated, with a focus on the impact of additional proton proton collisions on jet substructure performance in future LHC operating scenarios. A final section summarizes the lessons learnt during the deployment of substructure analyses in searches for new physics in the production of boosted top quarks.
△ Less
Submitted 4 December, 2013; v1 submitted 12 November, 2013;
originally announced November 2013.
-
Jet Cleansing: Pileup Removal at High Luminosity
Authors:
David Krohn,
Matthew Low,
Matthew D. Schwartz,
Lian-Tao Wang
Abstract:
One of the greatest impediments to extracting useful information from high luminosity hadron-collider data is radiation from secondary collisions (i.e. pileup) which can overlap with that of the primary interaction. In this paper we introduce a simple jet-substructure technique termed cleansing which can consistently correct for large amounts of pileup in an observable independent way. Cleansing w…
▽ More
One of the greatest impediments to extracting useful information from high luminosity hadron-collider data is radiation from secondary collisions (i.e. pileup) which can overlap with that of the primary interaction. In this paper we introduce a simple jet-substructure technique termed cleansing which can consistently correct for large amounts of pileup in an observable independent way. Cleansing works at the subjet level, combining tracker and calorimeter-based data to reconstruct the pileup-free primary interaction. The technique can be used on its own, with various degrees of sophistication, or in concert with jet grooming. We apply cleansing to both kinematic and jet shape reconstruction, finding in all cases a marked improvement over previous methods both in the correlation of the cleansed data with uncontaminated results and in measures like S/rt(B). Cleansing should improve the sensitivity of new-physics searches at high luminosity and could also aid in the comparison of precision QCD calculations to collider data.
△ Less
Submitted 26 September, 2014; v1 submitted 18 September, 2013;
originally announced September 2013.
-
Quark and Gluon Jet Substructure
Authors:
Jason Gallicchio,
Matthew D. Schwartz
Abstract:
Distinguishing quark-initiated jets from gluon-initiated jets has the potential to significantly improve the reach of many beyond-the-standard model searches at the Large Hadron Collider and to provide additional tests of QCD. To explore whether quark and gluon jets could possibly be distinguished on an event-by-event basis, we perform a comprehensive simulation-based study. We explore a variety o…
▽ More
Distinguishing quark-initiated jets from gluon-initiated jets has the potential to significantly improve the reach of many beyond-the-standard model searches at the Large Hadron Collider and to provide additional tests of QCD. To explore whether quark and gluon jets could possibly be distinguished on an event-by-event basis, we perform a comprehensive simulation-based study. We explore a variety of motivated and unmotivated variables with a semi-automated multivariate approach. General conclusions are that at 50% quark jet acceptance efficiency, around 80%-90% of gluon jets can be rejected. Some benefit is gained by combining variables. Different event generators are compared, as are the effects of using only charged tracks to avoid pileup. Additional information, including interactive distributions of most variables and their cut efficiencies, can be at http://jets.physics.harvard.edu/qvg.
△ Less
Submitted 11 April, 2013; v1 submitted 29 November, 2012;
originally announced November 2012.
-
Qjets: A Non-Deterministic Approach to Tree-Based Jet Substructure
Authors:
Stephen D. Ellis,
Andrew Hornig,
David Krohn,
Tuhin S. Roy,
Matthew D. Schwartz
Abstract:
Jet substructure is typically studied using clustering algorithms, such as kT, which arrange the jets' constituents into trees. Instead of considering a single tree per jet, we propose that multiple trees should be considered, weighted by an appropriate metric. Then each jet in each event produces a distribution for an observable, rather than a single value. Advantages of this approach include: 1)…
▽ More
Jet substructure is typically studied using clustering algorithms, such as kT, which arrange the jets' constituents into trees. Instead of considering a single tree per jet, we propose that multiple trees should be considered, weighted by an appropriate metric. Then each jet in each event produces a distribution for an observable, rather than a single value. Advantages of this approach include: 1) observables have significantly increased statistical stability; and, 2) new observables, such as the variance of the distribution, provide new handles for signal and background discrimination. For example, we find that employing a set of trees substantially reduces the observed fluctuations in the pruned mass distribution, enhancing the likelihood of new particle discovery for a given integrated luminosity. Furthermore, the resulting pruned mass distributions for (background) QCD jets are found to be substantially wider than that for (signal) jets with intrinsic mass scales, e.g. jets containing a W decay. A cut on this width yields a substantial enhancement in significance relative to a cut on the standard pruned jet mass alone. In particular the luminosity needed for a given significance requirement decreases by a factor of two relative to standard pruning.
△ Less
Submitted 20 June, 2012; v1 submitted 9 January, 2012;
originally announced January 2012.
-
Jet Substructure at the Tevatron and LHC: New results, new tools, new benchmarks
Authors:
A. Altheimer,
S. Arora,
L. Asquith,
G. Brooijmans,
J. Butterworth,
M. Campanelli,
B. Chapleau,
A. E. Cholakian,
J. P. Chou,
M. Dasgupta,
A. Davison,
J. Dolen,
S. D. Ellis,
R. Essig,
J. J. Fan,
R. Field,
A. Fregoso,
J. Gallicchio,
Y. Gershtein,
A. Gomes,
A. Haas,
E. Halkiadakis,
V. Halyo,
S. Hoeche,
A. Hook
, et al. (46 additional authors not shown)
Abstract:
In this report we review recent theoretical progress and the latest experimental results in jet substructure from the Tevatron and the LHC. We review the status of and outlook for calculation and simulation tools for studying jet substructure. Following up on the report of the Boost 2010 workshop, we present a new set of benchmark comparisons of substructure techniques, focusing on the set of vari…
▽ More
In this report we review recent theoretical progress and the latest experimental results in jet substructure from the Tevatron and the LHC. We review the status of and outlook for calculation and simulation tools for studying jet substructure. Following up on the report of the Boost 2010 workshop, we present a new set of benchmark comparisons of substructure techniques, focusing on the set of variables and grooming methods that are collectively known as "top taggers". To facilitate further exploration, we have attempted to collect, harmonise, and publish software implementations of these techniques.
△ Less
Submitted 25 May, 2012; v1 submitted 29 December, 2011;
originally announced January 2012.
-
Resummation for W and Z production at large pT
Authors:
Thomas Becher,
Christian Lorentzen,
Matthew D. Schwartz
Abstract:
Soft-Collinear Effective theory is used to perform threshold resummation for W and Z production at large transverse momentum to next-to-next-to-leading logarithmic accuracy including matching to next-to-leading fixed-order results. The results agree very well with data from the Tevatron, and predictions are made for the high-pT spectra at the LHC. While the higher-log terms are of moderate size, t…
▽ More
Soft-Collinear Effective theory is used to perform threshold resummation for W and Z production at large transverse momentum to next-to-next-to-leading logarithmic accuracy including matching to next-to-leading fixed-order results. The results agree very well with data from the Tevatron, and predictions are made for the high-pT spectra at the LHC. While the higher-log terms are of moderate size, their inclusion leads to a substantial reduction of the perturbative uncertainty. With these improvements, the PDF uncertainties now dominate the error on the predicted cross section.
△ Less
Submitted 21 June, 2011;
originally announced June 2011.
-
Top condensation as a motivated explanation of the top forward-backward asymmetry
Authors:
Yanou Cui,
Zhenyu Han,
Matthew D. Schwartz
Abstract:
Models of top condensation can provide both a compelling solution to the hierarchy problem as well as an explanation of why the top-quark mass is large. The spectrum of such models, in particular topcolor-assisted technicolor, includes top-pions, top-rhos and the top-Higgs, all of which can easily have large top-charm or top-up couplings. Large top-up couplings in particular would lead to a top fo…
▽ More
Models of top condensation can provide both a compelling solution to the hierarchy problem as well as an explanation of why the top-quark mass is large. The spectrum of such models, in particular topcolor-assisted technicolor, includes top-pions, top-rhos and the top-Higgs, all of which can easily have large top-charm or top-up couplings. Large top-up couplings in particular would lead to a top forward-backward asymmetry through $t$-channel exchange, easily consistent with the Tevatron measurements. Intriguingly, there is destructive interference between the top-mesons and the standard model which conspire to make the overall top pair production rate consistent with the standard model. The rate for same-sign top production is also small due to destructive interference between the neutral top-pion and the top-Higgs. Flavor physics is under control because new physics is mostly confined to the top quark. In this way, top condensation can explain the asymmetry and be consistent with all experimental bounds. There are many additional signatures of topcolor with large tu mixing, such as top(s)+jet(s) events, in which a top and a jet reconstruct a resonance mass, which make these models easily testable at the LHC.
△ Less
Submitted 15 June, 2011;
originally announced June 2011.
-
Quark and Gluon Tagging at the LHC
Authors:
Jason Gallicchio,
Matthew D. Schwartz
Abstract:
Being able to distinguish light-quark jets from gluon jets on an event-by-event basis could significantly enhance the reach for many new physics searches at the Large Hadron Collider. Through an exhaustive search of existing and novel jet substructure observables, we find that a multivariate approach can filter out over 95% of the gluon jets while keeping more than half of the light-quark jets. Mo…
▽ More
Being able to distinguish light-quark jets from gluon jets on an event-by-event basis could significantly enhance the reach for many new physics searches at the Large Hadron Collider. Through an exhaustive search of existing and novel jet substructure observables, we find that a multivariate approach can filter out over 95% of the gluon jets while keeping more than half of the light-quark jets. Moreover, a combination of two simple variables, the charge track multiplicity and the $p_T$-weighted linear radial moment (girth), can achieve similar results. While this pair appears very promising, our study is only Monte Carlo based, and other discriminants may work better with real data in a realistic experimental environment. To that end, we explore many other observables constructed using different jet sizes and parameters, and highlight those that deserve further theoretical and experimental scrutiny. Additional information, including distributions of around 10,000 variables, can be found on this website http://jets.physics.harvard.edu/qvg .
△ Less
Submitted 19 October, 2011; v1 submitted 15 June, 2011;
originally announced June 2011.
-
Simplified Models for LHC New Physics Searches
Authors:
Daniele Alves,
Nima Arkani-Hamed,
Sanjay Arora,
Yang Bai,
Matthew Baumgart,
Joshua Berger,
Matthew Buckley,
Bart Butler,
Spencer Chang,
Hsin-Chia Cheng,
Clifford Cheung,
R. Sekhar Chivukula,
Won Sang Cho,
Randy Cotta,
Mariarosaria D'Alfonso,
Sonia El Hedri,
Rouven Essig,
Jared A. Evans,
Liam Fitzpatrick,
Patrick Fox,
Roberto Franceschini,
Ayres Freitas,
James S. Gainer,
Yuri Gershtein,
Richard Gray
, et al. (70 additional authors not shown)
Abstract:
This document proposes a collection of simplified models relevant to the design of new-physics searches at the LHC and the characterization of their results. Both ATLAS and CMS have already presented some results in terms of simplified models, and we encourage them to continue and expand this effort, which supplements both signature-based results and benchmark model interpretations. A simplified m…
▽ More
This document proposes a collection of simplified models relevant to the design of new-physics searches at the LHC and the characterization of their results. Both ATLAS and CMS have already presented some results in terms of simplified models, and we encourage them to continue and expand this effort, which supplements both signature-based results and benchmark model interpretations. A simplified model is defined by an effective Lagrangian describing the interactions of a small number of new particles. Simplified models can equally well be described by a small number of masses and cross-sections. These parameters are directly related to collider physics observables, making simplified models a particularly effective framework for evaluating searches and a useful starting point for characterizing positive signals of new physics. This document serves as an official summary of the results from the "Topologies for Early LHC Searches" workshop, held at SLAC in September of 2010, the purpose of which was to develop a set of representative models that can be used to cover all relevant phase space in experimental searches. Particular emphasis is placed on searches relevant for the first ~50-500 pb-1 of data and those motivated by supersymmetric models. This note largely summarizes material posted at http://lhcnewphysics.org/, which includes simplified model definitions, Monte Carlo material, and supporting contacts within the theory community. We also comment on future developments that may be useful as more data is gathered and analyzed by the experiments.
△ Less
Submitted 13 May, 2011;
originally announced May 2011.
-
Pure Samples of Quark and Gluon Jets at the LHC
Authors:
Jason Gallicchio,
Matthew D. Schwartz
Abstract:
Having pure samples of quark and gluon jets would greatly facilitate the study of jet properties and substructure, with many potential standard model and new physics applications. To this end, we consider multijet and jets+X samples, to determine the purity that can be achieved by simple kinematic cuts leaving reasonable production cross sections. We find, for example, that at the 7 TeV LHC, the p…
▽ More
Having pure samples of quark and gluon jets would greatly facilitate the study of jet properties and substructure, with many potential standard model and new physics applications. To this end, we consider multijet and jets+X samples, to determine the purity that can be achieved by simple kinematic cuts leaving reasonable production cross sections. We find, for example, that at the 7 TeV LHC, the pp {\to} γ+2jets sample can provide 98% pure quark jets with 200 GeV of transverse momentum and a cross section of 5 pb. To get 10 pb of 200 GeV jets with 90% gluon purity, the pp {\to} 3jets sample can be used. b+2jets is also useful for gluons, but only if the b-tagging is very efficient.
△ Less
Submitted 19 October, 2011; v1 submitted 6 April, 2011;
originally announced April 2011.
-
W-jet Tagging: Optimizing the Identification of Boosted Hadronically-Decaying W Bosons
Authors:
Yanou Cui,
Zhenyu Han,
Matthew D. Schwartz
Abstract:
A method is proposed for distinguishing highly boosted hadronically decaying W's (W-jets) from QCD-jets using jet substructure. Previous methods, such as the filtering/mass-drop method, can give a factor of ~2 improvement in S/sqrt(B) for jet pT > 200 GeV. In contrast, a multivariate approach including new discriminants such as R-cores, which characterize the shape of the W-jet, subjet planar flow…
▽ More
A method is proposed for distinguishing highly boosted hadronically decaying W's (W-jets) from QCD-jets using jet substructure. Previous methods, such as the filtering/mass-drop method, can give a factor of ~2 improvement in S/sqrt(B) for jet pT > 200 GeV. In contrast, a multivariate approach including new discriminants such as R-cores, which characterize the shape of the W-jet, subjet planar flow, and grooming-sensitivities is shown to provide a much larger factor of ~5 improvement in S/sqrt(B). For longitudinally polarized W's, such as those coming from many new physics models, the discrimination is even better. Comparing different Monte Carlo simulations, we observe a sensitivity of some variables to the underlying event; however, even with a conservative estimates, the multivariate approach is very powerful. Applications to semileptonic WW resonance searches and all-hadronic W+jet searches at the LHC are also discussed. Code implementing our W-jet tagging algorithm is publicly available at http://jets.physics.harvard.edu/wtag
△ Less
Submitted 10 May, 2011; v1 submitted 9 December, 2010;
originally announced December 2010.
-
Multivariate discrimination and the Higgs + W/Z search
Authors:
Kevin Black,
Jason Gallicchio,
John Huth,
Michael Kagan,
Matthew D. Schwartz,
Brock Tweedie
Abstract:
A systematic method for optimizing multivariate discriminants is developed and applied to the important example of a light Higgs boson search at the Tevatron and the LHC. The Significance Improvement Characteristic (SIC), defined as the signal efficiency of a cut or multivariate discriminant divided by the square root of the background efficiency, is shown to be an extremely powerful visualization…
▽ More
A systematic method for optimizing multivariate discriminants is developed and applied to the important example of a light Higgs boson search at the Tevatron and the LHC. The Significance Improvement Characteristic (SIC), defined as the signal efficiency of a cut or multivariate discriminant divided by the square root of the background efficiency, is shown to be an extremely powerful visualization tool. SIC curves demonstrate numerical instabilities in the multivariate discriminants, show convergence as the number of variables is increased, and display the sensitivity to the optimal cut values. For our application, we concentrate on Higgs boson production in association with a W or Z boson with H -> bb and compare to the irreducible standard model background, Z/W + bb. We explore thousands of experimentally motivated, physically motivated, and unmotivated single variable discriminants. Along with the standard kinematic variables, a number of new ones, such as twist, are described which should have applicability to many processes. We find that some single variables, such as the pull angle, are weak discriminants, but when combined with others they provide important marginal improvement. We also find that multiple Higgs boson-candidate mass measures, such as from mild and aggressively trimmed jets, when combined may provide additional discriminating power. Comparing the significance improvement from our variables to those used in recent CDF and DZero searches, we find that a 10-20% improvement in significance against Z/W + bb is possible. Our analysis also suggests that the H + W/Z channel with H -> bb is also viable at the LHC, without requiring a hard cut on the W/Z transverse momentum.
△ Less
Submitted 21 June, 2011; v1 submitted 18 October, 2010;
originally announced October 2010.
-
THE TOOLS AND MONTE CARLO WORKING GROUP Summary Report from the Les Houches 2009 Workshop on TeV Colliders
Authors:
J. M. Butterworth,
F. Maltoni,
F. Moortgat,
P. Richardson,
S. Schumann,
P. Skands,
J. Alwall,
A. Arbey,
L. Basso,
S. Belov,
A. Bharucha,
F. Braam,
A. Buckley,
M. Campanelli,
R. Chierici,
A. Djouadi,
L. Dudko,
C. Duhr,
F. Febres Cordero,
P. Francavilla,
B. Fuks,
L. Garren,
T. Goto,
M. Grazzini,
T. Hahn
, et al. (47 additional authors not shown)
Abstract:
This is the summary and introduction to the proceedings contributions for the Les Houches 2009 "Tools and Monte Carlo" working group.
This is the summary and introduction to the proceedings contributions for the Les Houches 2009 "Tools and Monte Carlo" working group.
△ Less
Submitted 8 March, 2010;
originally announced March 2010.