-
Communicating Likelihoods with Normalising Flows
Authors:
Jack Y. Araz,
Anja Beck,
Méril Reboud,
Michael Spannowsky,
Danny van Dyk
Abstract:
We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods…
▽ More
We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods for subsequent analyses. We demonstrate its effectiveness through three case studies in high-energy physics. To support broader adoption, we provide an open-source reference implementation, nabu.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods
Authors:
Daniel Maître,
Vishal S. Ngairangbam,
Michael Spannowsky
Abstract:
The Matrix-Element Method (MEM) has long been a cornerstone of data analysis in high-energy physics. It leverages theoretical knowledge of parton-level processes and symmetries to evaluate the likelihood of observed events. In parallel, the advent of geometric deep learning has enabled neural network architectures that incorporate known symmetries directly into their design, leading to more effici…
▽ More
The Matrix-Element Method (MEM) has long been a cornerstone of data analysis in high-energy physics. It leverages theoretical knowledge of parton-level processes and symmetries to evaluate the likelihood of observed events. In parallel, the advent of geometric deep learning has enabled neural network architectures that incorporate known symmetries directly into their design, leading to more efficient learning. This paper presents a novel approach that combines MEM-inspired symmetry considerations with equivariant neural network design for particle physics analysis. Even though Lorentz invariance and permutation invariance overall reconstructed objects are the largest and most natural symmetry in the input domain, we find that they are sub-optimal in most practical search scenarios. We propose a longitudinal boost-equivariant message-passing neural network architecture that preserves relevant discrete symmetries. We present numerical studies demonstrating MEM-inspired architectures achieve new state-of-the-art performance in distinguishing di-Higgs decays to four bottom quarks from the QCD background, with enhanced sample and parameter efficiencies. This synergy between MEM and equivariant deep learning opens new directions for physics-informed architecture design, promising more powerful tools for probing physics beyond the Standard Model.
△ Less
Submitted 24 October, 2024;
originally announced October 2024.
-
Collective variables of neural networks: empirical time evolution and scaling laws
Authors:
Samuel Tovey,
Sven Krippendorf,
Michael Spannowsky,
Konstantin Nikolaou,
Christian Holm
Abstract:
This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield insight into the representations learned by a neural network and how these can be improved through architecture scaling. These results are demonstrated first on test…
▽ More
This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield insight into the representations learned by a neural network and how these can be improved through architecture scaling. These results are demonstrated first on test cases before being shown on more complex networks, including transformers, auto-encoders, graph neural networks, and reinforcement learning studies. In testing on a wide range of architectures, we highlight the universal nature of training dynamics and further discuss how it can be used to understand the mechanisms behind learning in neural networks. We identify two such dominant mechanisms present throughout machine learning training. The first, information compression, is seen through a reduction in the entropy of the NTK spectrum during training, and occurs predominantly in small neural networks. The second, coined structure formation, is seen through an increasing entropy and thus, the creation of structure in the neural network representations beyond the prior established by the network at initialization. Due to the ubiquity of the latter in deep neural network architectures and its flexibility in the creation of feature-rich representations, we argue that this form of evolution of the network's entropy be considered the onset of a deep learning regime.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
The role of data embedding in quantum autoencoders for improved anomaly detection
Authors:
Jack Y. Araz,
Michael Spannowsky
Abstract:
The performance of Quantum Autoencoders (QAEs) in anomaly detection tasks is critically dependent on the choice of data embedding and ansatz design. This study explores the effects of three data embedding techniques, data re-uploading, parallel embedding, and alternate embedding, on the representability and effectiveness of QAEs in detecting anomalies. Our findings reveal that even with relatively…
▽ More
The performance of Quantum Autoencoders (QAEs) in anomaly detection tasks is critically dependent on the choice of data embedding and ansatz design. This study explores the effects of three data embedding techniques, data re-uploading, parallel embedding, and alternate embedding, on the representability and effectiveness of QAEs in detecting anomalies. Our findings reveal that even with relatively simple variational circuits, enhanced data embedding strategies can substantially improve anomaly detection accuracy and the representability of underlying data across different datasets. Starting with toy examples featuring low-dimensional data, we visually demonstrate the effect of different embedding techniques on the representability of the model. We then extend our analysis to complex, higher-dimensional datasets, highlighting the significant impact of embedding methods on QAE performance.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Optimal Symmetries in Binary Classification
Authors:
Vishal S. Ngairangbam,
Michael Spannowsky
Abstract:
We explore the role of group symmetries in binary classification tasks, presenting a novel framework that leverages the principles of Neyman-Pearson optimality. Contrary to the common intuition that larger symmetry groups lead to improved classification performance, our findings show that selecting the appropriate group symmetries is crucial for optimising generalisation and sample efficiency. We…
▽ More
We explore the role of group symmetries in binary classification tasks, presenting a novel framework that leverages the principles of Neyman-Pearson optimality. Contrary to the common intuition that larger symmetry groups lead to improved classification performance, our findings show that selecting the appropriate group symmetries is crucial for optimising generalisation and sample efficiency. We develop a theoretical foundation for designing group equivariant neural networks that align the choice of symmetries with the underlying probability distributions of the data. Our approach provides a unified methodology for improving classification accuracy across a broad range of applications by carefully tailoring the symmetry group to the specific characteristics of the problem. Theoretical analysis and experimental results demonstrate that optimal classification performance is not always associated with the largest equivariant groups possible in the domain, even when the likelihood ratio is invariant under one of its proper subgroups, but rather with those subgroups themselves. This work offers insights and practical guidelines for constructing more effective group equivariant architectures in diverse machine-learning contexts.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
Generating Quantum Reservoir State Representations with Random Matrices
Authors:
Samuel Tovey,
Tobias Fellner,
Christian Holm,
Michael Spannowsky
Abstract:
We demonstrate a novel approach to reservoir computation measurements using random matrices. We do so to motivate how atomic-scale devices could be used for real-world computational applications. Our approach uses random matrices to construct reservoir measurements, introducing a simple, scalable means of generating state representations. In our studies, two reservoirs, a five-atom Heisenberg spin…
▽ More
We demonstrate a novel approach to reservoir computation measurements using random matrices. We do so to motivate how atomic-scale devices could be used for real-world computational applications. Our approach uses random matrices to construct reservoir measurements, introducing a simple, scalable means of generating state representations. In our studies, two reservoirs, a five-atom Heisenberg spin chain and a five-qubit quantum circuit, perform time series prediction and data interpolation. The performance of the measurement technique and current limitations are discussed in detail, along with an exploration of the diversity of measurements provided by the random matrices. In addition, we explore the role of reservoir parameters such as coupling strength and measurement dimension, providing insight into how these learning machines could be automatically tuned for different problems. This research highlights the use of random matrices to measure simple quantum reservoirs for natural learning devices, and outlines a path forward for improving their performance and experimental realization.
△ Less
Submitted 4 March, 2025; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Deuterium spectroscopy for enhanced bounds on physics beyond the Standard Model
Authors:
Robert M. Potvliege,
Adair Nicolson,
Matthew P. A. Jones,
Michael Spannowsky
Abstract:
We consider the impact of combining precision spectroscopic measurements made in atomic hydrogen with similar measurements made in atomic deuterium on the search for physics beyond the Standard Model. Specifically we consider the wide class of models that can be described by an effective Yukawa-type interaction between the nucleus and the electron. We find that it is possible to set bounds on new…
▽ More
We consider the impact of combining precision spectroscopic measurements made in atomic hydrogen with similar measurements made in atomic deuterium on the search for physics beyond the Standard Model. Specifically we consider the wide class of models that can be described by an effective Yukawa-type interaction between the nucleus and the electron. We find that it is possible to set bounds on new light-mass bosons that are orders of magnitude more sensitive than those set using a single isotope only, provided the interaction couples differently to the deuteron and proton. Further enhancements of these bounds by an order of magnitude or more would be made possible by extending the current measurements of the isotope shift of the 1s$_{1/2}$-2s$_{1/2}$ transition frequency to that of a transition between the 2s$_{1/2}$ state and a Rydberg s-state.
△ Less
Submitted 31 October, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Training Neural Networks with Universal Adiabatic Quantum Computing
Authors:
Steve Abel,
Juan Carlos Criado,
Michael Spannowsky
Abstract:
The training of neural networks (NNs) is a computationally intensive task requiring significant time and resources. This paper presents a novel approach to NN training using Adiabatic Quantum Computing (AQC), a paradigm that leverages the principles of adiabatic evolution to solve optimisation problems. We propose a universal AQC method that can be implemented on gate quantum computers, allowing f…
▽ More
The training of neural networks (NNs) is a computationally intensive task requiring significant time and resources. This paper presents a novel approach to NN training using Adiabatic Quantum Computing (AQC), a paradigm that leverages the principles of adiabatic evolution to solve optimisation problems. We propose a universal AQC method that can be implemented on gate quantum computers, allowing for a broad range of Hamiltonians and thus enabling the training of expressive neural networks. We apply this approach to various neural networks with continuous, discrete, and binary weights. Our results indicate that AQC can very efficiently find the global minimum of the loss function, offering a promising alternative to classical training methods.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Quantum-probabilistic Hamiltonian learning for generative modelling & anomaly detection
Authors:
Jack Y. Araz,
Michael Spannowsky
Abstract:
The Hamiltonian of an isolated quantum mechanical system determines its dynamics and physical behaviour. This study investigates the possibility of learning and utilising a system's Hamiltonian and its variational thermal state estimation for data analysis techniques. For this purpose, we employ the method of Quantum Hamiltonian-based models for the generative modelling of simulated Large Hadron C…
▽ More
The Hamiltonian of an isolated quantum mechanical system determines its dynamics and physical behaviour. This study investigates the possibility of learning and utilising a system's Hamiltonian and its variational thermal state estimation for data analysis techniques. For this purpose, we employ the method of Quantum Hamiltonian-based models for the generative modelling of simulated Large Hadron Collider data and demonstrate the representability of such data as a mixed state. In a further step, we use the learned Hamiltonian for anomaly detection, showing that different sample types can form distinct dynamical behaviours once treated as a quantum many-body system. We exploit these characteristics to quantify the difference between sample types. Our findings show that the methodologies designed for field theory computations can be utilised in machine learning applications to employ theoretical approaches in data analysis techniques.
△ Less
Submitted 28 November, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Quantum-inspired event reconstruction with Tensor Networks: Matrix Product States
Authors:
Jack Y. Araz,
Michael Spannowsky
Abstract:
Tensor Networks are non-trivial representations of high-dimensional tensors, originally designed to describe quantum many-body systems. We show that Tensor Networks are ideal vehicles to connect quantum mechanical concepts to machine learning techniques, thereby facilitating an improved interpretability of neural networks. This study presents the discrimination of top quark signal over QCD backgro…
▽ More
Tensor Networks are non-trivial representations of high-dimensional tensors, originally designed to describe quantum many-body systems. We show that Tensor Networks are ideal vehicles to connect quantum mechanical concepts to machine learning techniques, thereby facilitating an improved interpretability of neural networks. This study presents the discrimination of top quark signal over QCD background processes using a Matrix Product State classifier. We show that entanglement entropy can be used to interpret what a network learns, which can be used to reduce the complexity of the network and feature space without loss of generality or performance. For the optimisation of the network, we compare the Density Matrix Renormalization Group (DMRG) algorithm to stochastic gradient descent (SGD) and propose a joined training algorithm to harness the explainability of DMRG with the efficiency of SGD.
△ Less
Submitted 6 August, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Quantum Optimisation of Complex Systems with a Quantum Annealer
Authors:
Steve Abel,
Andrew Blance,
Michael Spannowsky
Abstract:
We perform an in-depth comparison of quantum annealing with several classical optimisation techniques, namely thermal annealing, Nelder-Mead, and gradient descent. We begin with a direct study of the 2D Ising model on a quantum annealer, and compare its properties directly with those of the thermal 2D Ising model. These properties include an Ising-like phase transition that can be induced by eithe…
▽ More
We perform an in-depth comparison of quantum annealing with several classical optimisation techniques, namely thermal annealing, Nelder-Mead, and gradient descent. We begin with a direct study of the 2D Ising model on a quantum annealer, and compare its properties directly with those of the thermal 2D Ising model. These properties include an Ising-like phase transition that can be induced by either a change in 'quantum-ness' of the theory, or by a scaling the Ising couplings up or down. This behaviour is in accord with what is expected from the physical understanding of the quantum system. We then go on to demonstrate the efficacy of the quantum annealer at minimising several increasingly hard two dimensional potentials. For all the potentials we find the general behaviour that Nelder-Mead and gradient descent methods are very susceptible to becoming trapped in false minima, while the thermal anneal method is somewhat better at discovering the true minimum. However, and despite current limitations on its size, the quantum annealer performs a minimisation very markedly better than any of these classical techniques. A quantum anneal can be designed so that the system almost never gets trapped in a false minimum, and rapidly and successfully minimises the potentials.
△ Less
Submitted 21 June, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Unsupervised Event Classification with Graphs on Classical and Photonic Quantum Computers
Authors:
Andrew Blance,
Michael Spannowsky
Abstract:
Photonic Quantum Computers provides several benefits over the discrete qubit-based paradigm of quantum computing. By using the power of continuous-variable computing we build an anomaly detection model to use on searches for New Physics. Our model uses Gaussian Boson Sampling, a $\#$P-hard problem and thus not efficiently accessible to classical devices. This is used to create feature vectors from…
▽ More
Photonic Quantum Computers provides several benefits over the discrete qubit-based paradigm of quantum computing. By using the power of continuous-variable computing we build an anomaly detection model to use on searches for New Physics. Our model uses Gaussian Boson Sampling, a $\#$P-hard problem and thus not efficiently accessible to classical devices. This is used to create feature vectors from graph data, a natural format for representing data of high-energy collision events. A simple K-means clustering algorithm is used to provide a baseline method of classification. We then present a novel method of anomaly detection, combining the use of Gaussian Boson Sampling and a quantum extension to K-means known as Q-means. This is found to give equivalent results compared to the classical clustering version while also reducing the $\mathcal{O}$ complexity, with respect to the sample's feature-vector length, from $\mathcal{O}(N)$ to $\mathcal{O}(\mbox{log}(N))$. Due to the speed of the sampling algorithm and the feasibility of near-term photonic quantum devices, anomaly detection at the trigger level can become practical in future LHC runs.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Probing new physics using Rydberg states of atomic hydrogen
Authors:
Matthew P. A. Jones,
Robert M. Potvliege,
Michael Spannowsky
Abstract:
We consider the role of high-lying Rydberg states of simple atomic systems such as $^1$H in setting constraints on physics beyond the Standard Model. We obtain highly accurate bound states energies for a hydrogen atom in the presence of an additional force carrier (the energy levels of the Hellmann potential). These results show that varying the size and shape of the Rydberg state by varying the q…
▽ More
We consider the role of high-lying Rydberg states of simple atomic systems such as $^1$H in setting constraints on physics beyond the Standard Model. We obtain highly accurate bound states energies for a hydrogen atom in the presence of an additional force carrier (the energy levels of the Hellmann potential). These results show that varying the size and shape of the Rydberg state by varying the quantum numbers provides a way to probe the range of new forces. By combining these results with the current state-of-the-art QED corrections, we determine a robust global constraint on new physics that includes all current spectroscopic data in hydrogen. Lastly we show that improved measurements that fully exploit modern cooling and trapping methods as well as higher-lying states could lead to a strong, statistically robust global constraint on new physics based on laboratory measurements only.
△ Less
Submitted 6 February, 2020; v1 submitted 19 September, 2019;
originally announced September 2019.
-
The Compact Linear Collider (CLIC) - 2018 Summary Report
Authors:
The CLIC,
CLICdp collaborations,
:,
T. K. Charles,
P. J. Giansiracusa,
T. G. Lucas,
R. P. Rassool,
M. Volpi,
C. Balazs,
K. Afanaciev,
V. Makarenko,
A. Patapenka,
I. Zhuk,
C. Collette,
M. J. Boland,
A. C. Abusleme Hoffman,
M. A. Diaz,
F. Garay,
Y. Chi,
X. He,
G. Pei,
S. Pei,
G. Shu,
X. Wang,
J. Zhang
, et al. (671 additional authors not shown)
Abstract:
The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the…
▽ More
The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the detector. CLIC is foreseen to be built and operated in stages, at centre-of-mass energies of 380 GeV, 1.5 TeV and 3 TeV, respectively. CLIC uses a two-beam acceleration scheme, in which 12 GHz accelerating structures are powered via a high-current drive beam. For the first stage, an alternative with X-band klystron powering is also considered. CLIC accelerator optimisation, technical developments and system tests have resulted in an increased energy efficiency (power around 170 MW) for the 380 GeV stage, together with a reduced cost estimate at the level of 6 billion CHF. The detector concept has been refined using improved software tools. Significant progress has been made on detector technology developments for the tracking and calorimetry systems. A wide range of CLIC physics studies has been conducted, both through full detector simulations and parametric studies, together providing a broad overview of the CLIC physics potential. Each of the three energy stages adds cornerstones of the full CLIC physics programme, such as Higgs width and couplings, top-quark properties, Higgs self-coupling, direct searches, and many precision electroweak measurements. The interpretation of the combined results gives crucial and accurate insight into new physics, largely complementary to LHC and HL-LHC. The construction of the first CLIC energy stage could start by 2026. First beams would be available by 2035, marking the beginning of a broad CLIC physics programme spanning 25-30 years.
△ Less
Submitted 6 May, 2019; v1 submitted 14 December, 2018;
originally announced December 2018.