Search | arXiv e-print repository

Communicating Likelihoods with Normalising Flows

Authors: Jack Y. Araz, Anja Beck, Méril Reboud, Michael Spannowsky, Danny van Dyk

Abstract: We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods… ▽ More We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods for subsequent analyses. We demonstrate its effectiveness through three case studies in high-energy physics. To support broader adoption, we provide an open-source reference implementation, nabu. △ Less

Submitted 13 February, 2025; originally announced February 2025.

Comments: 4 pages + references, 1 figure

Report number: IPPP/25/07

arXiv:2410.18553 [pdf, other]

Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods

Authors: Daniel Maître, Vishal S. Ngairangbam, Michael Spannowsky

Abstract: The Matrix-Element Method (MEM) has long been a cornerstone of data analysis in high-energy physics. It leverages theoretical knowledge of parton-level processes and symmetries to evaluate the likelihood of observed events. In parallel, the advent of geometric deep learning has enabled neural network architectures that incorporate known symmetries directly into their design, leading to more effici… ▽ More The Matrix-Element Method (MEM) has long been a cornerstone of data analysis in high-energy physics. It leverages theoretical knowledge of parton-level processes and symmetries to evaluate the likelihood of observed events. In parallel, the advent of geometric deep learning has enabled neural network architectures that incorporate known symmetries directly into their design, leading to more efficient learning. This paper presents a novel approach that combines MEM-inspired symmetry considerations with equivariant neural network design for particle physics analysis. Even though Lorentz invariance and permutation invariance overall reconstructed objects are the largest and most natural symmetry in the input domain, we find that they are sub-optimal in most practical search scenarios. We propose a longitudinal boost-equivariant message-passing neural network architecture that preserves relevant discrete symmetries. We present numerical studies demonstrating MEM-inspired architectures achieve new state-of-the-art performance in distinguishing di-Higgs decays to four bottom quarks from the QCD background, with enhanced sample and parameter efficiencies. This synergy between MEM and equivariant deep learning opens new directions for physics-informed architecture design, promising more powerful tools for probing physics beyond the Standard Model. △ Less

Submitted 24 October, 2024; originally announced October 2024.

Comments: 31 pages, 6 figures, 3 tables

Report number: IPPP/24/69

arXiv:2410.07451 [pdf, other]

Collective variables of neural networks: empirical time evolution and scaling laws

Authors: Samuel Tovey, Sven Krippendorf, Michael Spannowsky, Konstantin Nikolaou, Christian Holm

Abstract: This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield insight into the representations learned by a neural network and how these can be improved through architecture scaling. These results are demonstrated first on test… ▽ More This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield insight into the representations learned by a neural network and how these can be improved through architecture scaling. These results are demonstrated first on test cases before being shown on more complex networks, including transformers, auto-encoders, graph neural networks, and reinforcement learning studies. In testing on a wide range of architectures, we highlight the universal nature of training dynamics and further discuss how it can be used to understand the mechanisms behind learning in neural networks. We identify two such dominant mechanisms present throughout machine learning training. The first, information compression, is seen through a reduction in the entropy of the NTK spectrum during training, and occurs predominantly in small neural networks. The second, coined structure formation, is seen through an increasing entropy and thus, the creation of structure in the neural network representations beyond the prior established by the network at initialization. Due to the ubiquity of the latter in deep neural network architectures and its flexibility in the creation of feature-rich representations, we argue that this form of evolution of the network's entropy be considered the onset of a deep learning regime. △ Less

Submitted 9 October, 2024; originally announced October 2024.

Comments: 11 pages, 3 figures

Report number: IPPP/24/66

arXiv:2409.04519 [pdf, other]

The role of data embedding in quantum autoencoders for improved anomaly detection

Authors: Jack Y. Araz, Michael Spannowsky

Abstract: The performance of Quantum Autoencoders (QAEs) in anomaly detection tasks is critically dependent on the choice of data embedding and ansatz design. This study explores the effects of three data embedding techniques, data re-uploading, parallel embedding, and alternate embedding, on the representability and effectiveness of QAEs in detecting anomalies. Our findings reveal that even with relatively… ▽ More The performance of Quantum Autoencoders (QAEs) in anomaly detection tasks is critically dependent on the choice of data embedding and ansatz design. This study explores the effects of three data embedding techniques, data re-uploading, parallel embedding, and alternate embedding, on the representability and effectiveness of QAEs in detecting anomalies. Our findings reveal that even with relatively simple variational circuits, enhanced data embedding strategies can substantially improve anomaly detection accuracy and the representability of underlying data across different datasets. Starting with toy examples featuring low-dimensional data, we visually demonstrate the effect of different embedding techniques on the representability of the model. We then extend our analysis to complex, higher-dimensional datasets, highlighting the significant impact of embedding methods on QAE performance. △ Less

Submitted 6 September, 2024; originally announced September 2024.

Comments: 8 pages, 5 figures, 4 tables

Report number: JLAB-THY-24-4170, IPPP/24/60

arXiv:2408.08823 [pdf, other]

Optimal Symmetries in Binary Classification

Authors: Vishal S. Ngairangbam, Michael Spannowsky

Abstract: We explore the role of group symmetries in binary classification tasks, presenting a novel framework that leverages the principles of Neyman-Pearson optimality. Contrary to the common intuition that larger symmetry groups lead to improved classification performance, our findings show that selecting the appropriate group symmetries is crucial for optimising generalisation and sample efficiency. We… ▽ More We explore the role of group symmetries in binary classification tasks, presenting a novel framework that leverages the principles of Neyman-Pearson optimality. Contrary to the common intuition that larger symmetry groups lead to improved classification performance, our findings show that selecting the appropriate group symmetries is crucial for optimising generalisation and sample efficiency. We develop a theoretical foundation for designing group equivariant neural networks that align the choice of symmetries with the underlying probability distributions of the data. Our approach provides a unified methodology for improving classification accuracy across a broad range of applications by carefully tailoring the symmetry group to the specific characteristics of the problem. Theoretical analysis and experimental results demonstrate that optimal classification performance is not always associated with the largest equivariant groups possible in the domain, even when the likelihood ratio is invariant under one of its proper subgroups, but rather with those subgroups themselves. This work offers insights and practical guidelines for constructing more effective group equivariant architectures in diverse machine-learning contexts. △ Less

Submitted 16 August, 2024; originally announced August 2024.

Comments: 13 pages, 1 figure, 2 tables

Report number: IPPP/24/57

arXiv:2404.07278 [pdf, other]

Generating Quantum Reservoir State Representations with Random Matrices

Authors: Samuel Tovey, Tobias Fellner, Christian Holm, Michael Spannowsky

Abstract: We demonstrate a novel approach to reservoir computation measurements using random matrices. We do so to motivate how atomic-scale devices could be used for real-world computational applications. Our approach uses random matrices to construct reservoir measurements, introducing a simple, scalable means of generating state representations. In our studies, two reservoirs, a five-atom Heisenberg spin… ▽ More We demonstrate a novel approach to reservoir computation measurements using random matrices. We do so to motivate how atomic-scale devices could be used for real-world computational applications. Our approach uses random matrices to construct reservoir measurements, introducing a simple, scalable means of generating state representations. In our studies, two reservoirs, a five-atom Heisenberg spin chain and a five-qubit quantum circuit, perform time series prediction and data interpolation. The performance of the measurement technique and current limitations are discussed in detail, along with an exploration of the diversity of measurements provided by the random matrices. In addition, we explore the role of reservoir parameters such as coupling strength and measurement dimension, providing insight into how these learning machines could be automatically tuned for different problems. This research highlights the use of random matrices to measure simple quantum reservoirs for natural learning devices, and outlines a path forward for improving their performance and experimental realization. △ Less

Submitted 4 March, 2025; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: 12 pages, 5 figures

Report number: IPPP/24/18

arXiv:2309.03732 [pdf, other]

doi 10.1103/PhysRevA.108.052825

Deuterium spectroscopy for enhanced bounds on physics beyond the Standard Model

Authors: Robert M. Potvliege, Adair Nicolson, Matthew P. A. Jones, Michael Spannowsky

Abstract: We consider the impact of combining precision spectroscopic measurements made in atomic hydrogen with similar measurements made in atomic deuterium on the search for physics beyond the Standard Model. Specifically we consider the wide class of models that can be described by an effective Yukawa-type interaction between the nucleus and the electron. We find that it is possible to set bounds on new… ▽ More We consider the impact of combining precision spectroscopic measurements made in atomic hydrogen with similar measurements made in atomic deuterium on the search for physics beyond the Standard Model. Specifically we consider the wide class of models that can be described by an effective Yukawa-type interaction between the nucleus and the electron. We find that it is possible to set bounds on new light-mass bosons that are orders of magnitude more sensitive than those set using a single isotope only, provided the interaction couples differently to the deuteron and proton. Further enhancements of these bounds by an order of magnitude or more would be made possible by extending the current measurements of the isotope shift of the 1s$_{1/2}$-2s$_{1/2}$ transition frequency to that of a transition between the 2s$_{1/2}$ state and a Rydberg s-state. △ Less

Submitted 31 October, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 16 pages, 9 figures

Report number: IPPP/23/50

Journal ref: Phys. Rev. A 108, 052825 (2023)

arXiv:2308.13028 [pdf, other]

Training Neural Networks with Universal Adiabatic Quantum Computing

Authors: Steve Abel, Juan Carlos Criado, Michael Spannowsky

Abstract: The training of neural networks (NNs) is a computationally intensive task requiring significant time and resources. This paper presents a novel approach to NN training using Adiabatic Quantum Computing (AQC), a paradigm that leverages the principles of adiabatic evolution to solve optimisation problems. We propose a universal AQC method that can be implemented on gate quantum computers, allowing f… ▽ More The training of neural networks (NNs) is a computationally intensive task requiring significant time and resources. This paper presents a novel approach to NN training using Adiabatic Quantum Computing (AQC), a paradigm that leverages the principles of adiabatic evolution to solve optimisation problems. We propose a universal AQC method that can be implemented on gate quantum computers, allowing for a broad range of Hamiltonians and thus enabling the training of expressive neural networks. We apply this approach to various neural networks with continuous, discrete, and binary weights. Our results indicate that AQC can very efficiently find the global minimum of the loss function, offering a promising alternative to classical training methods. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: 14 pages

Report number: IPPP/23/46, CERN-TH-2023-162

arXiv:2211.03803 [pdf, other]

doi 10.1103/PhysRevA.108.062422

Quantum-probabilistic Hamiltonian learning for generative modelling & anomaly detection

Authors: Jack Y. Araz, Michael Spannowsky

Abstract: The Hamiltonian of an isolated quantum mechanical system determines its dynamics and physical behaviour. This study investigates the possibility of learning and utilising a system's Hamiltonian and its variational thermal state estimation for data analysis techniques. For this purpose, we employ the method of Quantum Hamiltonian-based models for the generative modelling of simulated Large Hadron C… ▽ More The Hamiltonian of an isolated quantum mechanical system determines its dynamics and physical behaviour. This study investigates the possibility of learning and utilising a system's Hamiltonian and its variational thermal state estimation for data analysis techniques. For this purpose, we employ the method of Quantum Hamiltonian-based models for the generative modelling of simulated Large Hadron Collider data and demonstrate the representability of such data as a mixed state. In a further step, we use the learned Hamiltonian for anomaly detection, showing that different sample types can form distinct dynamical behaviours once treated as a quantum many-body system. We exploit these characteristics to quantify the difference between sample types. Our findings show that the methodologies designed for field theory computations can be utilised in machine learning applications to employ theoretical approaches in data analysis techniques. △ Less

Submitted 28 November, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: 14 pages, 7 figures. Accepted version for publication

Report number: IPPP/22/77

Journal ref: Phys.Rev.A 108 (2023) 6, 062422

arXiv:2106.08334 [pdf, other]

doi 10.1007/JHEP08(2021)112

Quantum-inspired event reconstruction with Tensor Networks: Matrix Product States

Authors: Jack Y. Araz, Michael Spannowsky

Abstract: Tensor Networks are non-trivial representations of high-dimensional tensors, originally designed to describe quantum many-body systems. We show that Tensor Networks are ideal vehicles to connect quantum mechanical concepts to machine learning techniques, thereby facilitating an improved interpretability of neural networks. This study presents the discrimination of top quark signal over QCD backgro… ▽ More Tensor Networks are non-trivial representations of high-dimensional tensors, originally designed to describe quantum many-body systems. We show that Tensor Networks are ideal vehicles to connect quantum mechanical concepts to machine learning techniques, thereby facilitating an improved interpretability of neural networks. This study presents the discrimination of top quark signal over QCD background processes using a Matrix Product State classifier. We show that entanglement entropy can be used to interpret what a network learns, which can be used to reduce the complexity of the network and feature space without loss of generality or performance. For the optimisation of the network, we compare the Density Matrix Renormalization Group (DMRG) algorithm to stochastic gradient descent (SGD) and propose a joined training algorithm to harness the explainability of DMRG with the efficiency of SGD. △ Less

Submitted 6 August, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

Comments: 29 pages, 15 figures. Accepted version for publication in JHEP

Report number: IPPP/20/114

Journal ref: JHEP 08 (2021) 112

arXiv:2105.13945 [pdf, other]

doi 10.1103/PhysRevA.106.042607

Quantum Optimisation of Complex Systems with a Quantum Annealer

Authors: Steve Abel, Andrew Blance, Michael Spannowsky

Abstract: We perform an in-depth comparison of quantum annealing with several classical optimisation techniques, namely thermal annealing, Nelder-Mead, and gradient descent. We begin with a direct study of the 2D Ising model on a quantum annealer, and compare its properties directly with those of the thermal 2D Ising model. These properties include an Ising-like phase transition that can be induced by eithe… ▽ More We perform an in-depth comparison of quantum annealing with several classical optimisation techniques, namely thermal annealing, Nelder-Mead, and gradient descent. We begin with a direct study of the 2D Ising model on a quantum annealer, and compare its properties directly with those of the thermal 2D Ising model. These properties include an Ising-like phase transition that can be induced by either a change in 'quantum-ness' of the theory, or by a scaling the Ising couplings up or down. This behaviour is in accord with what is expected from the physical understanding of the quantum system. We then go on to demonstrate the efficacy of the quantum annealer at minimising several increasingly hard two dimensional potentials. For all the potentials we find the general behaviour that Nelder-Mead and gradient descent methods are very susceptible to becoming trapped in false minima, while the thermal anneal method is somewhat better at discovering the true minimum. However, and despite current limitations on its size, the quantum annealer performs a minimisation very markedly better than any of these classical techniques. A quantum anneal can be designed so that the system almost never gets trapped in a false minimum, and rapidly and successfully minimises the potentials. △ Less

Submitted 21 June, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

Comments: 24 pages, 19 figures, V3 (fixed typo on page 5)

Report number: IPPP/20/106

arXiv:2103.03897 [pdf, other]

doi 10.1007/JHEP08(2021)170

Unsupervised Event Classification with Graphs on Classical and Photonic Quantum Computers

Authors: Andrew Blance, Michael Spannowsky

Abstract: Photonic Quantum Computers provides several benefits over the discrete qubit-based paradigm of quantum computing. By using the power of continuous-variable computing we build an anomaly detection model to use on searches for New Physics. Our model uses Gaussian Boson Sampling, a $\#$P-hard problem and thus not efficiently accessible to classical devices. This is used to create feature vectors from… ▽ More Photonic Quantum Computers provides several benefits over the discrete qubit-based paradigm of quantum computing. By using the power of continuous-variable computing we build an anomaly detection model to use on searches for New Physics. Our model uses Gaussian Boson Sampling, a $\#$P-hard problem and thus not efficiently accessible to classical devices. This is used to create feature vectors from graph data, a natural format for representing data of high-energy collision events. A simple K-means clustering algorithm is used to provide a baseline method of classification. We then present a novel method of anomaly detection, combining the use of Gaussian Boson Sampling and a quantum extension to K-means known as Q-means. This is found to give equivalent results compared to the classical clustering version while also reducing the $\mathcal{O}$ complexity, with respect to the sample's feature-vector length, from $\mathcal{O}(N)$ to $\mathcal{O}(\mbox{log}(N))$. Due to the speed of the sampling algorithm and the feasibility of near-term photonic quantum devices, anomaly detection at the trigger level can become practical in future LHC runs. △ Less

Submitted 5 March, 2021; originally announced March 2021.

Comments: 24 pages, 7 figures

Journal ref: J. High Energ. Phys. 2021, 170

arXiv:1909.09194 [pdf, other]

doi 10.1103/PhysRevResearch.2.013244

Probing new physics using Rydberg states of atomic hydrogen

Authors: Matthew P. A. Jones, Robert M. Potvliege, Michael Spannowsky

Abstract: We consider the role of high-lying Rydberg states of simple atomic systems such as $^1$H in setting constraints on physics beyond the Standard Model. We obtain highly accurate bound states energies for a hydrogen atom in the presence of an additional force carrier (the energy levels of the Hellmann potential). These results show that varying the size and shape of the Rydberg state by varying the q… ▽ More We consider the role of high-lying Rydberg states of simple atomic systems such as $^1$H in setting constraints on physics beyond the Standard Model. We obtain highly accurate bound states energies for a hydrogen atom in the presence of an additional force carrier (the energy levels of the Hellmann potential). These results show that varying the size and shape of the Rydberg state by varying the quantum numbers provides a way to probe the range of new forces. By combining these results with the current state-of-the-art QED corrections, we determine a robust global constraint on new physics that includes all current spectroscopic data in hydrogen. Lastly we show that improved measurements that fully exploit modern cooling and trapping methods as well as higher-lying states could lead to a strong, statistically robust global constraint on new physics based on laboratory measurements only. △ Less

Submitted 6 February, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

Comments: Accepted for publication in Physical Review Research

Report number: IPPP/19/76

Journal ref: Phys. Rev. Research 2, 013244 (2020)

arXiv:1812.06018 [pdf, other]

doi 10.23731/CYRM-2018-002

The Compact Linear Collider (CLIC) - 2018 Summary Report

Authors: The CLIC, CLICdp collaborations, :, T. K. Charles, P. J. Giansiracusa, T. G. Lucas, R. P. Rassool, M. Volpi, C. Balazs, K. Afanaciev, V. Makarenko, A. Patapenka, I. Zhuk, C. Collette, M. J. Boland, A. C. Abusleme Hoffman, M. A. Diaz, F. Garay, Y. Chi, X. He, G. Pei, S. Pei, G. Shu, X. Wang, J. Zhang , et al. (671 additional authors not shown)

Abstract: The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the… ▽ More The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the detector. CLIC is foreseen to be built and operated in stages, at centre-of-mass energies of 380 GeV, 1.5 TeV and 3 TeV, respectively. CLIC uses a two-beam acceleration scheme, in which 12 GHz accelerating structures are powered via a high-current drive beam. For the first stage, an alternative with X-band klystron powering is also considered. CLIC accelerator optimisation, technical developments and system tests have resulted in an increased energy efficiency (power around 170 MW) for the 380 GeV stage, together with a reduced cost estimate at the level of 6 billion CHF. The detector concept has been refined using improved software tools. Significant progress has been made on detector technology developments for the tracking and calorimetry systems. A wide range of CLIC physics studies has been conducted, both through full detector simulations and parametric studies, together providing a broad overview of the CLIC physics potential. Each of the three energy stages adds cornerstones of the full CLIC physics programme, such as Higgs width and couplings, top-quark properties, Higgs self-coupling, direct searches, and many precision electroweak measurements. The interpretation of the combined results gives crucial and accurate insight into new physics, largely complementary to LHC and HL-LHC. The construction of the first CLIC energy stage could start by 2026. First beams would be available by 2035, marking the beginning of a broad CLIC physics programme spanning 25-30 years. △ Less

Submitted 6 May, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

Comments: 112 pages, 59 figures; published as CERN Yellow Report Monograph Vol. 2/2018; corresponding editors: Philip N. Burrows, Nuria Catalan Lasheras, Lucie Linssen, Marko Petrič, Aidan Robson, Daniel Schulte, Eva Sicking, Steinar Stapnes

Report number: CERN-2018-005-M

Showing 1–14 of 14 results for author: Spannowsky, M