-
A Neural Network-Based Search for Unmodeled Transients in LIGO-Virgo-KAGRA's Third Observing Run
Authors:
Ryan Raikman,
Eric A. Moreno,
Katya Govorkova,
Siddharth Soni,
Ethan Marx,
William Benoit,
Alec Gunny,
Deep Chatterjee,
Christina Reissel,
Malina M. Desai,
Rafia Omer,
Muhammed Saleem,
Philip Harris,
Erik Katsavounidis,
Michael W. Coughlin,
Dylan Rankin
Abstract:
This paper presents the results of a Neural Network (NN)-based search for short-duration gravitational-wave transients in data from the third observing run of LIGO, Virgo, and KAGRA. The search targets unmodeled transients with durations of milliseconds to a few seconds in the 30-1500 Hz frequency band, without assumptions about the incoming signal direction, polarization, or morphology. Using the…
▽ More
This paper presents the results of a Neural Network (NN)-based search for short-duration gravitational-wave transients in data from the third observing run of LIGO, Virgo, and KAGRA. The search targets unmodeled transients with durations of milliseconds to a few seconds in the 30-1500 Hz frequency band, without assumptions about the incoming signal direction, polarization, or morphology. Using the Gravitational Wave Anomalous Knowledge (GWAK) method, three compact binary coalescences (CBCs) identified by existing pipelines are successfully detected, along with a range of detector glitches. The algorithm constructs a low-dimensional embedded space to capture the physical features of signals, enabling the detection of CBCs, detector glitches, and unmodeled transients. This study demonstrates GWAK's ability to enhance gravitational-wave searches beyond the limits of existing pipelines, laying the groundwork for future detection strategies.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Low Latency Transformer Inference on FPGAs for Physics Applications with hls4ml
Authors:
Zhixing Jiang,
Dennis Yin,
Yihui Chen,
Elham E Khoda,
Scott Hauck,
Shih-Chieh Hsu,
Ekaterina Govorkova,
Philip Harris,
Vladimir Loncar,
Eric A. Moreno
Abstract:
This study presents an efficient implementation of transformer architectures in Field-Programmable Gate Arrays(FPGAs) using hls4ml. We demonstrate the strategy for implementing the multi-head attention, softmax, and normalization layer and evaluate three distinct models. Their deployment on VU13P FPGA chip achieved latency less than 2us, demonstrating the potential for real-time applications. HLS4…
▽ More
This study presents an efficient implementation of transformer architectures in Field-Programmable Gate Arrays(FPGAs) using hls4ml. We demonstrate the strategy for implementing the multi-head attention, softmax, and normalization layer and evaluate three distinct models. Their deployment on VU13P FPGA chip achieved latency less than 2us, demonstrating the potential for real-time applications. HLS4ML compatibility with any TensorFlow-built transformer model further enhances the scalability and applicability of this work. Index Terms: FPGAs, machine learning, transformers, high energy physics, LIGO
△ Less
Submitted 8 September, 2024;
originally announced September 2024.
-
GWAK: Gravitational-Wave Anomalous Knowledge with Recurrent Autoencoders
Authors:
Ryan Raikman,
Eric A. Moreno,
Ekaterina Govorkova,
Ethan J Marx,
Alec Gunny,
William Benoit,
Deep Chatterjee,
Rafia Omer,
Muhammed Saleem,
Dylan S Rankin,
Michael W Coughlin,
Philip C Harris,
Erik Katsavounidis
Abstract:
Matched-filtering detection techniques for gravitational-wave (GW) signals in ground-based interferometers rely on having well-modeled templates of the GW emission. Such techniques have been traditionally used in searches for compact binary coalescences (CBCs), and have been employed in all known GW detections so far. However, interesting science cases aside from compact mergers do not yet have ac…
▽ More
Matched-filtering detection techniques for gravitational-wave (GW) signals in ground-based interferometers rely on having well-modeled templates of the GW emission. Such techniques have been traditionally used in searches for compact binary coalescences (CBCs), and have been employed in all known GW detections so far. However, interesting science cases aside from compact mergers do not yet have accurate enough modeling to make matched filtering possible, including core-collapse supernovae and sources where stochasticity may be involved. Therefore the development of techniques to identify sources of these types is of significant interest. In this paper, we present a method of anomaly detection based on deep recurrent autoencoders to enhance the search region to unmodeled transients. We use a semi-supervised strategy that we name Gravitational Wave Anomalous Knowledge (GWAK). While the semi-supervised nature of the problem comes with a cost in terms of accuracy as compared to supervised techniques, there is a qualitative advantage in generalizing experimental sensitivity beyond pre-computed signal templates. We construct a low-dimensional embedded space using the GWAK method, capturing the physical signatures of distinct signals on each axis of the space. By introducing signal priors that capture some of the salient features of GW signals, we allow for the recovery of sensitivity even when an unmodeled anomaly is encountered. We show that regions of the GWAK space can identify CBCs, detector glitches and also a variety of unmodeled astrophysical sources.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Applications of Deep Learning to physics workflows
Authors:
Manan Agarwal,
Jay Alameda,
Jeroen Audenaert,
Will Benoit,
Damon Beveridge,
Meghna Bhattacharya,
Chayan Chatterjee,
Deep Chatterjee,
Andy Chen,
Muhammed Saleem Cholayil,
Chia-Jui Chou,
Sunil Choudhary,
Michael Coughlin,
Maximilian Dax,
Aman Desai,
Andrea Di Luca,
Javier Mauricio Duarte,
Steven Farrell,
Yongbin Feng,
Pooyan Goodarzi,
Ekaterina Govorkova,
Matthew Graham,
Jonathan Guiang,
Alec Gunny,
Weichangfeng Guo
, et al. (43 additional authors not shown)
Abstract:
Modern large-scale physics experiments create datasets with sizes and streaming rates that can exceed those from industry leaders such as Google Cloud and Netflix. Fully processing these datasets requires both sufficient compute power and efficient workflows. Recent advances in Machine Learning (ML) and Artificial Intelligence (AI) can either improve or replace existing domain-specific algorithms…
▽ More
Modern large-scale physics experiments create datasets with sizes and streaming rates that can exceed those from industry leaders such as Google Cloud and Netflix. Fully processing these datasets requires both sufficient compute power and efficient workflows. Recent advances in Machine Learning (ML) and Artificial Intelligence (AI) can either improve or replace existing domain-specific algorithms to increase workflow efficiency. Not only can these algorithms improve the physics performance of current algorithms, but they can often be executed more quickly, especially when run on coprocessors such as GPUs or FPGAs. In the winter of 2023, MIT hosted the Accelerating Physics with ML at MIT workshop, which brought together researchers from gravitational-wave physics, multi-messenger astrophysics, and particle physics to discuss and share current efforts to integrate ML tools into their workflows. The following white paper highlights examples of algorithms and computing frameworks discussed during this workshop and summarizes the expected computing needs for the immediate future of the involved fields.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
DarkQuest: A dark sector upgrade to SpinQuest at the 120 GeV Fermilab Main Injector
Authors:
Aram Apyan,
Brian Batell,
Asher Berlin,
Nikita Blinov,
Caspian Chaharom,
Sergio Cuadra,
Zeynep Demiragli,
Adam Duran,
Yongbin Feng,
I. P. Fernando,
Stefania Gori,
Philip Harris,
Duc Hoang,
Dustin Keller,
Elizabeth Kowalczyk,
Monica Leys,
Kun Liu,
Ming Liu,
Wolfgang Lorenzon,
Petar Maksimovic,
Cristina Mantilla Suarez,
Hrachya Marukyan,
Amitav Mitra,
Yoshiyuki Miyachi,
Patrick McCormack
, et al. (14 additional authors not shown)
Abstract:
Expanding the mass range and techniques by which we search for dark matter is an important part of the worldwide particle physics program. Accelerator-based searches for dark matter and dark sector particles are a uniquely compelling part of this program as a way to both create and detect dark matter in the laboratory and explore the dark sector by searching for mediators and excited dark matter p…
▽ More
Expanding the mass range and techniques by which we search for dark matter is an important part of the worldwide particle physics program. Accelerator-based searches for dark matter and dark sector particles are a uniquely compelling part of this program as a way to both create and detect dark matter in the laboratory and explore the dark sector by searching for mediators and excited dark matter particles. This paper focuses on developing the DarkQuest experimental concept and gives an outlook on related enhancements collectively referred to as LongQuest. DarkQuest is a proton fixed-target experiment with leading sensitivity to an array of visible dark sector signatures in the MeV-GeV mass range. Because it builds off of existing accelerator and detector infrastructure, it offers a powerful but modest-cost experimental initiative that can be realized on a short timescale.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders
Authors:
Eric A. Moreno,
Jean-Roch Vlimant,
Maria Spiropulu,
Bartlomiej Borzyszkowski,
Maurizio Pierini
Abstract:
We present an application of anomaly detection techniques based on deep recurrent autoencoders to the problem of detecting gravitational wave signals in laser interferometers. Trained on noise data, this class of algorithms could detect signals using an unsupervised strategy, i.e., without targeting a specific kind of source. We develop a custom architecture to analyze the data from two interferom…
▽ More
We present an application of anomaly detection techniques based on deep recurrent autoencoders to the problem of detecting gravitational wave signals in laser interferometers. Trained on noise data, this class of algorithms could detect signals using an unsupervised strategy, i.e., without targeting a specific kind of source. We develop a custom architecture to analyze the data from two interferometers. We compare the obtained performance to that obtained with other autoencoder architectures and with a convolutional classifier. The unsupervised nature of the proposed strategy comes with a cost in terms of accuracy, when compared to more traditional supervised techniques. On the other hand, there is a qualitative gain in generalizing the experimental sensitivity beyond the ensemble of pre-computed signal templates. The recurrent autoencoder outperforms other autoencoders based on different architectures. The class of recurrent autoencoders presented in this paper could complement the search strategy employed for gravitational wave detection and extend the reach of the ongoing detection campaigns.
△ Less
Submitted 14 December, 2021; v1 submitted 27 July, 2021;
originally announced July 2021.
-
Interaction networks for the identification of boosted $H\to b\overline{b}$ decays
Authors:
Eric A. Moreno,
Thong Q. Nguyen,
Jean-Roch Vlimant,
Olmo Cerri,
Harvey B. Newman,
Avikar Periwal,
Maria Spiropulu,
Javier M. Duarte,
Maurizio Pierini
Abstract:
We develop an algorithm based on an interaction network to identify high-transverse-momentum Higgs bosons decaying to bottom quark-antiquark pairs and distinguish them from ordinary jets that reflect the configurations of quarks and gluons at short distances. The algorithm's inputs are features of the reconstructed charged particles in a jet and the secondary vertices associated with them. Describ…
▽ More
We develop an algorithm based on an interaction network to identify high-transverse-momentum Higgs bosons decaying to bottom quark-antiquark pairs and distinguish them from ordinary jets that reflect the configurations of quarks and gluons at short distances. The algorithm's inputs are features of the reconstructed charged particles in a jet and the secondary vertices associated with them. Describing the jet shower as a combination of particle-to-particle and particle-to-vertex interactions, the model is trained to learn a jet representation on which the classification problem is optimized. The algorithm is trained on simulated samples of realistic LHC collisions, released by the CMS Collaboration on the CERN Open Data Portal. The interaction network achieves a drastic improvement in the identification performance with respect to state-of-the-art algorithms.
△ Less
Submitted 28 July, 2020; v1 submitted 26 September, 2019;
originally announced September 2019.
-
JEDI-net: a jet identification algorithm based on interaction networks
Authors:
Eric A. Moreno,
Olmo Cerri,
Javier M. Duarte,
Harvey B. Newman,
Thong Q. Nguyen,
Avikar Periwal,
Maurizio Pierini,
Aidana Serikova,
Maria Spiropulu,
Jean-Roch Vlimant
Abstract:
We investigate the performance of a jet identification algorithm based on interaction networks (JEDI-net) to identify all-hadronic decays of high-momentum heavy particles produced at the LHC and distinguish them from ordinary jets originating from the hadronization of quarks and gluons. The jet dynamics are described as a set of one-to-one interactions between the jet constituents. Based on a repr…
▽ More
We investigate the performance of a jet identification algorithm based on interaction networks (JEDI-net) to identify all-hadronic decays of high-momentum heavy particles produced at the LHC and distinguish them from ordinary jets originating from the hadronization of quarks and gluons. The jet dynamics are described as a set of one-to-one interactions between the jet constituents. Based on a representation learned from these interactions, the jet is associated to one of the considered categories. Unlike other architectures, the JEDI-net models achieve their performance without special handling of the sparse input jet representation, extensive pre-processing, particle ordering, or specific assumptions regarding the underlying detector geometry. The presented models give better results with less model parameters, offering interesting prospects for LHC applications.
△ Less
Submitted 27 January, 2020; v1 submitted 14 August, 2019;
originally announced August 2019.