-
Real-Time Active Learning for optimised spectroscopic follow-up: Enhancing early SN Ia classification with the Fink broker
Authors:
A. Möller,
E. E. O. Ishida,
J. Peloton,
O. Vidal Velázquez,
J. Soon,
B. Martin,
M. Cluver,
M. Leoni,
E. Taylor
Abstract:
Current and future surveys rely on machine learning classification to obtain large and complete samples of transients. Many of these algorithms are restricted by training samples that contain a limited number of spectroscopically confirmed events. Here, we present the first real-time application of Active Learning to optimise spectroscopic follow-up with the goal of improving training sets of earl…
▽ More
Current and future surveys rely on machine learning classification to obtain large and complete samples of transients. Many of these algorithms are restricted by training samples that contain a limited number of spectroscopically confirmed events. Here, we present the first real-time application of Active Learning to optimise spectroscopic follow-up with the goal of improving training sets of early type Ia supernovae (SNe Ia) classifiers. Using a photometric classifier for early SN Ia, we apply an Active Learning strategy for follow-up optimisation using the real-time FINK broker processing of the ZTF public stream. We perform follow-up observations at the ANU 2.3m telescope in Australia and obtain 92 spectroscopic classified events that are incorporated in our training set. We show that our follow-up strategy yields a training set that, with 25% less spectra, improves classification metrics when compared to publicly reported spectra. Our strategy selects in average fainter events and, not only supernovae types, but also microlensing events and flaring stars which are usually not incorporated on training sets. Our results confirm the effectiveness of active learning strategies to construct optimal training samples for astronomical classifiers. With the Rubin Observatory LSST soon online, we propose improvements to obtain earlier candidates and optimise follow-up. This work paves the way to the deployment of real-time AL follow-up strategies in the era of large surveys.
△ Less
Submitted 12 March, 2025; v1 submitted 26 February, 2025;
originally announced February 2025.
-
Transient Classifiers for Fink: Benchmarks for LSST
Authors:
B. M. O. Fraga,
C. R. Bom,
A. Santos,
E. Russeil,
M. Leoni,
J. Peloton,
E. E. O. Ishida,
A. Möller,
S. Blondin
Abstract:
The upcoming Legacy Survey of Space and Time (LSST) is expected to detect a few million transients per night, which will generate a live alert stream during the entire ten years of the survey. This stream will be distributed via community brokers whose task is to select subsets of the stream and direct them to scientific communities. Given the volume and complexity of the anticipated data, machine…
▽ More
The upcoming Legacy Survey of Space and Time (LSST) is expected to detect a few million transients per night, which will generate a live alert stream during the entire ten years of the survey. This stream will be distributed via community brokers whose task is to select subsets of the stream and direct them to scientific communities. Given the volume and complexity of the anticipated data, machine learning algorithms will be paramount for this task. We present the infrastructure tests and classification methods developed within the Fink broker in preparation for LSST. This work aims to provide detailed information regarding the underlying assumptions and methods behind each classifier and enable users to make informed follow-up decisions from Fink photometric classifications. Using simulated data from ELAsTiCC, we showcase the performance of binary and multi-class ML classifiers available in Fink. These include tree-based classifiers coupled with tailored feature extraction strategies as well as deep learning algorithms. Moreover, we introduce CATS, a deep learning architecture specifically designed for this task. Our results show that Fink classifiers are able to handle the extra complexity that is expected from LSST data. CATS achieved $\geq 93\%$ precision for all classes except `long' (for which it achieved $\sim 83\%$), while our best performing binary classifier achieves $\geq 98\%$ precision and $\geq 99\%$ completeness when classifying the periodic class. ELAsTiCC was an important milestone in preparing the Fink infrastructure to deal with LSST-like data. Our results demonstrate that Fink classifiers are well prepared for the arrival of the new stream, but this work also highlights that transitioning from the current infrastructures to Rubin will require significant adaptation of the currently available tools. This work was the first step in the right direction.
△ Less
Submitted 29 November, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Fink: early supernovae Ia classification using active learning
Authors:
Marco Leoni,
Emille E. O. Ishida,
Julien Peloton,
Anais Möller
Abstract:
We describe how the Fink broker early supernova Ia classifier optimizes its ML classifications by employing an active learning (AL) strategy. We demonstrate the feasibility of implementation of such strategies in the current Zwicky Transient Facility (ZTF) public alert data stream. We compare the performance of two AL strategies: uncertainty sampling and random sampling. Our pipeline consists of 3…
▽ More
We describe how the Fink broker early supernova Ia classifier optimizes its ML classifications by employing an active learning (AL) strategy. We demonstrate the feasibility of implementation of such strategies in the current Zwicky Transient Facility (ZTF) public alert data stream. We compare the performance of two AL strategies: uncertainty sampling and random sampling. Our pipeline consists of 3 stages: feature extraction, classification and learning strategy. Starting from an initial sample of 10 alerts (5 SN Ia and 5 non-Ia), we let the algorithm identify which alert should be added to the training sample. The system is allowed to evolve through 300 iterations. Our data set consists of 23 840 alerts from the ZTF with confirmed classification via cross-match with SIMBAD database and the Transient name server (TNS), 1 600 of which were SNe Ia (1 021 unique objects). The data configuration, after the learning cycle was completed, consists of 310 alerts for training and 23 530 for testing. Averaging over 100 realizations, the classifier achieved 89% purity and 54% efficiency. From 01/November/2020 to 31/October/2021 Fink has applied its early supernova Ia module to the ZTF stream and communicated promising SN Ia candidates to the TNS. From the 535 spectroscopically classified Fink candidates, 459 (86%) were proven to be SNe Ia. Our results confirm the effectiveness of active learning strategies for guiding the construction of optimal training samples for astronomical classifiers. It demonstrates in real data that the performance of learning algorithms can be highly improved without the need of extra computational resources or overwhelmingly large training samples. This is, to our knowledge, the first application of AL to real alerts data.
△ Less
Submitted 20 April, 2022; v1 submitted 22 November, 2021;
originally announced November 2021.
-
Fink, a new generation of broker for the LSST community
Authors:
Anais Möller,
Julien Peloton,
Emille E. O. Ishida,
Chris Arnault,
Etienne Bachelet,
Tristan Blaineau,
Dominique Boutigny,
Abhishek Chauhan,
Emmanuel Gangler,
Fabio Hernandez,
Julius Hrivnac,
Marco Leoni,
Nicolas Leroy,
Marc Moniez,
Sacha Pateyron,
Adrien Ramparison,
Damien Turpin,
Réza Ansari,
Tarek Allam Jr.,
Armelle Bajat,
Biswajit Biswas,
Alexandre Boucaud,
Johan Bregeon,
Jean-Eric Campagne,
Johann Cohen-Tanugi
, et al. (11 additional authors not shown)
Abstract:
Fink is a broker designed to enable science with large time-domain alert streams such as the one from the upcoming Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). It exhibits traditional astronomy broker features such as automatised ingestion, annotation, selection and redistribution of promising alerts for transient science. It is also designed to go beyond traditional broker fe…
▽ More
Fink is a broker designed to enable science with large time-domain alert streams such as the one from the upcoming Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). It exhibits traditional astronomy broker features such as automatised ingestion, annotation, selection and redistribution of promising alerts for transient science. It is also designed to go beyond traditional broker features by providing real-time transient classification which is continuously improved by using state-of-the-art Deep Learning and Adaptive Learning techniques. These evolving added values will enable more accurate scientific output from LSST photometric data for diverse science cases while also leading to a higher incidence of new discoveries which shall accompany the evolution of the survey. In this paper we introduce Fink, its science motivation, architecture and current status including first science verification cases using the Zwicky Transient Facility alert stream.
△ Less
Submitted 16 December, 2020; v1 submitted 21 September, 2020;
originally announced September 2020.