-
Probabilistic solar flare forecasting using historical magnetogram data
Authors:
Kiera van der Sande,
Andrés Muñoz-Jaramillo,
Subhamoy Chatterjee
Abstract:
Solar flare forecasting research using machine learning (ML) has focused on high resolution magnetogram data from the SDO/HMI era covering Solar Cycle 24 and the start of Solar Cycle 25, with some efforts looking back to SOHO/MDI for data from Solar Cycle 23. In this paper, we consider over 4 solar cycles of daily historical magnetogram data from multiple instruments. This is the first attempt to…
▽ More
Solar flare forecasting research using machine learning (ML) has focused on high resolution magnetogram data from the SDO/HMI era covering Solar Cycle 24 and the start of Solar Cycle 25, with some efforts looking back to SOHO/MDI for data from Solar Cycle 23. In this paper, we consider over 4 solar cycles of daily historical magnetogram data from multiple instruments. This is the first attempt to take advantage of this historical data for ML-based flare forecasting. We apply a convolutional neural network (CNN) to extract features from full-disk magnetograms together with a logistic regression model to incorporate scalar features based on magnetograms and flaring history. We use an ensemble approach to generate calibrated probabilistic forecasts of M-class or larger flares in the next 24 hours. Overall, we find that including historical data improves forecasting skill and reliability. We show that single frame magnetograms do not contain significantly more relevant information than can be summarized in a small number of scalar features, and that flaring history has greater predictive power than our CNN-extracted features. This indicates the importance of including temporal information in flare forecasting models.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Decreasing False Alarm Rates in ML-based Solar Flare Prediction using SDO/HMI Data
Authors:
Varad Deshmukh,
Natasha Flyer,
Kiera Van Der Sande,
Thomas Berger
Abstract:
A hybrid two-stage machine learning architecture that addresses the problem of excessive false positives (false alarms) in solar flare prediction systems is investigated. The first stage is a convolutional neural network (CNN) model based on the VGG-16 architecture that extracts features from a temporal stack of consecutive Solar Dynamics Observatory (SDO) Helioseismic and Magnetic Imager (HMI) ma…
▽ More
A hybrid two-stage machine learning architecture that addresses the problem of excessive false positives (false alarms) in solar flare prediction systems is investigated. The first stage is a convolutional neural network (CNN) model based on the VGG-16 architecture that extracts features from a temporal stack of consecutive Solar Dynamics Observatory (SDO) Helioseismic and Magnetic Imager (HMI) magnetogram images to produce a flaring probability. The probability of flaring is added to a feature vector derived from the magnetograms to train an extremely randomized trees (ERT) model in the second stage to produce a binary deterministic prediction (flare/no flare) in a 12-hour forecast window. To tune the hyperparameters of the architecture a new evaluation metric is introduced, the "scaled True Skill Statistic". It specifically addresses the large discrepancy between the true positive rate and the false positive rate in the highly unbalanced solar flare event training datasets. Through hyperparameter tuning to maximize this new metric, our two-stage architecture drastically reduces false positives by $\approx$ $48\%$ without significantly affecting the true positives (reduction by $\approx$ $12\%$), when compared with predictions from the first stage CNN alone. This, in turn, improves various traditional binary classification metrics sensitive to false positives such as the precision, F1 and the Heidke Skill Score. The end result is a more robust 12-hour flare prediction system that could be combined with current operational flare forecasting methods. Additionally, using the ERT-based feature ranking mechanism, we show that the CNN output probability is highly ranked in terms of flare prediction relevance.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
Fourier Continuation Discontinuous Galerkin Methods for Linear Hyperbolic Problems
Authors:
Daniel Appelo,
Kiera van der Sande,
Nathan Albin
Abstract:
Fourier continuation is an approach used to create periodic extensions of non-periodic functions in order to obtain highly-accurate Fourier expansions. These methods have been used in PDE-solvers and have demonstrated high-order convergence and spectrally accurate dispersion relations in numerical experiments. Discontinuous Galerkin (DG) methods are increasingly used for solving PDEs and, as all G…
▽ More
Fourier continuation is an approach used to create periodic extensions of non-periodic functions in order to obtain highly-accurate Fourier expansions. These methods have been used in PDE-solvers and have demonstrated high-order convergence and spectrally accurate dispersion relations in numerical experiments. Discontinuous Galerkin (DG) methods are increasingly used for solving PDEs and, as all Galerkin formulations, come with a strong framework for proving stability and convergence. Here we propose the use of Fourier continuation in forming a new basis for the DG framework.
△ Less
Submitted 30 April, 2021;
originally announced May 2021.
-
Dynamic soliton-mean flow interaction with nonconvex flux
Authors:
Kiera van der Sande,
Gennady A. El,
Mark A. Hoefer
Abstract:
The interaction of localised solitary waves with large-scale, time-varying dispersive mean flows subject to nonconvex flux is studied in the framework of the modified Korteweg-de Vries (mKdV) equation, a canonical model for nonlinear internal gravity wave propagation in stratified fluids. The principal feature of the studied interaction is that both the solitary wave and the large-scale mean flow…
▽ More
The interaction of localised solitary waves with large-scale, time-varying dispersive mean flows subject to nonconvex flux is studied in the framework of the modified Korteweg-de Vries (mKdV) equation, a canonical model for nonlinear internal gravity wave propagation in stratified fluids. The principal feature of the studied interaction is that both the solitary wave and the large-scale mean flow -- a rarefaction wave or a dispersive shock wave (undular bore) -- are described by the same dispersive hydrodynamic equation. A recent theoretical and experimental study of this new type of dynamic soliton-mean flow interaction has revealed two main scenarios when the solitary wave either tunnels through the varying mean flow that connects two constant asymptotic states, or remains trapped inside it. While the previous work considered convex systems, in this paper it is demonstrated that the presence of a nonconvex hydrodynamic flux introduces significant modifications to the scenarios for transmission and trapping. A reduced set of Whitham modulation equations, termed the solitonic modulation system, is used to formulate a general, approximate mathematical framework for solitary wave-mean flow interaction with nonconvex flux. Solitary wave trapping is conveniently stated in terms of crossing characteristics for the solitonic system. Numerical simulations of the mKdV equation agree with the predictions of modulation theory. The developed theory draws upon general properties of dispersive hydrodynamic partial differential equations, not on the complete integrability of the mKdV equation. As such, the mathematical framework developed here enables application to other fluid dynamic contexts subject to nonconvex flux.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Fast variable density 3-D node generation
Authors:
Kiera van der Sande,
Bengt Fornberg
Abstract:
Mesh-free solvers for partial differential equations perform best on scattered quasi-uniform nodes. Computational efficiency can be improved by using nodes with greater spacing in regions of less activity. We present an advancing front type method to generate variable density nodes in 2-D and 3-D with clear generalization to higher dimensions. The exhibited cost of generating a node set of size…
▽ More
Mesh-free solvers for partial differential equations perform best on scattered quasi-uniform nodes. Computational efficiency can be improved by using nodes with greater spacing in regions of less activity. We present an advancing front type method to generate variable density nodes in 2-D and 3-D with clear generalization to higher dimensions. The exhibited cost of generating a node set of size $N$ in 2-D and 3-D with the present method is O(N).
△ Less
Submitted 9 September, 2020; v1 submitted 3 June, 2019;
originally announced June 2019.