-
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Authors:
Francesco Pio Ramunno,
Paolo Massa,
Vitaliy Kinakh,
Brandon Panos,
André Csillaghy,
Slava Voloshynovskiy
Abstract:
The spatial properties of the solar magnetic field are crucial to decoding the physical processes in the solar interior and their interplanetary effects. However, observations from older instruments, such as the Michelson Doppler Imager (MDI), have limited spatial or temporal resolution, which hinders the ability to study small-scale solar features in detail. Super resolving these older datasets i…
▽ More
The spatial properties of the solar magnetic field are crucial to decoding the physical processes in the solar interior and their interplanetary effects. However, observations from older instruments, such as the Michelson Doppler Imager (MDI), have limited spatial or temporal resolution, which hinders the ability to study small-scale solar features in detail. Super resolving these older datasets is essential for uniform analysis across different solar cycles, enabling better characterization of solar flares, active regions, and magnetic network dynamics. In this work, we introduce a novel diffusion model approach for Super-Resolution and we apply it to MDI magnetograms to match the higher-resolution capabilities of the Helioseismic and Magnetic Imager (HMI). By training a Latent Diffusion Model (LDM) with residuals on downscaled HMI data and fine-tuning it with paired MDI/HMI data, we can enhance the resolution of MDI observations from 2"/pixel to 0.5"/pixel. We evaluate the quality of the reconstructed images by means of classical metrics (e.g., PSNR, SSIM, FID and LPIPS) and we check if physical properties, such as the unsigned magnetic flux or the size of an active region, are preserved. We compare our model with different variations of LDM and Denoising Diffusion Probabilistic models (DDPMs), but also with two deterministic architectures already used in the past for performing the Super-Resolution task. Furthermore, we show with an analysis in the Fourier domain that the LDM with residuals can resolve features smaller than 2", and due to the probabilistic nature of the LDM, we can asses their reliability, in contrast with the deterministic models. Future studies aim to super-resolve the temporal scale of the solar MDI instrument so that we can also have a better overview of the dynamics of the old events.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Bridging the Gap: Examining Vision Foundation Models for Optical and Radio Astronomy Applications
Authors:
E. Lastufka,
O. Bait,
M. Drozdova,
V. Kinakh,
D. Piras,
M. Audard,
M. Dessauges-Zavadsky,
T. Holotyak,
D. Schaerer,
S. Voloshynovskiy
Abstract:
Vision foundation models, which have demonstrated significant potential in many multimedia applications, are often underutilized in the natural sciences. This is primarily due to mismatches between the nature of domain-specific scientific data and the typical training data used for foundation models, leading to distribution shifts. Scientific data often differ substantially in structure and charac…
▽ More
Vision foundation models, which have demonstrated significant potential in many multimedia applications, are often underutilized in the natural sciences. This is primarily due to mismatches between the nature of domain-specific scientific data and the typical training data used for foundation models, leading to distribution shifts. Scientific data often differ substantially in structure and characteristics, and researchers frequently face the challenge of optimizing model performance with limited labeled data of only a few hundred or thousand images. This work evaluates the performance of vision foundation models in astrophysics, with a focus on identifying the best practices for adapting them to domain-specific datasets. We aim to establish a framework for selecting, fine-tuning, and optimizing these models for common tasks in optical and radio astronomy. We compared multiple foundation models, including self-supervised, weakly supervised, and distillation-based architectures, across two representative optical and radio datasets. Experiments involved different fine-tuning strategies, projector heads, and data preprocessing techniques, with performance evaluated on classification and detection metrics. Features extracted by specific foundation models improved classification accuracy for optical galaxy images compared to conventional supervised training. Similarly, these models achieved equivalent or superior performance in object detection tasks with radio images. However, classification performance for radio galaxy images was generally poor, often falling short of supervised approaches. These findings demonstrate that vision foundation models can be effectively adapted to astrophysical applications, provided practitioners iterate on model selection, training strategies, and data handling.
△ Less
Submitted 9 January, 2025; v1 submitted 17 September, 2024;
originally announced September 2024.
-
Self-Supervised Learning on MeerKAT Wide-Field Continuum Images
Authors:
Erica Lastufka,
Omkar Bait,
Olga Taran,
Mariia Drozdova,
Vitaliy Kinakh,
Davide Piras,
Marc Audard,
Miroslava Dessauges-Zavadsky,
Taras Holotyak,
Daniel Schaerer,
Svyatoslav Voloshynovskiy
Abstract:
Self-supervised learning (SSL) applied to natural images has demonstrated a remarkable ability to learn meaningful, low-dimension representations without labels, resulting in models that are adaptable to many different tasks. Until now, applications of SSL to astronomical images have been limited to Galaxy Zoo datasets, which require a significant amount of pre-processing to prepare sparse images…
▽ More
Self-supervised learning (SSL) applied to natural images has demonstrated a remarkable ability to learn meaningful, low-dimension representations without labels, resulting in models that are adaptable to many different tasks. Until now, applications of SSL to astronomical images have been limited to Galaxy Zoo datasets, which require a significant amount of pre-processing to prepare sparse images centered on a single galaxy. With wide-field survey instruments at the forefront of the Square Kilometer Array (SKA) era, this approach to gathering training data is impractical. We demonstrate that continuum images from surveys like the MeerKAT Galactic Cluster Legacy Survey (MGCLS) can be successfully used with SSL, without extracting single-galaxy cutouts. Using the SSL framework DINO, we experiment with various preprocessing steps, augmentations, and architectures to determine the optimal approach for this data. We train both ResNet50 and Vision Transformer (ViT) backbones. Our models match state-of-the-art results (trained on Radio Galaxy Zoo) for FRI/FRII morphology classification. Furthermore, they predict the number of compact sources via linear regression with much higher accuracy. However, fine-tuning results in similar performance between our models, the state-of-the-art, and open-source models on multi-class morphology classification. Using source-rich crops from wide-field images to train multi-purpose models is an easily scalable approach that significantly reduces data preparation time. For the tasks evaluated in this work, twenty thousand crops is sufficient training data for models that produce results similar to state-of-the-art. In the future, complex tasks like source detection and characterization, together with domain-specific tasks, ought to demonstrate the true advantages of training models with radio astronomy data over natural-image foundation models.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution
Authors:
Francesco Pio Ramunno,
Hyun-Jin Jeong,
Stefan Hackstein,
André Csillaghy,
Svyatoslav Voloshynovskiy,
Manolis K. Georgoulis
Abstract:
Investigating the solar magnetic field is crucial to understand the physical processes in the solar interior as well as their effects on the interplanetary environment. We introduce a novel method to predict the evolution of the solar line-of-sight (LoS) magnetogram using image-to-image translation with Denoising Diffusion Probabilistic Models (DDPMs). Our approach combines "computer science metri…
▽ More
Investigating the solar magnetic field is crucial to understand the physical processes in the solar interior as well as their effects on the interplanetary environment. We introduce a novel method to predict the evolution of the solar line-of-sight (LoS) magnetogram using image-to-image translation with Denoising Diffusion Probabilistic Models (DDPMs). Our approach combines "computer science metrics" for image quality and "physics metrics" for physical accuracy to evaluate model performance. The results indicate that DDPMs are effective in maintaining the structural integrity, the dynamic range of solar magnetic fields, the magnetic flux and other physical features such as the size of the active regions, surpassing traditional persistence models, also in flaring situation. We aim to use deep learning not only for visualisation but as an integrative and interactive tool for telescopes, enhancing our understanding of unexpected physical events like solar flares. Future studies will aim to integrate more diverse solar data to refine the accuracy and applicability of our generative model.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Solar synthetic imaging: Introducing denoising diffusion probabilistic models on SDO/AIA data
Authors:
Francesco P. Ramunno,
S. Hackstein,
V. Kinakh,
M. Drozdova,
G. Quetant,
A. Csillaghy,
S. Voloshynovskiy
Abstract:
Given the rarity of significant solar flares compared to smaller ones, training effective machine learning models for solar activity forecasting is challenging due to insufficient data. This study proposes using generative deep learning models, specifically a Denoising Diffusion Probabilistic Model (DDPM), to create synthetic images of solar phenomena, including flares of varying intensities. By e…
▽ More
Given the rarity of significant solar flares compared to smaller ones, training effective machine learning models for solar activity forecasting is challenging due to insufficient data. This study proposes using generative deep learning models, specifically a Denoising Diffusion Probabilistic Model (DDPM), to create synthetic images of solar phenomena, including flares of varying intensities. By employing a dataset from the AIA instrument aboard the SDO spacecraft, focusing on the 171 Å band that captures various solar activities, and classifying images with GOES X-ray measurements based on flare intensity, we aim to address the data scarcity issue. The DDPM's performance is evaluated using cluster metrics, Frechet Inception Distance (FID), and F1-score, showcasing promising results in generating realistic solar imagery. We conduct two experiments: one to train a supervised classifier for event identification and another for basic flare prediction, demonstrating the value of synthetic data in managing imbalanced datasets. This research underscores the potential of DDPMs in solar data analysis and forecasting, suggesting further exploration into their capabilities for solar flare prediction and application in other deep learning and physical tasks.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model
Authors:
Mariia Drozdova,
Vitaliy Kinakh,
Omkar Bait,
Olga Taran,
Erica Lastufka,
Miroslava Dessauges-Zavadsky,
Taras Holotyak,
Daniel Schaerer,
Slava Voloshynovskiy
Abstract:
Reconstructing sky models from dirty radio images for accurate source localization and flux estimation is crucial for studying galaxy evolution at high redshift, especially in deep fields using instruments like the Atacama Large Millimetre Array (ALMA). With new projects like the Square Kilometre Array (SKA), there's a growing need for better source extraction methods. Current techniques, such as…
▽ More
Reconstructing sky models from dirty radio images for accurate source localization and flux estimation is crucial for studying galaxy evolution at high redshift, especially in deep fields using instruments like the Atacama Large Millimetre Array (ALMA). With new projects like the Square Kilometre Array (SKA), there's a growing need for better source extraction methods. Current techniques, such as CLEAN and PyBDSF, often fail to detect faint sources, highlighting the need for more accurate methods. This study proposes using stochastic neural networks to rebuild sky models directly from dirty images. This method can pinpoint radio sources and measure their fluxes with related uncertainties, marking a potential improvement in radio source characterization. We tested this approach on 10164 images simulated with the CASA tool simalma, based on ALMA's Cycle 5.3 antenna setup. We applied conditional Denoising Diffusion Probabilistic Models (DDPMs) for sky models reconstruction, then used Photutils to determine source coordinates and fluxes, assessing the model's performance across different water vapor levels. Our method showed excellence in source localization, achieving more than 90% completeness at a signal-to-noise ratio (SNR) as low as 2. It also surpassed PyBDSF in flux estimation, accurately identifying fluxes for 96% of sources in the test set, a significant improvement over CLEAN+ PyBDSF's 57%. Conditional DDPMs is a powerful tool for image-to-image translation, yielding accurate and robust characterisation of radio sources, and outperforming existing methodologies. While this study underscores its significant potential for applications in radio astronomy, we also acknowledge certain limitations that accompany its usage, suggesting directions for further refinement and research.
△ Less
Submitted 20 February, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Challenging interferometric imaging: Machine learning-based source localization from uv-plane observations
Authors:
O. Taran,
O. Bait,
M. Dessauges-Zavadsky,
T. Holotyak,
D. Schaerer,
S. Voloshynovskiy
Abstract:
In our work, we examine, for the first time, the possibility of fast and efficient source localization directly from the uvobservations, omitting the recovering of the dirty or clean images. We propose a deep neural network-based framework that takes as its input a low-dimensional vector of sampled uvdata and outputs source positions on the sky. We investigated a representation of the complex-valu…
▽ More
In our work, we examine, for the first time, the possibility of fast and efficient source localization directly from the uvobservations, omitting the recovering of the dirty or clean images. We propose a deep neural network-based framework that takes as its input a low-dimensional vector of sampled uvdata and outputs source positions on the sky. We investigated a representation of the complex-valued input uv-data via the real and imaginary and the magnitude and phase components. We provided a comparison of the efficiency of the proposed framework with the traditional source localization pipeline based on the state-of-the-art Python Blob Detection and Source Finder (PyBDSF) method. The investigation was performed on a data set of 9164 sky models simulated using the Common Astronomy Software Applications (CASA) tool for the Atacama Large Millimeter Array (ALMA) Cycle 5.3 antenna configuration. We investigated two scenarios: (i) noise-free as an ideal case and (ii) sky simulations including noise representative of typical extra-galactic millimeter observations. In the noise-free case, the proposed localization framework demonstrates the same high performance as the state-of-the-art PyBDSF method. For noisy data, however, our new method demonstrates significantly better performance, achieving a completeness level that is three times higher for sources with uniform signal-to-noise (S/N) ratios between 1 and 10, and a high increase in completeness in the low S/N regime. Furthermore, the execution time of the proposed framework is significantly reduced (by factors about 30) as compared to traditional methods that include image reconstructions from the uv-plane and subsequent source detections.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Exploring mutual information between IRIS spectral lines. I. Correlations between spectral lines during solar flares and within the quiet Sun
Authors:
Brandon Panos,
Lucia Kleint,
Sviatoslav Voloshynovskiy
Abstract:
Spectral lines allow us to probe the thermodynamics of the solar atmosphere, but the shape of a single spectral line may be similar for different thermodynamic solutions. Multiline analyses are therefore crucial, but computationally cumbersome. We investigate correlations between several chromospheric and transition region lines to restrain the thermodynamic solutions of the solar atmosphere durin…
▽ More
Spectral lines allow us to probe the thermodynamics of the solar atmosphere, but the shape of a single spectral line may be similar for different thermodynamic solutions. Multiline analyses are therefore crucial, but computationally cumbersome. We investigate correlations between several chromospheric and transition region lines to restrain the thermodynamic solutions of the solar atmosphere during flares. We used machine-learning methods to capture the statistical dependencies between 6 spectral lines sourced from 21 large solar flares observed by NASA's Interface Region Imaging Spectrograph (IRIS). The techniques are based on an information-theoretic quantity called mutual information (MI), which captures both linear and nonlinear correlations between spectral lines. The MI is estimated using both a categorical and numeric method, and performed separately for a collection of quiet Sun and flaring observations. Both approaches return consistent results, indicating weak correlations between spectral lines under quiet Sun conditions, and substantially enhanced correlations under flaring conditions, with some line-pairs such as Mg II and C II having a normalized MI score as high as 0.5. We find that certain spectral lines couple more readily than others, indicating a coherence in the solar atmosphere over many scale heights during flares, and that all line-pairs are correlated to the GOES derivative, indicating a positive relationship between correlation strength and energy input. Our methods provide a highly stable and flexible framework for quantifying dependencies between the physical quantities of the solar atmosphere, allowing us to obtain a three-dimensional picture of its state.
△ Less
Submitted 25 April, 2021;
originally announced April 2021.
-
Solar activity classification based on Mg II spectra: towards classification on compressed data
Authors:
Sergey Ivanov,
Maksym Tsizh,
Denis Ullmann,
Brandon Panos,
Slava Voloshynovskiy
Abstract:
Although large volumes of solar data are available for study, the vast majority of these data remain unlabeled and are therefore not amenable to supervised machine learning methods. Having a way to accurately and automatically classify spectra into categories related to solar activity is highly desirable and will assist and speed up future research efforts in solar physics. At the same time, the l…
▽ More
Although large volumes of solar data are available for study, the vast majority of these data remain unlabeled and are therefore not amenable to supervised machine learning methods. Having a way to accurately and automatically classify spectra into categories related to solar activity is highly desirable and will assist and speed up future research efforts in solar physics. At the same time, the large volume of raw observational data is a serious bottleneck for machine learning, requiring powerful computational means that are not at the disposal of many laboratories. Besides, the raw data communication imposes restrictions on real time data observations and requires considerable bandwidth and energy for the onboard solar observation systems. To solve these issues, we propose a framework to classify solar activity on compressed data. For this, we used a labeling scheme from a pre-existing vector quantization technique in conjunction with different machine learning algorithms to categorize spectra of singly-ionized magnesium Mg II measured by NASA's Interface Region Imaging Spectrograph satellite (IRIS) into five types of solar activity. Our training dataset is a human annotated list of 85 IRIS observations containing 29097 frames. The annotated types of Solar activities are active region, pre-flare activity, Solar flare, Sunspot, and quiet Sun. We compress these data and reduce its complexity before training classifiers. We found that the XGBoost classifier produces the most accurate results on the compressed data, yielding over a 95\% prediction rate, and outperforming other ML methods like convolution neural networks, K-nearest neighbors, naive Bayes classifiers, and SVM. We find that the classification performance on compressed and uncompressed data is comparable, implying the possibility of large compression rates for relatively low degrees of information loss.
△ Less
Submitted 22 June, 2021; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Identifying typical Mg II flare spectra using machine learning
Authors:
B. Panos,
L. Kleint,
C. Huwyler,
S. Krucker,
M. Melchior,
D. Ullmann,
S. Voloshynovskiy
Abstract:
IRIS performs solar observations over a large range of atmospheric heights, including the chromosphere where the majority of flare energy is dissipated. The strong Mg II h&k spectral lines are capable of providing excellent atmospheric diagnostics, but have not been fully utilized for flaring atmospheres. We aim to investigate whether the physics of the chromosphere is identical for all flare obse…
▽ More
IRIS performs solar observations over a large range of atmospheric heights, including the chromosphere where the majority of flare energy is dissipated. The strong Mg II h&k spectral lines are capable of providing excellent atmospheric diagnostics, but have not been fully utilized for flaring atmospheres. We aim to investigate whether the physics of the chromosphere is identical for all flare observations by analyzing if there are certain spectra that occur in all flares. To achieve this, we automatically analyze hundreds of thousands of Mg II h&k line profiles from a set of 33 flares, and use a machine learning technique which we call supervised hierarchical k-means, to cluster all profile shapes. We identify a single peaked Mg II profile, in contrast to the double-peaked quiet Sun profiles, appearing in every flare. Additionally, we find extremely broad profiles with characteristic blue shifted central reversals appearing at the front of fast-moving flare ribbons. These profiles occur during the impulsive phase of the flare, and we present results of their temporal and spatial correlation with non-thermal hard X-ray signatures, suggesting that flare-accelerated electrons play an important role in the formation of these profiles. The ratio of the integrated Mg II h&k lines can also serve as an opacity diagnostic, and we find higher opacities during each flare maximum. Our study shows that machine learning is a powerful tool for large scale statistical solar analyses.
△ Less
Submitted 26 May, 2018;
originally announced May 2018.